-
Deep Learning-based forgery detection and localization for compressed images using a hybrid optimization model Multimedia Syst. (IF 3.9) Pub Date : 2024-04-21 Arundhati Bhowal, Sarmistha Neogy, Ruchira Naskar
-
PathNet: a novel multi-pathway convolutional neural network for few-shot image classification from scratch Multimedia Syst. (IF 3.9) Pub Date : 2024-04-20 Zhonghua Fan, Dongbai Sun, Hongying Yu, Weidong Zhang
-
A visual analysis approach for data transformation via domain knowledge and intelligent models Multimedia Syst. (IF 3.9) Pub Date : 2024-04-20 Haiyang Zhu, Jun Yin, Chengcan Chu, Minfeng Zhu, Yating Wei, Jiacheng Pan, Dongming Han, Xuwei Tan, Wei Chen
-
Linking unknown characters via oracle bone inscriptions retrieval Multimedia Syst. (IF 3.9) Pub Date : 2024-04-15 Feng Gao, Xu Chen, Bang Li, Yongge Liu, Runhua Jiang, Yahong Han
-
A review of deep learning algorithms for modeling drug interactions Multimedia Syst. (IF 3.9) Pub Date : 2024-04-14 Aga Basit Iqbal, Idris Afzal Shah, Injila, Assif Assad, Mushtaq Ahmed, Syed Zubair Shah
-
Link prediction in social networks using hyper-motif representation on hypergraph Multimedia Syst. (IF 3.9) Pub Date : 2024-04-12 ChunYan Meng, Hooman Motevalli
-
Students and teachers learning together: a robust training strategy for neural network pruning Multimedia Syst. (IF 3.9) Pub Date : 2024-04-12 Liyan Xiong, Qingsen Chen, Jiawen Huang, Xiaohui Huang, Peng Huang, Shangfeng Wei
-
Synchronous composition and semantic line detection based on cross-attention Multimedia Syst. (IF 3.9) Pub Date : 2024-04-09 Qinggang Hou, Yongzhen Ke, Kai Wang, Fan Qin, Yaoting Wang
-
Unbinding tensor product representations for image captioning with semantic alignment and complementation Multimedia Syst. (IF 3.9) Pub Date : 2024-04-08 Bicheng Wu, Yan Wo
-
RefinerHash: a new hashing-based re-ranking technique for image retrieval Multimedia Syst. (IF 3.9) Pub Date : 2024-04-08 Farzad Sabahi, M. Omair Ahmad, M.N.S. Swamy
-
Attention U-Net based on multi-scale feature extraction and WSDAN data augmentation for video anomaly detection Multimedia Syst. (IF 3.9) Pub Date : 2024-04-08 Shanzhong Lei, Junfang Song, Tengjiao Wang, Fangxin Wang, Zhuyang Yan
-
CLDE-Net: crowd localization and density estimation based on CNN and transformer network Multimedia Syst. (IF 3.9) Pub Date : 2024-04-08 Yaocong Hu, Yuanyuan Lin, Huicheng Yang, Bingyou Liu, Guoyang Wan, Jinwen Hong, Chao Xie, Wei Wang, Xiaobo Lu
-
An efficient heuristic-aided adaptive autoencoder-based dilated DNN with attention mechanism for enhancing the performance of the MIMO system in 5G communication Multimedia Syst. (IF 3.9) Pub Date : 2024-04-07 Rajalakshmi Jeyapal, Khaled Matrouk, Dass Purushothaman
-
Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects Multimedia Syst. (IF 3.9) Pub Date : 2024-04-06
Abstract In recent years, emotion recognition has received significant attention, presenting a plethora of opportunities for application in diverse fields such as human–computer interaction, psychology, and neuroscience, to name a few. Although unimodal emotion recognition methods offer certain benefits, they have limited ability to encompass the full spectrum of human emotional expression. In contrast
-
Design and implementation of a real-time face recognition system based on artificial intelligence techniques Multimedia Syst. (IF 3.9) Pub Date : 2024-04-05
Abstract This paper mainly discusses the asymmetric face recognition problem where the number of names in a name list and the number of faces in the photo might not be equal, but each face should be automatically labeled with a name. The motivation for this issue is that there had been many meetings in the past. After each meeting, the participant took group photos. The meeting provided only a corresponding
-
Complementary expert balanced learning for long-tail cross-modal retrieval Multimedia Syst. (IF 3.9) Pub Date : 2024-04-04 Peifang Liu, Xueliang Liu
-
Intelligent-paint: a Chinese painting process generation method based on vision transformer Multimedia Syst. (IF 3.9) Pub Date : 2024-04-03 Zunfu Wang, Fang Liu, Zhixiong Liu, Changjuan Ran, Mohan Zhang
The generation of painting steps can help people understand how artistic works are created and assist beginners in learning through copying. However, this task faces significant challenges: achieving the generation of clear and plausible intermediate painting steps while maintaining consistency with the real painting process. Existing related research mainly focuses on generating painting steps for
-
Indirect visual–semantic alignment for generalized zero-shot recognition Multimedia Syst. (IF 3.9) Pub Date : 2024-04-03 Yan-He Chen, Mei-Chen Yeh
Our paper addresses the challenge of generalized zero-shot learning, where the label of a target image may belong to either a seen or an unseen category. Previous methods for this task typically learn a joint embedding space where image features and their corresponding class prototypes are directly aligned. However, this can be difficult due to the inherent gap between the visual and semantic space
-
Domain-adaptive person re-identification via domain alignment and mutual pseudo-label refinement Multimedia Syst. (IF 3.9) Pub Date : 2024-04-02 Songhao Zhu, Tao Luo
Unsupervised domain-adaptive person re-identification refers to transferring knowledge from labeled to unlabeled datasets, thus alleviating the need for large amounts of labeled data. Existing methods address this problem using clustering methods to generate pseudo-labels. However, the pseudo-labels generated by current existing methods may be unstable and noisy, which will significantly degrade the
-
An efficient black widow optimization-based faster R-CNN for classification of COVID-19 from CT images Multimedia Syst. (IF 3.9) Pub Date : 2024-04-01 S. Vani, P. Malathi, V. Jeya Ramya, B. Sriman, M. Saravanan, R. Srivel
The coronavirus diseases (COVID-19) are transmittable diseases which are caused by Severe Acute Respiratory Syndrome human coronavirus (SARS-CoV). This paper describes the identification of coronavirus disease infections and better treatments based on recent technology. The categorization and projection of COVID-19 from the dataset of the most significant Computed tomography (CT) image features. The
-
Vision transformer models for mobile/edge devices: a survey Multimedia Syst. (IF 3.9) Pub Date : 2024-04-01 Seung Il Lee, Kwanghyun Koo, Jong Ho Lee, Gilha Lee, Sangbeom Jeong, Seongjun O, Hyun Kim
With the rapidly growing demand for high-performance deep learning vision models on mobile and edge devices, this paper emphasizes the importance of compact deep learning-based vision models that can provide high accuracy while maintaining a small model size. In particular, based on the success of transformer models in natural language processing and computer vision tasks, this paper offers a comprehensive
-
Severity of lung infection identification and classification using optimization-enabled deep learning with IoT Multimedia Syst. (IF 3.9) Pub Date : 2024-04-01 P. Vijaya, Satish Chander, Roshan Fernandes, Anisha P. Rodrigues, R. Maheswari
A major disease affecting individuals irrespective of the different ages is lung disease and this problem is a result of different causes. The recent spread of COVID-19 caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has affected a huge community worldwide and has impacted the respiratory system adversely. The infection severity can be determined by inspecting the using X-ray
-
PointSGLN: a novel point cloud classification network based on sampling grouping and local point normalization Multimedia Syst. (IF 3.9) Pub Date : 2024-03-30 Wenbin Zhao, Longbiao Jia, Hanlei Zhai, Shuhang Chai, Penghui Li
The point cloud data structure is characterized by disorder and spatial irregularity, which makes it impossible to apply 2D convolutional neural networks directly to extract features like regular data such as images and text, so the point cloud classification task faces a significant challenge. This study aims to classify the inferior mesenteric artery (IMA) in the form of point clouds, the key of
-
Personalized time-sync comment generation based on a multimodal transformer Multimedia Syst. (IF 3.9) Pub Date : 2024-03-30 Hei-Chia Wang, Martinus Maslim, Wei-Ting Hong
Online video entertainment has attracted large audiences and sustained viewing in various fields. With more than 4.5 billion Internet users worldwide, online video entertainment continues to be the most popular activity for users. Time synchronization comments (TSCs) are a new type of text information in videos. Unlike traditional online video-sharing platforms, where users can only leave comments
-
Imbalance multiclass problem: a robust feature enhancement-based framework for liver lesion classification Multimedia Syst. (IF 3.9) Pub Date : 2024-03-30
Abstract The classification of liver lesions in CT images is essential for the diagnosis and treatment of liver diseases. Since the characteristics of different classes of lesions are similar and the degree of differentiation is not obvious, it is difficult to accurately classify different classes of liver lesions, especially imbalanced multiclass distribution. To this end, we propose a novel feature
-
Visual sentiment analysis using data-augmented deep transfer learning techniques Multimedia Syst. (IF 3.9) Pub Date : 2024-03-29 Haoran Hong, Waneeza Zaheer, Aamir Wali
The use of visual content to express emotions on social media platforms has become increasingly popular. Visual sentiment analysis can be used to understand the sentiment conveyed by the users using images. Compared to text, visual sentiment analysis is a challenging task since images are a more condensed form of data, have ambiguity and do not have explicit textual clues. Recently, a few studies used
-
Image inpainting method based on AU-GAN Multimedia Syst. (IF 3.9) Pub Date : 2024-03-29 Chuangchuang Dong, Huaming Liu, Xiuyou Wang, Xuehui Bi
-
Overcomplete-to-sparse representation learning for few-shot class-incremental learning Multimedia Syst. (IF 3.9) Pub Date : 2024-03-29 Fu Mengying, Liu Binghao, Ma Tianren, Ye Qixiang
-
Reconciling global and local optimal label assignments for heavily occluded pedestrian detection Multimedia Syst. (IF 3.9) Pub Date : 2024-03-29 Chongwei Liu, Haojie Li, Zhihui Wang, Rui Xu
-
An adaptive Bagging algorithm based on lightweight transformer for multi-class imbalance recognition Multimedia Syst. (IF 3.9) Pub Date : 2024-03-28 Junyi Wang, Xuezheng Jiang, Hailian Liu, Haibin Cai, Qinggang Meng
-
Design of integrated interactive system for pre-diagnosis of breast cancer pathological images based on CNN and PyQt5 Multimedia Syst. (IF 3.9) Pub Date : 2024-03-27 Yunkai Yang, Qijia Yang, Weifeng Liu, Baodi Liu
-
Learning shared features from specific and ambiguous descriptions for text-based person search Multimedia Syst. (IF 3.9) Pub Date : 2024-03-27
Abstract Text-based person search endeavors to utilize natural language descriptions for retrieving pedestrian images. Previous studies have primarily focused on leveraging information among pedestrians with distinct identities, overlooking the exploration of data variations within the same identity. Although some have attempted to extract multiple samples for each identity, an appropriate loss function
-
Arbitrary style transfer method with attentional feature distribution matching Multimedia Syst. (IF 3.9) Pub Date : 2024-03-27
Abstract Most arbitrary style transfer methods only consider transferring the features of the style and content images. Although the pixel-wise style transfer is achieved. It is limited to preserving the content structure, the model tends to transfer the style features, and the loss of image information occurs during the transfer process. The model incline to transfer the style features and preservation
-
Multiscale image denoising algorithm based on UNet3+ Multimedia Syst. (IF 3.9) Pub Date : 2024-03-27 Kui Liu, Yu Liu, Benyue Su, Huiping Tang
-
Prior tissue knowledge-driven contrastive learning for brain CT report generation Multimedia Syst. (IF 3.9) Pub Date : 2024-03-27 Yanzhao Shi, Junzhong Ji, Xiaodan Zhang, Ying Liu, Zheng Wang, Huimin Xu
-
Recognize after early fusion: the Chinese food recognition based on the alignment of image and ingredients Multimedia Syst. (IF 3.9) Pub Date : 2024-03-26 Ruoxuan Zhang, Dantong Ouyang, Lili He, Lingjin Kuang, Hongtao Bai
-
Audio splicing detection and localization using multistage filterbank spectral sketches and decision fusion Multimedia Syst. (IF 3.9) Pub Date : 2024-03-25 Zhaopin Su, Ziqi Fang, Chensi Lian, Guofu Zhang, Mengke Li
-
Dual-band low-light image enhancement Multimedia Syst. (IF 3.9) Pub Date : 2024-03-25 Aizhong Mi, Wenhui Luo, Zhanqiang Huo
-
SMA-GCN: a fall detection method based on spatio-temporal relationship Multimedia Syst. (IF 3.9) Pub Date : 2024-03-22 Xuecun Yang, Shanghui Zhang, Wei Ji, Yijing Song, lintao He, Hang Xue
-
Boundary-aware GAN for multiple overlapping objects in layout-to-image generation Multimedia Syst. (IF 3.9) Pub Date : 2024-03-21 Fengnan Quan, Bo Lang
-
360° video quality assessment based on saliency-guided viewport extraction Multimedia Syst. (IF 3.9) Pub Date : 2024-03-21 Fanxi Yang, Chao Yang, Ping An, Xinpeng Huang
-
Gs-DeblurGANv2: a QR code deblurring algorithm based on lightweight network structure Multimedia Syst. (IF 3.9) Pub Date : 2024-03-21
Abstract Currently, QR codes are widely utilized in a variety of industries, including payment, shipping, and the industrial Internet of Things. However, during the detection and recognition process, QR code images are frequently impacted by external elements, including recording equipment, light, and filming angle, which causes fuzzy QR codes that cannot be read to provide accurate information. This
-
A secure video data streaming model using modified firefly and SVD technique Multimedia Syst. (IF 3.9) Pub Date : 2024-03-20 K. Muthulakshmi, K. Valarmathi
-
Virtual human pose estimation in a fire education system for children with autism spectrum disorders Multimedia Syst. (IF 3.9) Pub Date : 2024-03-19 Yangyang Guo, Hongye Liu, Yaojin Sun, Yongjun Ren
-
Iris-LAHNet: a lightweight attention-guided high-resolution network for iris segmentation and localization Multimedia Syst. (IF 3.9) Pub Date : 2024-03-19 Yue Yan, Qi Wang, Hegui Zhu, Wuming Jiang
-
Driver intention prediction based on multi-dimensional cross-modality information interaction Multimedia Syst. (IF 3.9) Pub Date : 2024-03-15 Mengfan Xue, Zengkui Xu, Shaohua Qiao, Jiannan Zheng, Tao Li, Yuerong Wang, Dongliang Peng
-
Zero-shot image classification via Visual–Semantic Feature Decoupling Multimedia Syst. (IF 3.9) Pub Date : 2024-03-15 Xin Sun, Yu Tian, Haojie Li
-
Target aware network architecture search and compression for efficient knowledge transfer Multimedia Syst. (IF 3.9) Pub Date : 2024-03-14 S. H. Shabbeer Basha, Debapriya Tula, Sravan Kumar Vinakota, Shiv Ram Dubey
-
An improved non-local means algorithm for CT image denoising Multimedia Syst. (IF 3.9) Pub Date : 2024-03-13 Huihua Kong, Wenbo Gao, Xiaoshuang Du, Yunxia Di
-
Unsupervised domain adaptation of dynamic extension networks based on class decision boundaries Multimedia Syst. (IF 3.9) Pub Date : 2024-03-13
Abstract In response to the problems of inaccurate feature alignment, loss of source domain information, imbalanced sample distribution, and biased class decision boundaries in traditional unsupervised domain adaptation methods, this paper proposes a class decision boundary-based dynamic expansion network unsupervised domain adaptation method called CDE-Net. Specifically, our method dynamically expands
-
MadFormer: multi-attention-driven image super-resolution method based on Transformer Multimedia Syst. (IF 3.9) Pub Date : 2024-03-12 Beibei Liu, Jing Sun, Bing Zhu, Ting Li, Fuming Sun
-
Scalable image coding with enhancement features for human and machine Multimedia Syst. (IF 3.9) Pub Date : 2024-03-10 Ying Wu, Ping An, Chao Yang, XinPeng Huang
-
Assessing the adoption of the Yavuz Battleship application in the mixed reality environment using the technology acceptance model Multimedia Syst. (IF 3.9) Pub Date : 2024-03-08
Abstract This study concentrates on developing a mixed reality (MR) app for the historic ship Yavuz Battleship, also known as “SMS Goeben,” and assessing its acceptance using the technology acceptance model (TAM). Mixed reality blends real and virtual environments to create novel spatial experiences, bridging the gap between the virtual and real worlds. This technology enables designers to craft immersive
-
Indirect: invertible and discrete noisy image rescaling with enhancement from case-dependent textures Multimedia Syst. (IF 3.9) Pub Date : 2024-03-07 Huu-Phu Do, Yan-An Chen, Nhat-Tuong Do-Tran, Kai-Lung Hua, Wen-Hsiao Peng, Ching-Chun Huang
-
Reducing blind spots in esophagogastroduodenoscopy examinations using a novel deep learning model Multimedia Syst. (IF 3.9) Pub Date : 2024-03-04 Guangquan Wan, Guanghui Lian, Lan Yao
-
FMR-Net: a fast multi-scale residual network for low-light image enhancement Multimedia Syst. (IF 3.9) Pub Date : 2024-03-01
Abstract The low-light image enhancement algorithm aims to solve the problem of poor contrast and low brightness of images in low-light environments. Although many image enhancement algorithms have been proposed, they still face the problems of loss of significant features in the enhanced image, inadequate brightness improvement, and a large number of algorithm-specific parameters. To solve the above
-
Mmy-net: a multimodal network exploiting image and patient metadata for simultaneous segmentation and diagnosis Multimedia Syst. (IF 3.9) Pub Date : 2024-02-29 Renshu Gu, Yueyu Zhang, Lisha Wang, Dechao Chen, Yaqi Wang, Ruiquan Ge, Zicheng Jiao, Juan Ye, Gangyong Jia, Linyan Wang
-
Infant head and brain segmentation from magnetic resonance images using fusion-based deep learning strategies Multimedia Syst. (IF 3.9) Pub Date : 2024-02-26 Helena R. Torres, Bruno Oliveira, Pedro Morais, Anne Fritze, Gabriele Hahn, Mario Rüdiger, Jaime C. Fonseca, João L. Vilaça
-
Same-clothes person re-identification with dual-stream network Multimedia Syst. (IF 3.9) Pub Date : 2024-02-26 Zhiyue Wu, Zirui Hu, Jianwei Ding
-
GHCL: Gaussian heuristic curriculum learning for Brain CT report generation Multimedia Syst. (IF 3.9) Pub Date : 2024-02-23 Qingya Shen, Yanzhao Shi, Xiaodan Zhang, Junzhong Ji, Ying Liu, Huimin Xu