arXiv - CS - Artificial Intelligence期刊最新论文, 计算机, 人工智能类期刊,

GraphMatcher: A Graph Representation Learning Approach for Ontology Matching

arXiv.cs.AI Pub Date : 2024-04-20
Sefika Efeoglu

Ontology matching is defined as finding a relationship or correspondence between two or more entities in two or more ontologies. To solve the interoperability problem of the domain ontologies, semantically similar entities in these ontologies must be found and aligned before merging them. GraphMatcher, developed in this study, is an ontology matching system using a graph attention approach to compute

更新日期：2024-04-24

详情收藏

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

arXiv.cs.AI Pub Date : 2024-04-23
Mingbao Lin, Zhihang Lin, Wengyi Zhan, Liujuan Cao, Rongrong Ji

Transforming large pre-trained low-resolution diffusion models to cater to higher-resolution demands, i.e., diffusion extrapolation, significantly improves diffusion adaptability. We propose tuning-free CutDiffusion, aimed at simplifying and accelerating the diffusion extrapolation process, making it more affordable and improving performance. CutDiffusion abides by the existing patch-wise extrapolation

更新日期：2024-04-24

详情收藏

A review of deep learning-based information fusion techniques for multimodal medical image classification

arXiv.cs.AI Pub Date : 2024-04-23
Yihao Li, Mostafa El Habib Daho, Pierre-Henri Conze, Rachid Zeghlache, Hugo Le Boité, Ramin Tadayoni, Béatrice Cochener, Mathieu Lamard, Gwenolé Quellec

Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep learning-based multimodal fusion techniques have emerged as powerful tools for improving medical image classification. This review offers a thorough analysis of the developments

更新日期：2024-04-24

详情收藏

SGFormer: Spherical Geometry Transformer for 360 Depth Estimation

arXiv.cs.AI Pub Date : 2024-04-23
Junsong Zhang, Zisong Chen, Chunyu Lin, Lang Nie, Zhijie Shen, Junda Huang, Yao Zhao

Panoramic distortion poses a significant challenge in 360 depth estimation, particularly pronounced at the north and south poles. Existing methods either adopt a bi-projection fusion strategy to remove distortions or model long-range dependencies to capture global structures, which can result in either unclear structure or insufficient local perception. In this paper, we propose a spherical geometry

更新日期：2024-04-24

详情收藏

Leveraging Speech for Gesture Detection in Multimodal Communication

arXiv.cs.AI Pub Date : 2024-04-23
Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim Pouw, Ivan Toni, Peter Uhrig, Anna Wilson, Judith Holler, Aslı Özyürek, Raquel Fernández

Gestures are inherent to human interaction and often complement speech in face-to-face communication, forming a multimodal communication system. An important task in gesture analysis is detecting a gesture's beginning and end. Research on automatic gesture detection has primarily focused on visual and kinematic information to detect a limited set of isolated or silent gestures with low variability

更新日期：2024-04-24

详情收藏

CoProNN: Concept-based Prototypical Nearest Neighbors for Explaining Vision Models

arXiv.cs.AI Pub Date : 2024-04-23
Teodor Chiaburu, Frank Haußer, Felix Bießmann

Mounting evidence in explainability for artificial intelligence (XAI) research suggests that good explanations should be tailored to individual tasks and should relate to concepts relevant to the task. However, building task specific explanations is time consuming and requires domain expertise which can be difficult to integrate into generic XAI methods. A promising approach towards designing useful

更新日期：2024-04-24

详情收藏

CNN2GNN: How to Bridge CNN with GNN

arXiv.cs.AI Pub Date : 2024-04-23
Ziheng Jiao, Hongyuan Zhang, Xuelong Li

Although the convolutional neural network (CNN) has achieved excellent performance in vision tasks by extracting the intra-sample representation, it will take a higher training expense because of stacking numerous convolutional layers. Recently, as the bilinear models, graph neural networks (GNN) have succeeded in exploring the underlying topological relationship among the graph data with a few graph

更新日期：2024-04-24

详情收藏

Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

arXiv.cs.AI Pub Date : 2024-04-23
Qiao Deng, Zhongzhen Huang, Yunqi Wang, Zhichuan Wang, Zhao Wang, Xiaofan Zhang, Qi Dou, Yeung Yu Hui, Edward S. Hui

Medical vision-language pre-training has emerged as a promising approach for learning domain-general representations of medical image and text. Current algorithms that exploit the global and local alignment between medical image and text could however be marred by the redundant information in medical data. To address this issue, we propose a grounded knowledge-enhanced medical vision-language pre-training

更新日期：2024-04-24

详情收藏

Cross-Task Multi-Branch Vision Transformer for Facial Expression and Mask Wearing Classification

arXiv.cs.AI Pub Date : 2024-04-22
Armando Zhu, Keqin Li, Tong Wu, Peng Zhao, Wenjing Zhou, Bo Hong

With wearing masks becoming a new cultural norm, facial expression recognition (FER) while taking masks into account has become a significant challenge. In this paper, we propose a unified multi-branch vision transformer for facial expression recognition and mask wearing classification tasks. Our approach extracts shared features for both tasks using a dual-branch architecture that obtains multi-scale

更新日期：2024-04-24

详情收藏

Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report)

arXiv.cs.AI Pub Date : 2024-04-22
Xiang Yin, Potyka Nico, Francesca Toni

Quantitatively explaining the strength of arguments under gradual semantics has recently received increasing attention. Specifically, several works in the literature provide quantitative explanations by computing the attribution scores of arguments. These works disregard the importance of attacks and supports, even though they play an essential role when explaining arguments' strength. In this paper

更新日期：2024-04-23

详情收藏

Mechanistic Interpretability for AI Safety -- A Review

arXiv.cs.AI Pub Date : 2024-04-22
Leonard Bereska, Efstratios Gavves

Understanding AI systems' inner workings is critical for ensuring value alignment and safety. This review explores mechanistic interpretability: reverse-engineering the computational mechanisms and representations learned by neural networks into human-understandable algorithms and concepts to provide a granular, causal understanding. We establish foundational concepts such as features encoding knowledge

更新日期：2024-04-23

详情收藏

Multi-channel Emotion Analysis for Consensus Reaching in Group Movie Recommendation Systems

arXiv.cs.AI Pub Date : 2024-04-21
Adilet Yerkin, Elnara Kadyrgali, Yerdauit Torekhan, Pakizar Shamoi

Watching movies is one of the social activities typically done in groups. Emotion is the most vital factor that affects movie viewers' preferences. So, the emotional aspect of the movie needs to be determined and analyzed for further recommendations. It can be challenging to choose a movie that appeals to the emotions of a diverse group. Reaching an agreement for a group can be difficult due to the

更新日期：2024-04-23

详情收藏

On the Value of Labeled Data and Symbolic Methods for Hidden Neuron Activation Analysis

arXiv.cs.AI Pub Date : 2024-04-21
Abhilekha Dalal, Rushrukh Rayan, Adrita Barua, Eugene Y. Vasserman, Md Kamruzzaman Sarker, Pascal Hitzler

A major challenge in Explainable AI is in correctly interpreting activations of hidden neurons: accurate interpretations would help answer the question of what a deep learning system internally detects as relevant in the input, demystifying the otherwise black-box nature of deep learning systems. The state of the art indicates that hidden node activations can, in some cases, be interpretable in a way

更新日期：2024-04-23

详情收藏

A Survey on the Memory Mechanism of Large Language Model based Agents

arXiv.cs.AI Pub Date : 2024-04-21
Zeyu Zhang, Xiaohe Bo, Chen Ma, Rui Li, Xu Chen, Quanyu Dai, Jieming Zhu, Zhenhua Dong, Ji-Rong Wen

Large language model (LLM) based agents have recently attracted much attention from the research and industry communities. Compared with original LLMs, LLM-based agents are featured in their self-evolving capability, which is the basis for solving real-world problems that need long-term and complex agent-environment interactions. The key component to support agent-environment interactions is the memory

更新日期：2024-04-23

详情收藏

MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering

arXiv.cs.AI Pub Date : 2024-04-19
Avinash Anand, Janak Kapuriya, Chhavi Kirtani, Apoorv Singh, Jay Saraf, Naman Lal, Jatin Kumar, Adarsh Raj Shivam, Astha Verma, Rajiv Ratn Shah, Roger Zimmermann

Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose

更新日期：2024-04-22

详情收藏

A Clean-graph Backdoor Attack against Graph Convolutional Networks with Poisoned Label Only

arXiv.cs.AI Pub Date : 2024-04-19
Jiazhu Dai, Haoyu Sun

Graph Convolutional Networks (GCNs) have shown excellent performance in dealing with various graph structures such as node classification, graph classification and other tasks. However,recent studies have shown that GCNs are vulnerable to a novel threat known as backdoor attacks. However, all existing backdoor attacks in the graph domain require modifying the training samples to accomplish the backdoor

更新日期：2024-04-22

详情收藏

How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples

arXiv.cs.AI Pub Date : 2024-04-19
Dren Fazlija, Arkadij Orlov, Johanna Schrader, Monty-Maximilian Zühlke, Michael Rohs, Daniel Kudenko

With an ever-increasing reliance on machine learning (ML) models in the real world, adversarial examples threaten the safety of AI-based systems such as autonomous vehicles. In the image domain, they represent maliciously perturbed data points that look benign to humans (i.e., the image modification is not noticeable) but greatly mislead state-of-the-art ML models. Previously, researchers ensured the

更新日期：2024-04-22

详情收藏

Learning to Cut via Hierarchical Sequence/Set Model for Efficient Mixed-Integer Programming

arXiv.cs.AI Pub Date : 2024-04-19
Jie Wang, Zhihai Wang, Xijun Li, Yufei Kuang, Zhihao Shi, Fangzhou Zhu, Mingxuan Yuan, Jia Zeng, Yongdong Zhang, Feng Wu

Cutting planes (cuts) play an important role in solving mixed-integer linear programs (MILPs), which formulate many important real-world applications. Cut selection heavily depends on (P1) which cuts to prefer and (P2) how many cuts to select. Although modern MILP solvers tackle (P1)-(P2) by human-designed heuristics, machine learning carries the potential to learn more effective heuristics. However

更新日期：2024-04-22

详情收藏

GluMarker: A Novel Predictive Modeling of Glycemic Control Through Digital Biomarkers

arXiv.cs.AI Pub Date : 2024-04-19
Ziyi Zhou, Ming Cheng, Xingjian Diao, Yanjun Cui, Xiangling Li

The escalating prevalence of diabetes globally underscores the need for diabetes management. Recent research highlights the growing focus on digital biomarkers in diabetes management, with innovations in computational frameworks and noninvasive monitoring techniques using personalized glucose metrics. However, they predominantly focus on insulin dosing and specific glucose values, or with limited attention

更新日期：2024-04-22

详情收藏

Reinforcement Learning Approach for Integrating Compressed Contexts into Knowledge Graphs

arXiv.cs.AI Pub Date : 2024-04-19
Ngoc Quach, Qi Wang, Zijun Gao, Qifeng Sun, Bo Guan, Lillian Floyd

The widespread use of knowledge graphs in various fields has brought about a challenge in effectively integrating and updating information within them. When it comes to incorporating contexts, conventional methods often rely on rules or basic machine learning models, which may not fully grasp the complexity and fluidity of context information. This research suggests an approach based on reinforcement

更新日期：2024-04-22

详情收藏

Centralized vs. Decentralized Multi-Agent Reinforcement Learning for Enhanced Control of Electric Vehicle Charging Networks

arXiv.cs.AI Pub Date : 2024-04-18
Amin Shojaeighadikolaei, Zsolt Talata, Morteza Hashemi

The widespread adoption of electric vehicles (EVs) poses several challenges to power distribution networks and smart grid infrastructure due to the possibility of significantly increasing electricity demands, especially during peak hours. Furthermore, when EVs participate in demand-side management programs, charging expenses can be reduced by using optimal charging control policies that fully utilize

更新日期：2024-04-22

详情收藏

The collective use and evaluation of generative AI tools in digital humanities research: Survey-based results

arXiv.cs.AI Pub Date : 2024-04-18
Meredith Dedema, Rongqian Ma

The advent of generative artificial intelligence (GenAI) technologies has revolutionized research, with significant implications for Digital Humanities (DH), a field inherently intertwined with technological progress. This article investigates how digital humanities scholars adopt, practice, as well as critically evaluate, GenAI technologies such as ChatGPT in the research process. Drawing on 76 responses

更新日期：2024-04-22

详情收藏

DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era

arXiv.cs.AI Pub Date : 2024-04-18
David Restrepo, Chenwei Wu, Constanza Vásquez-Venegas, Luis Filipe Nakayama, Leo Anthony Celi, Diego M López

In the big data era, integrating diverse data modalities poses significant challenges, particularly in complex fields like healthcare. This paper introduces a new process model for multimodal Data Fusion for Data Mining, integrating embeddings and the Cross-Industry Standard Process for Data Mining with the existing Data Fusion Information Group model. Our model aims to decrease computational costs