样式: 排序: IF: - GO 导出 标记为已读
-
On the Value of Head Labels in Multi-Label Text Classification ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Haobo Wang, Cheng Peng, Hede Dong, Lei Feng, Weiwei Liu, Tianlei Hu, Ke Chen, Gang Chen
A formidable challenge in the multi-label text classification (MLTC) context is that the labels often exhibit a long-tailed distribution, which typically prevents deep MLTC models from obtaining satisfactory performance. To alleviate this problem, most existing solutions attempt to improve tail performance by means of sampling or introducing extra knowledge. Data-rich labels, though more trustworthy
-
Incorporating Multi-Level Sampling with Adaptive Aggregation for Inductive Knowledge Graph Completion ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Kai Sun, Huajie Jiang, Yongli Hu, Baocai Yin
In recent years, Graph Neural Networks (GNNs) have achieved unprecedented success in handling graph-structured data, thereby driving the development of numerous GNN-oriented techniques for inductive knowledge graph completion (KGC). A key limitation of existing methods, however, is their dependence on pre-defined aggregation functions, which lack the adaptability to diverse data, resulting in suboptimal
-
SA2E-AD: A Stacked Attention Autoencoder for Anomaly Detection in Multivariate Time Series ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Mengyao Li, Zhiyong Li, Zhibang Yang, Xu Zhou, Yifan Li, Ziyan Wu, Lingzhao Kong, Ke Nai
Anomaly detection for multivariate time series is an essential task in the modern industrial field. Although several methods have been developed for anomaly detection, they usually fail to effectively exploit the metrical-temporal correlation and the other dependencies among multiple variables. To address this problem, we propose a stacked attention autoencoder for anomaly detection in multivariate
-
Automatically Inspecting Thousands of Static Bug Warnings with Large Language Model: How Far Are We? ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Cheng Wen, Yuandao Cai, Bin Zhang, Jie Su, Zhiwu Xu, Dugang Liu, Shengchao Qin, Zhong Ming, Cong Tian
Static analysis tools for capturing bugs and vulnerabilities in software programs are widely employed in practice, as they have the unique advantages of high coverage and independence from the execution environment. However, existing tools for analyzing large codebases often produce a great deal of false warnings over genuine bug reports. As a result, developers are required to manually inspect and
-
Dual Homogeneity Hypergraph Motifs with Cross-view Contrastive Learning for Multiple Social Recommendations ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Jiadi Han, Yufei Tang, Qian Tao, Yuhan Xia, LiMing Zhang
Social relations are often used as auxiliary information to address data sparsity and cold-start issues in social recommendations. In the real world, social relations among users are complex and diverse. Widely used graph neural networks (GNNs) can only model pairwise node relationships and are not conducive to exploring higher-order connectivity, while hypergraph provides a natural way to model high-order
-
X-FSPMiner: A Novel Algorithm for Frequent Similar Pattern Mining ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Ansel Y. Rodríguez-González, Ramón Aranda, Miguel Á. Álvarez-Carmona, Angel Díaz-Pacheco, Rosa María Valdovinos Rosas
Frequent similar pattern mining (FSP mining) allows for finding frequent patterns hidden from the classical approach. However, the use of similarity functions implies more computational effort, necessitating the development of more efficient algorithms for FSP mining. This work aims to improve the efficiency of mining all FSPs when using Boolean and non-increasing monotonic similarity functions. A
-
Multi-Instance Learning with One Side Label Noise ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Tianxiang Luan, Shilin Gu, Xijia Tang, Wenzhang Zhuge, Chenping Hou
Multi-instance Learning (MIL) is a popular learning paradigm arising from many real applications. It assigns a label to a set of instances, which is called a bag, and the bag’s label is determined by the instances within it. A bag is positive if and only if it has at least one positive instance. Since labeling bags is more complicated than labeling each instance, we will often face the mislabeling
-
Math Word Problem Generation via Disentangled Memory Retrieval ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Wei Qin, Xiaowei Wang, Zhenzhen Hu, Lei Wang, Yunshi Lan, Richang Hong
The task of math word problem (MWP) generation, which generates an MWP given an equation and relevant topic words, has increasingly attracted researchers’ attention. In this work, we introduce a simple memory retrieval module to search related training MWPs, which are used to augment the generation. To retrieve more relevant training data, we also propose a disentangled memory retrieval module based
-
Towards Differential Privacy in Sequential Recommendation: A Noisy Graph Neural Network Approach ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Wentao Hu, Hui Fang
With increasing frequency of high-profile privacy breaches in various online platforms, users are becoming more concerned about their privacy. And recommender system is the core component of online platforms for providing personalized service, consequently, its privacy preservation has attracted great attention. As the gold standard of privacy protection, differential privacy has been widely adopted
-
Local Community Detection in Multiple Private Networks ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Li Ni, Rui Ye, Wenjian Luo, Yiwen Zhang
Individuals are often involved in multiple online social networks. Considering that owners of these networks are unwilling to share their networks, some global algorithms combine information from multiple networks to detect all communities in multiple networks without sharing their edges. When data owners are only interested in the community containing a given node, it is unnecessary and computationally
-
Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Yongduo Sui, Wenyu Mao, Shuyao Wang, Xiang Wang, Jiancan Wu, Xiangnan He, Tat-Seng Chua
In graph classification, attention- and pooling-based graph neural networks (GNNs) predominate to extract salient features from the input graph and support the prediction. They mostly follow the paradigm of “learning to attend,” which maximizes the mutual information between the attended graph and the ground-truth label. However, this paradigm causes GNN classifiers to indiscriminately absorb all statistical
-
A Taxonomy for Learning with Perturbation and Algorithms ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Rujing Yao, Ou Wu
Weighting strategy prevails in machine learning. For example, a common approach in robust machine learning is to exert low weights on samples which are likely to be noisy or quite hard. This study summarizes another less-explored strategy, namely, perturbation. Various incarnations of perturbation have been utilized but it has not been explicitly revealed. Learning with perturbation is called perturbation
-
Generation-based Multi-view Contrast for Self-supervised Graph Representation Learning ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Yuehui Han
Graph contrastive learning has made remarkable achievements in the self-supervised representation learning of graph-structured data. By employing perturbation function (i.e., perturbation on the nodes or edges of graph), most graph contrastive learning methods construct contrastive samples on the original graph. However, the perturbation-based data augmentation methods randomly change the inherent
-
Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising Strategies ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Kuldeep Singh, Bhaskar Biswas
High utility itemsets (HUIs) mining is an emerging area of data mining which discovers sets of items generating a high profit from transactional datasets. In recent years, several algorithms have been proposed for this task. However, most of them do not consider the on-shelf time period of items and negative utility of items. High on-shelf utility itemset (HOUIs) mining is more difficult than traditional
-
Conditional Generative Adversarial Network for Early Classification of Longitudinal Datasets Using an Imputation Approach ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Sharon Torao Pingi, Richi Nayak, Md Abul Bashar
Early classification of longitudinal data remains an active area of research today. The complexity of these datasets and the high rates of missing data caused by irregular sampling present data-level challenges for the Early Longitudinal Data Classification (ELDC) problem. Coupled with the algorithmic challenge of optimising the opposing objectives of early classification (i.e., earliness and accuracy)
-
Scalable and Inductive Semi-supervised Classifier with Sample Weighting Based on Graph Topology ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-26 Fadi Dornaika, Zoulfikar Ibrahim, Alirezah Bosaghzadeh
Recently, graph-based semi-supervised learning (GSSL) has garnered significant interest in the realms of machine learning and pattern recognition. Although some of the proposed methods have made some progress, there are still some shortcomings that need to be overcome. There are three main limitations. First, the graphs used in these approaches are usually predefined regardless of the task at hand
-
Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-22 Hong Zhao, Zhengyu Li, Wenwei He, Yan Zhao
Existing methods based on transfer learning leverage auxiliary information to help tail generalization and improve the performance of the tail classes. However, they cannot fully exploit the relationships between auxiliary information and tail classes and bring irrelevant knowledge to the tail classes. To solve this problem, we propose a hierarchical CNN with knowledge complementation, which regards
-
Multi-Source and Multi-modal Deep Network Embedding for Cross-Network Node Classification ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-20 Hongwei Yang, Hui He, Weizhe Zhang, Yan Wang, Lin Jing
In recent years, to address the issue of networked data sparsity in node classification tasks, cross-network node classification (CNNC) leverages the richer information from a source network to enhance the performance of node classification in the target network, which typically has sparser information. However, in real-world applications, labeled nodes may be collected from multiple sources with multiple
-
NOODLE: Joint Cross-View Discrepancy Discovery and High-Order Correlation Detection for Multi-View Subspace Clustering ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-20 Zhibin Gu, Songhe Feng, Zhendong Li, Jiazheng Yuan, Jun Liu
Benefiting from the effective exploration of the valuable topological pair-wise relationship of data points across multiple views, multi-view subspace clustering (MVSC) has received increasing attention in recent years. However, we observe that existing MVSC approaches still suffer from two limitations that need to be further improved to enhance the clustering effectiveness. Firstly, previous MVSC
-
Representative and Back-In-Time Sampling from Real-World Hypergraphs ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-19 Minyoung Choe, Jaemin Yoo, Geon Lee, Woonsung Baek, U Kang, Kijung Shin
Graphs are widely used for representing pairwise interactions in complex systems. Since such real-world graphs are large and often evergrowing, sampling subgraphs is useful for various purposes, including simulation, visualization, stream processing, representation learning, and crawling. However, many complex systems consist of group interactions (e.g., collaborations of researchers and discussions
-
A Dual Perspective Framework of Knowledge-correlation for Cross-domain Recommendation ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-18 Yuhan Wang, Qing Xie, Mengzi Tang, Lin Li, Jingling Yuan, Yongjian Liu
Recommender System provides users with online services in a personalized way. The performance of traditional recommender systems may deteriorate because of problems such as cold-start and data sparsity. Cross-domain Recommendation System utilizes the richer information from auxiliary domains to guide the task in the target domain. However, direct knowledge transfer may lead to a negative impact due
-
Semi-supervised Multi-view Clustering based on Nonnegative Matrix Factorization with Fusion Regularization ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-18 Guosheng Cui, Ruxin Wang, Dan Wu, Ye Li
Multi-view clustering has attracted significant attention and application. Nonnegative matrix factorization is one popular feature learning technology in pattern recognition. In recent years, many semi-supervised nonnegative matrix factorization algorithms are proposed by considering label information, which has achieved outstanding performance for multi-view clustering. However, most of these existing
-
FulBM: Fast fully batch maintenance for landmark-based 3-hop cover labeling ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-15 Wentai Zhang, HaiHong E, HaoRan Luo, Mingzhi Sun
Landmark-based 3-hop cover labeling is a category of approaches for shortest distance/path queries on large-scale complex networks. It pre-computes an index offline to accelerate the online distance/path query. Most real-world graphs undergo rapid changes in topology, which makes index maintenance on dynamic graphs necessary. So far, the majority of index maintenance methods can handle only one edge
-
DeepMeshCity: A Deep Learning Model for Urban Grid Prediction ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-15 Chi Zhang, Linhao Cai, Meng Chen, Xiucheng Li, Gao Cong
Urban grid prediction can be applied to many classic spatial-temporal prediction tasks such as air quality prediction, crowd density prediction, and traffic flow prediction, which is of great importance to smart city building. In light of its practical values, many methods have been developed for it and have achieved promising results. Despite their successes, two main challenges remain open: a) how
-
Node Embedding Preserving Graph Summarization ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-08 Houquan Zhou, Shenghua Liu, Huawei Shen, Xueqi Cheng
Graph summarization is a useful tool for analyzing large-scale graphs. Some works tried to preserve original node embeddings encoding rich structural information of nodes on the summary graph. However, their algorithms are designed heuristically and not theoretically guaranteed. In this paper, we theoretically study the problem of preserving node embeddings on summary graph. We prove that three matrix-factorization
-
Adaptive Content-Aware Influence Maximization via Online Learning to Rank ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-08 Konstantinos Theocharidis, Panagiotis Karras, Manolis Terrovitis, Spiros Skiadopoulos, Hady W. Lauw
How can we adapt the composition of a post over a series of rounds to make it more appealing in a social network? Techniques that progressively learn how to make a fixed post more influential over rounds have been studied in the context of the Influence Maximization (IM) problem, which seeks a set of seed users that maximize a post’s influence. However, there is no work on progressively learning how
-
Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-06 Derun Song, Enneng Yang, Guibing Guo, Li Shen, Linying Jiang, Xingwei Wang
Multi-scenario and multi-task recommendation can use various feedback behaviors of users in different scenarios to learn users’ preferences and then make recommendations, which has attracted attention. However, the existing work ignores feature interactions and the fact that a pair of feature interactions will have differing levels of importance under different scenario-task pairs, leading to sub-optimal
-
SsAG: Summarization and Sparsification of Attributed Graphs ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-06 Sarwan Ali, Muhammad Ahmad, Maham Anwer Beg, Imdad Ullah Khan, Safiullah Faizullah, Muhammad Asad Khan
Graph summarization has become integral for managing and analyzing large-scale graphs in diverse real-world applications, including social networks, biological networks, and communication networks. Existing methods for graph summarization often face challenges, being either computationally expensive, limiting their applicability to large graphs, or lacking the incorporation of node attributes. In response
-
Building Shortcuts between Distant Nodes with Biaffine Mapping for Graph Convolutional Networks ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-03-01 Acong Zhang, Jincheng Huang, Ping Li, Kai Zhang
Multiple recent studies show a paradox in graph convolutional networks (GCNs), that is, shallow architectures limit the capability of learning information from high-order neighbors, while deep architectures suffer from over-smoothing or over-squashing. To enjoy the simplicity of shallow architectures and overcome their limits of neighborhood extension, in this work, we introduce Biaffine technique
-
EffCause: Discover Dynamic Causal Relationships Efficiently from Time-Series ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Yicheng Pan, Yifan Zhang, Xinrui Jiang, Meng Ma, Ping Wang
Since the proposal of Granger causality, many researchers have followed the idea and developed extensions to the original algorithm. The classic Granger causality test aims to detect the existence of the static causal relationship. Notably, a fundamental assumption underlying most previous studies is the stationarity of causality, which requires the causality between variables to keep stable. However
-
Asymmetric Learning for Graph Neural Network based Link Prediction ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Kai-Lang Yao, Wu-Jun Li
Link prediction is a fundamental problem in many graph-based applications, such as protein-protein interaction prediction. Recently, graph neural network (GNN) has been widely used for link prediction. However, existing GNN-based link prediction (GNN-LP) methods suffer from scalability problem during training for large-scale graphs, which has received little attention from researchers. In this paper
-
Multi-Task Learning with Sequential Dependence Toward Industrial Applications: A Systematic Formulation ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Xiaobo Guo, Mingming Ha, Xuewen Tao, Shaoshuai Li, Youru Li, Zhenfeng Zhu, Zhiyong Shen, Li Ma
Multi-task learning (MTL) is widely used in the online recommendation and financial services for multi-step conversion estimation, but current works often overlook the sequential dependence among tasks. In particular, sequential dependence multi-task learning (SDMTL) faces challenges in dealing with complex task correlations and extracting valuable information in real-world scenarios, leading to negative
-
Package Arrival Time Prediction via Knowledge Distillation Graph Neural Network ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Lei Zhang, Yong Liu, Zhiwei Zeng, Yiming Cao, Xingyu Wu, Yonghui Xu, Zhiqi Shen, Lizhen Cui
Accurately estimating packages’ arrival time in e-commerce can enhance users’ shopping experience and improve the placement rate of products. This problem is often formalized as an Origin-Destination (OD)-based ETA (i.e., estimated time of arrival) prediction task, where the delivery time is estimated mainly based on sender and receiver addresses and other context information. One inherent challenge
-
Correlation-aware Graph Data Augmentation with Implicit and Explicit Neighbors ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Chuan-Wei Kuo, Bo-Yu Chen, Wen-Chih Peng, Chih-Chieh Hung, Hsin-Ning Su
In recent years, there has been a significant surge in commercial demand for citation graph-based tasks, such as patent analysis, social network analysis, and recommendation systems. Graph Neural Networks (GNNs) are widely used for these tasks due to their remarkable performance in capturing topological graph information. However, GNNs’ output results are highly dependent on the composition of local
-
Attacking Click-through Rate Predictors via Generating Realistic Fake Samples ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Mingxing Duan, Kenli Li, Weinan Zhang, Jiarui Qin, Bin Xiao
How to construct imperceptible (realistic) fake samples is critical in adversarial attacks. Due to the sample feature diversity of a recommender system (containing both discrete and continuous features), traditional gradient-based adversarial attack methods may fail to construct realistic fake samples. Meanwhile, most recommendation models adopt click-through rate (CTR) predictors, which usually utilize
-
Prerequisite-Enhanced Category-Aware Graph Neural Networks for Course Recommendation ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Jianshan Sun, Suyuan Mei, Kun Yuan, Yuanchun Jiang, Jie Cao
The rapid development of Massive Open Online Courses (MOOCs) platforms has created an urgent need for an efficient personalized course recommender system that can assist learners of all backgrounds and levels of knowledge in selecting appropriate courses. Currently, most existing methods utilize a sequential recommendation paradigm that captures the user’s learning interests from their learning history
-
TaSPM: Targeted Sequential Pattern Mining ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Gengsen Huang, Wensheng Gan, Philip S. Yu
Sequential pattern mining (SPM) is an important technique in the field of pattern mining, which has many applications in reality. Although many efficient SPM algorithms have been proposed, there are few studies that can focus on targeted tasks. Targeted querying of the concerned sequential patterns can not only reduce the number of patterns generated, but also increase the efficiency of users in performing
-
Networked Time-series Prediction with Incomplete Data via Generative Adversarial Network ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Yichen Zhu, Bo Jiang, Haiming Jin, Mengtian Zhang, Feng Gao, Jianqiang Huang, Tao Lin, Xinbing Wang
A networked time series (NETS) is a family of time series on a given graph, one for each node. It has a wide range of applications from intelligent transportation to environment monitoring to smart grid management. An important task in such applications is to predict the future values of a NETS based on its historical values and the underlying graph. Most existing methods require complete data for
-
CoBjeason: Reasoning Covered Object in Image by Multi-Agent Collaboration Based on Informed Knowledge Graph ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Huan Rong, Minfeng Qian, Tinghuai Ma, Di Jin, Victor S. Sheng
Object detection is a widely studied problem in existing works. However, in this paper, we turn to a more challenging problem of “Covered Object Reasoning”, aimed at reasoning the category label of target object in the given image particularly when it has been totally covered (or invisible). To resolve this problem, we propose CoBjeason to seize the opportunity when visual reasoning meets the knowledge
-
Graph Time-series Modeling in Deep Learning: A Survey ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Hongjie Chen, Hoda Eldardiry
Time-series and graphs have been extensively studied for their ubiquitous existence in numerous domains. Both topics have been separately explored in the field of deep learning. For time-series modeling, recurrent neural networks or convolutional neural networks model the relations between values across timesteps, while for graph modeling, graph neural networks model the inter-relations between nodes
-
MoMENt: Marked Point Processes with Memory-Enhanced Neural Networks for User Activity Modeling ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-29 Sherry Sahebi, Mengfan Yao, Siqian Zhao, Reza Feyzi Behnagh
Marked temporal point process models (MTPPs) aim to model event sequences and event markers (associated features) in continuous time. These models have been applied to various application domains where capturing event dynamics in continuous time is beneficial, such as education systems, social networks, and recommender systems. However, current MTPPs suffer from two major limitations, i.e., inefficient
-
DP-GCN: Node Classification by Connectivity and Local Topology Structure on Real-World Network ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Zhe Chen, Aixin Sun
Node classification is to predict the class label of a node by analyzing its properties and interactions in a network. We note that many existing solutions for graph-based node classification only consider node connectivity but not node’s local topology structure. However, nodes residing in different parts of a real-world network may share similar local topology structures. For example, local topology
-
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-28 Jingfeng Yang, Hongye Jin, Ruixiang Tang, Xiaotian Han, Qizhang Feng, Haoming Jiang, Shaochen Zhong, Bing Yin, Xia Hu
This paper presents a comprehensive and practical guide for practitioners and end-users working with Large Language Models (LLMs) in their downstream natural language processing (NLP) tasks. We provide discussions and insights into the usage of LLMs from the perspectives of models, data, and downstream tasks. Firstly, we offer an introduction and brief summary of current language models. Then, we discuss
-
A Fully Test-Time Training Framework for Semi-Supervised Node Classification on Out-of-Distribution Graphs ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-26 Jiaxin Zhang, Yiqi Wang, Xihong Yang, En Zhu
Graph neural networks (GNNs) have shown great potential in representation learning for various graph tasks. However, the distribution shift between the training and test sets poses a challenge to the efficiency of GNNs. To address this challenge, HomoTTT propose a fully test-time training (FTTT) framework for GNNs to enhance the model’s generalization capabilities for node classification tasks. Specifically
-
FiFrauD: Unsupervised Financial Fraud Detection in Dynamic Graph Streams ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Samira Khodabandehlou, Alireza Hashemi Golpayegani
Given a stream of financial transactions between traders in an e-market, how can we accurately detect fraudulent traders and suspicious behaviors in real time? Despite the efforts made in detecting these fraudsters, this field still faces serious challenges, including the ineffectiveness of existing methods for the complex and streaming environment of e-markets. As a result, it is still difficult to
-
Domain Generalization in Time Series Forecasting ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Songgaojun Deng, Olivier Sprangers, Ming Li, Sebastian Schelter, Maarten de Rijke
Domain generalization aims to design models that can effectively generalize to unseen target domains by learning from observed source domains. Domain generalization poses a significant challenge for time series data, due to varying data distributions and temporal dependencies. Existing approaches to domain generalization are not designed for time series data, which often results in suboptimal or unstable
-
X-distribution: Retraceable Power-law Exponent of Complex Networks ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Pradumn Kumar Pandey, Aikta Arya, Akrati Saxena
Network modeling has been explored extensively by means of theoretical analysis as well as numerical simulations for Network Reconstruction (NR). The network reconstruction problem requires the estimation of the power-law exponent (γ) of a given input network. Thus, the effectiveness of the NR solution depends on the accuracy of the calculation of γ. In this article, we re-examine the degree distribution-based
-
Supervised Clustering of Persian Handwritten Images Using Regularization and Dimension Reduction Methods ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Sajedeh Moradnia, Mousa Golalizadeh
Clustering, as a fundamental exploratory data technique, not only is used to discover patterns and structures in complex datasets but also is utilized to group variables in high-dimensional data analysis. Dimension reduction through clustering helps identify important variables and reduce data dimensions without losing significant information. High-dimensional image datasets, such as Persian handwritten
-
A Survey on AutoML Methods and Systems for Clustering ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-27 Yannis Poulakis, Christos Doulkeridis, Dimosthenis Kyriazis
Automated Machine Learning (AutoML) aims to identify the best-performing machine learning algorithm along with its input parameters for a given dataset and a specific machine learning task. This is a challenging problem, as the process of finding the best model and tuning it for a particular problem at hand is both time-consuming for a data scientist and computationally expensive. In this survey, we
-
Fairness-Aware Graph Neural Networks: A Survey ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-24 April Chen, Ryan A. Rossi, Namyong Park, Puja Trivedi, Yu Wang, Tong Yu, Sungchul Kim, Franck Dernoncourt, Nesreen K. Ahmed
Graph Neural Networks (GNNs) have become increasingly important due to their representational power and state-of-the-art predictive performance on many fundamental learning tasks. Despite this success, GNNs suffer from fairness issues that arise as a result of the underlying graph data and the fundamental aggregation mechanism that lies at the heart of the large class of GNN models. In this article
-
BapFL : You can Backdoor Personalized Federated Learning ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-23 Tiandi Ye, Cen Chen, Yinggui Wang, Xiang Li, Ming Gao
In federated learning (FL), malicious clients could manipulate the predictions of the trained model through backdoor attacks, posing a significant threat to the security of FL systems. Existing research primarily focuses on backdoor attacks and defenses within the generic federated learning scenario, where all clients collaborate to train a single global model. A recent study conducted by Qin et al
-
Citation Forecasting with Multi-Context Attention-Aided Dependency Modeling ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-23 Taoran Ji, Nathan Self, Kaiqun Fu, Zhiqian Chen, Naren Ramakrishnan, Chang-Tien Lu
Forecasting citations of scientific patents and publications is a crucial task for understanding the evolution and development of technological domains and for foresight into emerging technologies. By construing citations as a time series, the task can be cast into the domain of temporal point processes. Most existing work on forecasting with temporal point processes, both conventional and neural network-based
-
Learning to Generate Temporal Origin-destination Flow Based on Urban Regional Features and Traffic Information ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-20 Can Rong, Zhicheng Liu, Jingtao Ding, Yong Li
Origin-destination (OD) flow contains population mobility information between every two regions in the city, which is of great value in urban planning and transportation management. Nevertheless, the collection of OD flow data is extremely difficult due to the hindrance of privacy issues and collection costs. Significant efforts have been made to generate OD flow based on urban regional features, e
-
Dynamic Environment Responsive Online Meta-Learning with Fairness Awareness ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-20 Chen Zhao, Feng Mi, Xintao Wu, Kai Jiang, Latifur Khan, Feng Chen
The fairness-aware online learning framework has emerged as a potent tool within the context of continuous lifelong learning. In this scenario, the learner’s objective is to progressively acquire new tasks as they arrive over time, while also guaranteeing statistical parity among various protected sub-populations, such as race and gender, when it comes to the newly introduced tasks. A significant limitation
-
ProtoMGAE: Prototype-aware Masked Graph Auto-Encoder for Graph Representation Learning ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-20 Yimei Zheng, Caiyan Jia
Graph self-supervised representation learning has gained considerable attention and demonstrated remarkable efficacy in extracting meaningful representations from graphs, particularly in the absence of labeled data. Two representative methods in this domain are graph auto-encoding and graph contrastive learning. However, the former methods primarily focus on global structures, potentially overlooking
-
Dual-side Adversarial Learning based Fair Recommendation for Sensitive Attribute Filtering ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-19 Shenghao Liu, Yu Zhang, Lingzhi Yi, Xianjun Deng, Laurence T. Yang, Bang Wang
With the development of recommendation algorithms, researchers are paying increasing attention to fairness issues such as user discrimination in recommendations. To address these issues, existing works often filter users’ sensitive information that may cause discrimination during the process of learning user representations. However, these approaches overlook the latent relationship between items’
-
PU-Detector: A PU Learning-based Framework for Real Money Trading Detection in MMORPG ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-13 Yilin Wang, Sha Zhao, Shiwei Zhao, Runze Wu, Yuhong Xu, Jianrong Tao, Tangjie Lv, Shijian Li, Zhipeng Hu, Gang Pan
Massive multiplayer online role-playing games (MMORPG) have been becoming one of the most popular and exciting online games. In recent years, a cheating phenomenon called real money trading (RMT) has arisen and damaged the fantasy world in many ways. RMT is the sale of in-game items, currency, or even characters to earn real money, breaking the balance of the game economy ecosystem and damaging the
-
HITS-based Propagation Paradigm for Graph Neural Networks ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-13 Mehak Khan, Gustavo B. M. Mello, Laurence Habib, Paal Engelstad, Anis Yazidi
In this article, we present a new propagation paradigm based on the principle of Hyperlink-Induced Topic Search (HITS) algorithm. The HITS algorithm utilizes the concept of a “self-reinforcing” relationship of authority-hub. Using HITS, the centrality of nodes is determined via repeated updates of authority-hub scores that converge to a stationary distribution. Unlike PageRank-based propagation methods
-
Do we really need imputation in AutoML predictive modeling? ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-16 George Paterakis, Stefanos Fafalios, Paulos Charonyktakis, Vassilis Christophides, Ioannis Tsamardinos
Numerous real-world data contain missing values, while in contrast, most Machine Learning (ML) algorithms assume complete datasets. For this reason, several imputation algorithms have been proposed to predict and fill in the missing values. Given the advances in predictive modeling algorithms tuned in an AutoML setting, a question that naturally arises is to what extent sophisticated imputation algorithms
-
Intricate Spatiotemporal Dependency Learning for Temporal Knowledge Graph Reasoning ACM Trans. Knowl. Discov. Data (IF 3.6) Pub Date : 2024-02-16 Xuefei Li, Huiwei Zhou, Weihong Yao, Wenchu Li, Baojie Liu, Yingyu Lin
Knowledge Graph (KG) reasoning has been an interesting topic in recent decades. Most current researches focus on predicting the missing facts for incomplete KG. Nevertheless, Temporal KG (TKG) reasoning, which is to forecast the future facts, still faces with the dilemma due to the complex interactions between entities over time. This paper proposes a novel intricate Spatiotemporal Dependency learning