样式: 排序: IF: - GO 导出 标记为已读
-
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-19 Cunhang Fan, Mingming Ding, Jianhua Tao, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Zhao Lv
-
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-18 Bengt J. Borgström, Michael S. Brandstein
-
Distance Metric-Based Open-Set Domain Adaptation for Speaker Verification IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Jianchen Li, Jiqing Han, Fan Qian, Tieran Zheng, Yongjun He, Guibin Zheng
-
Graph Neural Networks for Contextual ASR With the Tree-Constrained Pointer Generator IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Guangzhi Sun, Chao Zhang, Philip C. Woodland
-
A Large-Scale Evaluation of Speech Foundation Models IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee
-
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino
-
Cost-Effective Acoustic Feedback Cancellers for Digital Hearing Aids IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Yusuf Eren, Buket Çolak Güvenç, Engin Cemal Mengüç
-
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16 Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie
-
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-12 Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li
-
Multi-resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09 Lei Zhao, Wenbo Zhu, Shengqiang Li, Hong Luo, Xiao-Lei Zhang, Susanto Rahardja
-
Learning with an Open Horizon in Ever-Changing Dialogue Circumstances IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09 Christian Geishauser, Carel van Niekerk, Nurul Lubis, Hsien-chin Lin, Michael Heck, Shutong Feng, Benjamin Ruppik, Renato Vukovic, Milica Gašić
-
Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09 Yang Ai, Zhen-Hua Ling
-
Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-08 Yanxiong Li, Jialong Li, Yongjie Si, Jiaxin Tan, Qianhua He
-
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-01 Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li
-
Dynamic Higher-Order Stereophony IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27 Jacob Hollebon, Filippo Maria Fazi
-
Speaker Distance Estimation in Enclosures from Single-Channel Audio IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27 Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen
-
Large-scale unsupervised audio pre-training for video-to-speech synthesis IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27 Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic
-
Adjustable Coherent-to-Diffuse Power Estimator for Binaural Speech Enhancement in Multi-Talker Environments IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27 Reza Ghanavi, Craig T. Jin
-
A non-invasive speech quality evaluation algorithm for hearing aids with multi-head self-attention and audiogram-based features IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-26 Ruiyu Liang, Yue Xie, Jiaming Cheng, Cong Pang, Björn Schuller
-
FA-ExU-Net: the simultaneous training of an embedding extractor and enhancement model for a speaker verification system robust to short noisy utterances IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-22 Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu
-
Information Dropping Data Augmentation for Machine Translation Quality Estimation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-22 Shuo Li, Xiaojun Bi, Tao Liu, Zheng Chen
-
Joint Dual Learning with Mutual Information Maximization for Natural Language Understanding and Generation in Dialogues IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-21 Shang-Yu Su, Yung-Sung Chung, Yun-Nung Chen
-
Self-Supervised Learning of Multi-level Audio Representations for Music Segmentation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20 Morgan Buisson, Brian McFee, Slim Essid, Hélène C. Crayencour
-
SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20 Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei
-
Detecting the Presence of Sperm Whales' Echolocation Clicks in Noisy Environments IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20 Guy Gubnitky, Roee Diamant
-
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20 Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki
-
Leveraging Diverse Modeling Contexts with Collaborating Learning for Neural Machine Translation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20 Yusheng Liao, Yanfeng Wang, Yu Wang
-
MusicECAN: An Automatic Denoising Network for Music Recordings with Efficient Channel Attention IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-19 Haonan Cheng, Shulin Liu, Zhicheng Lian, Long Ye, Qin Zhang
-
Phrase-Aware Financial Sentiment Analysis Based on Constituent Syntax IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-19 Chunli Xiang, Junchi Zhang, Jun Zhou, Fei Li, Chong Teng, Donghong Ji
-
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18 Jiahong Li, Chenda Li, Yifei Wu, Yanmin Qian
-
How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18 Peter Leer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaar, Lars Bramsløw
-
Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18 Florian Schmid, Khaled Koutini, Gerhard Widmer
-
Dual-Channel Target Speaker Extraction Based on Conditional Variational Autoencoder and Directional Information IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-14 Rui Wang, Li Li, Tomoki Toda
-
Active Discovering New Slots for Task-Oriented Conversation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13 Yuxia Wu, Tianhao Dai, Zhedong Zheng, Lizi Liao
-
Unsupervised Disentanglement Learning Model for Exemplar-Guided Paraphrase Generation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13 Linjian Li, Yi Cai, Xin Wu
-
Question-Directed Reasoning With Relation-Aware Graph Attention Network for Complex Question Answering Over Knowledge Graph IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13 Geng Zhang, Jin Liu, Guangyou Zhou, Kunsong Zhao, Zhiwen Xie, Bo Huang
-
HRTF upsampling with a generative adversarial network using a gnomonic equiangular projection IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-11 Aidan O. T. Hogg, Mads Jenkins, He Liu, Isaac Squires, Samuel J. Cooper, Lorenzo Picinali
-
KGAgent: Learning a Deep Reinforced Agent for Keyphrase Generation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-11 Yu Yao, Peng Yang, Guangzhen Zhao, Guoshun Yin
-
BaSFormer: A Balanced Sparsity Regularized Attention Network for Transformer IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06 Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan, Xiangping Wu
-
Let Topic Flow: A Unified Topic-guided Segment-wise Dialogue Summarization Framework IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06 Qinyu Han, Zhihao Yang, Hongfei Lin, Tian Qin
-
Reverberant Source Separation using NTF with Delayed Subsources and Spatial Priors IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06 Mieszko Fraś, Konrad Kowalczyk
-
A User-centric Approach for Deep Residual-Echo Suppression in Double-talk IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06 Amir Ivry, Israel Cohen, Baruch Berdugo
-
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab and Convolutional Recurrent Neural Networks IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04 Yingming Gao, Peter Birkholz, Ya Li
-
Envelope-Based Multichannel Noise Reduction for Cochlear Implant Applications IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04 Luciana M. X. de Souza, Márcio H. Costa, Renata C. Borges
-
Decomposed Meta-Learning for Few-Shot Sequence Labeling IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04 Tingting Ma, Qianhui Wu, Huiqiang Jiang, Jieru Lin, Börje F. Karlsson, Tiejun Zhao, Chin-Yew Lin
-
On Local Temporal Embedding for Semi-Supervised Sound Event Detection IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-28 Lijian Gao, Qirong Mao, Ming Dong
-
Hierarchical Multi-granularity Interaction Graph Convolutional Network for Long Document Classification IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-28 Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin
-
Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Yoshiki Masuyama, Kouei Yamaoka, Takao Kawamura, Nobutaka Ono
-
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas
-
Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Taihui Wang, Feiran Yang, Jun Yang
-
Time-domain Speech Super-resolution with GAN based Modeling for Telephony Speaker Verification IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Żelasko, Najim Dehak
-
EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Chenfeng Miao, Qingying Zhu, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao
-
Acoustic Imaging with Circular Microphone Array: a new Approach for Sound Field Analysis IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Marco Olivieri, Amy Bastine, Mirco Pezzoli, Fabio Antonacci, Thushara Abhayapala, Augusto Sarti
-
Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23 Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi, Hiroshi Saruwatari
-
Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-22 Nursadul Mamun, John H. L. Hansen
-
R 2: A Novel Recall & Ranking Framework for Legal Judgment Prediction IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19 Yuquan Le, Zhe Quan, Jiawei Wang, Da Cao, Kenli Li
-
Please donate to save a Life: Inducing Politeness to handle Resistance in Persuasive Dialogue Agents IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19 Kshitij Mishra, Mauajama Firdaus, Asif Ekbal
-
NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19 Adrián Barahona-Ríos, Tom Collins
-
Accented Text-to-Speech Synthesis with Limited Data IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-16 Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li
-
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-16 Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian