IEEE/ACM Transactions on Audio, Speech, and Language Processing期刊最新论文, 电子通信, 广播电视类期刊,

Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-19
Cunhang Fan, Mingming Ding, Jianhua Tao, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Zhao Lv

更新日期：2024-04-19

详情收藏

A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-18
Bengt J. Borgström, Michael S. Brandstein

更新日期：2024-04-18

详情收藏

Distance Metric-Based Open-Set Domain Adaptation for Speaker Verification

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Jianchen Li, Jiqing Han, Fan Qian, Tieran Zheng, Yongjun He, Guibin Zheng

更新日期：2024-04-16

详情收藏

Graph Neural Networks for Contextual ASR With the Tree-Constrained Pointer Generator

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Guangzhi Sun, Chao Zhang, Philip C. Woodland

更新日期：2024-04-16

详情收藏

A Large-Scale Evaluation of Speech Foundation Models

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

更新日期：2024-04-16

详情收藏

Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

更新日期：2024-04-16

详情收藏

Cost-Effective Acoustic Feedback Cancellers for Digital Hearing Aids

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Yusuf Eren, Buket Çolak Güvenç, Engin Cemal Mengüç

更新日期：2024-04-16

详情收藏

Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-16
Kun Wei, Bei Li, Hang Lv, Quan Lu, Ning Jiang, Lei Xie

更新日期：2024-04-16

详情收藏

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-12
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

更新日期：2024-04-12

详情收藏

Multi-resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09
Lei Zhao, Wenbo Zhu, Shengqiang Li, Hong Luo, Xiao-Lei Zhang, Susanto Rahardja

更新日期：2024-04-09

详情收藏

Learning with an Open Horizon in Ever-Changing Dialogue Circumstances

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09
Christian Geishauser, Carel van Niekerk, Nurul Lubis, Hsien-chin Lin, Michael Heck, Shutong Feng, Benjamin Ruppik, Renato Vukovic, Milica Gašić

更新日期：2024-04-09

详情收藏

Low-Latency Neural Speech Phase Prediction based on Parallel Estimation Architecture and Anti-Wrapping Losses for Speech Generation Tasks

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-09
Yang Ai, Zhen-Hua Ling

更新日期：2024-04-09

详情收藏

Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-08
Yanxiong Li, Jialong Li, Yongjie Si, Jiaxin Tan, Qianhua He

更新日期：2024-04-08

详情收藏

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-04-01
Rui Liu, Berrak Sisman, Guanglai Gao, Haizhou Li

更新日期：2024-04-01

详情收藏

Dynamic Higher-Order Stereophony

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27
Jacob Hollebon, Filippo Maria Fazi

更新日期：2024-03-27

详情收藏

Speaker Distance Estimation in Enclosures from Single-Channel Audio

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27
Michael Neri, Archontis Politis, Daniel Krause, Marco Carli, Tuomas Virtanen

更新日期：2024-03-27

详情收藏

Large-scale unsupervised audio pre-training for video-to-speech synthesis

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27
Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

更新日期：2024-03-27

详情收藏

Adjustable Coherent-to-Diffuse Power Estimator for Binaural Speech Enhancement in Multi-Talker Environments

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-27
Reza Ghanavi, Craig T. Jin

更新日期：2024-03-27

详情收藏

A non-invasive speech quality evaluation algorithm for hearing aids with multi-head self-attention and audiogram-based features

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-26
Ruiyu Liang, Yue Xie, Jiaming Cheng, Cong Pang, Björn Schuller

更新日期：2024-03-26

详情收藏

FA-ExU-Net: the simultaneous training of an embedding extractor and enhancement model for a speaker verification system robust to short noisy utterances

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-22
Ju-ho Kim, Jungwoo Heo, Hyun-seo Shin, Chan-yeong Lim, Ha-Jin Yu

更新日期：2024-03-22

详情收藏

Information Dropping Data Augmentation for Machine Translation Quality Estimation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-22
Shuo Li, Xiaojun Bi, Tao Liu, Zheng Chen

更新日期：2024-03-22

详情收藏

Joint Dual Learning with Mutual Information Maximization for Natural Language Understanding and Generation in Dialogues

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-21
Shang-Yu Su, Yung-Sung Chung, Yun-Nung Chen

更新日期：2024-03-21

详情收藏

Self-Supervised Learning of Multi-level Audio Representations for Music Segmentation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20
Morgan Buisson, Brian McFee, Slim Essid, Hélène C. Crayencour

更新日期：2024-03-20

详情收藏

SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Lirong Dai, Jinyu Li, Furu Wei

更新日期：2024-03-20

详情收藏

Detecting the Presence of Sperm Whales' Echolocation Clicks in Noisy Environments

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20
Guy Gubnitky, Roee Diamant

更新日期：2024-03-20

详情收藏

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20
Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki

更新日期：2024-03-20

详情收藏

Leveraging Diverse Modeling Contexts with Collaborating Learning for Neural Machine Translation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-20
Yusheng Liao, Yanfeng Wang, Yu Wang

更新日期：2024-03-20

详情收藏

MusicECAN: An Automatic Denoising Network for Music Recordings with Efficient Channel Attention

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-19
Haonan Cheng, Shulin Liu, Zhicheng Lian, Long Ye, Qin Zhang

更新日期：2024-03-19

详情收藏

Phrase-Aware Financial Sentiment Analysis Based on Constituent Syntax

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-19
Chunli Xiang, Junchi Zhang, Jun Zhou, Fei Li, Chong Teng, Donghong Ji

更新日期：2024-03-19

详情收藏

Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18
Jiahong Li, Chenda Li, Yifei Wu, Yanmin Qian

更新日期：2024-03-18

详情收藏

How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18
Peter Leer, Jesper Jensen, Zheng-Hua Tan, Jan Østergaar, Lars Bramsløw

更新日期：2024-03-18

详情收藏

Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-18
Florian Schmid, Khaled Koutini, Gerhard Widmer

更新日期：2024-03-18

详情收藏

Dual-Channel Target Speaker Extraction Based on Conditional Variational Autoencoder and Directional Information

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-14
Rui Wang, Li Li, Tomoki Toda

更新日期：2024-03-14

详情收藏

Active Discovering New Slots for Task-Oriented Conversation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13
Yuxia Wu, Tianhao Dai, Zhedong Zheng, Lizi Liao

更新日期：2024-03-13

详情收藏

Unsupervised Disentanglement Learning Model for Exemplar-Guided Paraphrase Generation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13
Linjian Li, Yi Cai, Xin Wu

更新日期：2024-03-13

详情收藏

Question-Directed Reasoning With Relation-Aware Graph Attention Network for Complex Question Answering Over Knowledge Graph

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-13
Geng Zhang, Jin Liu, Guangyou Zhou, Kunsong Zhao, Zhiwen Xie, Bo Huang

更新日期：2024-03-13

详情收藏

HRTF upsampling with a generative adversarial network using a gnomonic equiangular projection

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-11
Aidan O. T. Hogg, Mads Jenkins, He Liu, Isaac Squires, Samuel J. Cooper, Lorenzo Picinali

更新日期：2024-03-11

详情收藏

KGAgent: Learning a Deep Reinforced Agent for Keyphrase Generation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-11
Yu Yao, Peng Yang, Guangzhen Zhao, Guoshun Yin

更新日期：2024-03-11

详情收藏

BaSFormer: A Balanced Sparsity Regularized Attention Network for Transformer

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06
Shuoran Jiang, Qingcai Chen, Yang Xiang, Youcheng Pan, Xiangping Wu

更新日期：2024-03-06

详情收藏

Let Topic Flow: A Unified Topic-guided Segment-wise Dialogue Summarization Framework

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06
Qinyu Han, Zhihao Yang, Hongfei Lin, Tian Qin

更新日期：2024-03-06

详情收藏

Reverberant Source Separation using NTF with Delayed Subsources and Spatial Priors

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06
Mieszko Fraś, Konrad Kowalczyk

更新日期：2024-03-06

详情收藏

A User-centric Approach for Deep Residual-Echo Suppression in Double-talk

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-06
Amir Ivry, Israel Cohen, Baruch Berdugo

更新日期：2024-03-06

详情收藏

Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab and Convolutional Recurrent Neural Networks

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04
Yingming Gao, Peter Birkholz, Ya Li

更新日期：2024-03-04

详情收藏

Envelope-Based Multichannel Noise Reduction for Cochlear Implant Applications

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04
Luciana M. X. de Souza, Márcio H. Costa, Renata C. Borges

更新日期：2024-03-04

详情收藏

Decomposed Meta-Learning for Few-Shot Sequence Labeling

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-03-04
Tingting Ma, Qianhui Wu, Huiqiang Jiang, Jieru Lin, Börje F. Karlsson, Tiejun Zhao, Chin-Yew Lin

更新日期：2024-03-04

详情收藏

On Local Temporal Embedding for Semi-Supervised Sound Event Detection

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-28
Lijian Gao, Qirong Mao, Ming Dong

更新日期：2024-02-28

详情收藏

Hierarchical Multi-granularity Interaction Graph Convolutional Network for Long Document Classification

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-28
Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin

更新日期：2024-02-28

详情收藏

Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Yoshiki Masuyama, Kouei Yamaoka, Takao Kawamura, Nobutaka Ono

更新日期：2024-02-23

详情收藏

Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Théo Mariotte, Anthony Larcher, Silvio Montrésor, Jean-Hugh Thomas

更新日期：2024-02-23

详情收藏

Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Taihui Wang, Feiran Yang, Jun Yang

更新日期：2024-02-23

详情收藏

Time-domain Speech Super-resolution with GAN based Modeling for Telephony Speaker Verification

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Żelasko, Najim Dehak

更新日期：2024-02-23

详情收藏

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Chenfeng Miao, Qingying Zhu, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao

更新日期：2024-02-23

详情收藏

Acoustic Imaging with Circular Microphone Array: a new Approach for Sound Field Analysis

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Marco Olivieri, Amy Bastine, Mirco Pezzoli, Fabio Antonacci, Thushara Abhayapala, Augusto Sarti

更新日期：2024-02-23

详情收藏

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-23
Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe, Shinnosuke Takamichi, Hiroshi Saruwatari

更新日期：2024-02-23

详情收藏

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-22
Nursadul Mamun, John H. L. Hansen

更新日期：2024-02-22

详情收藏

R 2: A Novel Recall & Ranking Framework for Legal Judgment Prediction

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19
Yuquan Le, Zhe Quan, Jiawei Wang, Da Cao, Kenli Li

更新日期：2024-02-19

详情收藏

Please donate to save a Life: Inducing Politeness to handle Resistance in Persuasive Dialogue Agents

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19
Kshitij Mishra, Mauajama Firdaus, Asif Ekbal

更新日期：2024-02-19

详情收藏

NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-19
Adrián Barahona-Ríos, Tom Collins

更新日期：2024-02-19

详情收藏

Accented Text-to-Speech Synthesis with Limited Data

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-16
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li

更新日期：2024-02-16

详情收藏

Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer

IEEE ACM Trans. Audio Speech Lang. Process. (IF 5.4) Pub Date : 2024-02-16
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian

更新日期：2024-02-16

详情收藏