Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Yijing Chu, Sipei Zhao, Feng Niu, Yongzheng Dong, Yuezhe Zhao |
A New Diffusion Filtered-X Affine Projection Algorithm: Performance Analysis and Application in Windy Environment. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yoav Vered, Stephen J. Elliott |
A Parallel Analog and Digital Adaptive Feedforward Controller for Active Noise Control. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Matan Karo, Arie Yeredor, Itshak Lapidot |
Compact Time-Domain Representation for Logical Access Spoofed Audio. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Wenmeng Xiong, Changchun Bao, Jing Zhou, Maoshen Jia, José Picheral |
Joint DOA Estimation and Dereverberation Based on Multi-Channel Linear Prediction Filtering and Azimuth Sparsity. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Varun Krishna, Tarun Sai, Sriram Ganapathy |
Representation Learning With Hidden Unit Clustering for Low Resource Speech Applications. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yoshiki Masuyama, Kouei Yamaoka, Yuma Kinoshita, Taishi Nakashima, Nobutaka Ono |
Causal and Relaxed-Distortionless Response Beamforming for Online Target Source Extraction. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xu Wang, Hainan Zhang, Shuai Zhao 0001, Hongshen Chen, Zhuoye Ding, Zhiguo Wan, Bo Cheng 0001, Yanyan Lan |
Debiasing Counterfactual Context With Causal Inference for Multi-Turn Dialogue Reasoning. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Alejandro Santorum Varela, Svetlana Stoyanchev, Simon Keizer, Rama Doddipatla, Kate Knill |
Entity Resolution in Situated Dialog With Unimodal and Multimodal Transformers. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Reza Varzandeh, Simon Doclo, Volker Hohmann |
Speech-Aware Binaural DOA Estimation Utilizing Periodicity and Spatial Features in Convolutional Neural Networks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Shekhar Kumar Yadav, Nithin V. George |
Joint Dereverberation and Beamforming With Blind Estimation of the Shape Parameter of the Desired Source Prior. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yaru Zhao, Bo Cheng 0001, Yakun Huang, Zhiguo Wan |
FluGCF: A Fluent Dialogue Generation Model With Coherent Concept Entity Flow. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Han Wu 0004, Kun Xu 0005, Linqi Song |
Structure-Aware Dialogue Modeling Methods for Conversational Semantic Role Labeling. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xun Gong 0005, Yu Wu 0012, Jinyu Li 0001, Shujie Liu 0001, Rui Zhao 0017, Xie Chen 0001, Yanmin Qian |
Advanced Long-Content Speech Recognition With Factorized Neural Transducer. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Lior Frenkel, Shlomo E. Chazan, Jacob Goldberger |
Domain Adaptation Using Suitable Pseudo Labels for Speech Enhancement and Dereverberation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Davide Berghi, Philip J. B. Jackson |
Leveraging Visual Supervision for Array-Based Active Speaker Detection and Localization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | David Thulke, Nico Daheim, Christian Dugast, Hermann Ney |
Task-Oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Shouhui Wang, Biao Qin |
A Novel Joint Training Model for Knowledge Base Question Answering. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Congcong Jiang, Tieyun Qian, Bing Liu 0001 |
One General Teacher for Multi-Data Multi-Task: A New Knowledge Distillation Framework for Discourse Relation Analysis. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Shiwen Ni, Jiawen Li 0003, Min Yang 0007, Hung-Yu Kao |
DropAttack: A Random Dropped Weight Attack Adversarial Training for Natural Language Understanding. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Penghui Ma, Jianfeng Li 0001, Jingjing Pan, Xiaofei Zhang 0001, Roberto Gil-Pita |
Coherent Signal DOA Estimation With Coprime Array: Exploiting Signal Subspace Reconstructing Strategy. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Lijian Gao, Qirong Mao, Ming Dong 0001 |
On Local Temporal Embedding for Semi-Supervised Sound Event Detection. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Changkai Lin, Hongju Cheng, Qiang Rao, Yang Yang 0026 |
M$^{3}$SA: Multimodal Sentiment Analysis Based on Multi-Scale Feature Extraction and Multi-Task Learning. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Haisheng Lu, Jiangnan Liang, Chuang Shi |
Comments on "Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction". |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yile Wang, Yue Zhang 0004, Peng Li 0030, Yang Liu 0005 |
Gradual Syntactic Label Replacement for Language Model Pre-Training. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xuehao Zhou, Mingyang Zhang 0003, Yi Zhou 0020, Zhizheng Wu 0001, Haizhou Li 0001 |
Accented Text-to-Speech Synthesis With Limited Data. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiaotong Jiang, Peiwen You, Chen Chen 0090, Zhongqing Wang, Guodong Zhou |
Exploring Scope Detection for Aspect-Based Sentiment Analysis. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Alkis Koudounas, Eliana Pastor, Giuseppe Attanasio, Vittorio Mazzia, Manuel Giollo, Thomas Gueudré, Elisa Reale, Luca Cagliero, Sandro Cumani, Luca de Alfaro, Elena Baralis, Daniele Amberti |
Towards Comprehensive Subgroup Performance Analysis in Speech Models. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Han Zhu, Gaofeng Cheng, Jindong Wang 0001, Wenxin Hou, Pengyuan Zhang, Yonghong Yan 0002 |
Boosting Cross-Domain Speech Recognition With Self-Supervision. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Thomas Haubner, Andreas Brendel, Walter Kellermann |
End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Marco Olivieri, Amy Bastine, Mirco Pezzoli, Fabio Antonacci, Thushara D. Abhayapala, Augusto Sarti |
Acoustic Imaging With Circular Microphone Array: A New Approach for Sound Field Analysis. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yigitcan Özer, Meinard Müller |
Source Separation of Piano Concertos Using Musically Motivated Augmentation Techniques. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Shu Jiang, Zuchao Li, Hai Zhao 0001, Weiping Ding 0001 |
Entity-Relation Extraction as Full Shallow Semantic Dependency Parsing. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Taihui Wang, Feiran Yang, Jun Yang 0004 |
Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Vibhav Agarwal, Sourav Ghosh, Harichandana B. S. S, Himanshu Arora, Barath Raj Kandur Raja |
TrICy: Trigger-Guided Data-to-Text Generation With Intent Aware Attention-Copy. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiao Li, Ruirui Liu, Huichou Huang, Qingyao Wu |
Contrastive Learning for Target Speaker Extraction With Attention-Based Fusion. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jungwoo Lim, Taesun Whang, Dongyub Lee, Heuiseok Lim |
Adaptive Multi-Domain Dialogue State Tracking on Spoken Conversations. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jiahao Zhao, Wenji Mao, Daniel Dajun Zeng |
Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jingsong Yan, Piji Li, Haibin Chen, Junhao Zheng, Qianli Ma 0001 |
Does the Order Matter? A Random Generative Way to Learn Label Hierarchy for Hierarchical Text Classification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Linhui Sun, Shuo Yuan, Aifei Gong, Lei Ye, Eng Siong Chng |
Dual-Branch Modeling Based on State-Space Model for Speech Enhancement. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yanxiong Li, Zhongjie Jiang, Qisheng Huang, Wenchang Cao, Jialong Li |
Lightweight Speaker Verification Using Transformation Module With Feature Partition and Fusion. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Songbin Li, Jingang Wang, Peng Liu 0046, Ke Shi |
SANet: A Compressed Speech Encoder and Steganography Algorithm Independent Steganalysis Deep Neural Network. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xinfa Zhu, Yi Lei, Tao Li, Yongmao Zhang, Hongbin Zhou, Heng Lu, Lei Xie 0001 |
METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Qiangqiang Zhang, Dongyuan Lin, Yingying Xiao, Yunfei Zheng, Shiyuan Wang |
Error Reused Filtered-X Least Mean Square Algorithm for Active Noise Control. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Huang He, Hua Lu 0014, Siqi Bao, Fan Wang, Hua Wu 0003, Zheng-Yu Niu, Haifeng Wang 0001 |
Learning to Select External Knowledge With Multi-Scale Negative Sampling. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gaël Richard, Florence d'Alché-Buc |
Tackling Interpretability in Audio Classification Networks With Non-negative Matrix Factorization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yuhan Dai, Zhirui Zhang, Yichao Du, Shengcai Liu, Lemao Liu, Tong Xu 0001 |
Datastore Distillation for Nearest Neighbor Machine Translation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jiapu Wang, Boyue Wang, Junbin Gao, Simin Hu, Yongli Hu, Baocai Yin |
Multi-Level Interaction Based Knowledge Graph Completion. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiang Chen 0016, Lei Li 0040, Yuqi Zhu, Shumin Deng, Chuanqi Tan, Fei Huang 0004, Luo Si, Ningyu Zhang 0001, Huajun Chen |
Sequence Labeling as Non-Autoregressive Dual-Query Set Generation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Alexander Bohlender, Ann Spriet, Wouter Tirry, Nilesh Madhu |
Spatially Selective Speaker Separation Using a DNN With a Location Dependent Feature Extraction. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter |
Disentangling Prosody Representations With Unsupervised Speech Reconstruction. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Myeonghun Jeong, Minchan Kim, Byoung Jin Choi, Jaesam Yoon, Won Jang, Nam Soo Kim |
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Vinay Kothapally, John H. L. Hansen |
Monaural Speech Dereverberation Using Deformable Convolutional Networks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yongwei Zhou, Junwei Bao 0001, Youzheng Wu, Xiaodong He 0001, Tiejun Zhao |
Operation-Augmented Numerical Reasoning for Question Answering. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yuchen Hu, Chen Chen 0075, Qiushi Zhu, Eng Siong Chng |
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yile Wang, Yue Zhang 0004 |
Lost in Context? On the Sense-Wise Variance of Contextualized Word Embeddings. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Federico Miotello, Mirco Pezzoli, Luca Comanducci, Fabio Antonacci, Augusto Sarti |
Deep Prior-Based Audio Inpainting Using Multi-Resolution Harmonic Convolutional Neural Networks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Hao-Chen Pei, Hao Fang 0010, Xin Luo 0006, Xin-Shun Xu |
Gradformer: A Framework for Multi-Aspect Multi-Granularity Pronunciation Assessment. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yun Zhao, Dexi Liu, Changxuan Wan, Xiping Liu, Jian-Yun Nie, Jiaming Liu |
JMS-QA: A Joint Hierarchical Architecture for Mental Health Question Answering. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Wei Wang, Yanmin Qian |
Universal Cross-Lingual Data Generation for Low Resource ASR. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiaobo Liang, Runze Mao, Lijun Wu, Juntao Li, Min Zhang 0005, Qing Li 0001 |
Enhancing Low-Resource NLP by Consistency Training With Data and Model Perturbations. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Changhao Ding, Zhangjie Fu, Zhongliang Yang, Qi Yu, Daqiu Li, Yongfeng Huang 0001 |
Context-Aware Linguistic Steganography Model Based on Neural Machine Translation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Srdan Kitic, Jérôme Daniel |
Blind Identification of Ambisonic Reduced Room Impulse Response. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Mathias Bach Pedersen, Søren Holdt Jensen, Zheng-Hua Tan, Jesper Jensen 0001 |
Data-Driven Non-Intrusive Speech Intelligibility Prediction Using Speech Presence Probability. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Hua Lu 0014, Zhen Guo, Chanjuan Li, Yunyi Yang, Huang He, Siqi Bao |
Towards Building an Open-Domain Dialogue System Incorporated With Internet Memes. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin |
Hierarchical Multi-Granularity Interaction Graph Convolutional Network for Long Document Classification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Szymon Drgas, Lars Bramsløw, Archontis Politis, Gaurav Naithani, Tuomas Virtanen |
Dynamic Processing Neural Network Architecture for Hearing Loss Compensation. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Takaaki Saeki, Soumi Maiti, Xinjian Li, Shinji Watanabe 0001, Shinnosuke Takamichi, Hiroshi Saruwatari |
Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Orel Ben Zaken, Anurag Kumar 0003, Vladimir Tourbabin, Boaz Rafaely |
Neural-Network-Based Direction-of-Arrival Estimation for Reverberant Speech - The Importance of Energetic, Temporal, and Spatial Information. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Koichiro Yoshino, Yun-Nung Chen, Paul A. Crook, Satwik Kottur, Jinchao Li, Behnam Hedayatnia, Seungwhan Moon, Zhengcong Fei, Zekang Li, Jinchao Zhang, Yang Feng, Jie Zhou 0016, Seokhwan Kim, Yang Liu 0004, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan 0001, Dilek Hakkani-Tur, Babak Damavandi, Alborz Geramifard, Chiori Hori, Ankit Shah, Chen Zhang, Haizhou Li 0001, João Sedoc, Luis F. D'Haro, Rafael E. Banchs, Alexander Rudnicky |
Overview of the Tenth Dialog System Technology Challenge: DSTC10. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Shiyao Cui, Jiangxia Cao, Xin Cong, Jiawei Sheng, Quangang Li, Tingwen Liu, Jinqiao Shi |
Enhancing Multimodal Entity and Relation Extraction With Variational Information Bottleneck. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Etienne Thuillier, Craig T. Jin, Vesa Välimäki |
HRTF Interpolation Using a Spherical Neural Process Meta-Learner. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Chin-Po Chen, Ho-Hsien Pan, Susan Shur-Fen Gau, Chi-Chun Lee |
Using Measures of Vowel Space for Autistic Traits Characterization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xian Li, Nian Shao, Xiaofei Li 0001 |
Self-Supervised Audio Teacher-Student Transformer for Both Clip-Level and Frame-Level Tasks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Saurabh Kataria, Jesús Villalba 0001, Laureano Moro-Velázquez, Piotr Zelasko, Najim Dehak |
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Dong Zhou, Fang Lei, Lin Li 0001, Yongmei Zhou, Aimin Yang |
Cross-Modal Interaction via Reinforcement Feedback for Audio-Lyrics Retrieval. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Ruiying Lu, Bo Chen 0001, Dandan Guo, Dongsheng Wang 0003, Mingyuan Zhou |
Hierarchical Topic-Aware Contextualized Transformers. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Anurenjan Purushothaman, Debottam Dutta, Rohit Kumar, Sriram Ganapathy |
Speech Dereverberation With Frequency Domain Autoregressive Modeling. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Zhibin Quan, Chi-Man Vong, Weili Zeng, Wankou Yang |
The MorPhEMe Machine: An Addressable Neural Memory for Learning Knowledge-Regularized Deep Contextualized Chinese Embedding. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yehav Alkaher, Israel Cohen |
Howling Detection and Gain Control for Speech Reinforcement in a Noisy Car Cabin Environment. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki, Shoji Makino |
Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Cristian Lucian Stanciu, Jacob Benesty, Constantin Paleologu, Ruxandra-Liana Costea, Laura-Maria Dogariu, Silviu Ciochina |
Decomposition-Based Wiener Filter Using the Kronecker Product and Conjugate Gradient Method. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Femke B. Gelderblom, Tron V. Tronstad, Torbjørn Svendsen, Tor André Myrvoll |
On the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement Networks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Yoshiki Masuyama, Kouei Yamaoka, Takao Kawamura, Nobutaka Ono |
Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Bing Han, Zhengyang Chen, Yanmin Qian |
Self-Supervised Learning With Cluster-Aware-DINO for High-Performance Robust Speaker Verification. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely |
Analysis and Design of Head-Tracked Compensation for Bilateral Ambisonics. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Seong-Gyun Leem, Daniel Fulford, Jukka-Pekka Onnela, David Gard, Carlos Busso |
Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiuying Chen, Shen Gao, Mingzhe Li, Qingqing Zhu, Xin Gao 0001, Xiangliang Zhang 0001 |
Write Summary Step-by-Step: A Pilot Study of Stepwise Summarization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Lei Liu, Li Liu 0036, Haizhou Li 0001 |
Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Emma Hamel, Nickvash Kani |
Factors That Influence Automatic Recognition of African-American Vernacular English in Machine-Learning Models. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Tarek Kanan, Amani AbedAlghafer, Shadi AlZu'bi, Bilal Hawashin, Ala Mughaid, Ghassan Kanaan, M. M. Kamruzzaman |
An Intelligent Health Care System for Detecting Drug Abuse in Social Media Platforms Based on Low Resource Language. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Ritujoy Biswas, Karan Nathwani, Vinayak Abrol |
Statistically Guided Near-End Speech Intelligibility Improvement Through Voice Transformation and Transfer Learning. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Zhibo Man, Zengcheng Huang, Yujie Zhang, Yu Li, Yuanmeng Chen, Yufeng Chen 0005, Jinan Xu |
WDSRL: Multi-Domain Neural Machine Translation With Word-Level Domain-Sensitive Representation Learning. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Daniel Aleksander Krause 0001, Guillermo García-Barrios, Archontis Politis, Annamaria Mesaros |
Binaural Sound Source Distance Estimation and Localization for a Moving Listener. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos |
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Xiaotong Jiang, Ruirui Bai, Zhongqing Wang, Guodong Zhou |
Cross-Domain Aspect-Based Sentiment Classification With Tripartite Graph Modeling. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jiaming Xu 0001, Jian Cui, Yunzhe Hao, Bo Xu 0002 |
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Zhengyang Chen, Bing Han, Shuai Wang 0016, Yanmin Qian |
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Puning Zhang, Rongjian Zhao, Boran Yang, Yuexian Li, Zhigang Yang |
Integrated Syntactic and Semantic Tree for Targeted Sentiment Classification Using Dual-Channel Graph Convolutional Network. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Jun Kong, Jin Wang 0008, Xuejie Zhang 0002 |
Adaptive Ensemble Self-Distillation With Consistent Gradients for Fast Inference of Pretrained Language Models. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
1 | Hoang Ngoc Chau, Tien Dat Bui, Huu Binh Nguyen, Thanh Thi Hien Duong, Quoc-Cuong Nguyen |
A Novel Approach to Multi-Channel Speech Enhancement Based on Graph Neural Networks. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|