|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 1123 publication records. Showing 1123 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | David Qiu, Shaojin Ding, Yanzhang He |
The Role of Feature Correlation on Quantized Neural Networks. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yongmao Zhang, Guanghou Liu, Yi Lei, Yunlin Chen, Hao Yin, Lei Xie 0001, Zhifei Li |
Promptspeaker: Speaker Generation Based on Text Descriptions. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | William Chen, Jiatong Shi, Brian Yan, Dan Berrebbi, Wangyou Zhang, Yifan Peng, Xuankai Chang, Soumi Maiti, Shinji Watanabe 0001 |
Joint Prediction and Denoising for Large-Scale Multilingual Self-Supervised Learning. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Rajeev Rajan, Noumida Abdul Kareem, Sreelakshmi S |
Paraconsistent Feature Analysis for the Competency Evaluation of Voice Impersonation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Junfeng Hou, Peiyao Wang, Jincheng Zhang, Meng Yang, Minwei Feng, Jingcheng Yin |
CTC Blank Triggered Dynamic Layer-Skipping for Efficient Ctc-Based Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jason Clarke, Yoshihiko Gotoh, Stefan Goetze |
Improving Audiovisual Active Speaker Detection in Egocentric Recordings with the Data-Efficient Image Transformer. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke |
Generative Speech Recognition Error Correction With Large Language Models and Task-Activating Prompting. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Varun Krishna, Sriram Ganapathy |
Pseudo-Label Based Supervised Contrastive Loss for Robust Speech Representations. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yosuke Higuchi, Andrew Rosenberg, Yuan Wang, Murali Karthick Baskar, Bhuvana Ramabhadran |
Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Zihan Zhang, Jiayao Sun, Xianjun Xia, Ziqian Wang, Xiaopeng Yan, Yijian Xiao, Lei Xie 0001 |
An Exploration of Task-Decoupling on Two-Stage Neural Post Filter for Real-Time Personalized Acoustic Echo Cancellation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Ke Hu, Tara N. Sainath, Bo Li 0028, Yu Zhang, Yong Cheng, Tao Wang, Yujing Zhang, Frederick Liu |
Improving Multilingual and Code-Switching ASR Using Large Language Model Generated Text. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Anusha Prakash 0001, Srinivasan Umesh, Hema A. Murthy |
Towards Developing State-of-The-Art TTS Synthesisers for 13 Indian Languages with Signal Processing Aided Alignments. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Ching-Feng Yeh, Po-Yao Huang 0001, Vasu Sharma, Shang-Wen Li 0001, Gargi Ghosh |
Flap: Fast Language-Audio Pre-Training. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yuewei Zhang, Huanbin Zou, Jie Zhu |
Vsanet: Real-Time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang 0007, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma 0010, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar 0003, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe 0001, Yangyang Shi, Yumeng Tao |
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Peikun Chen, Fan Yu, Yuhao Liang, Hongfei Xue, Xucheng Wan, Naijun Zheng, Huan Zhou 0004, Lei Xie 0001 |
BA-MoE: Boundary-Aware Mixture-of-Experts Adapter for Code-Switching Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Gan Song, Zelin Wu, Golan Pundak, Angad Chandorkar, Kandarp Joshi, Xavier Velez, Diamantino Caseiro, Ben Haynor, Weiran Wang, Nikhil Siddhartha, Pat Rondon, Khe Chai Sim |
Contextual Spelling Correction with Large Language Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Robin Netzorg, Ajil Jalal, Luna McNulty, Gopala Krishna Anumanchipalli |
Permod: Perceptually Grounded Voice Modification With Latent Diffusion Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Lakshmi Rajendram Bashyam, Alexander Blatt, Dietrich Klakow |
Enabling Noisy Label Usage for Out-of-Airspace Data in Read-Back Error Detection. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Lahiru Samarakoon, Samuel J. Broughton, Marc Härkönen, Ivan Fung |
Transformer Attractors for Robust and Efficient End-To-End Neural Diarization. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yuewei Zhang, Huanbin Zou, Jie Zhu |
Magnitude-and-Phase-Aware Speech Enhancement With Parallel Sequence Modeling. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Takatomo Kano, Atsunori Ogawa, Marc Delcroix, Kohei Matsuura, Takanori Ashihara, William Chen, Shinji Watanabe 0001 |
Summarize While Translating: Universal Model With Parallel Decoding for Summarization and Translation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Minchan Kim, Myeonghun Jeong, Byoung Jin Choi, Dongjune Lee, Nam Soo Kim |
Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Tzu-Quan Lin, Hung-Yi Lee, Hao Tang 0002 |
MelHuBERT: A Simplified Hubert on Mel Spectrograms. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Armand Stricker, Patrick Paroubek |
Enhancing Task-Oriented Dialogues With Chitchat: A Comparative Study Based on Lexical Diversity And Divergence. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jun-You Wang, Chon-In Leong, Yu-Chen Lin, Li Su, Jyh-Shing Roger Jang |
Adapting Pretrained Speech Model for Mandarin Lyrics Transcription and Alignment. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Zili Qi, Xinhui Hu, Wangjin Zhou, Sheng Li 0010, Hao Wu, Jian Lu, Xinkang Xu |
LE-SSL-MOS: Self-Supervised Learning MOS Prediction with Listener Enhancement. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu |
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba 0001, Sanjeev Khudanpur, Najim Dehak |
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Daichi Hayakawa, Takehiko Kagoshima, Kenji Iwata, Norbert Braunschweiler, Rama Doddipatla |
Robust Recognition of Speaker Emotion With Difference Feature Extraction Using a Few Enrollment Utterances. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Wen-Chin Huang, Lester Phillip Violeta, Songxiang Liu, Jiatong Shi, Tomoki Toda |
The Singing Voice Conversion Challenge 2023. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Maliha Jahan, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak, Jesús Villalba 0001 |
Model-Based Fairness Metric for Speaker Verification. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Meng Yu 0003, Yong Xu 0004, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu 0001 |
Neuralecho: Hybrid of Full-Band and Sub-Band Recurrent Neural Network For Acoustic Echo Cancellation and Speech Enhancement. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Hillary Ngai, Rohan Agrawal, Neeraj Gaur, W. Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar |
Audio-Adapterfusion: A Task-Id-Free Approach for Efficient and Non-Destructive Multi-Task Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Da-Hee Yang, Joon-Hyuk Chang |
Towards Robust Packet Loss Concealment System With ASR-Guided Representations. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jae-Hong Lee, Do-Hee Kim, Joon-Hyuk Chang |
AWMC: Online Test-Time Adaptation Without Mode Collapse for Continual Adaptation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba 0001, Sanjeev Khudanpur, Najim Dehak |
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Hsin-Tien Chiang, Kuo-Hsuan Hung, Szu-Wei Fu, Heng-Cheng Kuo, Ming-Hsueh Tsai, Yu Tsao 0001 |
Study on the Correlation Between Objective Evaluations and Subjective Speech Quality and Intelligibility. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jin Qiu, Lu Huang, Boyu Li, Jun Zhang 0066, Lu Lu 0015, Zejun Ma |
Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jarod Duret, Benjamin O'Brien, Yannick Estève, Titouan Parcollet |
Enhancing Expressivity Transfer in Textless Speech-to-Speech Translation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Kailai Shen, Diqun Yan, Li Dong 0006, Ying Ren, Xiaoxun Wu, Jing Hu |
SQAT-LD: SPeech Quality Assessment Transformer Utilizing Listener Dependent Modeling for Zero-Shot Out-of-Domain MOS Prediction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel 0001, Jee-Weon Jung, Soumi Maiti, Shinji Watanabe 0001 |
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Erica Cooper, Wen-Chin Huang, Yu Tsao 0001, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi |
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Chung-Ming Chien, Mingjiamei Zhang, Ju-Chieh Chou, Karen Livescu |
Few-Shot Spoken Language Understanding Via Joint Speech-Text Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Huali Zhou, Yueqian Lin, Yao Shi, Peng Sun, Ming Li |
Bisinger: Bilingual Singing Voice Synthesis. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Masao Someki, Nicholas Eng, Yosuke Higuchi, Shinji Watanabe 0001 |
Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Hao Zhang, Meng Yu, Dong Yu |
Deep Learning for Joint Acoustic Echo and Acoustic Howling Suppression in Hybrid Meetings. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Mohan Li, Catalin Zorila, Cong-Thanh Do, Rama Doddipatla |
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yuke Li, Xinfa Zhu, Yi Lei, Hai Li, Junhui Liu, Danming Xie, Lei Xie 0001 |
Zero-Shot Emotion Transfer for Cross-Lingual Speech Synthesis. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li 0001, Abdelrahman Mohamed, Hung-Yi Lee, Shinji Watanabe 0001 |
Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Dongji Gao, Hainan Xu, Desh Raj, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur |
Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Kohei Saijo, Wangyou Zhang, Zhong-Qiu Wang, Shinji Watanabe 0001, Tetsunori Kobayashi, Tetsuji Ogawa |
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, And Extraction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jihyun Lee, Yejin Jeon, Wonjun Lee, Yunsu Kim 0001, Gary Geunbae Lee |
Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Anjali Raj, Shikhar Bharadwaj, Sriram Ganapathy, Min Ma, Shikhar Vashishth |
MASR: Multi-Label Aware Speech Representation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Eric Sun, Jinyu Li 0001, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu 0001, Edward Lin, Yifan Gong 0001 |
Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Shaoxiong Lin, Chao Zhang, Yanmin Qian |
Improving Speech Enhancement Using Audio Tagging Knowledge From Pre-Trained Representations and Multi-Task Learning. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Zhengyang Li, Thomas Graave, Jing Liu, Timo Lohrenz, Siegfried Kunzmann, Tim Fingscheidt |
Parameter-Efficient Cross-Language Transfer Learning for a Language-Modular Audiovisual Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Sibo Tong, Philip Harding, Simon Wiesler |
Hierarchical Attention-Based Contextual Biasing For Personalized Speech Recognition Using Neural Transducers. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yusheng Tian, Wei Liu, Tan Lee |
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yan Huang 0028, Piyush Behre, Guoli Ye, Shawn Chang, Yifan Gong 0001 |
Multi Transcription-Style Speech Transcription Using Attention-Based Encoder-Decoder Model. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang 0001, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu |
Acoustic Model Fusion For End-to-End Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Sangeet Sagar, Mirco Ravanelli, Bernd Kiefer, Ivana Kruijff-Korbayová, Josef van Genabith |
Rescuespeech: A German Corpus for Speech Recognition in Search and Rescue Domain. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yingzhi Wang, Mirco Ravanelli, Alya Yacoubi |
Speech Emotion Diarization: Which Emotion Appears When? |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jun-You Wang, Hung-Yi Lee, Jyh-Shing Roger Jang, Li Su |
Zero-Shot Singing Voice Synthesis from Musical Score. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Veera Raghavendra Elluru, Devang Kulshreshtha, Rohit Paturi, Sravan Bodapati, Srikanth Ronanki |
Generalized Zero-Shot Audio-to-Intent Classification. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Guru Prakash Arumugam, Shuo-Yiin Chang, Tara N. Sainath, Rohit Prabhavalkar, Quan Wang, Shaan Bijwadia |
Improved Long-Form Speech Recognition By Jointly Modeling The Primary And Non-Primary Speakers. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Xinjian Li, Shinnosuke Takamichi, Takaaki Saeki, William Chen, Sayaka Shiota, Shinji Watanabe 0001 |
Yodas: Youtube-Oriented Dataset for Audio and Speech. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Seongjin Park, Rutuja Ubale |
Multitask Learning Model with Text and Speech Representation for Fine-Grained Speech Scoring. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yusuke Shinohara, Shinji Watanabe 0001 |
Domain Adaptation by Data Distribution Matching Via Submodularity For Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yangze Li, Fan Yu, Yuhao Liang, Pengcheng Guo, Mohan Shi, Zhihao Du, Shiliang Zhang, Lei Xie 0001 |
Sa-Paraformer: Non-Autoregressive End-To-End Speaker-Attributed ASR. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Ivan Fung, Lahiru Samarakoon, Samuel J. Broughton |
Robust End-to-End Diarization with Domain Adaptive Training and Multi-Task Learning. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Bahman Mirheidari, Ronan O'Malley, Daniel Blackburn, Heidi Christensen |
Identifying People with Mild Cognitive Impairment at Risk of Developing Dementia using Speech Analysis. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Dominik Wagner, Ilja Baumann, Sebastian P. Bayerl, Korbinian Riedhammer, Tobias Bocklet |
Speaker Adaptation for End-to-End Speech Recognition Systems in Noisy Environments. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | William Ravenscroft, Stefan Goetze, Thomas Hain |
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Xingyu Cai, David Qiu, Shaojin Ding, Dongseong Hwang, Weiran Wang, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He |
Efficient Cascaded Streaming ASR System Via Frame Rate Reduction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jian Xue, Peidong Wang, Jinyu Li 0001, Eric Sun |
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Daniel Galvez, Tim Kaldewey |
GPU-Accelerated Wfst Beam Search Decoder for CTC-Based Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Takuma Okamoto, Haruki Yamashita, Yamato Ohtani, Tomoki Toda, Hisashi Kawai |
WaveNeXt: ConvNeXt-Based Fast Neural Vocoder Without ISTFT layer. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Dongning Yang, Wei Wang, Yanmin Qian |
FAT-HuBERT: Front-End Adaptive Training of Hidden-Unit BERT For Distortion-Invariant Robust Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Elaf Islam, Thomas Hain, Protima Nomo Sudro |
Simulation of Teacher-Learner Interaction in English Language Pronunciation Learning. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Daniel Mann, Tina Raissi, Wilfried Michel, Ralf Schlüter, Hermann Ney |
End-To-End Training of a Neural HMM with Label and Transition Probabilities. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Nick Rossenbach, Benedikt Hilmes, Ralf Schlüter |
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Roshan S. Sharma, William Chen, Takatomo Kano, Ruchira Sharma, Siddhant Arora, Shinji Watanabe 0001, Atsunori Ogawa, Marc Delcroix, Rita Singh, Bhiksha Raj |
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Aleksandr Meister, Matvei Novikov, Nikolay Karpov, Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg |
LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of End-to-End ASR Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung |
Consistency Based Unsupervised Self-Training for ASR Personalisation. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Zexu Pan, Gordon Wichern, Yoshiki Masuyama, François G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux |
Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Weiming Xu, Zhouxuan Chen, Zhili Tan, Shubo Lv, Runduo Han, Wenjiang Zhou, Weifeng Zhao, Lei Xie |
MBTFNET: Multi-Band Temporal-Frequency Neural Network for Singing Voice Enhancement. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Yu-Hsiang Wang, Huang-Yu Chen, Kai-Wei Chang, Winston H. Hsu, Hung-Yi Lee |
Minisuperb: Lightweight Benchmark for Self-Supervised Speech Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Geoffroy Vanderreydt, Amrutha Prasad, Driss Khalil, Srikanth R. Madikeri, Kris Demuynck, Petr Motlícek |
Parameter-Efficient Tuning with Adaptive Bottlenecks for Automatic Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Abinay Reddy Naini, Shruthi Subramanium, Seong-Gyun Leem, Carlos Busso |
Combining Relative and Absolute Learning Formulations to Predict Emotional Attributes From Speech. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Md Asif Jalal, Pablo Peso Parada, George Pavlidis, Vasileios Moschopoulos, Karthikeyan Saravanan, Chrysovalantis-Giorgos Kontoulis, Jisi Zhang, Anastasios Drosou, Gil Ho Lee, Jungin Lee, Seokyeong Jung |
Locality Enhanced Dynamic Biasing and Sampling Strategies For Contextual ASR. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Pasquale D'Alterio, Christian Hensel, Bashar Awwad Shiekh Hasan |
Can Unpaired Textual Data Replace Synthetic Speech in ASR Model Adaptation? |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao 0001 |
LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain |
MUST: A Multilingual Student-Teacher Learning Approach for Low-Resource Speech Recognition. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Mun-Hak Lee, Sang-Eon Lee, Ji-Eun Choi, Joon-Hyuk Chang |
Cross-Modal Learning for CTC-Based ASR: Leveraging CTC-Bertscore and Sequence-Level Training. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Amit Meghanani, Thomas Hain |
Deriving Translational Acoustic Sub-Word Embeddings. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Jiajun He, Zekun Yang, Tomoki Toda |
ED-CEC: Improving Rare word Recognition Using ASR Postprocessing Based on Error Detection and Context-Aware Error Correction. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Lillian Zhou, Yuxin Ding, Mingqing Chen, Harry Zhang, Rohit Prabhavalkar, Dhruv Guliani, Giovanni Motta, Rajiv Mathews |
The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections Through Federated Learning. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Prashanth Gurunath Shivakumar, Jari Kolehmainen, Yile Gu, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko |
Discriminative Speech Recognition Rescoring With Pre-Trained Language Models. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Junteng Jia, Ke Li, Mani Malek 0001, Kshitiz Malik, Jay Mahadeokar, Ozlem Kalinli, Frank Seide |
Joint Federated Learning and Personalization for on-Device ASR. |
ASRU |
2023 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 1123 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|