|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 986 publication records. Showing 986 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | |
IEEE Spoken Language Technology Workshop, SLT 2022, Doha, Qatar, January 9-12, 2023 |
SLT |
2023 |
DBLP DOI BibTeX RDF |
|
1 | Ke-Han Lu, Kuan-Yu Chen |
A Context-Aware Knowledge Transferring Strategy for CTC-Based ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Giovanni Morrone, Samuele Cornell, Desh Raj, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini |
Low-Latency Speech Separation Guided Diarization for Telephone Conversations. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jinhwan Park, Sichen Jin, Junmo Park, Sungsoo Kim, Dhairya Sandhyana, Changheon Lee, Myoungji Han, Jungin Lee, Seokyeong Jung, Changwoo Han, Chanwoo Kim 0001 |
Conformer-Based on-Device Streaming Speech Recognition with KD Compression and Two-Pass Architecture. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang 0033, Zhouyuan Huo, Zhehuai Chen, Bo Li 0028, Weiran Wang, Trevor Strohman |
JOIST: A Joint Speech and Text Streaming Model for ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Samantha Kotey, Rozenn Dahyot, Naomi Harte |
Fine Grained Spoken Document Summarization Through Text Segmentation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aleksandr Laptev, Boris Ginsburg |
Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-to-End Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Cal Peyser, W. Ronny Huang, Tara N. Sainath, Rohit Prabhavalkar, Michael Picheny, Kyunghyun Cho |
Dual Learning for Large Vocabulary On-Device ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tatsuya Komatsu, Yusuke Fujita |
Interdecoder: using Attention Decoders as Intermediate Regularization for CTC-Based Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yoshiki Masuyama, Xuankai Chang, Samuele Cornell, Shinji Watanabe 0001, Nobutaka Ono |
End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda |
Two-Stage Training Method for Japanese Electrolaryngeal Speech Enhancement Based on Sequence-to-Sequence Voice Conversion. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe 0001, Paola García 0001, Hung-yi Lee, Hao Tang 0002 |
On Compressing Sequences for Self-Supervised Speech Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aditya Patil, Vikas Joshi, Purvi Agrawal, Rupesh R. Mehta |
Streaming Bilingual End-to-End ASR Model Using Attention Over Multiple Softmax. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shelly Jain, Aditya Yadavalli, Ganesh Mirishkar, Anil Kumar Vuppala |
How do Phonological Properties Affect Bilingual Automatic Speech Recognition? |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yoshifumi Nakano, Takaaki Saeki, Shinnosuke Takamichi, Katsuhito Sudoh, Hiroshi Saruwatari |
VTTS: Visual-Text To Speech. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sung Hwan Mun, Jee-weon Jung, Min Hyun Han, Nam Soo Kim |
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shubo Lv, Yihui Fu, Yukai Jv, Lei Xie 0001, Weixin Zhu, Wei Rao, Yannan Wang |
Spatial-DCCRN: DCCRN Equipped with Frame-Level Angle Feature and Hybrid Filtering for Multi-Channel Speech Enhancement. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jin Sakuma, Shinya Fujie, Tetsunori Kobayashi |
Response Timing Estimation for Spoken Dialog Systems Based on Syntactic Completeness Prediction. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sandipana Dowerah, Romain Serizel, Denis Jouvet, Mohammad MohammadAmini, Driss Matrouf |
Joint Optimization of Diffusion Probabilistic-Based Multichannel Speech Enhancement with Far-Field Speaker Verification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ismail Rasim Ülgen, Levent M. Arslan |
Unsupervised Domain Adaptation of Neural PLDA Using Segment Pairs for Speaker Verification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qinghua Liu, Yating Huang, Yunzhe Hao, Jiaming Xu 0001, Bo Xu 0002 |
LiMuSE: Lightweight Multi-Modal Speaker Extraction. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jinzi Qi, Hugo Van hamme |
Weak-Supervised Dysarthria-Invariant Features for Spoken Language Understanding Using an Fhvae and Adversarial Training. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang 0033, Bhuvana Ramabhadran, Pedro J. Moreno 0001, Nanxin Chen |
Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Atsushi Ando, Ryo Masumura, Akihiko Takashima, Satoshi Suzuki, Naoki Makishima, Keita Suzuki, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato |
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, Jing Xiao 0006 |
Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Alexander H. Liu, Wei-Ning Hsu, Michael Auli, Alexei Baevski |
Towards End-to-End Unsupervised Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ji Won Yoon, Beom Jun Woo, Sunghwan Ahn, Hyeonseung Lee, Nam Soo Kim |
Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Saket Dingliwal, Monica Sunkara, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff, Sravan Bodapati |
Personalization of CTC Speech Recognition Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno 0001, Quoc V. Le, Daniel S. Park |
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yan Gaol, Javier Fernández-Marqués, Titouan Parcollet, Pedro P. B. de Gusmao, Nicholas D. Lane |
Match to Win: Analysing Sequences Lengths for Efficient Self-Supervised Learning in Speech and Audio. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chandran Savithri Anoop, A. G. Ramakrishnan |
Exploring a Unified ASR for Multiple South Indian Languages Leveraging Multilingual Acoustic and Language Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aghilas Sini, Antoine Perquin, Damien Lolive, Arnaud Delhay |
Phone-Level Pronunciation Scoring for L1 Using Weighted-Dynamic Time Warping. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Dongjune Lee, Minchan Kim, Sung Hwan Mun, Min Hyun Han, Nam Soo Kim |
Fully Unsupervised Training of Few-Shot Keyword Spotting. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Benjamin Kleiner, Jack G. M. Fitzgerald, Haidar Khan, Gohkan Tur |
Mixture of Domain Experts for Language Understanding: an Analysis of Modularity, Task Performance, and Memory Tradeoffs. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Juan Zuluaga-Gomez, Seyyed Saeed Sarfjoo, Amrutha Prasad, Iuliia Nigmatulina, Petr Motlícek, Karel Ondrej, Oliver Ohneiser, Hartmut Helmke |
Bertraffic: Bert-Based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao 0006 |
SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vasista Sai Lodagala, Sreyan Ghosh, Srinivasan Umesh |
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gaëlle Laperrière, Valentin Pelloin, Mickaël Rouvier, Themos Stafylakis, Yannick Estève |
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Soumi Maiti, Yushi Ueda, Shinji Watanabe 0001, Chunlei Zhang, Meng Yu 0003, Shi-Xiong Zhang, Yong Xu 0004 |
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sharman Tan, Piyush Behre, Nick Kibre, Issac Alphonso, Shuangyu Chang |
Four-in-One: a Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chaoyue Ding, Jiakui Li, Martin Zong, Baoxiang Li |
Speed-Robust Keyword Spotting Via Soft Self-Attention on Multi-Scale Features. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe 0001, Abdelrahman Mohamed, Shang-Wen Li 0001, Hung-yi Lee |
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuma Koizumi, Kohei Yatabe, Heiga Zen, Michiel Bacchiani |
Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali 0002, Sanjeev Khudanpur |
Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tina Raissi, Wei Zhou 0043, Simon Berger, Ralf Schlüter, Hermann Ney |
HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ragheb Al-Ghezi, Yaroslav Getman, Ekaterina Voskoboinik, Mittul Singh, Mikko Kurimo |
Automatic Rating of Spontaneous Speech for Low-Resource Languages. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos |
Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hong Liu, Yucheng Cai, Zhijian Ou, Yi Huang, Junlan Feng |
Building Markovian Generative Architectures Over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chia-Yu Li, Ngoc Thang Vu |
Improving Semi-Supervised End-To-End Automatic Speech Recognition Using Cyclegan and Inter-Domain Losses. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Juan Manuel Coria, Hervé Bredin, Sahar Ghannay, Sophie Rosset |
Continual Self-Supervised Domain Adaptation for End-to-End Speaker Diarization. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Peng Shen, Xugang Lu, Hisashi Kawai |
Pronunciation-Aware Unique Character Encoding for RNN Transducer-Based Mandarin Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Florian Lux, Ching-Yi Chen, Ngoc Thang Vu |
Combining Contrastive and Non-Contrastive Losses for Fine-Tuning Pretrained Models in Speech Analysis. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lu Zeng, Sree Hari Krishnan Parthasarathi, Dilek Hakkani-Tur |
N-Best Hypotheses Reranking for Text-to-SQL Systems. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Brady Houston, Katrin Kirchhoff |
Exploration of Language-Specific Self-Attention Parameters for Multilingual End-to-End Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tsendsuren Munkhdalai, Zelin Wu, Golan Pundak, Khe Chai Sim, Jiayang Li, Pat Rondon, Tara N. Sainath |
NAM+: Towards Scalable End-to-End Contextual Biasing for Adaptive ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee 0001 |
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bhusan Chettri |
The Clever Hans Effect in Voice Spoofing Detection. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohan Li, Rama Doddipatla |
Non-Autoregressive End-to-End Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphanso, Jinyu Li 0001, Yifan Gong 0001 |
Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen |
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger Jang |
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junyi Peng, Oldrich Plchot, Themos Stafylakis, Ladislav Mosner, Lukás Burget, Jan Cernocký |
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tyler Vuong, Nikhil Madaan, Rohan Panda, Richard M. Stern |
Investigating the Important Temporal Modulations for Deep-Learning-Based Speech Activity Detection. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wolfgang Mack, Emanuël A. P. Habets |
A Hybrid Acoustic Echo Reduction Approach Using Kalman Filtering and Informed Source Extraction with Improved Training. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu Jeong Han, Shinji Watanabe 0001 |
E-Branchformer: Branchformer with Enhanced Merging for Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Samuele Cornell, Thomas Balestri, Thibaud Sénéchal |
Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anupama Chingacham, Vera Demberg, Dietrich Klakow |
A Data-Driven Investigation of Noise-Adaptive Utterance Generation with Linguistic Modification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hyung Yong Kim, Byeong-Yeol Kim, Seung Woo Yoo, Youshin Lim, Yunkyu Lim, Hanbin Lee |
ASBERT: ASR-Specific Self-Supervised Learning with Self-Training. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Donghyeon Kim, Jeong-gi Kwak, Hanseok Ko |
Efficient Dynamic Filter For Robust and Low Computational Feature Extraction. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amruta Saraf, Ganesh Sivaraman, Elie Khoury 0001 |
A Zero-Shot Approach to Identifying Children's Speech in Automatic Gender Classification. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chanwoo Kim 0001, Sathish Indurti, Jinhwan Park, Wonyong Sung |
Macro-Block Dropout for Improved Regularization in Training End-to-End Speech Recognition Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aparna Khare, Minhua Wu, Saurabhchand Bhati, Jasha Droppo, Roland Maas |
Guided Contrastive Self-Supervised Pre-Training for Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Abdelhamid Ezzerg, Thomas Merritt, Kayoko Yanagisawa, Piotr Bilinski, Magdalena Proszewska, Kamil Pokora, Renard Korzeniowski, Roberto Barra-Chicote, Daniel Korzekwa |
Remap, Warp and Attend: Non-Parallel Many-to-Many Accent Conversion with Normalizing Flows. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yifan Peng, Siddhant Arora, Yosuke Higuchi, Yushi Ueda, Sujay Kumar, Karthik Ganesan 0003, Siddharth Dalmia, Xuankai Chang, Shinji Watanabe 0001 |
A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Robin Scheibler, Wangyou Zhang, Xuankai Chang, Shinji Watanabe 0001, Yanmin Qian |
End-to-End Multi-Speaker ASR with Independent Vector Analysis. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Joshua Jansen van Vüren, Thomas Niesler |
Code-Switched Language Modelling Using a Code Predictive Lstm in Under-Resourced South African Languages. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohammad Al-Fetyani, Muhammad Al-Barham, Gheith A. Abandah, Adham Alsharkawi, Maha Dawas |
MASC: Massive Arabic Speech Corpus. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lahiru Samarakoon, Ivan Fung |
Untied Positional Encodings for Efficient Transformer-Based Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Muhammad Huzaifah 0001, Ivan Kukanov |
An Analysis of Semantically-Aligned Speech-Text Embeddings. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xin Wang 0037, Junichi Yamagishi |
Investigating Active-Learning-Based Training Data Selection for Speech Spoofing Countermeasure. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sepand Mavandadi, Bo Li 0028, Chao Zhang, Brian Farris, Tara N. Sainath, Trevor Strohman |
A Truly Multilingual First Pass and Monolingual Second Pass Streaming on-Device ASR System. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Arun Narayanan, James Walker, Sankaran Panchapagesan, Nathan Howard, Yuma Koizumi |
Learning Mask Scalars for Improved Robust Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jwala Dhamala, Varun Kumar, Rahul Gupta 0001, Kai-Wei Chang, Aram Galstyan |
An Analysis of The Effects of Decoding Algorithms on Fairness in Open-Ended Language Generation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mikolaj Babianski, Kamil Pokora, Raahil Shah, Rafal Sienkiewicz, Daniel Korzekwa, Viacheslav Klimkov |
On Granularity of Prosodic Representations in Expressive Text-to-Speech. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hiroki Kanagawa, Yusuke Ijima |
SIMD-Size Aware Weight Regularization for Fast Neural Vocoding on CPU. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Antoine Bruguier, David Qiu, Trevor Strohman, Yanzhang He |
Flickering Reduction with Partial Hypothesis Reranking for Streaming ASR. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | David Qiu, Tsendsuren Munkhdalai, Yanzhang He, Khe Chai Sim |
Context-Aware Neural Confidence Estimation for Rare Word Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tian Li, Qingliang Meng, Yujian Sun |
Improved Noisy Iterative Pseudo-Labeling for Semi-Supervised Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Injy Hamed, Amir Hussein, Oumnia Chellah, Shammur Absar Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali 0002 |
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yukai Ju, Shimin Zhang, Wei Rao, Yannan Wang, Tao Yu, Lei Xie 0001, Shidong Shang |
TEA-PSE 2.0: Sub-Band Network for Real-Time Personalized Speech Enhancement. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wen Wu, Chao Zhang 0031, Philip C. Woodland |
Distribution-Based Emotion Recognition in Conversation. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Themos Stafylakis, Ladislav Mosner, Sofoklis Kakouros, Oldrich Plchot, Lukás Burget, Jan Cernocký |
Extracting Speaker and Emotion Information from Self-Supervised Speech Models via Channel-Wise Correlations. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Suliang Bu, Tuo Zhao, Yunxin Zhao |
TDOA Estimation of Speech Source in Noisy Reverberant Environments. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Leanne Nortje, Herman Kamper |
Towards Visually Prompted Keyword Localisation for Zero-Resource Spoken Languages. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chuanbo Zhu 0001, Takuya Kunihara, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi |
Automatic Prediction of Intelligibility of Words and Phonemes Produced Orally by Japanese Learners of English. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Binghuai Lin, Liyuan Wang |
Exploiting Information From Native Data for Non-Native Automatic Pronunciation Assessment. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jia Cui, Heng Lu 0004, Wenjie Wang, Shiyin Kang, Liqiang He, Guangzhi Li, Dong Yu 0001 |
Efficient Text Analysis with Pre-Trained Neural Network Models. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sungjun Han, Deepak Baby, Valentin Mendelev |
Residual Adapters for Targeted Updates in RNN-Transducer Based Speech Recognition System. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xiaoming Zhang, Fan Zhang, Xiaodong Cui, Wei Zhang |
Speech Emotion Recognition with Complementary Acoustic Representations. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fan Yu, Shiliang Zhang, Pengcheng Guo, Yuhao Liang, Zhihao Du, Yuxiao Lin, Lei Xie 0001 |
MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario. |
SLT |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 986 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|