|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 18782 publication records. Showing 18782 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | A. Arunkumar, Vrunda Nileshkumar Sukhadia, Srinivasan Umesh |
Investigation of Ensemble features of Self-Supervised Pretrained Models for Automatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chenfeng Miao, Kun Zou, Ziyang Zhuang, Tao Wei, Jun Ma 0018, Shaojun Wang, Jing Xiao 0006 |
Towards Efficiently Learning Monotonic Alignments for Attention-based End-to-End Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Golan Pundak, Tsendsuren Munkhdalai, Khe Chai Sim |
On-the-fly ASR Corrections with Audio Exemplars. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey |
CycleGAN-based Unpaired Speech Dereverberation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana |
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yixuan Zhang 0005, Heming Wang, DeLiang Wang |
Densely-connected Convolutional Recurrent Network for Fundamental Frequency Estimation in Noisy Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Bronya Roni Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer Cole 0001, Joseph Keshet |
DeepFry: Identifying Vocal Fry Using Deep Neural Networks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shengyuan Xu, Wenxiao Zhao, Jing Guo |
RefineGAN: Universally Generating Waveform Better than Ground Truth with Highly Accurate Pitch and Intensity Responses. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ilya Sklyar, Anna Piunova, Christian Osendorfer |
Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mutian He 0001, Jingzhou Yang, Lei He 0005, Frank K. Soong |
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tobias Weise, Philipp Klumpp, Andreas K. Maier, Elmar Nöth, Björn Heismann, Maria Schuster, Seung Hee Yang |
Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sanli Tian, Keqi Deng, Zehan Li, Lingxuan Ye, Gaofeng Cheng, Ta Li, Yonghong Yan 0002 |
Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà |
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chenfeng Miao, Ting Chen, Minchuan Chen, Jun Ma 0018, Shaojun Wang, Jing Xiao 0006 |
A compact transformer-based GAN vocoder. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haoyue Zhan, Xinyuan Yu, Haitong Zhang, Yang Zhang, Yue Lin 0002 |
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zuoyu Tian, Xiao Dong, Feier Gao, Haining Wang, Chien-Jer Charles Lin |
Mandarin Tone Sandhi Realization: Evidence from Large Speech Corpora. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xue Jiang, Xiulian Peng, Huaying Xue, Yuan Zhang, Yan Lu 0001 |
Cross-Scale Vector Quantization for Scalable Neural Speech Coding. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Changhwan Kim, Se-Yun Um, Hyungchan Yoon, Hong-Goo Kang |
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Peng Zhang, Peng Hu, Xueliang Zhang |
Norm-constrained Score-level Ensemble for Spoofing Aware Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Min-Kyung Kim 0002, Joon-Hyuk Chang |
Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kaiqi Fu, Shaojun Gao, Xiaohai Tian, Wei Li 0012, Zejun Ma |
Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Georgios Karakasidis, Tamás Grósz, Mikko Kurimo |
Comparison and Analysis of New Curriculum Criteria for End-to-End ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rahil Parikh, Gaspar Rochette, Carol Y. Espy-Wilson, Shihab A. Shamma |
An Empirical Analysis on the Vulnerabilities of End-to-End Speech Segregation Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hira Dhamyal, Bhiksha Raj, Rita Singh |
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Seungu Han, Junhyeok Lee |
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zongyang Du, Berrak Sisman, Kun Zhou 0003, Haizhou Li 0001 |
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vera Bernhard, Sandra Schwab, Jean-Philippe Goldman |
Acoustic Stress Detection in Isolated English Words for Computer-Assisted Pronunciation Training. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mohammad MohammadAmini, Driss Matrouf, Jean-François Bonastre, Sandipana Dowerah, Romain Serizel, Denis Jouvet |
Barlow Twins self-supervised learning for robust speaker recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Matthew Perez, Mimansa Jaiswal, Minxue Niu, Cristina Gorrostieta, Matthew Roddy, Kye Taylor, Reza Lotfian, John Kane, Emily Mower Provost |
Mind the gap: On the value of silence representations to lexical-based speech emotion recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhihan Wang, Feng Hou, Yuanhang Qiu, Zhizhong Ma, Satwinder Singh, Ruili Wang |
CyclicAugment: Speech Data Random Augmentation with Cosine Annealing Scheduler for Auotmatic Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Eleftherios Kapelonis, Efthymios Georgiou, Alexandros Potamianos |
A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Janine Rugayan, Torbjørn Svendsen, Giampiero Salvi |
Semantically Meaningful Metrics for Norwegian ASR Systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tasnima Sadekova, Vladimir Gogoryan, Ivan Vovk, Vadim Popov, Mikhail A. Kudinov, Jiansheng Wei |
A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng 0001, Jing Xiao 0006 |
Uncertainty Calibration for Deep Audio Classifiers. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Junyi Peng, Rongzhi Gu, Ladislav Mosner, Oldrich Plchot, Lukás Burget, Jan Cernocký |
Learnable Sparse Filterbank for Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Haoran Yin, Meng Ge, Yanjie Fu, Gaoyan Zhang, Longbiao Wang, Lei Zhang, Lin Qiu, Jianwu Dang 0001 |
MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Siqi Zheng, Hongbin Suo, Qian Chen |
PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Emiru Tsunoo, Yosuke Kashiwagi, Chaitanya Prasad Narisetty, Shinji Watanabe 0001 |
Residual Language Model for End-to-end Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Paula Andrea Pérez-Toro, Philipp Klumpp, Abner Hernandez, Tomas Arias, Patricia Lillo, Andrea Slachevsky, Adolfo Martín García, Maria Schuster, Andreas K. Maier, Elmar Nöth, Juan Rafael Orozco-Arroyave |
Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yifei Xin, Dongchao Yang, Yuexian Zou |
Audio Pyramid Transformer with Domain Adaption for Weakly Supervised Sound Event Detection and Audio Classification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Fan Qian, Hongwei Song, Jiqing Han 0001 |
Word-wise Sparse Attention for Multimodal Sentiment Analysis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anuroop Sriram, Michael Auli, Alexei Baevski |
Wav2Vec-Aug: Improved self-supervised training with limited data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Youxiang Zhu, Xiaohui Liang, John A. Batsis, Robert M. Roth |
Domain-aware Intermediate Pretraining for Dementia Detection with Limited Data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xueshuai Zhang, Jiakun Shen, Jun Zhou 0024, Pengyuan Zhang, Yonghong Yan 0002, Zhihua Huang, Yanfen Tang, Yu Wang, Fujie Zhang, Shaoxing Zhang, Aijun Sun |
Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gasper Begus, Alan Zhou |
Modeling speech recognition and synthesis simultaneously: Encoding and decoding lexical and sublexical semantic information into speech with no direct access to speech data. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jiatong Shi, Shuai Guo, Tao Qian, Tomoki Hayashi, Yuning Wu, Fangzheng Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe 0001, Qin Jin |
Muskits: an End-to-end Music Processing Toolkit for Singing Voice Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mao-Kui He, Jun Du, Chin-Hui Lee 0001 |
End-to-End Audio-Visual Neural Speaker Diarization. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ju-ho Kim, Jungwoo Heo, Hye-jin Shim, Ha-Jin Yu |
Extended U-Net for Speaker Verification in Noisy Environments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aaqib Saeed |
Binary Early-Exit Network for Adaptive Inference on Low-Resource Devices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mu Yang, Kevin Hirschi, Stephen Daniel Looney, Okim Kang, John H. L. Hansen |
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuanbo Hou, Zhaoyi Liu, Bo Kang, Yun Wang, Dick Botteldooren |
CT-SAT: Contextual Transformer for Sequential Audio Tagging. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Sondes Abderrazek, Corinne Fredouille, Alain Ghio, Muriel Lalain, Christine Meunier, Virginie Woisard |
Validation of the Neuro-Concept Detector framework for the characterization of speech disorders: A comparative study including Dysarthria and Dysphonia. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik |
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daniel Fernau, Stefan Hillmann, Nils Feldhus, Tim Polzehl |
Towards Automated Dialog Personalization using MBTI Personality Indicators. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xingming Wang, Xiaoyi Qin, Yikang Wang, Yunfei Xu, Ming Li 0026 |
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Francesco Nespoli, Daniel Barreda, Patrick A. Naylor |
Relative Acoustic Features for Distance Estimation in Smart-Homes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kohei Saijo, Tetsuji Ogawa |
Unsupervised Training of Sequential Neural Beamformer Using Coarsely-separated and Non-separated Signals. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zhaoyan Zhang, Jason Zhang, Jody Kreiman |
Effects of laryngeal manipulations on voice gender perception. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yi-Chang Chen, Yu-Chuan Steven, Yen-Cheng Chang, Yi-Ren Yeh |
g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shaohuan Zhou, Shun Lei, Weiya You, Deyi Tuo, Yuren You, Zhiyong Wu 0001, Shiyin Kang, Helen Meng |
Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Jovan M. Dalhouse, Katunobu Itou |
Cross-Lingual Transfer Learning Approach to Phoneme Error Detection via Latent Phonetic Representation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Qibing Bai, Yu Zhang 0006 |
Leveraging Pseudo-labeled Data to Improve Direct Speech-to-Speech Translation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amir Shirian, Krishna Somandepalli, Victor Sanchez, Tanaya Guha |
Visually-aware Acoustic Event Detection using Heterogeneous Graphs. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Rui Wang 0073, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang 0006, Tom Ko, Haizhou Li 0001 |
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ronglai Zuo, Brian Mak |
Local Context-aware Self-attention for Continuous Sign Language Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Vinicius Ribeiro, Yves Laprie |
Autoencoder-Based Tongue Shape Estimation During Continuous Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Wen-Chin Huang, Erica Cooper, Yu Tsao 0001, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi |
The VoiceMOS Challenge 2022. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Séverine Guillaume, Guillaume Wisniewski, Benjamin Galliot, Minh Chau Nguyen, Maxime Fily, Guillaume Jacques, Alexis Michaud |
Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource setting. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hokuto Munakata, Ryu Takeda, Kazunori Komatani |
Training Data Generation with DOA-based Selecting and Remixing for Unsupervised Training of Deep Separation Models. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li 0001, Hung-yi Lee |
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Tuan Vu Ho, Maori Kobayashi, Masato Akagi |
Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Woo Hyun Kang, Md. Jahangir Alam, Abderrahim Fathan |
End-to-end framework for spoof-aware speaker verification. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Daiki Yoshioka, Yusuke Yasuda, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda |
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ruchao Fan, Abeer Alwan |
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey |
UserLibri: A Dataset for ASR Personalization Using Only Text. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuxiang Zhang, Zhuo Li, Wenchao Wang, Pengyuan Zhang |
SASV Based on Pre-trained ASV System and Integrated Scoring Module. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Mark Gibson, Marcel Schlechtweg, Beatriz Blecua Falgueras, Judit Ayala Alcalde |
Language-specific interactions of vowel discrimination in noise. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Amrit Romana, Minxue Niu, Matthew Perez, Angela Roberts, Emily Mower Provost |
Enabling Off-the-Shelf Disfluency Detection and Categorization for Pathological Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Jesse Emond, Yinghui Huang, Pedro J. Moreno 0001 |
Non-Parallel Voice Conversion for ASR Augmentation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Ryan M. Corey, Manan Mittal, Kanad Sarkar, Andrew C. Singer |
Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Léane Salais, Pablo Arias 0003, Clément Le Moine, Victor Rosi, Yann Teytaut, Nicolas Obin, Axel Roebel |
Production Strategies of Vocal Attitudes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shaojin Ding, Rajeev Rikhye, Qiao Liang 0001, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw |
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuansheng Guan, Guochen Yu, Andong Li, Chengshi Zheng, Jie Wang |
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Xinmeng Xu, Yang Wang, Jie Jia, Binbin Chen 0006, Jianjun Hao |
GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hemant Yadav, Akshat Gupta, Sai Krishna Rallabandi, Alan W. Black, Rajiv Ratn Shah |
Intent classification using pre-trained language agnostic embeddings for low resource languages. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kavan Fatehi, Mercedes Torres Torres, Ayse Küçükyilmaz |
ScoutWav: Two-Step Fine-Tuning on Self-Supervised Automatic Speech Recognition for Low-Resource Environments. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | K. V. Vijay Girish, Srikanth Konjeti, Jithendra Vepa |
Interpretabilty of Speech Emotion Recognition modelled using Self-Supervised Speech and Text Pre-Trained Embeddings. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Hyunjae Cho, Wonbin Jung, Junhyeok Lee, Sang Hoon Woo |
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Christin Kirchhübel, Georgina Brown |
Spoofed speech from the perspective of a forensic phonetician. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Feifei Xiong, Pengyu Wang, Zhongfu Ye, Jinwei Feng |
Joint Estimation of Direction-of-Arrival and Distance for Arrays with Directional Sensors based on Sparse Bayesian Learning. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang 0016, Rina Panigrahy, Qiao Liang 0001, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman |
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Linjun Cai, Yuhong Yang 0001, Xufeng Chen, Weiping Tu, Hongyang Chen |
CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yuhang He, Andrew Markham |
SoundDoA: Learn Sound Source Direction of Arrival and Semantics from Sound Raw Waveforms. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anoop Kumar, Pankaj Kumar Sharma, Aravind Illa, Sriram Venkatapathy, Subhrangshu Nandi, Pritam Varma, Anurag Dwarakanath, Aram Galstyan |
Learning Under Label Noise for Robust Spoken Language Understanding systems. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh |
Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Zihan Zhao, Yanfeng Wang, Yu Wang 0027 |
Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Kai Li 0018, Sheng Li 0010, Xugang Lu, Masato Akagi, Meng Liu, Lin Zhang, Chang Zeng, Longbiao Wang, Jianwu Dang 0001, Masashi Unoki |
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yixuan Zhou 0002, Changhe Song, Xiang Li 0105, Luwen Zhang, Zhiyong Wu 0001, Yanyao Bian, Dan Su 0002, Helen Meng |
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Katrina Kechun Li, Julia Schwarz, Jasper Hong Sim, Yixin Zhang, Elizabeth Buchanan-Worster, Brechtje Post, Kirsty McDougall |
Recording and timing vocal responses in online experimentation. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yang Liu 0262, Haoqin Sun, Wenbo Guan, Yuqi Xia, Zhen Zhao 0006 |
Discriminative Feature Representation Based on Cascaded Attention Network with Adversarial Joint Loss for Speech Emotion Recognition. |
INTERSPEECH |
2022 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 18782 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|