Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
196 | Srinivasan Umesh, Rohit Sinha 0003 |
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|
87 | Chun Keung Chau, Chak Shun Lai, Bertram Emil Shi |
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-Based Interactive Toy. |
Active Media Technology |
2001 |
DBLP DOI BibTeX RDF |
|
65 | Shizhen Wang, Xiaodong Cui, Abeer Alwan |
Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|
65 | Andrej Ljolje, Vincent Goffin, Murat Saraclar |
Low Latency Real-Time Vocal Tract Length Normalization. |
TSD |
2004 |
DBLP DOI BibTeX RDF |
|
65 | Dénes Paczolay, András Kocsor, László Tóth 0001 |
Real-Time Vocal Tract Length Normalization. |
TSD |
2003 |
DBLP DOI BibTeX RDF |
|
58 | Jirí Kopecký, Ondrej Glembek, Martin Karafiát |
Advances in Acoustic Modeling for the Recognition of Czech. |
TSD |
2008 |
DBLP DOI BibTeX RDF |
LVCSR system, HLDA, VTLN, CMLLR, lectures recognition, Automatic Speech Recognition, acoustic modeling |
44 | Wade Shen, Douglas A. Reynolds |
Improved GMM-based language recognition using constrained MLLR transforms. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
44 | Shizhen Wang, Abeer Alwan, Steven M. Lulich |
Speaker normalization based on subglottal resonances. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
44 | Daisuke Saito, Ryo Matsuura, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose |
Directional dependency of cepstrum on vocal tract length. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
44 | Dénes Paczolay, András Bánhalmi, András Kocsor |
Speaker Normalization Via Springy Discriminant Analysis and Pitch Estimation. |
TSD |
2007 |
DBLP DOI BibTeX RDF |
|
28 | Tanvina Patel, Odette Scharenborg |
Using Data Augmentations and VTLN to Reduce Bias in Dutch End-to-End Speech Recognition Systems. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
28 | Shakti P. Rath |
A computationally efficient approach for acoustic class specific VTLN using regression tree. |
Int. J. Speech Technol. |
2018 |
DBLP DOI BibTeX RDF |
|
28 | Maulik C. Madhavi, Hemant A. Patil |
VTLN-warped Gaussian posteriorgram for QbE-STD. |
EUSIPCO |
2017 |
DBLP DOI BibTeX RDF |
|
28 | Shubham Sharma 0001, Hemant A. Patil |
Combining Evidences from Bark Scale and Mel Scale Warped Features for VTLN. |
PerMIn |
2015 |
DBLP DOI BibTeX RDF |
|
28 | Harish Arsikere, Abeer Alwan |
Frequency warping using subglottal resonances: Complementarity with VTLN and robustness to additive noise. |
ICASSP |
2014 |
DBLP DOI BibTeX RDF |
|
28 | D. Rama Sanand, Torbjørn Svendsen |
Synthetic speaker models using VTLN to improve the performance of children in mismatched speaker conditions for ASR. |
INTERSPEECH |
2013 |
DBLP DOI BibTeX RDF |
|
28 | Néstor Becerra Yoma, Claudio Garretón, Fernando Huenupán, Ignacio Catalan, Jorge Wuth |
VTLN based on the linear interpolation of contiguous mel filter-bank energies. |
INTERSPEECH |
2013 |
DBLP DOI BibTeX RDF |
|
28 | Harish Arsikere, Steven M. Lulich, Abeer Alwan |
Non-linear frequency warping for VTLN using subglottal resonances and the third formant frequency. |
ICASSP |
2013 |
DBLP DOI BibTeX RDF |
|
28 | D. Rama Sanand, Srinivasan Umesh |
VTLN Using Analytically Determined Linear-Transformation on Conventional MFCC. |
IEEE Trans. Speech Audio Process. |
2012 |
DBLP DOI BibTeX RDF |
|
28 | Reima Karhila, Doddipatla Rama Sanand, Mikko Kurimo, Peter Smit |
Creating synthetic voices for children by adapting adult average voice using stacked transformations and VTLN. |
ICASSP |
2012 |
DBLP DOI BibTeX RDF |
|
28 | Doddipatla Rama Sanand, Mikko Kurimo |
A Study on Combining VTLN and SAT to Improve the Performance of Automatic Speech Recognition. |
INTERSPEECH |
2011 |
DBLP DOI BibTeX RDF |
|
28 | Ehsan Variani, Thomas Schaaf |
VTLN in the MFCC Domain: Band-Limited versus Local Interpolation. |
INTERSPEECH |
2011 |
DBLP DOI BibTeX RDF |
|
28 | Doddipatla Rama Sanand, Ralf Schlüter, Hermann Ney |
Revisiting VTLN using linear transformation on conventional MFCC. |
INTERSPEECH |
2010 |
DBLP DOI BibTeX RDF |
|
28 | Thomas Schaaf, Florian Metze |
Analysis of gender normalization using MLP and VTLN features. |
INTERSPEECH |
2010 |
DBLP DOI BibTeX RDF |
|
28 | Lakshmi Saheer, Philip N. Garner, John Dines, Hui Liang |
VTLN adaptation for statistical speech synthesis. |
ICASSP |
2010 |
DBLP DOI BibTeX RDF |
|
28 | Lakshmi Saheer, John Dines, Philip N. Garner, Hui Liang |
Implementation of VTLN for statistical speech synthesis. |
SSW |
2010 |
DBLP BibTeX RDF |
|
28 | Sankaran Panchapagesan, Abeer Alwan |
Frequency warping for VTLN and speaker adaptation by linear transformation of standard MFCC. |
Comput. Speech Lang. |
2009 |
DBLP DOI BibTeX RDF |
|
28 | Shakti Prasad Rath, Srinivasan Umesh |
Acoustic class specific VTLN-warping using regression class trees. |
INTERSPEECH |
2009 |
DBLP DOI BibTeX RDF |
|
28 | Shakti Prasad Rath, Srinivasan Umesh, Achintya Kumar Sarkar |
Using VTLN matrices for rapid and computationally-efficient speaker adaptation with robustness to first-pass transcription errors. |
INTERSPEECH |
2009 |
DBLP DOI BibTeX RDF |
|
28 | D. Rama Sanand, Shakti Prasad Rath, Srinivasan Umesh |
Improving the performance of VTLN under mismatched speaker conditions and making it approach that of matched speaker conditions. |
ICASSP |
2009 |
DBLP DOI BibTeX RDF |
|
28 | P. T. Akhil, Shakti Prasad Rath, Srinivasan Umesh, D. Rama Sanand |
A computationally efficient approach to warp factor estimation in VTLN using EM algorithm and sufficient statistics. |
INTERSPEECH |
2008 |
DBLP DOI BibTeX RDF |
|
28 | D. Rama Sanand, Srinivasan Umesh |
Study of jacobian compensation using linear transformation of conventional MFCC for VTLN. |
INTERSPEECH |
2008 |
DBLP DOI BibTeX RDF |
|
28 | D. Rama Sanand, D. Dinesh Kumar, Srinivasan Umesh |
Linear transformation approach to VTLN using dynamic frequency warping. |
INTERSPEECH |
2007 |
DBLP DOI BibTeX RDF |
|
28 | Janne Pylkkönen |
Estimating VTLN warping factors by distribution matching. |
INTERSPEECH |
2007 |
DBLP DOI BibTeX RDF |
|
28 | Sankaran Panchapagesan, Abeer Alwan |
Multi-Parameter Frequency Warping for Vtln by Gradient Search. |
ICASSP (1) |
2006 |
DBLP DOI BibTeX RDF |
|
28 | Jonas Lööf, Hermann Ney, Srinivasan Umesh |
Vtln Warping Factor Estimation Using Accumulation of Sufficient Statistics. |
ICASSP (1) |
2006 |
DBLP DOI BibTeX RDF |
|
28 | Srinivasan Umesh, András Zolnay, Hermann Ney |
Implementing frequency-warping and VTLN through linear transformation of conventional MFCC. |
INTERSPEECH |
2005 |
DBLP DOI BibTeX RDF |
|
28 | David Sündermann, Guntram Strecha, Antonio Bonafonte, Harald Höge, Hermann Ney |
Evaluation of VTLN-based voice conversion for embedded speech synthesis. |
INTERSPEECH |
2005 |
DBLP DOI BibTeX RDF |
|
28 | Arlo Faria, David Gelbart |
Efficient pitch-based estimation of VTLN warp factors. |
INTERSPEECH |
2005 |
DBLP DOI BibTeX RDF |
|
28 | Xiaodong Cui, Abeer Alwan |
MLLR-like speaker adaptation based on linearization of VTLN with MFCC features. |
INTERSPEECH |
2005 |
DBLP DOI BibTeX RDF |
|
28 | Do Yeong Kim, Srinivasan Umesh, Mark J. F. Gales, Thomas Hain, Philip C. Woodland |
Using VTLN for broadcast news transcription. |
INTERSPEECH |
2004 |
DBLP DOI BibTeX RDF |
|
28 | Matthias Eichner, Matthias Wolff, Rüdiger Hoffmann |
Voice characteristics conversion for TTS using reverse VTLN. |
ICASSP (1) |
2004 |
DBLP DOI BibTeX RDF |
|
22 | Giulia Garau, Steve Renals |
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition. |
IEEE Trans. Speech Audio Process. |
2008 |
DBLP DOI BibTeX RDF |
|
22 | Mohamed Afify, Olivier Siohan |
Comments on Vocal Tract Length Normalization Equals Linear Transformation in Cepstral Space. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|