Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
177 | Panikos Heracleous, Satoshi Nakamura 0001, Kiyohiro Shikano |
Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method. |
J. VLSI Signal Process. |
2004 |
DBLP DOI BibTeX RDF |
distant-talking speech, multiple sound sources, speech recognition, microphone array |
151 | Douglas Brungart, Brian D. Simpson |
Optimizing the spatial configuration of a seven-talker speech display. |
ACM Trans. Appl. Percept. |
2005 |
DBLP DOI BibTeX RDF |
informational masking, Cocktail party effect |
52 | Eliana Colunga, Clare E. Sims |
Early Talkers and Late Talkers Know Nouns that License Different Word Learning Biases. |
CogSci |
2011 |
DBLP BibTeX RDF |
|
45 | Satoshi Nakamura 0001, Panikos Heracleous |
3-D N-Best Search for Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers. |
ICMI |
2002 |
DBLP DOI BibTeX RDF |
|
38 | Yoshihiro Sejima, Tomio Watanabe |
A speech-driven embodied entrainment wall picture system for supporting virtual communication. |
IUCS |
2009 |
DBLP DOI BibTeX RDF |
entrainment, virtual communication, human interaction, nonverbal communication, embodied communication |
38 | Hoang Do, Harvey F. Silverman |
A method for locating multiple sources from a frame of a large-aperture microphone array data without tracking. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
38 | Steven M. Schimmel, Les E. Atlas |
Target talker enhancement in hearing devices. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
38 | Ryu Takeda, Shun'ichi Yamamoto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears. |
IROS |
2006 |
DBLP DOI BibTeX RDF |
|
38 | Ce Wang, Michael S. Brandstein |
Head Pose Estimation for Video-Conferencing with Multiple Cameras and Microphones. |
ICMI |
2000 |
DBLP DOI BibTeX RDF |
SVM, Face Detection, Pose Estimation, Video-Conferencing |
33 | Patrick Truong, Fabrice Guillemin |
Estimating Local Cardinalities in a Multidimensional Multiset. (PDF / PS) |
AIMS |
2007 |
DBLP DOI BibTeX RDF |
Local Cardinality, Top-talkers, Real-time data mining, Network monitoring and security, Multiset, Cardinality |
33 | Jason Gait |
A Kernel for High-Performance Multicast Communications. |
IEEE Trans. Computers |
1989 |
DBLP DOI BibTeX RDF |
multicast addresses, high-performance multicast communications, talkers, protocols, multiprocessor interconnection networks, shared-memory multiprocessors, Ethernet, kernel, interprocess communication |
26 | Sheyenne Fishero, Joan A. Sereno, Allard Jongman |
Perception and production of Mandarin-Accented English: The effect of degree of Accentedness on the Interlanguage Speech Intelligibility Benefit for Listeners (ISIB-L) and Talkers (ISIB-T). |
J. Phonetics |
2023 |
DBLP DOI BibTeX RDF |
|
26 | Chenda Li, Yao Qian, Zhuo Chen 0006, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng 0001 |
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
26 | Manuj Yadav, Densil Cabrera |
Two simultaneous talkers distract more than one in simulated multi-talker environments, regardless of overall sound levels typical of open-plan offices. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
26 | Davide Berghi, Marco Volino, Philip J. B. Jackson |
Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
26 | Davide Berghi, Marco Volino, Philip J. B. Jackson |
Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research. |
CVMP |
2022 |
DBLP DOI BibTeX RDF |
|
26 | Shruthi Raghavendra, Sungmin Lee 0003, Fei Chen 0011, Brett A. Martin, Chin-Tuan Tan |
Cortical Entrainment to Speech Produced by Cochlear Implant Users and Normal-Hearing Talkers. |
EMBC |
2022 |
DBLP DOI BibTeX RDF |
|
26 | Vincent W. Neo, Stephan Weiss 0001, Simon W. McKnight, Aidan O. T. Hogg, Patrick A. Naylor |
Polynomial Eigenvalue Decomposition-Based Target Speaker Voice Activity Detection in the Presence of Competing Talkers. |
IWAENC |
2022 |
DBLP DOI BibTeX RDF |
|
26 | Michelle Cohn, Kristin Predeck, Melina Sarian, Georgia Zellou |
Prosodic alignment toward emotionally expressive speech: Comparing human and Alexa model talkers. |
Speech Commun. |
2021 |
DBLP DOI BibTeX RDF |
|
26 | Marvin Borsdorf, Chenglin Xu, Haizhou Li 0001, Tanja Schultz |
Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers. |
Interspeech |
2021 |
DBLP DOI BibTeX RDF |
|
26 | Mohammad Soleymanpour, Michael T. Johnson, Jeffrey Berry 0001 |
Comparison in Suprasegmental Characteristics between Typical and Dysarthric Talkers at Varying Severity Levels. |
SpeD |
2021 |
DBLP DOI BibTeX RDF |
|
26 | Neil Savage |
Code talkers. |
Commun. ACM |
2019 |
DBLP DOI BibTeX RDF |
|
26 | Bruno Ferenc Segedin, Michelle Cohn, Georgia Zellou |
Perceptual Adaptation to Device and Human Voices: Learning and Generalization of a Phonetic Shift Across Real and Voice-AI Talkers. |
INTERSPEECH |
2019 |
DBLP DOI BibTeX RDF |
|
26 | William F. Katz, Patrick Reidy, Divya Prabhakaran |
Sensorimotor Response to Tongue Displacement Imagery by Talkers with Parkinson's Disease. |
INTERSPEECH |
2018 |
DBLP DOI BibTeX RDF |
|
26 | Helen L. Bear |
Visual gesture variability between talkers in continuous visual speech. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
26 | Maja Taseska, Emanuël A. P. Habets |
DOA-informed source extraction in the presence of competing talkers and background noise. |
EURASIP J. Adv. Signal Process. |
2017 |
DBLP DOI BibTeX RDF |
|
26 | Eva Jimenez, Thomas T. Hills |
Network Analysis of a Large Sample of Typical and Late Talkers. |
CogSci |
2017 |
DBLP BibTeX RDF |
|
26 | Shahab Pasha, Christian H. Ritz, Yue Xian Zou |
Detecting multiple, simultaneous talkers through localising speech recorded by ad-hoc microphone arrays. |
APSIPA |
2016 |
DBLP DOI BibTeX RDF |
|
26 | Mariia Gavriushenko, Oleksiy Khriyenko, Iida Porokuokka |
Adaptive Vocabulary Learning Environment for Late Talkers. |
CSEDU (2) |
2016 |
DBLP DOI BibTeX RDF |
|
26 | Sarthak Khanal, Harvey F. Silverman |
Multi-stage rejection sampling (MSRS): A robust SRP-PHAT peak detection algorithm for localization of cocktail-party talkers. |
WASPAA |
2015 |
DBLP DOI BibTeX RDF |
|
26 | Chris Davis 0001, Jeesun Kim, Vincent Aubanel, Gregory Zelic, Yatin Mahajan |
The stability of mouth movements for multiple talkers over multiple sessions. |
AVSP |
2015 |
DBLP BibTeX RDF |
|
26 | Elizabeth A. McCullough |
Open-set identification of non-native talkers' language backgrounds. |
ICPhS |
2015 |
DBLP BibTeX RDF |
|
26 | Erin M. Ingvalson, Trevor L. Stoimenoff |
Greater benefit for familiar talkers under cognitive load. |
ICPhS |
2015 |
DBLP BibTeX RDF |
|
26 | Xionghu Zhong, James R. Hopgood |
Particle filtering for TDOA based acoustic source tracking: Nonconcurrent Multiple Talkers. |
Signal Process. |
2014 |
DBLP DOI BibTeX RDF |
|
26 | Ramine Tinati, Elena Simperl, Markus Luczak-Rösch, Max Van Kleek, Nigel Shadbolt |
Collective Intelligence in Citizen Science - A Study of Performers and Talkers. |
CoRR |
2014 |
DBLP BibTeX RDF |
|
26 | Michael I. Mandel, Sarah E. Yoho, Eric W. Healy |
Generalizing time-frequency importance functions across noises, talkers, and phonemes. |
INTERSPEECH |
2014 |
DBLP DOI BibTeX RDF |
|
26 | Yutaka Ishii, Tomio Watanabe |
Evaluation of Superimposed Self-character Based on the Detection of Talkers' Face Angles in Video Communication. |
HCI (13) |
2013 |
DBLP DOI BibTeX RDF |
|
26 | Eric Osterweil, Danny McPherson, Steve DiBenedetto, Christos Papadopoulos, Daniel Massey |
Behavior of DNS' Top Talkers, a .com/.net View. |
PAM |
2012 |
DBLP DOI BibTeX RDF |
|
26 | Robert DeLine |
Code Talkers. |
Making Software |
2011 |
DBLP BibTeX RDF |
|
26 | Hoang Do, Harvey F. Silverman |
SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data. |
ICASSP |
2010 |
DBLP DOI BibTeX RDF |
|
26 | Harsh Vardhan Sharma, Mark Hasegawa-Johnson |
Universal access: speech recognition for talkers with spastic dysarthria. |
INTERSPEECH |
2009 |
DBLP DOI BibTeX RDF |
|
26 | Kornel Laskowski, Elizabeth Shriberg |
Modeling other talkers for improved dialog act recognition in meetings. |
INTERSPEECH |
2009 |
DBLP DOI BibTeX RDF |
|
26 | Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers. |
IROS |
2008 |
DBLP DOI BibTeX RDF |
|
26 | Ruth Rettie |
Texters not Talkers: Phone Call Aversion among Mobile Phone Users. |
PsychNology J. |
2007 |
DBLP BibTeX RDF |
|
26 | Xihong Wu, Jing Chen 0019, Zhigang Yang, Qiang Huang, Mengyuan Wang, Liang Li |
Effect of number of masking talkers on speech-on-speech masking in Chinese. |
INTERSPEECH |
2007 |
DBLP DOI BibTeX RDF |
|
26 | P. S. Sathidevi 0001, S. Aravindakshan, R. Rajavel |
Separation of dominant speech from simultaneous talkers. |
SIP |
2007 |
DBLP BibTeX RDF |
|
26 | Mark Hasegawa-Johnson, Jon R. Gunderson, Adrienne Perlman, Thomas S. Huang |
Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria. |
ICASSP (3) |
2006 |
DBLP DOI BibTeX RDF |
|
26 | Ameya N. Deoras, Mark Hasegawa-Johnson |
A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel. |
ICASSP (1) |
2004 |
DBLP DOI BibTeX RDF |
|
26 | Panikos Heracleous, Satoshi Nakamura 0001, Kiyohiro Shikano |
A semi-blind source separation method for hands-free speech recognition of multiple talkers. |
INTERSPEECH |
2003 |
DBLP DOI BibTeX RDF |
|
26 | Carol L. Mackersie |
The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss. |
INTERSPEECH |
2002 |
DBLP DOI BibTeX RDF |
|
26 | Jintao Jiang, Abeer Alwan, Lynne E. Bernstein, Edward T. Auer, Patricia A. Keating |
Similarity structure in perceptual and physical measures for visual Consonants across talkers. |
ICASSP |
2002 |
DBLP DOI BibTeX RDF |
|
26 | Stephen Huffman |
The Navajo Code Talkers: a Cryptologic and Linguistic Perspective. |
Cryptologia |
2000 |
DBLP DOI BibTeX RDF |
|
26 | Matthew Richardson, Mei-Yuh Hwang, Alex Acero, Xuedong Huang 0001 |
Improvements on speech recognition for fast talkers. |
EUROSPEECH |
1999 |
DBLP DOI BibTeX RDF |
|
26 | Douglas E. Sturim, Michael S. Brandstein, Harvey F. Silverman |
Tracking multiple talkers using microphone-array measurements. |
ICASSP |
1997 |
DBLP DOI BibTeX RDF |
|
26 | Ann K. Syrdal |
Acoustic variability in spontaneous conversational speech of american English talkers. |
ICSLP |
1996 |
DBLP DOI BibTeX RDF |
|
26 | Kazuhiko Kakehi, Kazumi Kato |
Perception for VCV speech uttered simultaneously or sequentially by two talkers. |
ICSLP |
1994 |
DBLP DOI BibTeX RDF |
|
26 | Eric I. Chang, Richard Lippmann |
Using Voice Transformations to Create Additional Training Talkers for Word Spotting. |
NIPS |
1994 |
DBLP BibTeX RDF |
|
26 | Patrick M. Peterson |
Using linearly-constrained adaptive beamforming to reduce interference in hearing aids from competing talkers in reverberant rooms. |
ICASSP |
1987 |
DBLP DOI BibTeX RDF |
|
26 | Mitchel Weintraub |
A computational model for separating two simultaneous talkers. |
ICASSP |
1986 |
DBLP DOI BibTeX RDF |
|
19 | Toru Takahashi 0001, Kazuhiro Nakadai, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model. |
IROS |
2009 |
DBLP DOI BibTeX RDF |
|
19 | Nicoleta Roman, DeLiang Wang |
Binaural Tracking of Multiple Moving Sources. |
IEEE Trans. Speech Audio Process. |
2008 |
DBLP DOI BibTeX RDF |
|
19 | Stephen Voran |
Listener detection of talker stress in low-rate coded speech. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
19 | Yuji Kubota, Masatoshi Yoshida, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking. |
ISM |
2008 |
DBLP DOI BibTeX RDF |
|
19 | Yutaka Ishii, Tomio Watanabe |
An Embodied Avatar Mediated Communication System with VirtualActor for Human Interaction Analysis. |
RO-MAN |
2007 |
DBLP DOI BibTeX RDF |
|
19 | Hyun-Don Kim, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno |
Auditory and Visual Integration based Localization and Tracking of Multiple Moving Sounds in Daily-life Environments. |
RO-MAN |
2007 |
DBLP DOI BibTeX RDF |
|
19 | Chen Yu 0001 |
Embodied Active Vision in Language Learning and Grounding. |
WAPCV |
2007 |
DBLP DOI BibTeX RDF |
|
19 | Malay Gupta, Scott C. Douglas |
Performance Evaluation of Convolutive Blind Source Separation of Mixtures of Unequal-Level Speech Signals. |
ISCAS |
2007 |
DBLP DOI BibTeX RDF |
|
19 | Sue Harding, Jon P. Barker, Guy J. Brown |
Mask estimation for missing data speech recognition based on statistics of binaural interaction. |
IEEE Trans. Speech Audio Process. |
2006 |
DBLP DOI BibTeX RDF |
|
19 | Joshua M. Sachar, Harvey F. Silverman, William R. Patterson III |
Microphone position and gain calibration for a large-aperture microphone array. |
IEEE Trans. Speech Audio Process. |
2005 |
DBLP DOI BibTeX RDF |
|
19 | Virginie Attina, Marie-Agnès Cathiard, Denis Beautemps |
Temporal Measures of Hand and Speech Coordination During French Cued Speech Production. |
Gesture Workshop |
2005 |
DBLP DOI BibTeX RDF |
|
19 | Nam Soo Kim, Joon-Hyuk Chang |
Signal modification for robust speech coding. |
IEEE Trans. Speech Audio Process. |
2004 |
DBLP DOI BibTeX RDF |
|
19 | Masamichi Hosoda, Akira Nakayama, Minoru Kobayashi, Satoshi Iwaki |
Conference state estimation by biosignal processing: observation of heart rate resonance. |
CHI Extended Abstracts |
2004 |
DBLP DOI BibTeX RDF |
conference state, resonance correlation matrix, heart rate, biosignal |
19 | Barbara Prazak, Stefan Mina, Gernot Kronreif |
Accessibility through Standard Low Cost Pointing Devices. |
ICCHP |
2004 |
DBLP DOI BibTeX RDF |
|
19 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano |
Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction. |
IEA/AIE |
2003 |
DBLP DOI BibTeX RDF |
|
19 | Hiroshi G. Okuno, Kazuhiro Nakadai, Hiroaki Kitano |
Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors. |
PRICAI |
2002 |
DBLP DOI BibTeX RDF |
|
19 | Ce Wang, Scott M. Griebel, Michael S. Brandstein, Bo-June Paul Hsu |
Real-Time Automated Video and Audio Capture with Multiple Cameras and Microphones. |
J. VLSI Signal Process. |
2001 |
DBLP DOI BibTeX RDF |
talker tracking, acoustic localization, audio enhancement, video conferencing, speech enhancement, person tracking, head pose estimation |