|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 800 occurrences of 559 keywords
|
|
|
Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
155 | Hector Ouilhet |
Google Sky Map: using your phone as an interface. |
Mobile HCI |
2010 |
DBLP DOI BibTeX RDF |
|
92 | Wei Jiang 0001, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui |
Short-term audio-visual atoms for generic video concept classification. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
audio-visual codebook, joint audio-visual analysis, short-term audio-visual atom, semantic concept detection |
78 | Niall A. Fox, Brian A. O'Mullane, Richard B. Reilly |
Audio-Visual Speaker Identification via Adaptive Fusion Using Reliability Estimates of Both Modalities. |
AVBPA |
2005 |
DBLP DOI BibTeX RDF |
|
68 | Mustafa Nazmi Kaynak, Qi Zhi, Adrian David Cheok, Kuntal Sengupta, Jian Zhang, Chi Chung Ko |
Analysis of lip geometric features for audio-visual speech recognition. |
IEEE Trans. Syst. Man Cybern. Part A |
2004 |
DBLP DOI BibTeX RDF |
|
64 | Timothy J. Hazen, Kate Saenko, Chia-Hao La, James R. Glass |
A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. |
ICMI |
2004 |
DBLP DOI BibTeX RDF |
audio-visual corpora, audio-visual speech recognition |
63 | David Grelaud, Nicolas Bonneel, Michael Wimmer 0001, Manuel Asselot, George Drettakis |
Efficient and practical audio-visual rendering for games using crossmodal perception. |
SI3D |
2009 |
DBLP DOI BibTeX RDF |
audio-visual rendering, crossmodal perception |
60 | Toshiko Isei-Jaakkola |
Cognition and Physio-acoustic Correlates - Audio and Audio-visual Effects of a Short English Emotional Statement: On JL2, FL2 and EL1. |
FinTAL |
2006 |
DBLP DOI BibTeX RDF |
|
58 | Giulia Garau, Sileye O. Ba, Hervé Bourlard, Jean-Marc Odobez |
Investigating the use of visual focus of attention for audio-visual speaker diarisation. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
audio-visual speaker diarisation, visual focus of attention |
58 | Niall A. Fox, Richard B. Reilly |
Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features. |
AVBPA |
2003 |
DBLP DOI BibTeX RDF |
|
57 | Jong-Seok Lee, Cheol Hoon Park |
Temporal filtering of visual speech for audio-visual speech recognition in acoustically and visually challenging environments. |
ICMI |
2007 |
DBLP DOI BibTeX RDF |
late integration, neural network, feature extraction, hidden Markov model, noise-robustness, audio-visual speech recognition, temporal filtering |
56 | Marco Cristani, Manuele Bicego, Vittorio Murino |
Audio-Visual Event Recognition in Surveillance Video Sequences. |
IEEE Trans. Multim. |
2007 |
DBLP DOI BibTeX RDF |
|
52 | Vasil Khalidov, Florence Forbes, Miles E. Hansard, Elise Arnaud, Radu Horaud |
Detection and localization of 3d audio-visual objects using unsupervised clustering. |
ICMI |
2008 |
DBLP DOI BibTeX RDF |
audio-visual clustering, binaural hearing, mixture models, stereo vision |
52 | Jörn Anemüller, Jörg-Hendrik Bach, Barbara Caputo, Michal Havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlícek, Tomás Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, Hynek Hermansky |
The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. |
ICMI |
2008 |
DBLP DOI BibTeX RDF |
sensor platform, multimodal interaction, event detection, augmented cognition, audio-visual |
52 | C. Mario Christoudias, Kate Saenko, Louis-Philippe Morency, Trevor Darrell |
Co-Adaptation of audio-visual speech and gesture classifiers. |
ICMI |
2006 |
DBLP DOI BibTeX RDF |
audio-visual speech and gesture, adaptation, semi-supervised learning, human-computer interfaces, co-training |
52 | Benjamin Senechal, Denis Pellerin, Laurent Besacier, Isabelle Simand, Stéphane Bres |
Audio, video and audio-visual signatures for short video clip detection: experiments on Trecvid2003. |
ICME |
2005 |
DBLP DOI BibTeX RDF |
audio-visual signature, video clip detection, Trecvid2003, spatio-temporal signature, gray level centroid, video database |
52 | Yu Cao 0002, Sung Baang, Shih-Hsi Liu, Ming Li 0007, Sanqing Hu |
Audio-visual event classification via spatial-temporal-audio words. |
ICPR |
2008 |
DBLP DOI BibTeX RDF |
|
52 | Jing Huang 0019, Etienne Marcheret, Karthik Visweswariah |
Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition. |
ICME |
2005 |
DBLP DOI BibTeX RDF |
|
50 | Sofia Tsekeridou, Ioannis Pitas |
Audio-Visual Content Analysis for Content-Based Video Indexing. |
ICMCS, Vol. 1 |
1999 |
DBLP DOI BibTeX RDF |
audio-visual content analysis, multimodal interaction, content-based indexing/retrieval |
50 | Marco Cristani, Manuele Bicego, Vittorio Murino |
Audio-Video Integration for Background Modelling. |
ECCV (2) |
2004 |
DBLP DOI BibTeX RDF |
|
50 | Ye Wang 0007, Bingjun Zhang, Olaf Schleusing |
Educational violin transcription by fusing multimedia streams. |
ACM Multimedia EMME Workshop |
2007 |
DBLP DOI BibTeX RDF |
audio-visual fusion, computer-assisted tutoring, detection function, note segmentation, music transcription, onset detection |
50 | Ming Liu 0009, Yun Fu 0001, Thomas S. Huang |
An audio-visual fusion framework with joint dimensionality reducton. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
49 | Louis H. Terry, Derek J. Shiell, Aggelos K. Katsaggelos |
Feature space video stream consistency estimation for dynamic stream weighting in audio-visual speech recognition. |
ICIP |
2008 |
DBLP DOI BibTeX RDF |
|
49 | Pengyu Hong, Zhen Wen, Thomas S. Huang, Heung-Yeung Shum |
Real-Time Speech-Driven 3D Face Animation. |
3DPVT |
2002 |
DBLP DOI BibTeX RDF |
|
48 | Tsuhan Chen, Hans Peter Graf, Barry G. Haskell, Eric Petajan, Yao Wang 0001, Homer H. Chen, Wu Chou |
Speech-assisted lip synchronization in audio-visual communications. |
ICIP |
1995 |
DBLP DOI BibTeX RDF |
audio-visual systems, audio-visual communications, speech-assisted lip synchronization, speech information, speech-assisted frame-rate conversion, talking head video coding, image processing, image processing, video coding, synchronisation, videoconferencing, teleconferencing, speech processing, speech analysis, videotelephony, video telephony |
47 | Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajput, L. Venkata Subramaniam |
Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models. |
ICPR |
2000 |
DBLP DOI BibTeX RDF |
|
46 | Timothy J. Hazen |
Visual model structures and synchrony constraints for audio-visual speech recognition. |
IEEE Trans. Speech Audio Process. |
2006 |
DBLP DOI BibTeX RDF |
|
45 | Girija Chetty, Michael Wagner 0004 |
A Robust Speaking Face Modelling Approach Based on Multilevel Fusion. |
DICTA |
2007 |
DBLP DOI BibTeX RDF |
|
44 | Niall A. Fox, Richard B. Reilly |
Robust multi-modal person identification with tolerance of facial expression. |
SMC (1) |
2004 |
DBLP DOI BibTeX RDF |
|
41 | Lei Xie 0001, Zhi-Qiang Liu |
Speech Animation Using Coupled Hidden Markov Models. |
ICPR (1) |
2006 |
DBLP DOI BibTeX RDF |
|
41 | Ara V. Nefian, Luhong Liang, Tieyan Fu, Xiaoxing Liu |
A Bayesian Approach to Audio-Visual Speaker Identification. |
AVBPA |
2003 |
DBLP DOI BibTeX RDF |
|
40 | Lei Chen 0002, Sule Gündüz, M. Tamer Özsu |
Mixed Type Audio Classification with Support Vector Machine. |
ICME |
2006 |
DBLP DOI BibTeX RDF |
|
39 | Petar S. Aleksic, Aggelos K. Katsaggelos |
Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance. |
ICIP (3) |
2005 |
DBLP DOI BibTeX RDF |
|
39 | Amitava Das |
Audio Visual Person Authentication by Multiple Nearest Neighbor Classifiers. |
ICB |
2007 |
DBLP DOI BibTeX RDF |
audio-visual biometric authentication, feature extraction, face recognition, Multimodal, fusion, Speaker recognition, multiple classifiers, VQ |
39 | Indrani Medhi, Archana Prasad, Kentaro Toyama |
Optimal audio-visual representations for illiterate users of computers. |
WWW |
2007 |
DBLP DOI BibTeX RDF |
audio-visual icons, illiterate users, text-free user interfaces |
39 | Satoshi Nakamura 0001, Eli Yamamoto |
Speech-to-Lip Movement Synthesis by Maximizing Audio-Visual Joint Probability Based on the EM Algorithm. |
J. VLSI Signal Process. |
2001 |
DBLP DOI BibTeX RDF |
lip movement synthesis, audio-visual joint probability, speech recognition, EM algorithm |
39 | Wee Sun Lee, Michael R. Frater, Mark R. Pickering, John F. Arnold |
Robustness of multiplexing protocols for audio-visual services over wireless networks. |
ICIP (3) |
1997 |
DBLP DOI BibTeX RDF |
multiplexing protocols, audio-visual services, MPEG 2 systems, ITU-T H.223, high bit-error rates, packet-oriented multiplexing, packet header, forward error correcting codes, header information, FEC protection, quality of service, wireless networks, packet radio networks, wireless channels |
39 | Mike A. Oren, Chris Harding, Terri L. Bonebright |
Evaluation of spatial abilities within a 2D auditory platform game. |
ASSETS |
2008 |
DBLP DOI BibTeX RDF |
visual impairments, video games, spatial ability, audio games |
39 | Alexánder Ceballos, Juan-Bernardo Gómez, Flavio Prieto, Tanneguy Redarce |
Robot Command Interface Using an Audio-Visual Speech Recognition System. |
CIARP |
2009 |
DBLP DOI BibTeX RDF |
Speech recognition, MPEG-4, manipulator |
39 | Tomoaki Koiwa, Kazuhiro Nakadai, Jun-ichi Imura |
Coarse speech recognition by audio-visual integration based on missing feature theory. |
IROS |
2007 |
DBLP DOI BibTeX RDF |
|
39 | Atulya Velivelli, Chong-Wah Ngo, Thomas S. Huang |
Detection of Documentary Scene Changes by Audio-Visual Fusion. |
CIVR |
2003 |
DBLP DOI BibTeX RDF |
|
39 | Hari Sundaram, Lexing Xie, Shih-Fu Chang |
A utility framework for the automatic generation of audio-visual skims. |
ACM Multimedia |
2002 |
DBLP DOI BibTeX RDF |
|
38 | Emily Mower, Maja J. Mataric, Shrikanth S. Narayanan |
Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays. |
ISM |
2008 |
DBLP DOI BibTeX RDF |
|
38 | Ilse Ravyse, Dongmei Jiang, Xiaoyue Jiang, Guoyun Lv, Yunshu Hou, Hichem Sahli, Rongchun Zhao |
DBN Based Models for Audio-Visual Speech Analysis and Recognition. |
PCM |
2006 |
DBLP DOI BibTeX RDF |
|
38 | Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith 0001, Andrew H. C. Thean, Pavel Zemcík |
Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
38 | Renaud Séguier, David Mercier |
Audio-Visual Speech Recognition One Pass Learning with Spiking Neurons. |
ICANN |
2002 |
DBLP DOI BibTeX RDF |
|
38 | Ferda Ofli, Yasemin Demir, Engin Erzin, Yücel Yemez, A. Murat Tekalp |
Multicamera Audio-Visual Analysis of Dance Figures. |
ICME |
2007 |
DBLP DOI BibTeX RDF |
|
38 | Ziyou Xiong |
Audio-visual sports highlights extraction using Coupled Hidden Markov Models. |
Pattern Anal. Appl. |
2005 |
DBLP DOI BibTeX RDF |
|
38 | Thierry Guiard-Marigny, Nicolas Tsingos, Ali Adjoudani, Christian Benoît, Marie-Paule Gascuel |
3D Models Of The Lips For Realistic Speech Animation. |
CA |
1996 |
DBLP DOI BibTeX RDF |
audio-visual systems, realistic images, lip modelling, realistic speech animation, audio visual articulatory speech synthesizer, border contours, vermilion zone, lip shape, speaker dependent conformations, natural lips, French speaker, reference lip shapes, lip contact, natural languages, computer animation, 3D models, implicit surfaces, speech synthesis, volumetric model, continuous functions, human face, geometrical analysis, algebraic equations |
37 | Lucas D. Terissi, Juan Carlos Gómez |
Audio-to-Visual Conversion Via HMM Inversion for Speech-Driven Facial Animation. |
SBIA |
2008 |
DBLP DOI BibTeX RDF |
Audio-Visual Speech Processing, Hidden Markov Models, Facial Animation |
37 | Jeho Nam, A. Enis Çetin, Ahmed H. Tewfik |
Speaker Identification and Video Analysis for Hierarchical Video Shot Classification . |
ICIP (2) |
1997 |
DBLP DOI BibTeX RDF |
hierarchical video shot classification, visual data tracks, audio data tracks, visual stream analysis, voiced phonemes tracking, speech data, audio-visual objects indexing, video shots matching, browsing, video analysis, content-based retrieval, multimedia databases, video databases, speaker recognition, speaker identification, clustering technique, content-based indexing, video data, audio signal, 3D wavelet transform, speaker changes detection |
37 | Rebecca Lunsford, Sharon L. Oviatt |
Human perception of intended addressee during computer-assisted meetings. |
ICMI |
2006 |
DBLP DOI BibTeX RDF |
acoustic-prosodic cues, dialogue style, human-computer teamwork, intended addressee, open-microphone engagement, gaze, multiparty interaction |
37 | Hendrik Knoche, Hermann de Meer, David Kirsh |
Compensating for low frame rates. |
CHI Extended Abstracts |
2005 |
DBLP DOI BibTeX RDF |
audio-visual integration, skew, frame rates, speech perception |
37 | Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos |
Robust multimodal audio-visual processing for advanced context awareness in smart spaces. |
Pers. Ubiquitous Comput. |
2009 |
DBLP DOI BibTeX RDF |
|
37 | Le Xin, Jianhua Tao 0001, Tieniu Tan |
Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method. |
ICIP (3) |
2007 |
DBLP DOI BibTeX RDF |
|
37 | Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos |
Robust Multimodal Audio-Visual Processing for Advanced Context Awareness in Smart Spaces. |
AIAI |
2006 |
DBLP DOI BibTeX RDF |
|
37 | Shengli Fu, Ricardo Gutierrez-Osuna, Anna Esposito, Praveen K. Kakumanu, Oscar N. Garcia |
Audio/visual mapping with cross-modal hidden Markov models. |
IEEE Trans. Multim. |
2005 |
DBLP DOI BibTeX RDF |
|
37 | Zhihong Zeng, Jilin Tu, Ming Liu 0009, Thomas S. Huang |
Multi-stream Confidence Analysis for Audio-Visual Affect Recognition. |
ACII |
2005 |
DBLP DOI BibTeX RDF |
|
37 | Walid Karam, Chafic Mokbel, Hanna Greige, Guido Aversano, Catherine Pelachaud, Gérard Chollet |
An Audio-Visual Imposture Scenario by Talking Face Animation. |
Summer School on Neural Networks |
2004 |
DBLP DOI BibTeX RDF |
|
37 | Satoshi Nakamura 0001 |
Fusion of Audio-Visual Information for Integrated Speech Processing. |
AVBPA |
2001 |
DBLP DOI BibTeX RDF |
|
36 | Tanveer A. Faruquie, Ashish Kapoor, Rohit J. Kate, Nitendra Rajput, L. Venkata Subramaniam |
Audio Driven Facial Animation For Audio-Visual Reality. |
ICME |
2001 |
DBLP DOI BibTeX RDF |
|
36 | Kate Saenko, Trevor Darrell, James R. Glass |
Articulatory features for robust visual speech recognition. |
ICMI |
2004 |
DBLP DOI BibTeX RDF |
articulatory features, speechreading, visual feature extraction, multimodal interfaces, audio-visual speech recognition |
35 | Wichian Sittiprapaporn, Jun Soo Kwon |
Brain Electric Microstate and Perception of Simultaneously Audiovisual Presentation. |
ICANN (1) |
2009 |
DBLP DOI BibTeX RDF |
Audiovisual perception, Microstate, Cognition, Brain, Event-related potential (ERP) |
35 | Bernt Schiele, Nuria Oliver, Tony Jebara, Alex Pentland |
An Interactive Computer Vision System DyPERS: Dynamic Personal Enhanced Reality System. |
ICVS |
1999 |
DBLP DOI BibTeX RDF |
|
34 | Hari Krishna Maganti, Daniel Gatica-Perez, Iain McCowan |
Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|
34 | Olivier Martin 0001, Irene Kotsia, Benoît Macq, Ioannis Pitas |
The eNTERFACE'05 Audio-Visual Emotion Database. |
ICDE Workshops |
2006 |
DBLP DOI BibTeX RDF |
|
34 | Mat Felthousen |
Combining audio/visual and computing support. |
SIGUCCS |
2005 |
DBLP DOI BibTeX RDF |
AV equipment, design, reliability, standardization, computer labs, budgeting |
34 | Myung-Won Kim, Joung Woo Ryu, Eun Ju Kim |
Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks. |
ICNC (2) |
2005 |
DBLP DOI BibTeX RDF |
|
34 | Lei Xie 0001, Zhi-Qiang Liu |
Multi-stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition. |
ICMLC |
2005 |
DBLP DOI BibTeX RDF |
|
34 | Raffay Hamid, Aaron F. Bobick, Anthony J. Yezzi |
Audio-visual flow -a variational approach to multi-modal flow estimation. |
ICIP |
2004 |
DBLP DOI BibTeX RDF |
|
34 | Iain A. Matthews, Gerasimos Potamianos, Chalapathy Neti, Juergen Luettin |
A Comparison Of Model And Transform-Based Visual Features For Audio-Visual LVCSR. |
ICME |
2001 |
DBLP DOI BibTeX RDF |
|
33 | Walter Allasia, Fabrizio Falchi, Francesco Gallo, Mouna Kacimi, Aaron Kaplan, Jonathan Mamou, Yosi Mass, Nicola Orio |
Audio-Visual Content Analysis in P2P Networks: The SAPIR Approach. |
DEXA Workshops |
2008 |
DBLP DOI BibTeX RDF |
Audio-visual content analysis, SAPIR, P2P |
33 | Zhihong Zeng, ZhenQiu Zhang, Brian Pianfetti, Jilin Tu, Thomas S. Huang |
Audio-visual affect recognition in activation-evaluation space. |
ICME |
2005 |
DBLP DOI BibTeX RDF |
bimodal affect recognition, audio-visual affect recognition, psychological research, activation-evaluation space, Fisher boosting learning algorithm, human-computer interaction, HCI |
33 | Raphaël Troncy, Jean Carrive |
A reduced yet extensible audio-visual description language. |
ACM Symposium on Document Engineering |
2004 |
DBLP DOI BibTeX RDF |
audio-visual description language, semantic web, semantics, knowledge representation, structure, MPEG-7, descriptor |
33 | Andreas Tirakis, Panagiotis Katalagarianos, Michael Papathomas, Christodoulos Hamilakis |
Distributed Audio-Visual Archives Network (DiVAN). |
ICMCS, Vol. 2 |
1999 |
DBLP DOI BibTeX RDF |
Distributed audio-visual archive, Extensible Data Model, OMG-CORBA infrastructure, Content-based access |
33 | Satoshi Nakamura 0001, Ken'ichi Kumatani, Satoshi Tamura |
Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust. |
ICMI |
2002 |
DBLP DOI BibTeX RDF |
|
33 | Kyoung-Ho Choi, Ying Luo 0011, Jenq-Neng Hwang |
Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System. |
J. VLSI Signal Process. |
2001 |
DBLP DOI BibTeX RDF |
HMMI, audio-to-visual conversion, MPEG-4, facial animation |
32 | Simon Lucey, Tsuhan Chen, Sridha Sridharan, Vinod Chandran |
Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition. |
IEEE Trans. Multim. |
2005 |
DBLP DOI BibTeX RDF |
|
32 | Simon Lucey, Tsuhan Chen |
Improved Audio-Visual Speaker Recognition via the Use of a Hybrid Combination Strategy. |
AVBPA |
2003 |
DBLP DOI BibTeX RDF |
|
32 | Zeljko Obrenovic, Dusan Starcevic, Emil Jovanov |
Toward Optimization of Multimodal User Interfaces for Tactical Audio Applications. |
User Interfaces for All |
2002 |
DBLP DOI BibTeX RDF |
|
32 | Yan Li, Heung-Yeung Shum |
Learning dynamic audio-visual mapping with input-output Hidden Markov models. |
IEEE Trans. Multim. |
2006 |
DBLP DOI BibTeX RDF |
|
32 | Gareth J. F. Jones, Richard J. Edens |
Automated Alignment and Annotation of Audio-Visual Presentations. |
ECDL |
2002 |
DBLP DOI BibTeX RDF |
|
32 | Laurent Girin |
Joint matrix quantization of face parameters and LPC coefficients for low bit rate audiovisual speech coding. |
IEEE Trans. Speech Audio Process. |
2004 |
DBLP DOI BibTeX RDF |
|
32 | Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro |
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Milos Zelezný, Petr Císar |
Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition. |
AVSP |
2003 |
DBLP BibTeX RDF |
|
32 | Laurent Girin, Jean-Luc Schwartz, Gang Feng 0002 |
Can the visual input make the audio signal "pop out" in noise ? a first study of the enhancement of noisy VCV acoustic sequences by audio-visual fusion. |
AVSP |
1997 |
DBLP BibTeX RDF |
|
31 | Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano |
Sound and Visual Tracking for Humanoid Robot. |
Appl. Intell. |
2004 |
DBLP DOI BibTeX RDF |
robot audition, audio-visual integration, audio-visual tracking, computational auditory scene analysis |
31 | Kyoung-Ho Choi, Jong-Hoon Lee |
Constrained optimization for a speech driven talking head. |
ISCAS (2) |
2003 |
DBLP DOI BibTeX RDF |
|
31 | Adam Nash |
Real time art engines 3: post-convergent creative practice in MUVEs. |
IE |
2007 |
DBLP BibTeX RDF |
audio-visual composition, post-convergent space, Second Life, multi-user virtual environments |
31 | Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, Iain McCowan |
Multimodal multispeaker probabilistic tracking in meetings. |
ICMI |
2005 |
DBLP DOI BibTeX RDF |
audio-visual speaker tracking, particle filters, MCMC |
30 | Brit Susan Jensen, Mikael B. Skov, Nissan Thiruravichandran |
Studying driver attention and behaviour for three configurations of GPS navigation in real traffic driving. |
CHI |
2010 |
DBLP DOI BibTeX RDF |
eye glances, navigation guide, output modalities, gps, driving, field experiment, in-vehicle systems |
30 | Martin J. Hicks, Sarah Nichols 0001, Claire O'Malley |
Comparing the roles of 3D representations in audio and audio-visual collaborations. |
Virtual Real. |
2004 |
DBLP DOI BibTeX RDF |
Collaborative problem solving, Information visualisation, 3D representations |
30 | Louis H. Terry, Aggelos K. Katsaggelos |
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition. |
ICPR |
2008 |
DBLP DOI BibTeX RDF |
|
30 | Paisarn Muneesawang, Tahir Amin, Ling Guan |
Audio Visual Cues for Video Indexing and Retrieval. |
PCM (1) |
2004 |
DBLP DOI BibTeX RDF |
|
30 | Marios Kyperountas, Constantine Kotropoulos, Ioannis Pitas |
Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection. |
IEEE Trans. Multim. |
2007 |
DBLP DOI BibTeX RDF |
|
30 | Kai Nickel, Tobias Gehrig, Hazim Kemal Ekenel, John W. McDonough, Rainer Stiefelhagen |
An Audio-Visual Particle Filter for Speaker Tracking on the CLEAR'06 Evaluation Dataset. |
CLEAR |
2006 |
DBLP DOI BibTeX RDF |
|
29 | Gerasimos Potamianos, Chalapathy Neti |
Improved ROI and within frame discriminant features for lipreading. |
ICIP (3) |
2001 |
DBLP DOI BibTeX RDF |
|
29 | Tian Gan, Wolfgang Menzel, Jianwei Zhang 0001 |
Using the Tandem Approach for AF Classification in an AVSR System. |
ISNN (2) |
2008 |
DBLP DOI BibTeX RDF |
Articulatory Features, MLP, Audio Visual Speech Recognition |
29 | Dong Zhang 0001, Daniel Gatica-Perez, Samy Bengio |
Semi-supervised meeting event recognition with adapted HMMs. |
ICME |
2005 |
DBLP DOI BibTeX RDF |
audio-visual event, semisupervised framework, meeting event recognition, HMM adaptation technique |
28 | Gerald Friedland, Chuohao Yeo, Hayley Hung |
Visual speaker localization aided by acoustic models. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
visual localization, multimodal integration, speaker diarization |
28 | Elise Arnaud, Heidi Christensen, Yan-Chen Lu, Jon Barker, Vasil Khalidov, Miles E. Hansard, Bertrand Holveck, Hervé Mathieu, Ramya Narasimha, Elise Taillant, Florence Forbes, Radu Horaud |
The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements. |
ICMI |
2008 |
DBLP DOI BibTeX RDF |
binaural hearing, database, stereo vision |
Displaying result #1 - #100 of 3474 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|