The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for phrase audio/visual (changed automatically) with no syntactic query expansion in all metadata.

Publication years (Num. hits)
1974-1993 (15) 1994-1996 (21) 1997 (55) 1998 (29) 1999 (34) 2000 (42) 2001 (73) 2002 (68) 2003 (103) 2004 (113) 2005 (100) 2006 (101) 2007 (144) 2008 (145) 2009 (118) 2010 (84) 2011 (103) 2012 (95) 2013 (102) 2014 (101) 2015 (85) 2016 (90) 2017 (99) 2018 (139) 2019 (160) 2020 (198) 2021 (269) 2022 (292) 2023 (418) 2024 (78)
Publication types (Num. hits)
article(1159) book(1) incollection(17) inproceedings(2241) phdthesis(43) proceedings(13)
Venues (Conferences, Journals, ...)
CoRR(567) ICASSP(158) INTERSPEECH(152) AVSP(136) HAVE(79) ACM Multimedia(75) ICME(69) ICMI(57) AVEC@ACM Multimedia(45) AVEC@MM(41) CVPR(40) IEEE Trans. Multim.(39) EUSIPCO(35) MMSP(32) IEEE Access(28) AAAI(24) More (+10 of total 800)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 800 occurrences of 559 keywords

Results
Found 3474 publication records. Showing 3474 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
155Hector Ouilhet Google Sky Map: using your phone as an interface. Search on Bibsonomy Mobile HCI The full citation details ... 2010 DBLP  DOI  BibTeX  RDF
92Wei Jiang 0001, Courtenay V. Cotton, Shih-Fu Chang, Dan Ellis, Alexander C. Loui Short-term audio-visual atoms for generic video concept classification. Search on Bibsonomy ACM Multimedia The full citation details ... 2009 DBLP  DOI  BibTeX  RDF audio-visual codebook, joint audio-visual analysis, short-term audio-visual atom, semantic concept detection
78Niall A. Fox, Brian A. O'Mullane, Richard B. Reilly Audio-Visual Speaker Identification via Adaptive Fusion Using Reliability Estimates of Both Modalities. Search on Bibsonomy AVBPA The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
68Mustafa Nazmi Kaynak, Qi Zhi, Adrian David Cheok, Kuntal Sengupta, Jian Zhang, Chi Chung Ko Analysis of lip geometric features for audio-visual speech recognition. Search on Bibsonomy IEEE Trans. Syst. Man Cybern. Part A The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
64Timothy J. Hazen, Kate Saenko, Chia-Hao La, James R. Glass A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. Search on Bibsonomy ICMI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF audio-visual corpora, audio-visual speech recognition
63David Grelaud, Nicolas Bonneel, Michael Wimmer 0001, Manuel Asselot, George Drettakis Efficient and practical audio-visual rendering for games using crossmodal perception. Search on Bibsonomy SI3D The full citation details ... 2009 DBLP  DOI  BibTeX  RDF audio-visual rendering, crossmodal perception
60Toshiko Isei-Jaakkola Cognition and Physio-acoustic Correlates - Audio and Audio-visual Effects of a Short English Emotional Statement: On JL2, FL2 and EL1. Search on Bibsonomy FinTAL The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
58Giulia Garau, Sileye O. Ba, Hervé Bourlard, Jean-Marc Odobez Investigating the use of visual focus of attention for audio-visual speaker diarisation. Search on Bibsonomy ACM Multimedia The full citation details ... 2009 DBLP  DOI  BibTeX  RDF audio-visual speaker diarisation, visual focus of attention
58Niall A. Fox, Richard B. Reilly Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features. Search on Bibsonomy AVBPA The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
57Jong-Seok Lee, Cheol Hoon Park Temporal filtering of visual speech for audio-visual speech recognition in acoustically and visually challenging environments. Search on Bibsonomy ICMI The full citation details ... 2007 DBLP  DOI  BibTeX  RDF late integration, neural network, feature extraction, hidden Markov model, noise-robustness, audio-visual speech recognition, temporal filtering
56Marco Cristani, Manuele Bicego, Vittorio Murino Audio-Visual Event Recognition in Surveillance Video Sequences. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
52Vasil Khalidov, Florence Forbes, Miles E. Hansard, Elise Arnaud, Radu Horaud Detection and localization of 3d audio-visual objects using unsupervised clustering. Search on Bibsonomy ICMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF audio-visual clustering, binaural hearing, mixture models, stereo vision
52Jörn Anemüller, Jörg-Hendrik Bach, Barbara Caputo, Michal Havlena, Jie Luo, Hendrik Kayser, Bastian Leibe, Petr Motlícek, Tomás Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, Hynek Hermansky The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. Search on Bibsonomy ICMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF sensor platform, multimodal interaction, event detection, augmented cognition, audio-visual
52C. Mario Christoudias, Kate Saenko, Louis-Philippe Morency, Trevor Darrell Co-Adaptation of audio-visual speech and gesture classifiers. Search on Bibsonomy ICMI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF audio-visual speech and gesture, adaptation, semi-supervised learning, human-computer interfaces, co-training
52Benjamin Senechal, Denis Pellerin, Laurent Besacier, Isabelle Simand, Stéphane Bres Audio, video and audio-visual signatures for short video clip detection: experiments on Trecvid2003. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF audio-visual signature, video clip detection, Trecvid2003, spatio-temporal signature, gray level centroid, video database
52Yu Cao 0002, Sung Baang, Shih-Hsi Liu, Ming Li 0007, Sanqing Hu Audio-visual event classification via spatial-temporal-audio words. Search on Bibsonomy ICPR The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
52Jing Huang 0019, Etienne Marcheret, Karthik Visweswariah Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
50Sofia Tsekeridou, Ioannis Pitas Audio-Visual Content Analysis for Content-Based Video Indexing. Search on Bibsonomy ICMCS, Vol. 1 The full citation details ... 1999 DBLP  DOI  BibTeX  RDF audio-visual content analysis, multimodal interaction, content-based indexing/retrieval
50Marco Cristani, Manuele Bicego, Vittorio Murino Audio-Video Integration for Background Modelling. Search on Bibsonomy ECCV (2) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
50Ye Wang 0007, Bingjun Zhang, Olaf Schleusing Educational violin transcription by fusing multimedia streams. Search on Bibsonomy ACM Multimedia EMME Workshop The full citation details ... 2007 DBLP  DOI  BibTeX  RDF audio-visual fusion, computer-assisted tutoring, detection function, note segmentation, music transcription, onset detection
50Ming Liu 0009, Yun Fu 0001, Thomas S. Huang An audio-visual fusion framework with joint dimensionality reducton. Search on Bibsonomy ICASSP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
49Louis H. Terry, Derek J. Shiell, Aggelos K. Katsaggelos Feature space video stream consistency estimation for dynamic stream weighting in audio-visual speech recognition. Search on Bibsonomy ICIP The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
49Pengyu Hong, Zhen Wen, Thomas S. Huang, Heung-Yeung Shum Real-Time Speech-Driven 3D Face Animation. Search on Bibsonomy 3DPVT The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
48Tsuhan Chen, Hans Peter Graf, Barry G. Haskell, Eric Petajan, Yao Wang 0001, Homer H. Chen, Wu Chou Speech-assisted lip synchronization in audio-visual communications. Search on Bibsonomy ICIP The full citation details ... 1995 DBLP  DOI  BibTeX  RDF audio-visual systems, audio-visual communications, speech-assisted lip synchronization, speech information, speech-assisted frame-rate conversion, talking head video coding, image processing, image processing, video coding, synchronisation, videoconferencing, teleconferencing, speech processing, speech analysis, videotelephony, video telephony
47Tanveer A. Faruquie, Abhik Majumdar, Nitendra Rajput, L. Venkata Subramaniam Large Vocabulary Audio-Visual Speech Recognition Using Active Shape Models. Search on Bibsonomy ICPR The full citation details ... 2000 DBLP  DOI  BibTeX  RDF
46Timothy J. Hazen Visual model structures and synchrony constraints for audio-visual speech recognition. Search on Bibsonomy IEEE Trans. Speech Audio Process. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
45Girija Chetty, Michael Wagner 0004 A Robust Speaking Face Modelling Approach Based on Multilevel Fusion. Search on Bibsonomy DICTA The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
44Niall A. Fox, Richard B. Reilly Robust multi-modal person identification with tolerance of facial expression. Search on Bibsonomy SMC (1) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
41Lei Xie 0001, Zhi-Qiang Liu Speech Animation Using Coupled Hidden Markov Models. Search on Bibsonomy ICPR (1) The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
41Ara V. Nefian, Luhong Liang, Tieyan Fu, Xiaoxing Liu A Bayesian Approach to Audio-Visual Speaker Identification. Search on Bibsonomy AVBPA The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
40Lei Chen 0002, Sule Gündüz, M. Tamer Özsu Mixed Type Audio Classification with Support Vector Machine. Search on Bibsonomy ICME The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
39Petar S. Aleksic, Aggelos K. Katsaggelos Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance. Search on Bibsonomy ICIP (3) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
39Amitava Das Audio Visual Person Authentication by Multiple Nearest Neighbor Classifiers. Search on Bibsonomy ICB The full citation details ... 2007 DBLP  DOI  BibTeX  RDF audio-visual biometric authentication, feature extraction, face recognition, Multimodal, fusion, Speaker recognition, multiple classifiers, VQ
39Indrani Medhi, Archana Prasad, Kentaro Toyama Optimal audio-visual representations for illiterate users of computers. Search on Bibsonomy WWW The full citation details ... 2007 DBLP  DOI  BibTeX  RDF audio-visual icons, illiterate users, text-free user interfaces
39Satoshi Nakamura 0001, Eli Yamamoto Speech-to-Lip Movement Synthesis by Maximizing Audio-Visual Joint Probability Based on the EM Algorithm. Search on Bibsonomy J. VLSI Signal Process. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF lip movement synthesis, audio-visual joint probability, speech recognition, EM algorithm
39Wee Sun Lee, Michael R. Frater, Mark R. Pickering, John F. Arnold Robustness of multiplexing protocols for audio-visual services over wireless networks. Search on Bibsonomy ICIP (3) The full citation details ... 1997 DBLP  DOI  BibTeX  RDF multiplexing protocols, audio-visual services, MPEG 2 systems, ITU-T H.223, high bit-error rates, packet-oriented multiplexing, packet header, forward error correcting codes, header information, FEC protection, quality of service, wireless networks, packet radio networks, wireless channels
39Mike A. Oren, Chris Harding, Terri L. Bonebright Evaluation of spatial abilities within a 2D auditory platform game. Search on Bibsonomy ASSETS The full citation details ... 2008 DBLP  DOI  BibTeX  RDF visual impairments, video games, spatial ability, audio games
39Alexánder Ceballos, Juan-Bernardo Gómez, Flavio Prieto, Tanneguy Redarce Robot Command Interface Using an Audio-Visual Speech Recognition System. Search on Bibsonomy CIARP The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Speech recognition, MPEG-4, manipulator
39Tomoaki Koiwa, Kazuhiro Nakadai, Jun-ichi Imura Coarse speech recognition by audio-visual integration based on missing feature theory. Search on Bibsonomy IROS The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
39Atulya Velivelli, Chong-Wah Ngo, Thomas S. Huang Detection of Documentary Scene Changes by Audio-Visual Fusion. Search on Bibsonomy CIVR The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
39Hari Sundaram, Lexing Xie, Shih-Fu Chang A utility framework for the automatic generation of audio-visual skims. Search on Bibsonomy ACM Multimedia The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
38Emily Mower, Maja J. Mataric, Shrikanth S. Narayanan Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays. Search on Bibsonomy ISM The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
38Ilse Ravyse, Dongmei Jiang, Xiaoyue Jiang, Guoyun Lv, Yunshu Hou, Hichem Sahli, Rongchun Zhao DBN Based Models for Audio-Visual Speech Analysis and Recognition. Search on Bibsonomy PCM The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
38Marc A. Al-Hames, Thomas Hain, Jan Cernocký, Sascha Schreiber, Mannes Poel, Ronald Müller, Sébastien Marcel, David A. van Leeuwen, Jean-Marc Odobez, Sileye O. Ba, Hervé Bourlard, Fabien Cardinaux, Daniel Gatica-Perez, Adam Janin, Petr Motlícek, Stephan Reiter, Steve Renals, Jeroen van Rest, Rutger Rienks, Gerhard Rigoll, Kevin Smith 0001, Andrew H. C. Thean, Pavel Zemcík Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. Search on Bibsonomy MLMI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
38Renaud Séguier, David Mercier Audio-Visual Speech Recognition One Pass Learning with Spiking Neurons. Search on Bibsonomy ICANN The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
38Ferda Ofli, Yasemin Demir, Engin Erzin, Yücel Yemez, A. Murat Tekalp Multicamera Audio-Visual Analysis of Dance Figures. Search on Bibsonomy ICME The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
38Ziyou Xiong Audio-visual sports highlights extraction using Coupled Hidden Markov Models. Search on Bibsonomy Pattern Anal. Appl. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
38Thierry Guiard-Marigny, Nicolas Tsingos, Ali Adjoudani, Christian Benoît, Marie-Paule Gascuel 3D Models Of The Lips For Realistic Speech Animation. Search on Bibsonomy CA The full citation details ... 1996 DBLP  DOI  BibTeX  RDF audio-visual systems, realistic images, lip modelling, realistic speech animation, audio visual articulatory speech synthesizer, border contours, vermilion zone, lip shape, speaker dependent conformations, natural lips, French speaker, reference lip shapes, lip contact, natural languages, computer animation, 3D models, implicit surfaces, speech synthesis, volumetric model, continuous functions, human face, geometrical analysis, algebraic equations
37Lucas D. Terissi, Juan Carlos Gómez Audio-to-Visual Conversion Via HMM Inversion for Speech-Driven Facial Animation. Search on Bibsonomy SBIA The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Audio-Visual Speech Processing, Hidden Markov Models, Facial Animation
37Jeho Nam, A. Enis Çetin, Ahmed H. Tewfik Speaker Identification and Video Analysis for Hierarchical Video Shot Classification . Search on Bibsonomy ICIP (2) The full citation details ... 1997 DBLP  DOI  BibTeX  RDF hierarchical video shot classification, visual data tracks, audio data tracks, visual stream analysis, voiced phonemes tracking, speech data, audio-visual objects indexing, video shots matching, browsing, video analysis, content-based retrieval, multimedia databases, video databases, speaker recognition, speaker identification, clustering technique, content-based indexing, video data, audio signal, 3D wavelet transform, speaker changes detection
37Rebecca Lunsford, Sharon L. Oviatt Human perception of intended addressee during computer-assisted meetings. Search on Bibsonomy ICMI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF acoustic-prosodic cues, dialogue style, human-computer teamwork, intended addressee, open-microphone engagement, gaze, multiparty interaction
37Hendrik Knoche, Hermann de Meer, David Kirsh Compensating for low frame rates. Search on Bibsonomy CHI Extended Abstracts The full citation details ... 2005 DBLP  DOI  BibTeX  RDF audio-visual integration, skew, frame rates, speech perception
37Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos Robust multimodal audio-visual processing for advanced context awareness in smart spaces. Search on Bibsonomy Pers. Ubiquitous Comput. The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
37Le Xin, Jianhua Tao 0001, Tieniu Tan Dynamic Audio-Visual Mapping using Fused Hidden Markov Model Inversion Method. Search on Bibsonomy ICIP (3) The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
37Aristodemos Pnevmatikakis, John Soldatos 0001, Fotios Talantzis, Lazaros Polymenakos Robust Multimodal Audio-Visual Processing for Advanced Context Awareness in Smart Spaces. Search on Bibsonomy AIAI The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
37Shengli Fu, Ricardo Gutierrez-Osuna, Anna Esposito, Praveen K. Kakumanu, Oscar N. Garcia Audio/visual mapping with cross-modal hidden Markov models. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
37Zhihong Zeng, Jilin Tu, Ming Liu 0009, Thomas S. Huang Multi-stream Confidence Analysis for Audio-Visual Affect Recognition. Search on Bibsonomy ACII The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
37Walid Karam, Chafic Mokbel, Hanna Greige, Guido Aversano, Catherine Pelachaud, Gérard Chollet An Audio-Visual Imposture Scenario by Talking Face Animation. Search on Bibsonomy Summer School on Neural Networks The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
37Satoshi Nakamura 0001 Fusion of Audio-Visual Information for Integrated Speech Processing. Search on Bibsonomy AVBPA The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
36Tanveer A. Faruquie, Ashish Kapoor, Rohit J. Kate, Nitendra Rajput, L. Venkata Subramaniam Audio Driven Facial Animation For Audio-Visual Reality. Search on Bibsonomy ICME The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
36Kate Saenko, Trevor Darrell, James R. Glass Articulatory features for robust visual speech recognition. Search on Bibsonomy ICMI The full citation details ... 2004 DBLP  DOI  BibTeX  RDF articulatory features, speechreading, visual feature extraction, multimodal interfaces, audio-visual speech recognition
35Wichian Sittiprapaporn, Jun Soo Kwon Brain Electric Microstate and Perception of Simultaneously Audiovisual Presentation. Search on Bibsonomy ICANN (1) The full citation details ... 2009 DBLP  DOI  BibTeX  RDF Audiovisual perception, Microstate, Cognition, Brain, Event-related potential (ERP)
35Bernt Schiele, Nuria Oliver, Tony Jebara, Alex Pentland An Interactive Computer Vision System DyPERS: Dynamic Personal Enhanced Reality System. Search on Bibsonomy ICVS The full citation details ... 1999 DBLP  DOI  BibTeX  RDF
34Hari Krishna Maganti, Daniel Gatica-Perez, Iain McCowan Speech Enhancement and Recognition in Meetings With an Audio-Visual Sensor Array. Search on Bibsonomy IEEE Trans. Speech Audio Process. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
34Olivier Martin 0001, Irene Kotsia, Benoît Macq, Ioannis Pitas The eNTERFACE'05 Audio-Visual Emotion Database. Search on Bibsonomy ICDE Workshops The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
34Mat Felthousen Combining audio/visual and computing support. Search on Bibsonomy SIGUCCS The full citation details ... 2005 DBLP  DOI  BibTeX  RDF AV equipment, design, reliability, standardization, computer labs, budgeting
34Myung-Won Kim, Joung Woo Ryu, Eun Ju Kim Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks. Search on Bibsonomy ICNC (2) The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
34Lei Xie 0001, Zhi-Qiang Liu Multi-stream Articulator Model with Adaptive Reliability Measure for Audio Visual Speech Recognition. Search on Bibsonomy ICMLC The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
34Raffay Hamid, Aaron F. Bobick, Anthony J. Yezzi Audio-visual flow -a variational approach to multi-modal flow estimation. Search on Bibsonomy ICIP The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
34Iain A. Matthews, Gerasimos Potamianos, Chalapathy Neti, Juergen Luettin A Comparison Of Model And Transform-Based Visual Features For Audio-Visual LVCSR. Search on Bibsonomy ICME The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
33Walter Allasia, Fabrizio Falchi, Francesco Gallo, Mouna Kacimi, Aaron Kaplan, Jonathan Mamou, Yosi Mass, Nicola Orio Audio-Visual Content Analysis in P2P Networks: The SAPIR Approach. Search on Bibsonomy DEXA Workshops The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Audio-visual content analysis, SAPIR, P2P
33Zhihong Zeng, ZhenQiu Zhang, Brian Pianfetti, Jilin Tu, Thomas S. Huang Audio-visual affect recognition in activation-evaluation space. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF bimodal affect recognition, audio-visual affect recognition, psychological research, activation-evaluation space, Fisher boosting learning algorithm, human-computer interaction, HCI
33Raphaël Troncy, Jean Carrive A reduced yet extensible audio-visual description language. Search on Bibsonomy ACM Symposium on Document Engineering The full citation details ... 2004 DBLP  DOI  BibTeX  RDF audio-visual description language, semantic web, semantics, knowledge representation, structure, MPEG-7, descriptor
33Andreas Tirakis, Panagiotis Katalagarianos, Michael Papathomas, Christodoulos Hamilakis Distributed Audio-Visual Archives Network (DiVAN). Search on Bibsonomy ICMCS, Vol. 2 The full citation details ... 1999 DBLP  DOI  BibTeX  RDF Distributed audio-visual archive, Extensible Data Model, OMG-CORBA infrastructure, Content-based access
33Satoshi Nakamura 0001, Ken'ichi Kumatani, Satoshi Tamura Multi-Modal Temporal Asynchronicity Modeling by Product HMMs for Robust. Search on Bibsonomy ICMI The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
33Kyoung-Ho Choi, Ying Luo 0011, Jenq-Neng Hwang Hidden Markov Model Inversion for Audio-to-Visual Conversion in an MPEG-4 Facial Animation System. Search on Bibsonomy J. VLSI Signal Process. The full citation details ... 2001 DBLP  DOI  BibTeX  RDF HMMI, audio-to-visual conversion, MPEG-4, facial animation
32Simon Lucey, Tsuhan Chen, Sridha Sridharan, Vinod Chandran Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2005 DBLP  DOI  BibTeX  RDF
32Simon Lucey, Tsuhan Chen Improved Audio-Visual Speaker Recognition via the Use of a Hybrid Combination Strategy. Search on Bibsonomy AVBPA The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
32Zeljko Obrenovic, Dusan Starcevic, Emil Jovanov Toward Optimization of Multimodal User Interfaces for Tactical Audio Applications. Search on Bibsonomy User Interfaces for All The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
32Yan Li, Heung-Yeung Shum Learning dynamic audio-visual mapping with input-output Hidden Markov models. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
32Gareth J. F. Jones, Richard J. Edens Automated Alignment and Annotation of Audio-Visual Presentations. Search on Bibsonomy ECDL The full citation details ... 2002 DBLP  DOI  BibTeX  RDF
32Laurent Girin Joint matrix quantization of face parameters and LPC coefficients for low bit rate audiovisual speech coding. Search on Bibsonomy IEEE Trans. Speech Audio Process. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
32Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
32Milos Zelezný, Petr Císar Czech audio-visual speech corpus of a car driver for in-vehicle audio-visual speech recognition. Search on Bibsonomy AVSP The full citation details ... 2003 DBLP  BibTeX  RDF
32Laurent Girin, Jean-Luc Schwartz, Gang Feng 0002 Can the visual input make the audio signal "pop out" in noise ? a first study of the enhancement of noisy VCV acoustic sequences by audio-visual fusion. Search on Bibsonomy AVSP The full citation details ... 1997 DBLP  BibTeX  RDF
31Hiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano Sound and Visual Tracking for Humanoid Robot. Search on Bibsonomy Appl. Intell. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF robot audition, audio-visual integration, audio-visual tracking, computational auditory scene analysis
31Kyoung-Ho Choi, Jong-Hoon Lee Constrained optimization for a speech driven talking head. Search on Bibsonomy ISCAS (2) The full citation details ... 2003 DBLP  DOI  BibTeX  RDF
31Adam Nash Real time art engines 3: post-convergent creative practice in MUVEs. Search on Bibsonomy IE The full citation details ... 2007 DBLP  BibTeX  RDF audio-visual composition, post-convergent space, Second Life, multi-user virtual environments
31Daniel Gatica-Perez, Guillaume Lathoud, Jean-Marc Odobez, Iain McCowan Multimodal multispeaker probabilistic tracking in meetings. Search on Bibsonomy ICMI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF audio-visual speaker tracking, particle filters, MCMC
30Brit Susan Jensen, Mikael B. Skov, Nissan Thiruravichandran Studying driver attention and behaviour for three configurations of GPS navigation in real traffic driving. Search on Bibsonomy CHI The full citation details ... 2010 DBLP  DOI  BibTeX  RDF eye glances, navigation guide, output modalities, gps, driving, field experiment, in-vehicle systems
30Martin J. Hicks, Sarah Nichols 0001, Claire O'Malley Comparing the roles of 3D representations in audio and audio-visual collaborations. Search on Bibsonomy Virtual Real. The full citation details ... 2004 DBLP  DOI  BibTeX  RDF Collaborative problem solving, Information visualisation, 3D representations
30Louis H. Terry, Aggelos K. Katsaggelos A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition. Search on Bibsonomy ICPR The full citation details ... 2008 DBLP  DOI  BibTeX  RDF
30Paisarn Muneesawang, Tahir Amin, Ling Guan Audio Visual Cues for Video Indexing and Retrieval. Search on Bibsonomy PCM (1) The full citation details ... 2004 DBLP  DOI  BibTeX  RDF
30Marios Kyperountas, Constantine Kotropoulos, Ioannis Pitas Enhanced Eigen-Audioframes for Audiovisual Scene Change Detection. Search on Bibsonomy IEEE Trans. Multim. The full citation details ... 2007 DBLP  DOI  BibTeX  RDF
30Kai Nickel, Tobias Gehrig, Hazim Kemal Ekenel, John W. McDonough, Rainer Stiefelhagen An Audio-Visual Particle Filter for Speaker Tracking on the CLEAR'06 Evaluation Dataset. Search on Bibsonomy CLEAR The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
29Gerasimos Potamianos, Chalapathy Neti Improved ROI and within frame discriminant features for lipreading. Search on Bibsonomy ICIP (3) The full citation details ... 2001 DBLP  DOI  BibTeX  RDF
29Tian Gan, Wolfgang Menzel, Jianwei Zhang 0001 Using the Tandem Approach for AF Classification in an AVSR System. Search on Bibsonomy ISNN (2) The full citation details ... 2008 DBLP  DOI  BibTeX  RDF Articulatory Features, MLP, Audio Visual Speech Recognition
29Dong Zhang 0001, Daniel Gatica-Perez, Samy Bengio Semi-supervised meeting event recognition with adapted HMMs. Search on Bibsonomy ICME The full citation details ... 2005 DBLP  DOI  BibTeX  RDF audio-visual event, semisupervised framework, meeting event recognition, HMM adaptation technique
28Gerald Friedland, Chuohao Yeo, Hayley Hung Visual speaker localization aided by acoustic models. Search on Bibsonomy ACM Multimedia The full citation details ... 2009 DBLP  DOI  BibTeX  RDF visual localization, multimodal integration, speaker diarization
28Elise Arnaud, Heidi Christensen, Yan-Chen Lu, Jon Barker, Vasil Khalidov, Miles E. Hansard, Bertrand Holveck, Hervé Mathieu, Ramya Narasimha, Elise Taillant, Florence Forbes, Radu Horaud The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements. Search on Bibsonomy ICMI The full citation details ... 2008 DBLP  DOI  BibTeX  RDF binaural hearing, database, stereo vision
Displaying result #1 - #100 of 3474 (100 per page; Change: )
Pages: [1][2][3][4][5][6][7][8][9][10][>>]
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license