|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 333 occurrences of 281 keywords
|
|
|
Results
Found 262 publication records. Showing 262 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
57 | Min Xu 0001, Changsheng Xu, Lingyu Duan, Jesse S. Jin, Suhuai Luo |
Audio keywords generation for sports video analysis. |
ACM Trans. Multim. Comput. Commun. Appl. |
2008 |
DBLP DOI BibTeX RDF |
Audio keywords, support vector machines, event detection, semantics analysis, sports video analysis |
46 | Marco Mattavelli, Sylvain Brunetton, Daniel Mlynek |
Computational Graceful Degradation for Video Sequence Decoding. |
ICIP (1) |
1997 |
DBLP DOI BibTeX RDF |
computational graceful degradation, video sequence decoding, software based video decoders, dedicated hardware real time systems, video/audio decoders, video compression standards, video/audio bitstreams processing, multimedia processors, compressed video sequences, H.263 video compression standard, video processor platform, interfaces, video coding, simulation results, real-time performance, main memory, IDCT, hardware platform |
39 | Jin-Hwan Jeong, Chuck Yoo |
Software-Based Video/Audio Processing for Cellular Phones. |
Telecommun. Syst. |
2005 |
DBLP DOI BibTeX RDF |
video/audio processing, adaptation, cellular phone |
36 | Ming Yang 0019, Nikolaos G. Bourbakis, Zizhong Chen, Monica A. Trifas |
An Efficient Audio-Video Synchronization Methodology. |
ICME |
2007 |
DBLP DOI BibTeX RDF |
|
36 | Alexander Haubold, Promiti Dutta, John R. Kender |
Evaluation of video browser features and user interaction with VAST MM. |
ACM Multimedia |
2008 |
DBLP DOI BibTeX RDF |
presentation video, speaker index, structure in videos, text augmentation, transcript analysis, evaluation, measures, user studies, automatic speech recognition, streaming video, speaker segmentation, video library, visual segmentation |
36 | Mohand-Said Hacid, Cyril Decleir, Jacques Kouloumdjian |
A Database Approach for Modeling and Querying Video Data. |
IEEE Trans. Knowl. Data Eng. |
2000 |
DBLP DOI BibTeX RDF |
Content-based access of video, rule-based query language, constraint query language, object-oriented modeling, video indexing, video database, video representation, video query |
36 | Min Xu 0001, Liang-Tien Chia, Jesse S. Jin |
Affective content analysis in comedy and horror videos by audio emotional event detection. |
ICME |
2005 |
DBLP DOI BibTeX RDF |
affective content analysis, comedy video, horror video, audio emotional event detection, AEE, video-audio segmentation, audio processing technique, domain knowledge, visual information |
32 | Yuchi Liu, Ling-Da Wu, Zhen Lei, Yu-Xiang Xie |
Hierarchically multi-modal indexing of soccer video. |
MMM |
2006 |
DBLP DOI BibTeX RDF |
|
32 | Hsiang-An Wang, Guey-Ching Chen, Chih-Yi Chiu, Jan-Ming Ho |
A Digital Video Archive System of NDAP Taiwan. |
ICADL |
2006 |
DBLP DOI BibTeX RDF |
video archive, video index, video management, video content analysis |
27 | Jianyu Dong, Chao He 0001, Yuan F. Zheng |
AVP: A Highly Efficient Real-Time Protocol for Multimedia Communications on Internet. |
ITCC |
2001 |
DBLP DOI BibTeX RDF |
|
27 | HonCheong Ng, HockLye Toh, Susanto Rahardja, Haibin Huang, Xiang Chen, Hui Lan, Xiao Lin 0001 |
A Standalone Video Communication System for Wireless Applications. |
ISCC |
2000 |
DBLP DOI BibTeX RDF |
|
27 | Wensheng Zhou, Son K. Dao |
Combining Hierarchical Classifiers with Video Semantic Indexing Systems. |
IEEE Pacific Rim Conference on Multimedia |
2001 |
DBLP DOI BibTeX RDF |
|
27 | Shitij Mutreja, Stuart Bailey, Robert L. Grossman, David Hanley |
Lightweight video service for multi-media digital libraries. |
CASCON |
1995 |
DBLP BibTeX RDF |
|
25 | Wenjun Zeng, Jiangtao Wen, Mike Severa |
Format-Compliant Selective Scrambling for Multimedia Access Control. |
ITCC |
2002 |
DBLP DOI BibTeX RDF |
selective scrambling, format compliant, access control, digital rights management, shuffling, selective encryption |
25 | Francine Chen 0001, Matthew Cooper 0002, John Adcock |
Video summarization preserving dynamic content. |
TVS |
2007 |
DBLP DOI BibTeX RDF |
clustering, segmentation, video summarization, presentation |
23 | Jeroen Lichtenauer, Michel François Valstar, Jie Shen 0008, Maja Pantic |
Cost-Effective Solution to Synchronized Audio-Visual Capture Using Multiple Sensors. |
AVSS |
2009 |
DBLP DOI BibTeX RDF |
|
23 | Archana Prasad, Indrani Medhi, Kentaro Toyama, Ravin Balakrishnan |
Exploring the feasibility of video mail for illiterate users. |
AVI |
2008 |
DBLP DOI BibTeX RDF |
ICT for development, illiterate users, video mail |
23 | Olivier Martin 0001, Irene Kotsia, Benoît Macq, Ioannis Pitas |
The eNTERFACE'05 Audio-Visual Emotion Database. |
ICDE Workshops |
2006 |
DBLP DOI BibTeX RDF |
|
23 | Wai Yat Wong, Peter Reimann 0001 |
Web Based Educational Video Teaching and Learning Platform with Collaborative Annotation. |
ICALT |
2009 |
DBLP DOI BibTeX RDF |
|
21 | Kyung-Ae Cha, Kyungdeok Kim |
MPEG-4 Scene Description Optimization for Interactive Terrestrial DMB Content. |
ICESS |
2007 |
DBLP DOI BibTeX RDF |
T-DMB, Scene description Optimization, MPEG-4 System, Interactive Content, BIFS |
21 | Subhransu Maji, Ruzena Bajcsy |
Fast unsupervised alignment of video and text for indexing/names and faces. |
MS |
2007 |
DBLP DOI BibTeX RDF |
multimedia alignment, names and faces, indexing |
21 | Hao Pan 0002, Zhi-Pei Liang, Thomas S. Huang |
Exploiting the Dependencies in Information Fusion. |
CVPR |
1999 |
DBLP DOI BibTeX RDF |
maximium entropy, multisensory signals, video/audio, information fusion, Bayesian |
21 | Nils Björkman, Alexander Latour-Henner, Urban Hansson |
The cell loss process in ATM networks and its impact on quality of service. |
LCN |
1995 |
DBLP DOI BibTeX RDF |
cell loss process, performance level, short-term CLR, objective analysis, video/audio codec, quality of service, performance evaluation, asynchronous transfer mode, local area networks, ATM networks, network performance, subjective assessment, Cell Loss Ratio |
20 | Zichen Yuan, Qi Shen, Bingyi Zheng, Yuting Liu, Linying Jiang, Guibing Guo |
Video and Audio are Images: A Cross-Modal Mixer for Original Data on Video-Audio Retrieval. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
19 | Wen-Chung Kao, Tai-Hua Sun, Sheng-Yuan Lin |
A robust embedded software platform for versatile camera systems. |
ISCAS (5) |
2005 |
DBLP DOI BibTeX RDF |
|
19 | Maia Garau, Mel Slater, Simon Bee, Martina Angela Sasse |
The impact of eye gaze on communication using humanoid avatars. |
CHI |
2001 |
DBLP DOI BibTeX RDF |
collaborative virtual enviroments (CVEs), nonverbal behaviours, computer-mediated communication (CMC), avatars, gaze, mediated communication |
19 | Roger Zimmermann, Elaine Chew, Sakire Arslan Ay, Moses Pawar |
Distributed musical performances: Architecture and stream management. |
ACM Trans. Multim. Comput. Commun. Appl. |
2008 |
DBLP DOI BibTeX RDF |
Distributed immersive performance, multimedia storage, multimodal data recorder, networked musical performance |
19 | Jeremiah Scholl, John D. McCarthy, Rikard Harr |
A comparison of chat and audio in media rich environments. |
CSCW |
2006 |
DBLP DOI BibTeX RDF |
collaboration, video conferencing, chat |
19 | Yu-Quan Zhang, Sabu Emmanuel |
A Novel Framework for Multiple Creatorship Protection of Digital Movies. |
DRMTICS |
2005 |
DBLP DOI BibTeX RDF |
|
19 | Giseok Choe, Jeongsoo Yu, Jeonghoon Choi, Jongho Nang |
Design and Implementation of a Real-Time Video Player on Tiled-Display System. |
CIT |
2007 |
DBLP DOI BibTeX RDF |
|
17 | Alexander Haubold, John R. Kender |
VAST MM: multimedia browser for presentation video. |
CIVR |
2007 |
DBLP DOI BibTeX RDF |
presentation video, speaker index, text augmentation, transcript analysis, automatic speech recognition, streaming video, speaker segmentation, video library, visual segmentation |
17 | Aijuan Dong, Honglin Li |
Semantic Segmentation of Documentary Video using Music Breaks. |
ICME |
2006 |
DBLP DOI BibTeX RDF |
|
17 | Kiyoharu Aizawa |
Digitizing Personal Experiences: Capture and Retrieval of Life Log. |
MMM |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Hirotatsu Sakamoto, Yoshihiro Okada, Toshihiko Shimokawa, Kazuo Ushijima |
Component Based Video Communication Tool for Collaborative Virtual Environment. |
ICOIN |
2001 |
DBLP DOI BibTeX RDF |
|
15 | Rick Kazman, Reem Al-Halimi, William Hunt, Marilyn M. Mantei |
Four Paradigms for Indexing Video Conferences. |
IEEE Multim. |
1996 |
DBLP DOI BibTeX RDF |
distributed meeting support, information retrieval, Video conferencing, automatic indexing |
15 | C. P. Ravikumar |
Multiprocessor Architectures for Embedded System-on-chip Applications. |
VLSI Design |
2004 |
DBLP DOI BibTeX RDF |
|
15 | Gernot Goebbels, Vali Lalioti |
Co-presence and co-working in distributed collaborative virtual environments. |
Afrigraph |
2001 |
DBLP DOI BibTeX RDF |
Co-work, Evaluation, Collaborative Virtual Environments, Telepresence, Co-presence |
15 | Tatsuo Sato, Joung-Hoon Lim, Ken-ichi Okada, Yutaka Matsushita |
A Multimedia Synchronization Model Described by Boolean Expressions. |
ACM Conference on Computer Science |
1993 |
DBLP DOI BibTeX RDF |
|
14 | Takayuki Kunieda, Yuki Wakita |
Package-Segment Model for Movie Retrieval System and Adaptable Applications. |
ICMCS, Vol. 2 |
1999 |
DBLP DOI BibTeX RDF |
Contet-based indexing/retrieval, Image video audio content analysis |
14 | Zekang Li, Zongjia Li, Jinchao Zhang, Yang Feng 0004, Cheng Niu, Jie Zhou 0016 |
Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
14 | Zhe Wang, Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Sibo Song, Fang Yuan, Kim Seokhwan, Nancy F. Chen, Luis Fernando D'Haro, Anh Tuan Luu, Hongyuan Zhu, Zeng Zeng, Ngai-Man Cheung, Georgios Piliouras, Jie Lin 0001, Vijay Chandrasekhar 0001 |
Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
13 | Ziad Sakr, Nicolas D. Georganas |
Robust content-based MPEG-4 XMT scene structure authentication and multimedia content location. |
ACM Trans. Multim. Comput. Commun. Appl. |
2007 |
DBLP DOI BibTeX RDF |
XML, Multimedia, VRML, watermarking, steganography, polynomial, MPEG-4, XMT, pseudorandom sequences |
13 | Chia-Hsun Jackie Lee, Chaochi Chang, Hyemin Chung, Connor Dickie, Ted Selker |
Emotionally reactive television. |
IUI |
2007 |
DBLP DOI BibTeX RDF |
visual/audio amplification, HCI, emotion, TV, social responses |
13 | Anthony Solon, Kevin Curran, Paul McKevitt |
Bandwidth determined transmoding through fuzzy logic in mobile intelligent multimedia presentation systems. |
Artif. Intell. Rev. |
2006 |
DBLP DOI BibTeX RDF |
TeleMorph, TeleTuras, Mobile multimedia transmoding, Cross-modality adaptation, Mobile intelligent multimedia presentation systems, Mobile network constraints, Bandwidth awareness, Fuzzy logic, Multimodal output |
13 | Ronald Schroeter, Jane Hunter 0001, Jonathon Guerin, Imran Khan, Michael Henderson |
A Synchronous Multimedia Annotation System for Secure Collaboratories. |
e-Science |
2006 |
DBLP DOI BibTeX RDF |
|
13 | Haohuan Fu, Xiaowen Li, Ji Shen, Weijia Jia 0001 |
Efficient Multiplexing Protocol for Low Bit Rate Multi-point Video Conferencing. |
MSN |
2005 |
DBLP DOI BibTeX RDF |
|
13 | Jens Riegelsberger, Martina Angela Sasse, John D. McCarthy |
Do people trust their eyes more than ears?: media bias in detecting cues of expertise. |
CHI Extended Abstracts |
2005 |
DBLP DOI BibTeX RDF |
interpersonal cues, trust, video, CMC, audio, avatar, photo |
13 | Weijia Jia 0001, Bo Han 0001, Ji Shen, Haohuan Fu, Man-Ching Yuen |
Efficient Implementation of 3G-324M Protocol Stack for Multimedia Communication. |
ICPADS (1) |
2005 |
DBLP DOI BibTeX RDF |
|
13 | R. Travis Rose, Francis K. H. Quek, Yang Shi |
MacVisSTA: a system for multimodal analysis. |
ICMI |
2004 |
DBLP DOI BibTeX RDF |
flexible visualization and annotation, multiple linked representation, gesture, multimodal interaction, embodied communication |
13 | Chunhai Hou, Songmin Jia, Kunikatsu Takase |
Real-Time Multimedia Applications in a Web-Based Robotic Telecare System. |
J. Intell. Robotic Syst. |
2003 |
DBLP DOI BibTeX RDF |
Java media framework, multimedia, multicast, Web, robot control, RTP, telecare |
13 | Gernot Goebbels, Vali Lalioti, Martin Göbel |
Design and evaluation of team work in distributed collaborative virtual environments. |
VRST |
2003 |
DBLP DOI BibTeX RDF |
CVE Design Model, evaluation, awareness, collaborative virtual environments, guidelines, tele-presence |
13 | Shoko Mikawa, Keiko Okawa, Jun Murai |
Establishment of a lecture environment using Internet technology via satellite communication in Asian countries. |
SAINT Workshops |
2003 |
DBLP DOI BibTeX RDF |
|
13 | Nathan Bos, Judith S. Olson, Darren Gergle, Gary M. Olson, Zach Wright |
Effects of four computer-mediated communications channels on trust development. |
CHI |
2002 |
DBLP DOI BibTeX RDF |
communication, trust, media, social dilemmas |
13 | Agustín José González, Hussein M. Abdel-Wahab, J. Christian Wild |
Lightweight Scalable Tool Sharing for the Internet. |
ISCC |
2001 |
DBLP DOI BibTeX RDF |
|
13 | Tingzhou Yang, Zhao Chen, Dimitrios Makrakis, Abdelhakim Hafid |
A Study of AF and EF Services Interaction. |
ICOIN |
2001 |
DBLP DOI BibTeX RDF |
|
13 | Igor Uros, Dusan Starcevic, Tamara Uros |
Multimedia Presentation: Generic and Implementation Model. |
WORDS |
2001 |
DBLP DOI BibTeX RDF |
|
13 | Artur Lugmayr, Seppo Kalli |
Transmission of DVB Service Information via Internet. |
INTERWORKING |
2000 |
DBLP DOI BibTeX RDF |
|
13 | Jana Dittmann, Anirban Mukherjee, Martin Steinebach |
Media-Independent Watermarking Classification and the Need for Combining Digital Video and Audio Watermarking for Media Authentication. |
ITCC |
2000 |
DBLP DOI BibTeX RDF |
|
13 | Markus Mielke, Yuqing Song 0002, Ramazan Savas Aygün, Aidong Zhang |
NetMedia: Giving Control to Distributed Multimedia Presentation Systems. |
SSDBM |
1999 |
DBLP DOI BibTeX RDF |
|
13 | Sridevi Palacharla, Ahmed Karmouch, Samy A. Mahmoud |
Design and Implementation of a Real-time Multimedia Presentation System using RTP. |
COMPSAC |
1997 |
DBLP DOI BibTeX RDF |
RTP, multimedia networking, QoS control, media synchronization, client-server computing |
13 | Sherry Marcus, V. S. Subrahmanian |
Foundations of Multimedia Database Systems. |
J. ACM |
1996 |
DBLP DOI BibTeX RDF |
data structures, query processing, query languages, multimedia databases |
13 | Robert S. Fish, Robert E. Kraut |
Networking for Collaboration: Video Telephony and Media Conferencing (Tutorial). |
CSCW |
1996 |
DBLP DOI BibTeX RDF |
|
13 | P. Venkat Rangan, Srinivas Ramanathan, Srihari Sampath Kumar |
Feedback Techniques for Continuity and Synchronisation in Multimedia Information Retrieval. |
ACM Trans. Inf. Syst. |
1995 |
DBLP DOI BibTeX RDF |
intermedia synchronization, intramedia continuity, multimedia on-demand information services, multimedia, synchronization |
13 | Qingxiong Yang, Xin Chen, Gang Wang 0012 |
Web 2.0 dictionary. |
CIVR |
2008 |
DBLP DOI BibTeX RDF |
bag-of-tags, photo/video sharing, rank, tag, retrieval |
13 | Ganesh Yadav, R. K. Singh, Vipin Chaudhary |
On Implementation of MPEG-2 Like Real-Time Parallel Media Applications on MDSP SoC Cradle Architecture. |
EUC |
2004 |
DBLP DOI BibTeX RDF |
|
13 | Bodo Yann, Nathalie Laurent, Jean-Luc Dugelay |
A scrambling method based on disturbance of motion vector. |
ACM Multimedia |
2002 |
DBLP DOI BibTeX RDF |
waterscrambling, watermarking, motion vector |
11 | John R. Smith, David S. Doermann, Amarnath Gupta, Jonathan Goldstein, Uri Shaft, Nalini K. Ratha |
Multimedia applications: beyond similarity searches. |
CVDB |
2005 |
DBLP DOI BibTeX RDF |
|
11 | Ari Wahyudi, Amos Omondi |
Parallel Multimedia Processor Using Customised Infineon TriCores. |
DSD |
2002 |
DBLP DOI BibTeX RDF |
|
11 | Charles B. Owen, James Ford, Fillia Makedon, Tilmann Steinberg, Christina Metaxaki-Kossionides |
Parallel text alignment. |
Int. J. Digit. Libr. |
2000 |
DBLP DOI BibTeX RDF |
Parallel text alignment, Improved multimedia data access, Cross-Modal Information Retrieval (CMIR), Applications, Retrieval |
11 | Charles B. Owen |
Parallel Text Alignment. |
ECDL |
1998 |
DBLP DOI BibTeX RDF |
|
11 | Yuxiao Hu 0001, Hao Tang 0001, Thomas S. Huang |
Camera and microphone array for 3D audiovisual face data collection. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
11 | Wen-Lin Yang, Cheng-Huang Tung |
Load-oriented and interference-aware channel assignment for multicasting in wireless mesh networks. |
Mobility Conference |
2008 |
DBLP DOI BibTeX RDF |
routing, wireless mesh networks, channel assignment, multicast tree, multi-radio |
11 | Radu Cornea, Alex Nicolau, Nikil D. Dutt |
Video Stream Annotations for Energy Trade-offs in Multimedia Applications. |
ISPDC |
2006 |
DBLP DOI BibTeX RDF |
|
11 | David J. Chatting, Josie S. Galpin, Judith S. Donath |
Presence and portrayal: video for casual home dialogues. |
ACM Multimedia |
2006 |
DBLP DOI BibTeX RDF |
manipulated video, portrayal, presence, video conferencing |
11 | Toufik Ahmed, Guillaume Buridant, Ahmed Mehaoua |
Encapsulation and Marking of MPEG-4 Video Over IP Differentiated Services. |
ISCC |
2001 |
DBLP DOI BibTeX RDF |
|
11 | Giancarlo Fortino, Libero Nigro, Francesco Pupo |
An MBone-Based On-Demand System for Cooperative Off-line Learning. |
EUROMICRO |
2001 |
DBLP DOI BibTeX RDF |
|
11 | Nevenka Dimitrova, Lalitha Agnihotri, Radu S. Jasinschi, John Zimmerman, George Marmaropoulos, Thomas McGee, Serhan Dagtas |
Video scouting demonstration: smart content selection and recording. |
ACM Multimedia |
2000 |
DBLP DOI BibTeX RDF |
content-based retrieval, multimodal integration, video access |
10 | Huiting Fan, Xingnan Zhang, Yingying Xu, Jiangxiong Fang, Shiqing Zhang, Xiaoming Zhao 0002, Jun Yu 0002 |
Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals. |
Inf. Fusion |
2024 |
DBLP DOI BibTeX RDF |
|
10 | Mengge He, Wenjing Du, Zhiquan Wen, Qing Du, Yutong Xie, Qi Wu 0001 |
Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning. |
IEEE Trans. Circuits Syst. Video Technol. |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Ruihan Yang, Hannes Gamper, Sebastian Braun |
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Mustafa Shukor, Corentin Dancette, Alexandre Ramé, Matthieu Cord |
Unified Model for Image, Video, Audio and Language Tasks. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Wenbo Li 0003, Yaodong Cui, Yintao Ma, Xingxin Chen, Guofa Li, Guanzhong Zeng, Gang Guo, Dongpu Cao |
A Spontaneous Driver Emotion Facial Expression (DEFE) Dataset for Intelligent Vehicles: Emotions Triggered by Video-Audio Clips in Driving Scenarios. |
IEEE Trans. Affect. Comput. |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Shengyu Zhang 0001, Xusheng Feng, Wenyan Fan, Wenjing Fang, Fuli Feng, Wei Ji 0008, Shuo Li, Li Wang 0056, Shanshan Zhao, Zhou Zhao, Tat-Seng Chua, Fei Wu |
Video-Audio Domain Generalization via Confounder Disentanglement. |
AAAI |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Doosan Baek, Jiho Kim, Hongchul Lee |
VATMAN : Video-Audio-Text Multimodal Abstractive Summarization with Trimodal Hierarchical Multi-head Attention. |
ICTC |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Elena E. Lyakso, Olga V. Frolova, Aleksandr Nikolaev, Severin Grechanyi, Anton Matveev, Yuri Matveev, Olesia Makhnytkina, Ruban Nersisson |
Emotional State of Children with ASD and Intellectual Disabilities: Perceptual Experiment and Automatic Recognition by Video, Audio and Text Modalities. |
SPECOM (1) |
2023 |
DBLP DOI BibTeX RDF |
|
10 | Chih-Hsueh Lin, Guo-Hsin Hu, Jie-Sheng Chen, Jun-Juh Yan, Kuang-Hui Tang |
Novel design of cryptosystems for video/audio streaming via dynamic synchronized chaos-based random keys. |
Multim. Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Thomas Hayes, Songyang Zhang, Xi Yin 0008, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh |
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Gang-Min Park, Hye-In Hyun, Hyuk-Yoon Kwon |
Multimodal learning model based on video-audio-chat feature fusion for detecting e-sports highlights. |
Appl. Soft Comput. |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Xiaoshuai Hao, Wanqian Zhang, Dayan Wu, Fei Zhu, Bo Li 0063 |
Listen and Look: Multi-Modal Aggregation and Co-Attention Network for Video-Audio Retrieval. |
ICME |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Elena E. Lyakso, Olga V. Frolova, Anton Matveev, Yuri Matveev, Aleksey Grigorev, Olesia Makhnytkina, Nersisson Ruban |
Recognition of the Emotional State of Children with Down Syndrome by Video, Audio and Text Modalities: Human and Automatic. |
SPECOM |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Thomas Hayes, Songyang Zhang, Xi Yin 0008, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh |
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration. |
ECCV (8) |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Pablo E. Balcero-Posada, Alfredo Jose Calderon-Montealegre, Juan Carlos Navarro-Beltran, Luis Andres Marentes, Jerzon Y. Carrillo-Pinzon, Luis Felipe Herrera-Quintero |
A practical approach based on IoT, Video, Audio, and ITS for Micro-Sleep detection in public transportation systems. |
ICVES |
2022 |
DBLP DOI BibTeX RDF |
|
10 | Shikun Zhang, Omid Jafari, Parth Nagarkar |
A Survey on Machine Learning Techniques for Auto Labeling of Video, Audio, and Text Data. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
10 | Hassan Akbari, Liangzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong |
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
10 | Shaobo Min, Qi Dai, Hongtao Xie, Chuang Gan, Yongdong Zhang 0001, Jingdong Wang 0001 |
Cross-Modal Attention Consistency for Video-Audio Unsupervised Learning. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
10 | Rinat Galiautdinov |
Digitally-Signed Video/Audio Streams as Prevention of AI-Based Attacks. |
Int. J. Softw. Sci. Comput. Intell. |
2021 |
DBLP DOI BibTeX RDF |
|
10 | Yanan Song, Yuanyang Cai, Lizhe Tan |
Video-Audio Emotion Recognition Based on Feature Fusion Deep Learning Method. |
MWSCAS |
2021 |
DBLP DOI BibTeX RDF |
|
10 | Hassan Akbari, Liangzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong |
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text. |
NeurIPS |
2021 |
DBLP BibTeX RDF |
|
10 | Runze Huang, Chuanzhi Yang, Xinguo Yu, Rao Peng |
An Algorithm for Correcting Video-Audio Asynchronization Based on Syncnet. |
TALE |
2021 |
DBLP DOI BibTeX RDF |
|
10 | Dario Luzuriaga, Chung-Horng Lung, Margaret Funmilayo |
Software-Based Video-Audio Production Mixer via an IP Network. |
IEEE Access |
2020 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 262 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ >>] |
|