|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 39 occurrences of 24 keywords
|
|
|
Results
Found 910 publication records. Showing 910 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
113 | Deepu Vijayasenan, Fabio Valente, Hervé Bourlard |
An Information Theoretic Approach to Speaker Diarization of Meeting Data. |
IEEE Trans. Speech Audio Process. |
2009 |
DBLP DOI BibTeX RDF |
|
106 | José M. Pardo, Xavier Anguera, Chuck Wooters |
Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information. |
IEEE Trans. Computers |
2007 |
DBLP DOI BibTeX RDF |
Speech source separation, meetings recognition, rich transcription, speaker segmentation, speaker diarization |
97 | Jing Huang 0019, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos |
The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
97 | S. E. Tranter, Douglas A. Reynolds |
An overview of automatic speaker diarization systems. |
IEEE Trans. Speech Audio Process. |
2006 |
DBLP DOI BibTeX RDF |
|
97 | Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauvain |
Speaker Diarization: From Broadcast News to Lectures. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
89 | Kentaro Ishizuka, Shoko Araki, Kazuhiro Otsuka, Tomohiro Nakatani, Masakiyo Fujimoto |
A speaker diarization method based on the probabilistic fusion of audio-visual location information. |
ICMI |
2009 |
DBLP DOI BibTeX RDF |
multi-modal systems, multi-party conversation analysis, speaker diarization |
81 | Gerald Friedland, Oriol Vinyals, Yan Huang, Christian A. Müller |
Prosodic and other Long-Term Features for Speaker Diarization. |
IEEE Trans. Speech Audio Process. |
2009 |
DBLP DOI BibTeX RDF |
|
81 | Xavier Anguera, Chuck Wooters, Javier Hernando |
Acoustic Beamforming for Speaker Diarization of Meetings. |
IEEE Trans. Speech Audio Process. |
2007 |
DBLP DOI BibTeX RDF |
|
81 | Jordi Luque, Xavier Anguera, Andrey Temko, Javier Hernando |
Speaker Diarization for Conference Room: The UPC RT07s Evaluation System. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
81 | Corinne Fredouille, Grégory Senay |
Technical Improvements of the E-HMM Based Speaker Diarization System for Meeting Records. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
74 | Gerald Friedland, Chuohao Yeo, Hayley Hung |
Visual speaker localization aided by acoustic models. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
visual localization, multimodal integration, speaker diarization |
73 | Kazuhiro Otsuka, Shoko Araki, Kentaro Ishizuka, Masakiyo Fujimoto, Martin Heinrich, Junji Yamato |
A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization. |
ICMI |
2008 |
DBLP DOI BibTeX RDF |
fisheye lens, face tracking, microphone array, focus of attention, speaker diarization, omnidirectional cameras, meeting analysis, realtime system |
66 | Himanshu Vajaria, Sudeep Sarkar, Rangachar Kasturi |
Exploring Co-Occurence Between Speech and Body Movement for Audio-Guided Video Localization. |
IEEE Trans. Circuits Syst. Video Technol. |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Deepu Vijayasenan, Fabio Valente, Hervé Bourlard |
Combination of agglomerative and sequential clustering for speaker diarization. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Kofi Boakye, B. Trueba-Hornero, Oriol Vinyals, Gerald Friedland |
Overlapped speech detection for improved speaker diarization in multiparty meetings. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Vishwa Gupta, Gilles Boulianne, Patrick Kenny, Pierre Ouellet, Pierre Dumouchel |
Speaker diarization of French broadcast news. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Kyu Jeong Han, Panayiotis G. Georgiou, Shrikanth S. Narayanan |
The SAIL speaker diarization system for analysis of spontaneous meetings. |
MMSP |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Xuan Zhu, Claude Barras, Lori Lamel, Jean-Luc Gauvain |
Multi-stage Speaker Diarization for Conference and Lecture Meetings. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
57 | Athanasios K. Noulas, Ben J. A. Kröse |
On-line multi-modal speaker diarization. |
ICMI |
2007 |
DBLP DOI BibTeX RDF |
multi-modal, speaker diarization, audio-visual, speaker detection |
49 | Jonathan G. Fiscus, Jerome Ajot, Martial Michel, John S. Garofolo |
The Rich Transcription 2006 Spring Meeting Recognition Evaluation. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
49 | Jonathan G. Fiscus, Nicolas Radde, John S. Garofolo, Audrey N. Le, Jerome Ajot, Christophe Laprun |
The Rich Transcription 2005 Spring Meeting Recognition Evaluation. |
MLMI |
2005 |
DBLP DOI BibTeX RDF |
|
48 | Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan |
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization. |
IEEE Trans. Speech Audio Process. |
2008 |
DBLP DOI BibTeX RDF |
|
48 | Hayley Hung, Yan Huang, Gerald Friedland, Daniel Gatica-Perez |
Estimating the dominant person in multi-party conversations using speaker diarization strategies. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
48 | Athanasios K. Noulas, Tim van Kasteren, Ben J. A. Kröse |
A Hybrid Generative-Discriminative Approach to Speaker Diarization. |
MLMI |
2008 |
DBLP DOI BibTeX RDF |
|
48 | David A. van Leeuwen, Matej Konecný |
Progress in the AMIDA Speaker Diarization System for Meeting Data. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
48 | Corinne Fredouille, Nicholas W. D. Evans |
The LIA RT'07 Speaker Diarization System. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
48 | Huazhong Ning, Wei Xu 0007, Yihong Gong, Thomas S. Huang |
Improving Speaker Diarization by Cross EM Refinement. |
ICME |
2006 |
DBLP DOI BibTeX RDF |
|
48 | David A. van Leeuwen, Marijn Huijbregts |
The AMI Speaker Diarization System for NIST RT06s Meeting Data. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
48 | David A. van Leeuwen |
The TNO Speaker Diarization System for NIST RT05s Meeting Data. |
MLMI |
2005 |
DBLP DOI BibTeX RDF |
|
42 | Kazuhiro Otsuka, Shoko Araki, Dan Mikami, Kentaro Ishizuka, Masakiyo Fujimoto, Junji Yamato |
Realtime meeting analysis and 3D meeting viewer based on omnidirectional multimodal sensors. |
ICMI |
2009 |
DBLP DOI BibTeX RDF |
fisheye lens, microphone array, focus of attention, speaker diarization, omnidirectional cameras, meeting analysis, realtime system |
42 | Sarah Favre, Hugues Salamin, John Dines, Alessandro Vinciarelli |
Role recognition in multiparty recordings using social affiliation networks and discrete distributions. |
ICMI |
2008 |
DBLP DOI BibTeX RDF |
broadcast data, role recognition, social network analysis, speaker diarization, meeting recordings |
33 | Panagiotis Sidiropoulos, Vasileios Mezaris, Ioannis Kompatsiaris, Hugo Meinedo, Isabel Trancoso |
Multi-modal scene segmentation using scene transition graphs. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
scene segmentation |
33 | Janez Zibert, France Mihelic |
Fusion of Acoustic and Prosodic Features for Speaker Clustering. |
TSD |
2009 |
DBLP DOI BibTeX RDF |
|
33 | Jonathan G. Fiscus, Jerome Ajot, John S. Garofolo |
The Rich Transcription 2007 Meeting Recognition Evaluation. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
31 | Chuck Wooters, Marijn Huijbregts |
The ICSI RT07s Speaker Diarization System. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
31 | Claude Barras, Xuan Zhu, Sylvain Meignier, Jean-Luc Gauvain |
Multistage speaker diarization of broadcast news. |
IEEE Trans. Speech Audio Process. |
2006 |
DBLP DOI BibTeX RDF |
|
31 | Xavier Anguera, Chuck Wooters, José M. Pardo |
Robust Speaker Diarization for Meetings: ICSI RT06S Meetings Evaluation System. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
31 | Elias Rentzeperis, Andreas Stergiou, Christos Boukis, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos |
The 2006 Athens Information Technology Speech Activity Detection and Speaker Diarization Systems. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
31 | José M. Pardo, Xavier Anguera, Chuck Wooters |
Speaker Diarization for Multi-microphone Meetings Using Only Between-Channel Differences. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
31 | Xavier Anguera, Chuck Wooters, Javier Hernando |
Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
31 | Dan Istrate, Corinne Fredouille, Sylvain Meignier, Laurent Besacier, Jean-François Bonastre |
NIST RT'05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings. |
MLMI |
2005 |
DBLP DOI BibTeX RDF |
|
30 | Naohiro Tawara, Marc Delcroix, Atsushi Ando, Atsunori Ogawa |
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
30 | Myat Aye Aye Aung, Win Pa Pa, Hay Mar Soe Naing |
M-Diarization: A Myanmar Speaker Diarization using Multi-scale dynamic weights. |
O-COCOSDA |
2023 |
DBLP DOI BibTeX RDF |
|
30 | Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang 0084, Qing Wang 0039, Lei Xie 0001 |
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Zhihao Du, Shiliang Zhang, Siqi Zheng, Zhijie Yan |
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Jiahao Wang, Guo Chen, Yin-Dong Zheng, Tong Lu |
Exploring Detection-based Method For Speaker Diarization @ Ego4D Audio-only Diarization Challenge 2022. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Tao Liu, Xu Xiang, Zhengyang Chen, Bing Han, Kai Yu 0004, Yanmin Qian |
The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Bowen Pang, Huan Zhao, Gaosheng Zhang, Xiaoyue Yang, Yang Sun, Li Zhang 0084, Qing Wang 0039, Lei Xie 0001 |
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge. |
ISCSLP |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Yusuke Fujita, Shinji Watanabe 0001, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu |
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
25 | Sadaoki Furui |
40 Years of Progress in Automatic Speaker Recognition. |
ICB |
2009 |
DBLP DOI BibTeX RDF |
text-dependent, text-independent, robust recognition, Speaker recognition, speaker verification, speaker identification, speaker diarization |
25 | Thilo Stadelmann, Bernd Freisleben |
Unfolding speaker clustering potential: a biomimetic approach. |
ACM Multimedia |
2009 |
DBLP DOI BibTeX RDF |
temporal context, GMM, speaker identification, MFCC, speaker diarization, one-class SVM, speaker clustering |
25 | Gerald Friedland, Oriol Vinyals |
Live speaker identification in conversations. |
ACM Multimedia |
2008 |
DBLP DOI BibTeX RDF |
diarization, conversations, online, speaker identification |
16 | Jáchym Kolár, Jan Svec |
The Czech Broadcast Conversation Corpus. |
TSD |
2009 |
DBLP DOI BibTeX RDF |
|
16 | Janez Zibert, Andrej Brodnik, France Mihelic |
An Adaptive BIC Approach for Robust Speaker Change Detection in Continuous Audio Streams. |
TSD |
2009 |
DBLP DOI BibTeX RDF |
|
16 | Shoko Araki, Masakiyo Fujimoto, Kentaro Ishizuka, Hiroshi Sawada, Shoji Makino |
Speaker indexing and speech enhancement in real meetings / conversations. |
ICASSP |
2008 |
DBLP DOI BibTeX RDF |
|
16 | Margarita Kotti, Constantine Kotropoulos |
Gender classification in two Emotional Speech databases. |
ICPR |
2008 |
DBLP DOI BibTeX RDF |
|
16 | Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky |
An HDP-HMM for systems with state persistence. |
ICML |
2008 |
DBLP DOI BibTeX RDF |
|
16 | Jing Huang 0019, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos |
The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
16 | Andreas Stolcke, Xavier Anguera, Kofi Boakye, Özgür Çetin, Adam Janin, Mathew Magimai-Doss, Chuck Wooters, Jing Zheng |
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System. |
CLEAR |
2007 |
DBLP DOI BibTeX RDF |
|
16 | Luis Javier Rodríguez, Mikel Peñagarikano, Germán Bordel |
A Simple But Effective Approach to Speaker Tracking in Broadcast News. |
IbPRIA (2) |
2007 |
DBLP DOI BibTeX RDF |
|
16 | Etienne Marcheret, Gerasimos Potamianos, Karthik Visweswariah, Jing Huang 0019 |
The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars. |
MLMI |
2006 |
DBLP DOI BibTeX RDF |
|
15 | Thaer Mufeed Taha, Zaineb Ben Messaoud, Mondher Frikha |
Convolutional Neural Network Architectures for Gender, Emotional Detection from Speech and Speaker Diarization. |
Int. J. Interact. Mob. Technol. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Din-Yuen Chan, Jhing-Fa Wang, Hsu-Ting Chin |
A new speaker-diarization technology with denoising spectral-LSTM for online automatic multi-dialogue recording. |
Multim. Tools Appl. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Ke-Ming Lyu, Ren-yuan Lyu, Hsien-Tsung Chang |
Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation. |
PeerJ Comput. Sci. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Runqing Zhang, Huijun Lu, Dunbo Cai, Zhiguo Huang, Yujian Du, Ling Qian, Yijun Zhang |
SiamTDNN: Enhancing Discriminative Embeddings for Speaker Diarization. |
J. Circuits Syst. Comput. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Marc Härkönen, Samuel J. Broughton, Lahiru Samarakoon |
EEND-M2F: Masked-attention mask transformers for speaker diarization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Haobin Tang, Xulong Zhang 0001, Ning Cheng 0001, Jing Xiao 0006, Jianzong Wang |
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | PeiYing Lee, HauYun Guo, Berlin Chen |
Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | He Zhao, Hangting Chen, Jianwei Yu, Yuehai Wang |
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Gérard Chollet, Hugues Sansen, Yannis Tevissen, Jérôme Boudy, Mossaab Hariz, Christophe Lohr, Fathy Yassa |
Privacy Preserving Personal Assistant with On-Device Diarization and Spoken Dialogue System for Home and Beyond. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Elio Gruttadauria, Mathieu Fontaine 0002, Slim Essid |
Online speaker diarization of meetings guided by speech separation. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yicheng Hsu, Ssuhan Chen, Mingsian R. Bai |
Spatial-Temporal Activity-Informed Diarization and Separation. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao |
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Nikhil Raghav, Md. Sahidullah |
Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Prachi Singh, Sriram Ganapathy |
Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization. |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget |
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? |
CoRR |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Wesley Beccaro, Miguel Arjona Ramírez, William Liaw, Heitor R. Guimarães |
Analysis of Oral Exams With Speaker Diarization and Speech Emotion Recognition: A Case Study. |
IEEE Trans. Educ. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Michele Giuseppe Di Cesare, David Perpetuini, Daniela Cardone, Arcangelo Merla |
Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson's Disease: A Study on Speaker Diarization and Classification Techniques. |
Sensors |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Zhengyang Chen, Bing Han, Shuai Wang 0016, Yanmin Qian |
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Yifan Chen, Gaofeng Cheng, Runyan Yang, Pengyuan Zhang, Yonghong Yan 0002 |
Interrelate Training and Clustering for Online Speaker Diarization. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Christoph Böddeker, Aswin Shanmugam Subramanian, Gordon Wichern, Reinhold Haeb-Umbach, Jonathan Le Roux |
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings. |
IEEE ACM Trans. Audio Speech Lang. Process. |
2024 |
DBLP DOI BibTeX RDF |
|
15 | Illia Zaiets, Vitalii Brydinskyi, Dmytro Sabodashko, Yuriy Khoma, Khrystyna Ruda |
Integrated System for Speaker Diarization and Intruder Detection using Speaker Embeddings. |
CPITS |
2024 |
DBLP BibTeX RDF |
|
15 | Or Haim Anidjar, Yannick Estève, Chen Hajaj, Amit Dvir, Itshak Lapidot |
Speech and multilingual natural language framework for speaker change detection and diarization. |
Expert Syst. Appl. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Pengbin Fu, Yuchen Ma, Huirong Yang |
Speaker diarization with variants of self-attention and joint speaker embedding extractor. |
J. Intell. Fuzzy Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yusuke Fujita, Tetsuji Ogawa, Tetsunori Kobayashi |
Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yi Wei, Haiyan Guo, Zirui Ge, Zhen Yang 0001 |
Graph attention-based deep embedded clustering for speaker diarization. |
Speech Commun. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao 0001, Hsin-Min Wang |
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization. |
IEEE Signal Process. Lett. |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian |
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Hassan Taherian, DeLiang Wang |
Multi-channel Conversational Speaker Separation via Neural Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Alexis Plaquet, Hervé Bredin |
Powerset multi-class cross entropy loss for neural speaker diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Geoffrey T. Frost, Emily Morris, Joshua Jansen van Vüren, Thomas Niesler |
Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor |
Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang |
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa |
Neural Diarization with Non-autoregressive Intermediate Attractors. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Baihan Lin, Xinxin Zhang |
A Reinforcement Learning Framework for Online Speaker Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Dominik Klement, Mireia Díez, Federico Landini, Lukás Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara |
Discriminative Training of VBx Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Yogesh Virkar, Brian Thompson, Rohit Paturi, Sundararajan Srinivasan, Marcello Federico |
Speaker Diarization of Scripted Audiovisual Content. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Prachi Singh, Amrit Kaul, Sriram Ganapathy |
Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Ye-Rin Jeoung, Joon-Young Yang, Jeong-Hwan Choi, Joon-Hyuk Chang |
Improving Transformer-based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
15 | Lahiru Samarakoon, Samuel J. Broughton, Marc Härkönen, Ivan Fung |
Transformer Attractors for Robust and Efficient End-to-End Neural Diarization. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 910 (100 per page; Change: ) Pages: [ 1][ 2][ 3][ 4][ 5][ 6][ 7][ 8][ 9][ 10][ >>] |
|