


default search action
Sanjeev Khudanpur
- > Home > Persons > Sanjeev Khudanpur
Publications
- 2026
- [j28]Alexander Polok
, Dominik Klement, Martin Kocour, Jiangyu Han, Federico Landini, Bolaji Yusuf, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
DiCoW: Diarization-conditioned Whisper for target speaker automatic speech recognition. Comput. Speech Lang. 95: 101841 (2026) - 2025
- [c244]Alexander Polok, Dominik Klement, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
Target Speaker ASR with Whisper. ICASSP 2025: 1-5 - [c243]Henry Li Xinyuan, Ashi Garg, Zexin Cai, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
HLTCOE Submission to the VoicePrivacy Attacker Challenge. ICASSP 2025: 1-2 - [i65]Alexander Polok, Dominik Klement, Martin Kocour, Jiangyu Han, Federico Landini, Bolaji Yusuf, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition. CoRR abs/2501.00114 (2025) - [i64]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews:
GenVC: Self-Supervised Zero-Shot Voice Conversion. CoRR abs/2502.04519 (2025) - [i63]Ashi Garg, Zexin Cai, Henry Li Xinyuan, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Matthew Wiesner, Nicholas Andrews:
Less is More for Synthetic Speech Detection in the Wild. CoRR abs/2502.05674 (2025) - [i61]Amir Hussein, Cihan Xiao, Matthew Wiesner, Dan Povey, Leibny Paola García, Sanjeev Khudanpur:
HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation. CoRR abs/2506.02157 (2025) - 2024
- [c239]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun
, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. ICASSP 2024: 11831-11835 - [c237]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur:
Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora. ICASSP 2024: 12006-12010 - [c235]Amir Hussein, Desh Raj, Matthew Wiesner, Daniel Povey, Paola García, Sanjeev Khudanpur:
Enhancing Neural Transducer for Multilingual ASR with Synchronized Language Diarization. INTERSPEECH 2024 - [c234]Matthew Maciejewski, Dominik Klement, Ruizhe Huang, Matthew Wiesner, Sanjeev Khudanpur:
Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. INTERSPEECH 2024 - [c231]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. Odyssey 2024: 91-98 - [c229]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
Privacy Versus Emotion Preservation Trade-Offs in Emotion-Preserving Speaker Anonymization. SLT 2024: 409-414 - [i60]Desh Raj, Matthew Wiesner, Matthew Maciejewski, Leibny Paola García-Perera, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. CoRR abs/2401.15676 (2024) - [i58]Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun
, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. CoRR abs/2406.02560 (2024) - [i55]Zexin Cai, Henry Li Xinyuan, Ashi Garg, Leibny Paola García-Perera, Kevin Duh, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization. CoRR abs/2409.03655 (2024) - [i54]Henry Li Xinyuan, Zexin Cai, Ashi Garg, Kevin Duh, Leibny Paola García-Perera, Sanjeev Khudanpur, Nicholas Andrews, Matthew Wiesner:
HLTCOE JHU Submission to the Voice Privacy Challenge 2024. CoRR abs/2409.08913 (2024) - [i52]Alexander Polok, Dominik Klement, Matthew Wiesner, Sanjeev Khudanpur, Jan Cernocký, Lukás Burget:
Target Speaker ASR with Whisper. CoRR abs/2409.09543 (2024) - 2023
- [c222]Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, Sanjeev Khudanpur:
Building Keyword Search System from End-To-End Asr Systems. ICASSP 2023: 1-5 - [c219]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. INTERSPEECH 2023: 924-928 - [c217]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. INTERSPEECH 2023: 4074-4078 - [c214]Amir Hussein, Cihan Xiao, Neha Verma, Thomas Thebaud, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Dialect Speech Translation System Description. IWSLT@ACL 2023: 283-290 - [c213]Henry Li Xinyuan, Neha Verma, Bismarck Bamfo Odoom, Ujvala Pradeep, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2023 Multilingual Speech Translation System Description. IWSLT@ACL 2023: 302-310 - [i50]Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola García, Daniel Povey, Sanjeev Khudanpur:
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts. CoRR abs/2306.01031 (2023) - [i48]Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur:
HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation. CoRR abs/2306.11252 (2023) - [i47]Samuele Cornell, Matthew Wiesner, Shinji Watanabe, Desh Raj, Xuankai Chang, Paola García, Yoshiki Masuyama, Zhong-Qiu Wang, Stefano Squartini, Sanjeev Khudanpur:
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios. CoRR abs/2306.13734 (2023) - [i46]Amir Hussein, Dorsa Zeinali, Ondrej Klejch, Matthew Wiesner, Brian Yan, Shammur Absar Chowdhury, Ahmed M. Ali, Shinji Watanabe
, Sanjeev Khudanpur:
Speech collage: code-switched audio generation by collaging monolingual corpora. CoRR abs/2309.15674 (2023) - 2022
- [c211]Matthew Wiesner, Desh Raj
, Sanjeev Khudanpur:
Injecting Text and Cross-Lingual Supervision in Few-Shot Learning from Self-Supervised Models. ICASSP 2022: 8597-8601 - [c207]Jinyi Yang, Amir Hussein, Matthew Wiesner, Sanjeev Khudanpur:
JHU IWSLT 2022 Dialect Speech Translation System Description. IWSLT@ACL 2022: 319-326 - 2021
- [c198]Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj
, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur:
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech 2021: 2906-2910 - [i27]Matthew Wiesner, Desh Raj, Sanjeev Khudanpur:
Injecting Text and Cross-lingual Supervision in Few-shot Learning from Self-Supervised Models. CoRR abs/2110.04863 (2021) - 2019
- [c179]Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur:
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer. ASRU 2019: 1048-1054 - [c166]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe
, Chunxi Liu, Najim Dehak
, Sanjeev Khudanpur:
Pretraining by Backtranslation for End-to-End ASR in Low-Resource Settings. INTERSPEECH 2019: 4375-4379 - 2018
- [c150]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak
, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056 - [c142]Chunxi Liu, Matthew Wiesner, Shinji Watanabe
, Craig Harman, Jan Trmal, Najim Dehak
, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663 - [i12]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018) - [i10]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018) - [i8]Matthew Wiesner, Adithya Renduchintala, Shinji Watanabe, Chunxi Liu, Najim Dehak, Sanjeev Khudanpur:
Low Resource Multi-modal Data Augmentation for End-to-end ASR. CoRR abs/1812.03919 (2018) - 2017
- [c131]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech Without ASR. INTERSPEECH 2017: 2501-2505 - [c129]Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601 - [i6]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech without ASR. CoRR abs/1703.07476 (2017) - 2015
- [c116]Hynek Hermansky, Lukás Burget
, Jordan Cohen, Emmanuel Dupoux, Naomi Feldman
, John Godfrey, Sanjeev Khudanpur, Matthew Maciejewski, Sri Harish Reddy Mallidi, Anjali Menon, Tetsuji Ogawa, Vijayaditya Peddinti, Richard C. Rose, Richard M. Stern, Matthew Wiesner, Karel Veselý:
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop. ICASSP 2015: 5009-5013

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-08-06 23:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint