default search action
Naohiro Tawara
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c34]Naohiro Tawara, Marc Delcroix, Atsushi Ando, Atsunori Ogawa:
NTT Speaker Diarization System for Chime-7: Multi-Domain, Multi-Microphone end-to-end and Vector Clustering Diarization. ICASSP 2024: 11281-11285 - [c33]Dominik Klement, Mireia Díez, Federico Landini, Lukás Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara:
Discriminative Training of VBx Diarization. ICASSP 2024: 11871-11875 - [c32]Carlos Hernandez-Olivan, Marc Delcroix, Tsubasa Ochiai, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
Interaural Time Difference Loss for Binaural Target Sound Extraction. IWAENC 2024: 210-214 - [i15]Atsunori Ogawa, Naoyuki Kamo, Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Takatomo Kano, Naohiro Tawara, Marc Delcroix:
Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over. CoRR abs/2406.18972 (2024) - [i14]Carlos Hernandez-Olivan, Marc Delcroix, Tsubasa Ochiai, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
Interaural time difference loss for binaural target sound extraction. CoRR abs/2408.00344 (2024) - [i13]Shota Horiguchi, Atsushi Ando, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix:
Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings. CoRR abs/2408.17142 (2024) - [i12]Carlos Hernandez-Olivan, Marc Delcroix, Tsubasa Ochiai, Daisuke Niizumi, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model. CoRR abs/2409.12528 (2024) - [i11]Alexis Plaquet, Naohiro Tawara, Marc Delcroix, Shota Horiguchi, Atsushi Ando, Shoko Araki:
Mamba-based Segmentation Model for Speaker Diarization. CoRR abs/2410.06459 (2024) - [i10]Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix:
Guided Speaker Embedding. CoRR abs/2410.12182 (2024) - 2023
- [c31]Yuki Kitagishi, Hosana Kamiyama, Naohiro Tawara, Atsunori Ogawa, Noboru Miyazaki, Taichi Asami:
Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation. APSIPA ASC 2023: 2213-2220 - [c30]Yuta Ide, Naohiro Tawara, Susumu Saito, Teppei Nakano, Tetsuji Ogawa:
Voice or Content? - Exploring Impact of Speech Content on Age Estimation from Voice. EUSIPCO 2023: 221-225 - [c29]Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix:
Iterative Shallow Fusion of Backward Language Model for End-To-End Speech Recognition. ICASSP 2023: 1-5 - [c28]Yuki Kitagishi, Naohiro Tawara, Atsunori Ogawa, Ryo Masumura, Taichi Asami:
What are differences? Comparing DNN and Human by Their Performance and Characteristics in Speaker Age Estimation. INTERSPEECH 2023: 1873-1877 - [c27]Marc Delcroix, Naohiro Tawara, Mireia Díez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukás Burget, Shoko Araki:
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. INTERSPEECH 2023: 3477-3481 - [c26]Hikaru Yanagida, Yusuke Ijima, Naohiro Tawara:
Influence of Personal Traits on Impressions of One's Own Voice. INTERSPEECH 2023: 5212-5216 - [i9]Marc Delcroix, Naohiro Tawara, Mireia Díez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukás Burget, Shoko Araki:
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. CoRR abs/2305.13580 (2023) - [i8]Naohiro Tawara, Marc Delcroix, Atsushi Ando, Atsunori Ogawa:
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization. CoRR abs/2309.12656 (2023) - [i7]Dominik Klement, Mireia Díez, Federico Landini, Lukás Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara:
Discriminative Training of VBx Diarization. CoRR abs/2310.02732 (2023) - [i6]Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix:
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition. CoRR abs/2310.11010 (2023) - [i5]Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki:
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models. CoRR abs/2312.12764 (2023) - [i4]Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix:
BLSTM-Based Confidence Estimation for End-to-End Speech Recognition. CoRR abs/2312.14609 (2023) - 2022
- [j1]Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Hiroto Ashikawa, Tetsunori Kobayashi, Tetsuji Ogawa:
Multi-Source Domain Generalization Using Domain Attributes for Recurrent Neural Network Language Models. IEICE Trans. Inf. Syst. 105-D(1): 150-160 (2022) - [c25]Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki:
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models. ICASSP 2022: 6517-6521 - 2021
- [c24]Naohiro Tawara, Atsunori Ogawa, Yuki Kitagishi, Hosana Kamiyama, Yusuke Ijima:
Robust Speech-Age Estimation Using Local Maximum Mean Discrepancy Under Mismatched Recording Conditions. ASRU 2021: 114-121 - [c23]Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix:
BLSTM-Based Confidence Estimation for End-to-End Speech Recognition. ICASSP 2021: 6383-6387 - [c22]Naohiro Tawara, Atsunori Ogawa, Yuki Kitagishi, Hosana Kamiyama:
Age-VOX-Celeb: Multi-Modal Corpus for Facial and Speech Estimation. ICASSP 2021: 6963-6967 - [c21]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds. ICASSP 2021: 7198-7202 - [c20]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech. Interspeech 2021: 3565-3569 - [i3]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speech. CoRR abs/2105.09040 (2021) - 2020
- [c19]Yuki Kitagishi, Hosana Kamiyama, Atsushi Ando, Naohiro Tawara, Takeshi Mori, Satoshi Kobashikawa:
Speaker Age Estimation Using Age-Dependent Insensitive Loss. APSIPA 2020: 319-324 - [c18]Yosuke Higuchi, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa:
Noise-robust Attention Learning for End-to-End Speech Recognition. EUSIPCO 2020: 311-315 - [c17]Marc Delcroix, Tsubasa Ochiai, Katerina Zmolíková, Keisuke Kinoshita, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. ICASSP 2020: 691-695 - [c16]Naohiro Tawara, Hosana Kamiyama, Satoshi Kobashikawa, Atsunori Ogawa:
Improving Speaker-Attribute Estimation by Voting Based on Speaker Cluster Information. ICASSP 2020: 6594-6598 - [c15]Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Marc Delcroix, Tetsuji Ogawa:
Frame-Level Phoneme-Invariant Speaker Embedding for Text-Independent Speaker Recognition on Extremely Short Utterances. ICASSP 2020: 6799-6803 - [c14]Atsunori Ogawa, Naohiro Tawara, Marc Delcroix:
Language Model Data Augmentation Based on Text Domain Transfer. INTERSPEECH 2020: 4926-4930 - [i2]Marc Delcroix, Tsubasa Ochiai, Katerina Zmolíková, Keisuke Kinoshita, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam. CoRR abs/2001.08378 (2020) - [i1]Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara:
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds. CoRR abs/2010.13366 (2020)
2010 – 2019
- 2019
- [c13]Naohiro Tawara, Hikari Tanabe, Tetsunori Kobayashi, Masaru Fujieda, Kazuhiro Katagiri, Takashi Yazu, Tetsuji Ogawa:
Postfiltering Using an Adversarial Denoising Autoencoder with Noise-aware Training. ICASSP 2019: 3282-3286 - [c12]Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa:
Multi-Channel Speech Enhancement Using Time-Domain Convolutional Denoising Autoencoder. INTERSPEECH 2019: 86-90 - [c11]Yosuke Higuchi, Naohiro Tawara, Tetsunori Kobayashi, Tetsuji Ogawa:
Speaker Adversarial Training of DPGMM-Based Feature Extractor for Zero-Resource Languages. INTERSPEECH 2019: 266-270 - 2018
- [c10]Naohiro Tawara, Tetsunori Kobayashi, Masaru Fujieda, Kazuhiro Katagiri, Takashi Yazu, Tetsuji Ogawa:
Adversarial autoencoder for reducing nonlinear distortion. APSIPA 2018: 1669-1673 - [c9]Taira Tsuchiya, Naohiro Tawara, Tetsuji Ogawa, Tetsunori Kobayashi:
Speaker Invariant Feature Extraction for Zero-Resource Languages with Adversarial Learning. ICASSP 2018: 2381-2385 - [c8]Tsuyoshi Morioka, Naohiro Tawara, Tetsuji Ogawa, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi:
Language Model Domain Adaptation Via Recurrent Neural Networks with Domain-Shared and Domain-Specific Representations. ICASSP 2018: 6084-6088 - [c7]Yuya Kokaki, Naohiro Tawara, Tetsunori Kobayashi, Kazuo Hashimoto, Tetsuji Ogawa:
Sequential Fish Catch Forecasting Using Bayesian State Space Models. ICPR 2018: 776-781 - 2017
- [c6]Hiroto Ashikawa, Naohiro Tawara, Atsunori Ogawa, Tomoharu Iwata, Tetsunori Kobayashi, Tetsuji Ogawa:
Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations. APSIPA 2017: 1263-1267 - 2015
- [c5]Naohiro Tawara, Tetsuji Ogawa, Tetsunori Kobayashi:
A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions. ICASSP 2015: 2041-2045 - 2013
- [c4]Naohiro Tawara, Tetsuji Ogawa, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi:
Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data. MLSP 2013: 1-6 - 2012
- [c3]Naohiro Tawara, Tetsuji Ogawa, Shinji Watanabe, Tetsunori Kobayashi:
Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering. ICASSP 2012: 5253-5256 - [c2]Naohiro Tawara, Tetsuji Ogawa, Shinji Watanabe, Atsushi Nakamura, Tetsunori Kobayashi:
Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model. INTERSPEECH 2012: 2166-2169 - 2011
- [c1]Naohiro Tawara, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi:
Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model. INTERSPEECH 2011: 2905-2908
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 00:11 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint