default search action
Odyssey 2018: Les Sables d'Olonne, France
- Anthony Larcher, Jean-François Bonastre:
Odyssey 2018: The Speaker and Language Recognition Workshop, 26-29 June 2018, Les Sables d'Olonne, France. ISCA 2018
Keynote: Els Kindt
- Els Kindt:
Speaker identification and Data protection.
Speaker Recognition I
- Moez Ajili, Solange Rossato, Dan Zhang, Jean-François Bonastre:
Impact of rhythm on forensic voice comparison reliability. 1-8 - Georgina Brown:
Segmental Content Effects on Text-dependent Automatic Accent Recognition. 9-15 - Andreas Nautsch, Sergey Isadskiy, Jascha Kolberg, Marta Gomez-Barrero, Christoph Busch:
Homomorphic Encryption for Speaker Recognition: Protection of Biometric Templates and Vendor Model Parameters. 16-23 - Martin Karu, Tanel Alumäe:
Weakly Supervised Training of Speaker Identification Models. 24-30
Language Recognition
- Bharat Padi, Shreyas Ramoji, Vaishnavi Yeruva, Satish Kumar, Sriram Ganapathy:
The LEAP Language Recognition System for LRE 2017 Challenge - Improvements and Error Analysis. 31-38 - Alicia Lozano-Diez, Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Joaquin Gonzalez-Rodriguez:
Analysis of DNN-based Embeddings for Language Recognition on the NIST LRE 2017. 39-46 - Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Sandro Cumani, Alicia Lozano-Diez, Josef Slavícek, Mireia Díez, Frantisek Grézl, Ondrej Glembek, Mounika Kamsali, Anna Silnova, Lukás Burget, Lucas Ondel, Santosh Kesiraju, Johan Rohdin:
Analysis of BUT-PT Submission for NIST LRE 2017. 47-53 - Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. 54-59 - Trung Ngo Trong, Ville Hautamäki, Kristiina Jokinen:
Staircase Network: structural language identification via hierarchical attentive units. 60-67 - Alan McCree, David Snyder, Gregory Sell, Daniel Garcia-Romero:
Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17. 68-73 - Weicheng Cai, Jinkun Chen, Ming Li:
Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. 74-81 - Seyed Omid Sadjadi, Timothée Kheyrkhah, Audrey Tong, Craig S. Greenberg, Douglas A. Reynolds, Elliot Singer, Lisa P. Mason, Jaime Hernandez-Cordero:
The 2017 NIST Language Recognition Evaluation. 82-89 - Mitchell McLaren, Mahesh Kumar Nandwana, Diego Castán, Luciana Ferrer:
Approaches to Multi-domain Language Recognition. 90-97 - Suwon Shon, Ahmed Ali, James R. Glass:
Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition. 98-104 - David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
Spoken Language Recognition using X-vectors. 105-111 - Jesús Antonio Villalba López, Niko Brummer, Najim Dehak:
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions. 112-119
Speaker diarization
- Ruth Aloni-Lavi, Irit Opher, Itshak Lapidot:
Incremental On-Line Clustering of Speakers' Short Segments. 120-127 - Liang He, Xianhong Chen, Can Xu, Jia Liu:
Latent Class Model for Single Channel Speaker Diarization. 128-133 - Xianhong Chen, Liang He, Can Xu, Yi Liu, Tianyu Liang, Jia Liu:
VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation. 134-139 - Jose Patino, Ruiqing Yin, Héctor Delgado, Hervé Bredin, Alain Komaty, Guillaume Wisniewski, Claude Barras, Nicholas W. D. Evans, Sébastien Marcel:
Low-latency speaker spotting with online diarization and detection. 140-146 - Mireia Díez, Lukás Burget, Pavel Matejka:
Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. 147-154
Noise Robustness
- Md. Hafizur Rahman, Ivan Himawan, David Dean, Clinton Fookes, Sridha Sridharan:
Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification. 155-161 - Wei-Wei Lin, Man-Wai Mak, Longxin Li, Jen-Tzung Chien:
Reducing Domain Mismatch by Maximum Mean Discrepancy Based Autoencoders. 162-167 - Ondrej Novotný, Oldrich Plchot, Pavel Matejka, Ladislav Mosner, Ondrej Glembek:
On the use of X-vectors for Robust Speaker Recognition. 168-175 - Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:
Speaker Verification in Mismatched Conditions with Frustratingly Easy Domain Adaptation. 176-180 - Chunlei Zhang, Shivesh Ranjan, John H. L. Hansen:
An Analysis of Transfer Learning for Domain Mismatched Text-independent Speaker Verification. 181-186
Keynote: Simon King
- Simoin King:
Speaking naturally? It depends who is listening.
Voice conversion
- Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. 187-194 - Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. 195-202 - Kazuhiro Kobayashi, Tomoki Toda:
sprocket: Open-Source Voice Conversion Software. 203-210
Voice conversion and spoofing
- Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018. 211-218 - Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
NU Voice Conversion System for the Voice Conversion Challenge 2018. 219-226 - Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:
Average Modeling Approach to Voice Conversion with Non-Parallel Data. 227-232 - Shihono Mochizuki, Sayaka Shiota, Hitoshi Kiya:
Voice liveness detection using phoneme-based pop-noise detector for speaker verification. 233-239 - Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. 240-247 - Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
The HCCL-CUHK System for the Voice Conversion Challenge 2018. 248-254 - Fahimeh Bahmaninezhad, Chunlei Zhang, John H. L. Hansen:
Convolutional Neural Network Based Speaker De-Identification. 255-260 - Kentaro Sone, Shinji Takaki, Toru Nakashika:
Bidirectional Voice Conversion Based on Joint Training Using Gaussian-Gaussian Deep Relational Model. 261-266 - Berrak Sisman, Grandee Lee, Haizhou Li:
Phonetically Aware Exemplar-Based Prosody Transformation. 267-274 - Akihiro Kato, Tomi Kinnunen:
A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech. 275-282 - Anna Silnova, Pavel Matejka, Ondrej Glembek, Oldrich Plchot, Ondrej Novotný, Frantisek Grézl, Petr Schwarz, Lukás Burget, Jan Cernocký:
BUT/Phonexia Bottleneck Feature Extractor. 283-287
Spoofing
- Giacomo Valenti, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Laurent Pilati:
An end-to-end spoofing countermeasure for automatic speaker verification using evolving recurrent neural networks. 288-295 - Héctor Delgado, Massimiliano Todisco, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Kong-Aik Lee, Junichi Yamagishi:
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. 296-303 - Joaquin Gonzalez-Rodriguez, Álvaro Escudero, Diego de Benito-Gorrón, Beltran Labrador, Javier Franco-Pedroso:
An Audio Fingerprinting Approach to Replay Attack Detection on ASVSPOOF 2017 Challenge Data. 304-311 - Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. 312-319 - Rosa González Hautamäki, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:
Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification. 320-326
Keynote: Pascal Belin
- Pascal Belin:
A Vocal Brain: Cerebral Processing of Voice Information.
Speaker recognition II
- Mitchell McLaren, Diego Castán, Mahesh Kumar Nandwana, Luciana Ferrer, Emre Yilmaz:
How to train your speaker embeddings extractor. 327-334 - Giacomo Valenti, Adrien Daniel, Nicholas W. D. Evans:
End-to-end automatic speaker verification with evolving recurrent neural networks. 335-341 - Jen-Tzung Chien, Kang-Ting Peng:
Adversarial Learning and Augmentation for Speaker Recognition. 342-348 - Niko Brummer, Anna Silnova, Lukás Burget, Themos Stafylakis:
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. 349-356 - Ville Vestman, Tomi Kinnunen:
Supervector Compression Strategies to Speed up I-Vector System Development. 357-364
Text-dependent speaker recognition
- Ziqiang Shi, Mengjiao Wang, Liu Liu, Huibin Lin, Rujie Liu:
A Double Joint Bayesian Approach for J-Vector Based Text-dependent Speaker Verification. 365-371 - Hossein Zeinali, Lukás Burget, Hossein Sameti, Honza Cernocký:
Spoken Pass-Phrase Verification in the i-vector Space. 372-377 - Sergey Novoselov, Andrey Shulipa, Ivan Kremnev, Alexandr Kozlov, Vadim Shchemelinin:
On deep speaker embeddings for text-independent speaker recognition. 378-385 - Hossein Zeinali, Hossein Sameti, Themos Stafylakis:
DeepMine Speech Processing Database: Text-Dependent and Independent Speaker Verification and Speech Recognition in Persian and English. 386-392 - Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:
Boosting the Performance of Spoofing Detection Systems on Replay Attacks Using q-Logarithm Domain Feature Normalization. 393-398
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.