


default search action
Xiaohai Tian
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c46]Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. NeurIPS 2024 - [i16]Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li:
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing. CoRR abs/2401.12264 (2024) - [i15]Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024) - [i14]Siyin Wang, Wenyi Yu, Yudong Yang, Changli Tang, Yixuan Li, Jimin Zhuang, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Guangzhi Sun, Lu Lu, Chao Zhang:
Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation. CoRR abs/2409.16644 (2024) - [i13]Wenyi Yu, Siyin Wang, Xiaoyu Yang, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Guangzhi Sun, Lu Lu, Yuxuan Wang, Chao Zhang:
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation. CoRR abs/2411.18138 (2024) - 2023
- [j8]Yi Zhou
, Zhizheng Wu
, Mingyang Zhang
, Xiaohai Tian
, Haizhou Li
:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023) - [j7]Yi Zhou
, Zhizheng Wu
, Xiaohai Tian
, Haizhou Li
:
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023) - [c45]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
:
Leveraging Phone-Level Linguistic-Acoustic Similarity For Utterance-Level Pronunciation Scoring. ICASSP 2023: 1-5 - [c44]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
:
An ASR-Free Fluency Scoring Approach with Self-Supervised Learning. ICASSP 2023: 1-5 - [c43]Kaiqi Fu, Shaojun Gao, Shuju Shi, Xiaohai Tian, Wei Li, Zejun Ma:
Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring. INTERSPEECH 2023: 949-953 - [c42]Shuju Shi, Kaiqi Fu, Yiwei Gu, Xiaohai Tian, Shaojun Gao, Wei Li, Zejun Ma:
Disentangling the Contribution of Non-native Speech in Automated Pronunciation Assessment. INTERSPEECH 2023: 954-958 - [i12]Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee
:
Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring. CoRR abs/2302.10444 (2023) - [i11]Kaiqi Fu, Shaojun Gao, Shuju Shi, Xiaohai Tian, Wei Li, Zejun Ma:
Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring. CoRR abs/2305.11438 (2023) - 2022
- [c41]Kaiqi Fu, Shaojun Gao, Xiaohai Tian, Wei Li, Zejun Ma:
Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring. INTERSPEECH 2022: 4337-4341 - [c40]Xiaohai Tian, Kaiqi Fu, Shaojun Gao, Yiwei Gu, Kai Wang, Wei Li, Zejun Ma:
A Transfer and Multi-Task Learning based Approach for MOS Prediction. INTERSPEECH 2022: 5438-5442 - [i10]Kaiqi Fu, Shaojun Gao, Kai Wang, Wei Li, Xiaohai Tian, Zejun Ma:
Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information. CoRR abs/2203.01826 (2022) - 2021
- [j6]Hongqiang Du
, Xiaohai Tian, Lei Xie, Haizhou Li
:
Factorized WaveNet for voice conversion with limited data. Speech Commun. 130: 45-54 (2021) - [j5]Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li
:
NHSS: A speech and singing parallel database. Speech Commun. 133: 9-22 (2021) - [j4]Yi Zhou
, Xiaohai Tian
, Haizhou Li
:
Language Agnostic Speaker Embedding for Cross-Lingual Personalized Speech Generation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3427-3439 (2021) - [c39]Qicong Xie, Xiaohai Tian, Guanghou Liu, Kun Song, Lei Xie, Zhiyong Wu, Hai Li, Song Shi, Haizhou Li, Fen Hong, Hui Bu, Xin Xu:
The Multi-Speaker Multi-Style Voice Cloning Challenge 2021. ICASSP 2021: 8613-8617 - [c38]Yi Zhou, Xiaohai Tian, Zhizheng Wu, Haizhou Li:
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation. Interspeech 2021: 1374-1378 - [c37]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing Voice Conversion Network with Cycle Consistency Loss of Speaker Identity. SLT 2021: 507-513 - 2020
- [j3]Yi Zhou
, Xiaohai Tian
, Haizhou Li
:
Multi-Task WaveRNN With an Integrated Architecture for Cross-Lingual Voice Conversion. IEEE Signal Process. Lett. 27: 1310-1314 (2020) - [c36]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --. Blizzard Challenge / Voice Conversion Challenge 2020 - [c35]Yi Zhou, Xiaohai Tian, Xuehao Zhou, Mingyang Zhang, Grandee Lee, Riu Liu, Berrak Sisman, Haizhou Li:
NUS-HLT System for Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c34]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. Blizzard Challenge / Voice Conversion Challenge 2020 - [c33]Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li:
The NUS & NWPU system for Voice Conversion Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c32]Xuehao Zhou, Xiaohai Tian, Grandee Lee, Rohan Kumar Das
, Haizhou Li:
End-to-End Code-Switching TTS with Cross-Lingual Language Model. ICASSP 2020: 7614-7618 - [c31]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Effective Wavenet Adaptation for Voice Conversion with Limited Data. ICASSP 2020: 7779-7783 - [c30]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. INTERSPEECH 2020: 4213-4217 - [c29]Xiaohai Tian, Rohan Kumar Das
, Haizhou Li:
Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion. Odyssey 2020: 159-164 - [c28]Xiaoxue Gao, Xiaohai Tian, Yi Zhou, Rohan Kumar Das
, Haizhou Li:
Personalized Singing Voice Generation Using WaveRNN. Odyssey 2020: 252-258 - [e1]Junichi Yamagishi, Zhenhua Ling, Rohan Kumar Das, Simon King, Tomi Kinnunen, Tomoki Toda, Wen-Chin Huang, Xiao Zhou, Xiaohai Tian, Yi Zhao:
Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, Shanghai, China, October 30, 2020. ISCA 2020 [contents] - [i9]Rohan Kumar Das, Xiaohai Tian, Tomi Kinnunen, Haizhou Li:
The Attacker's Perspective on Automatic Speaker Verification: An Overview. CoRR abs/2004.08849 (2020) - [i8]Yi Zhao, Wen-Chin Huang, Xiaohai Tian, Junichi Yamagishi, Rohan Kumar Das, Tomi Kinnunen, Zhen-Hua Ling, Tomoki Toda:
Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion. CoRR abs/2008.12527 (2020) - [i7]Rohan Kumar Das, Tomi Kinnunen, Wen-Chin Huang, Zhen-Hua Ling, Junichi Yamagishi, Yi Zhao, Xiaohai Tian, Tomoki Toda:
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions. CoRR abs/2009.03554 (2020) - [i6]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
Optimizing voice conversion network with cycle consistency loss of speaker identity. CoRR abs/2011.08548 (2020) - [i5]Bidisha Sharma, Xiaoxue Gao, Karthika Vijayan, Xiaohai Tian, Haizhou Li:
NHSS: A Speech and Singing Parallel Database. CoRR abs/2012.00337 (2020)
2010 – 2019
- 2019
- [b1]Xiaohai Tian:
Voice conversion with parallel/non-parallel data and synthetic speech detection. Nanyang Technological University, Singapore, 2019 - [c27]Xiaoxue Gao, Xiaohai Tian, Rohan Kumar Das
, Yi Zhou, Haizhou Li:
Speaker-independent Spectral Mapping for Speech-to-Singing Conversion. APSIPA 2019: 159-164 - [c26]Yi Zhou, Xiaohai Tian, Rohan Kumar Das
, Haizhou Li:
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network. APSIPA 2019: 1282-1287 - [c25]Hongqiang Du, Xiaohai Tian, Lei Xie, Haizhou Li:
WaveNet Factorization with Singular Value Decomposition for Voice Conversion. ASRU 2019: 152-159 - [c24]Yi Zhou, Xiaohai Tian, Emre Yilmaz, Rohan Kumar Das
, Haizhou Li:
A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion. ASRU 2019: 160-167 - [c23]Yi Zhou, Xiaohai Tian, Haihua Xu, Rohan Kumar Das
, Haizhou Li
:
Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling. ICASSP 2019: 6790-6794 - [c22]Xiaohai Tian, Eng Siong Chng
, Haizhou Li
:
A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data. INTERSPEECH 2019: 201-205 - [i4]Xiaohai Tian, Eng Siong Chng, Haizhou Li:
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data. CoRR abs/1902.03705 (2019) - 2018
- [c21]Junchao Wang, Xiaohai Tian, Minghui Dong, Haihua Xu:
The TL-NTU Text-to-speech System for the Blizzard Challenge 2018. Blizzard Challenge 2018 - [c20]Xinjia Yu, Lei Meng, Xiaohai Tian, Simon Fauvel, Bo Huang, Yunqing Guan
, Zhiqi Shen, Chunyan Miao
, Cyril Leung:
Usability Analysis of the Novel Functions to Assist the Senior Customers in Online Shopping. HCI (13) 2018: 173-185 - [c19]Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:
Average Modeling Approach to Voice Conversion with Non-Parallel Data. Odyssey 2018: 227-232 - 2017
- [j2]Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng
, Haizhou Li
:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017) - [c18]Zhi Hao Lim, Xiaohai Tian, Wei Rao, Eng Siong Chng
:
An investigation of spectral feature partitioning for replay attacks detection. APSIPA 2017: 1570-1573 - [c17]Xiaohai Tian, Lei Meng, Siyuan Liu, Zhiqi Shen, Eng Siong Chng
, Cyril Leung, Frank Guan
, Chunyan Miao
:
Novel Functional Technologies for Age-Friendly E-commerce. HCI (28) 2017: 150-158 - [c16]Nana Hou, Xiaohai Tian, Eng Siong Chng
, Bin Ma, Haizhou Li
:
Improving air traffic control speech intelligibility by reducing speaking rate effectively. IALP 2017: 197-200 - [c15]Lei Meng, Nguyen Quy Hy, Xiaohai Tian, Zhiqi Shen, Eng Siong Chng
, Frank Yunqing Guan
, Chunyan Miao
, Cyril Leung:
Towards Age-friendly E-commerce Through Crowd-Improved Speech Recognition, Multimodal Search, and Personalized Speech Feedback. ICCSE 2017: 127-135 - 2016
- [j1]Nguyen Quy Hy, Siu Wa Lee, Xiaohai Tian, Minghui Dong, Eng Siong Chng
:
High quality voice conversion using prosodic and high-resolution spectral features. Multim. Tools Appl. 75(9): 5265-5285 (2016) - [c14]Xiaohai Tian, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
Spoofing speech detection using temporal convolutional neural network. APSIPA 2016: 1-6 - [c13]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
Spoofing detection from a feature representation perspective. ICASSP 2016: 2119-2123 - [c12]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng
, Haizhou Li
:
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions. INTERSPEECH 2016: 1715-1719 - [c11]Dong-Yan Huang, Lei Xie, Yvonne Siu Wa Lee, Jie Wu, Huaiping Ming, Xiaohai Tian, Shaofei Zhang, Chuang Ding, Mei Li, Nguyen Quy Hy, Minghui Dong, Haizhou Li:
An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity. SSW 2016: 44-51 - [i3]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection under noisy conditions: a preliminary investigation and an initial database. CoRR abs/1602.02950 (2016) - 2015
- [c10]Bo Fan, Siu Wa Lee, Xiaohai Tian, Lei Xie, Minghui Dong:
A waveform representation framework for high-quality statistical parametric speech synthesis. APSIPA 2015: 530-536 - [c9]Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, Engsiong Chng
, Haizhou Li
:
Detecting synthetic speech using long term magnitude and phase information. ChinaSIP 2015: 611-615 - [c8]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Engsiong Chng
, Minghui Dong:
Sparse representation for frequency warping based voice conversion. ICASSP 2015: 4235-4239 - [c7]Daniel Erro, Inma Hernáez, Agustín Alonso, D. García-Lorenzo, Eva Navas, Jianpei Ye, Haritz Arzelus
, Igor Jauk, Nguyen Quy Hy, Carmen Magariños, R. Pérez-Ramón, M. Sulír, Xiaohai Tian, X. Wang:
Personalized synthetic voices for speaking impaired: website and app. INTERSPEECH 2015: 1251-1254 - [c6]Xiong Xiao, Xiaohai Tian, Steven Du, Haihua Xu, Engsiong Chng, Haizhou Li:
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge. INTERSPEECH 2015: 2052-2056 - [c5]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Minghui Dong, Engsiong Chng:
System fusion for high-performance voice conversion. INTERSPEECH 2015: 2759-2763 - [i2]Bo Fan, Siu Wa Lee, Xiaohai Tian, Lei Xie, Minghui Dong:
A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis. CoRR abs/1510.01443 (2015) - [i1]Nguyen Quy Hy, Siu Wa Lee, Xiaohai Tian, Minghui Dong, Engsiong Chng:
High quality voice conversion using prosodic and high-resolution spectral features. CoRR abs/1512.01809 (2015) - 2014
- [c4]Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li:
A comparative study of spectral transformation techniques for singing voice synthesis. INTERSPEECH 2014: 2499-2503 - [c3]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Engsiong Chng
:
Correlation-based frequency warping for voice conversion. ISCSLP 2014: 211-215 - 2013
- [c2]Xiaohai Tian, Zhizheng Wu, Engsiong Chng
:
Local partial least square regression for spectral mapping in voice conversion. APSIPA 2013: 1-6 - 2010
- [c1]Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, Bingfeng Li, Naicai Sun, Yali Zhao, Yanning Zhang:
Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications. UIC/ATC Workshops 2010: 503-505
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-19 00:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint