default search action
Ning Cheng 0001
Person information
- affiliation: Ping An Technology (Shenzhen) Co., Ltd., China
- affiliation (former): Chinese Academy of Sciences, Institute of Automation, Beijing, China
- affiliation (former): Chinese Academy of Sciences, Shenzhen Institute of Advanced Technology, China
- affiliation (PhD 2009): University of the Chinese Academy of Sciences (UCAS), Beijing, China
Other persons with the same name
- Ning Cheng — disambiguation page
- Ning Cheng 0002 — Futurewei Technologies, Bridgewater, NJ, USA (and 3 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c101]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. ACL (1) 2024: 14255-14273 - [c100]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis. APWeb/WAIM (1) 2024: 90-104 - [c99]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CSCWD 2024: 1110-1115 - [c98]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. ICASSP 2024: 7150-7154 - [c97]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. ICASSP 2024: 8276-8280 - [c96]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. ICASSP 2024: 12146-12150 - [c95]Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang:
Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning. ICASSP 2024: 12546-12550 - [c94]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-paired Cross-Modal Retrieval. ICIC (LNAI 5) 2024: 374-385 - [c93]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. ICIC (LNAI 3) 2024: 391-401 - [c92]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. IJCNN 2024: 1-7 - [c91]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. IJCNN 2024: 1-7 - [c90]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. IJCNN 2024: 1-7 - [c89]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. IJCNN 2024: 1-7 - [c88]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
ConTuner: Singing Voice Beautifying with Pitch and Expressiveness Condition. IJCNN 2024: 1-6 - [c87]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. IJCNN 2024: 1-7 - [c86]Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao:
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning. NAACL-HLT 2024: 7602-7635 - [i78]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. CoRR abs/2401.08049 (2024) - [i77]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. CoRR abs/2401.08096 (2024) - [i76]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. CoRR abs/2401.08166 (2024) - [i75]Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang:
Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning. CoRR abs/2401.09783 (2024) - [i74]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. CoRR abs/2402.00530 (2024) - [i73]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CoRR abs/2403.05000 (2024) - [i72]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition. CoRR abs/2404.19187 (2024) - [i71]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. CoRR abs/2404.19212 (2024) - [i70]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. CoRR abs/2404.19214 (2024) - [i69]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. CoRR abs/2404.19316 (2024) - [i68]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. CoRR abs/2405.00603 (2024) - [i67]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. CoRR abs/2405.00930 (2024) - [i66]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis. CoRR abs/2405.17028 (2024) - [i65]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval. CoRR abs/2405.17777 (2024) - [i64]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. CoRR abs/2405.17900 (2024) - [i63]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PFID: Privacy First Inference Delegation Framework for LLMs. CoRR abs/2406.12238 (2024) - 2023
- [c85]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Zhitao Li, Jing Xiao:
On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models. AAAI 2023: 13923-13931 - [c84]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. ADMA (4) 2023: 154-167 - [c83]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic and Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. ADMA (4) 2023: 168-181 - [c82]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology Based on Stochastic Teacher Network. ADMA (5) 2023: 250-261 - [c81]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. ISPA/BDCloud/SocialCom/SustainCom 2023: 752-757 - [c80]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. ISPA/BDCloud/SocialCom/SustainCom 2023: 923-928 - [c79]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. ISPA/BDCloud/SocialCom/SustainCom 2023: 1143-1148 - [c78]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter. EMNLP 2023: 5364-5375 - [c77]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective. ICASSP 2023: 1-5 - [c76]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Learning Speech Representations with Flexible Hidden Feature Dimensions. ICASSP 2023: 1-5 - [c75]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization. ICASSP 2023: 1-5 - [c74]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. ICASSP 2023: 1-5 - [c73]Tong Ye, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval. ICASSP 2023: 1-5 - [c72]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross Entropy. ICASSP 2023: 1-5 - [c71]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations. ICASSP 2023: 1-5 - [c70]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. ICTAI 2023: 641-645 - [c69]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. ICTAI 2023: 905-912 - [c68]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. ICTAI 2023: 913-917 - [c67]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. IJCNN 2023: 1-7 - [c66]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. INTERSPEECH 2023: 12-16 - [c65]Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao:
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism. INTERSPEECH 2023: 2173-2177 - [c64]Yong Zhang, Zhitao Li, Jianzong Wang, Yiming Gao, Ning Cheng, Fengying Yu, Jing Xiao:
Prompt Guided Copy Mechanism for Conversational Question Answering. INTERSPEECH 2023: 3422-3426 - [c63]Yifu Sun, Xulong Zhang, Jianzong Wang, Ning Cheng, Kaiyu Hu, Jing Xiao:
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning. INTERSPEECH 2023: 5456-5460 - [c62]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. ACM Multimedia 2023: 184-192 - [i62]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective. CoRR abs/2303.07667 (2023) - [i61]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. CoRR abs/2303.07682 (2023) - [i60]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy. CoRR abs/2303.07687 (2023) - [i59]Tong Ye, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval. CoRR abs/2303.08599 (2023) - [i58]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Zhitao Li, Jing Xiao:
On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models. CoRR abs/2303.08606 (2023) - [i57]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations. CoRR abs/2303.11421 (2023) - [i56]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. CoRR abs/2304.11547 (2023) - [i55]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. CoRR abs/2306.00648 (2023) - [i54]Yong Zhang, Zhitao Li, Jianzong Wang, Yiming Gao, Ning Cheng, Fengying Yu, Jing Xiao:
Prompt Guided Copy Mechanism for Conversational Question Answering. CoRR abs/2308.03422 (2023) - [i53]Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao:
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism. CoRR abs/2308.03423 (2023) - [i52]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. CoRR abs/2308.11084 (2023) - [i51]Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao:
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning. CoRR abs/2308.12032 (2023) - [i50]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. CoRR abs/2308.14317 (2023) - [i49]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. CoRR abs/2308.14319 (2023) - [i48]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology base on Stochastic Teacher Network. CoRR abs/2308.14322 (2023) - [i47]Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks. CoRR abs/2309.07509 (2023) - [i46]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. CoRR abs/2309.08837 (2023) - [i45]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. CoRR abs/2309.08838 (2023) - [i44]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. CoRR abs/2309.08839 (2023) - [i43]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter. CoRR abs/2310.18347 (2023) - [i42]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. CoRR abs/2311.07965 (2023) - [i41]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. CoRR abs/2311.08670 (2023) - [i40]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. CoRR abs/2311.08673 (2023) - 2022
- [c61]Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Shallow Diffusion Motion Model for Talking Face Generation from Speech. APWeb/WAIM (2) 2022: 144-157 - [c60]Chuanyao Zhang, Jianzong Wang, Zhangcheng Huang, Lingwei Kong, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Supervised Contrastive Meta-learning for Few-Shot Classification. HPCC/DSS/SmartCity/DependSys 2022: 1736-1742 - [c59]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. ICASSP 2022: 3184-3188 - [c58]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. ICASSP 2022: 4293-4297 - [c57]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning. ICASSP 2022: 4613-4617 - [c56]Tong Ye, Shijing Si, Jianzong Wang, Rui Wang, Ning Cheng, Jing Xiao:
VU-BERT: A Unified Framework for Visual Dialog. ICASSP 2022: 6687-6691 - [c55]Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Self-Attention for Incomplete Utterance Rewriting. ICASSP 2022: 8047-8051 - [c54]Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting StarGANs for Voice Conversion with Contrastive Discriminator. ICONIP (2) 2022: 355-366 - [c53]Denghao Li, Yuqiao Zeng, Jianzong Wang, Lingwei Kong, Zhangcheng Huang, Ning Cheng, Xiaoyang Qu, Jing Xiao:
Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation. ICTAI 2022: 228-232 - [c52]Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. ICTAI 2022: 1002-1006 - [c51]Jian Luo, Jianzong Wang, Ning Cheng, Haobin Tang, Jing Xiao:
Speech Augmentation Based Unsupervised Learning for Keyword Spotting. IJCNN 2022: 1-7 - [c50]Jian Luo, Jianzong Wang, Ning Cheng, Zhenpeng Zheng, Jing Xiao:
Adaptive Activation Network for Low Resource Multilingual Speech Recognition. IJCNN 2022: 1-7 - [c49]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. IJCNN 2022: 1-7 - [c48]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MDCNN-SID: Multi-scale Dilated Convolution Network for Singer Identification. IJCNN 2022: 1-7 - [c47]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. IJCNN 2022: 1-7 - [c46]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. IJCNN 2022: 1-7 - [c45]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. IJCNN 2022: 1-7 - [c44]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Jing Xiao:
Uncertainty Calibration for Deep Audio Classifiers. INTERSPEECH 2022: 1556-1560 - [c43]Sicheng Yang, Methawee Tantrawenith, Haolin Zhuang, Zhiyong Wu, Aolan Sun, Jianzong Wang, Ning Cheng, Huaizhen Tang, Xintao Zhao, Jie Wang, Helen Meng:
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion. INTERSPEECH 2022: 2553-2557 - [c42]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation. INTERSPEECH 2022: 5313-5317 - [c41]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. MSN 2022: 456-460 - [c40]Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. MSN 2022: 485-489 - [c39]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. MSN 2022: 841-846 - [c38]Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. MSN 2022: 915-920 - [c37]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. MSN 2022: 966-971 - [c36]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. MSN 2022: 1031-1036 - [i39]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning. CoRR abs/2202.10020 (2022) - [i38]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech. CoRR abs/2202.10712 (2022) - [i37]Tong Ye, Shijing Si, Jianzong Wang, Rui Wang, Ning Cheng, Jing Xiao:
VU-BERT: A Unified framework for Visual Dialog. CoRR abs/2202.10787 (2022) - [i36]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. CoRR abs/2202.10976 (2022) - [i35]Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Self-Attention for Incomplete Utterance Rewriting. CoRR abs/2202.12160 (2022) - [i34]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. CoRR abs/2205.11817 (2022) - [i33]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. CoRR abs/2205.11821 (2022) - [i32]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. CoRR abs/2205.11824 (2022) - [i31]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. CoRR abs/2205.11841 (2022) - [i30]Jian Luo, Jianzong Wang, Ning Cheng, Zhenpeng Zheng, Jing Xiao:
Adaptive Activation Network For Low Resource Multilingual Speech Recognition. CoRR abs/2205.14326 (2022) - [i29]Jian Luo, Jianzong Wang, Ning Cheng, Haobin Tang, Jing Xiao:
Speech Augmentation Based Unsupervised Learning for Keyword Spotting. CoRR abs/2205.14329 (2022) - [i28]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Jing Xiao:
Uncertainty Calibration for Deep Audio Classifiers. CoRR abs/2206.13071 (2022) - [i27]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation. CoRR abs/2206.13689 (2022) - [i26]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. CoRR abs/2208.04035 (2022) - [i25]Sicheng Yang, Methawee Tantrawenith, Haolin Zhuang, Zhiyong Wu, Aolan Sun, Jianzong Wang, Ning Cheng, Huaizhen Tang, Xintao Zhao, Jie Wang, Helen Meng:
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion. CoRR abs/2208.08757 (2022) - [i24]Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting Star-GANs for Voice Conversion with Contrastive Discriminator. CoRR abs/2209.10088 (2022) - [i23]Denghao Li, Yuqiao Zeng, Jianzong Wang, Lingwei Kong, Zhangcheng Huang, Ning Cheng, Xiaoyang Qu, Jing Xiao:
Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation. CoRR abs/2209.15164 (2022) - [i22]Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. CoRR abs/2210.06877 (2022) - [i21]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. CoRR abs/2210.13803 (2022) - [i20]Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. CoRR abs/2210.13805 (2022) - [i19]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. CoRR abs/2210.13811 (2022) - [i18]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. CoRR abs/2210.14723 (2022) - [i17]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. CoRR abs/2210.14724 (2022) - [i16]Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. CoRR abs/2210.14725 (2022) - 2021
- [c35]Fengying Yu, Jianzong Wang, Dewei Tao, Ning Cheng, Jing Xiao:
Self-supervised Learning for Semantic Sentence Matching with Dense Transformer Inference Network. APWeb/WAIM (1) 2021: 258-272 - [c34]Chao Sun, Jianzong Wang, Fengying Yu, Ning Cheng, Jing Xiao:
A Novel Capsule Aggregation Framework for Natural Language Inference. APWeb/WAIM (1) 2021: 300-315 - [c33]Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Cyclegean: Cycle Generative Enhanced Adversarial Network for Voice Conversion. ASRU 2021: 930-937 - [c32]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, Jing Xiao:
TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training. ASRU 2021: 938-945 - [c31]Aolan Sun, Jianzong Wang, Ning Cheng, Methawee Tantrawenith, Zhiyong Wu, Helen Meng, Edward Xiao, Jing Xiao:
Reconstructing Dual Learning for Neural Voice Conversion Using Relatively Few Samples. ASRU 2021: 946-953 - [c30]Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition. ICASSP 2021: 910-914 - [c29]Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation. ICASSP 2021: 6054-6058 - [c28]Yanfei Hui, Jianzong Wang, Ning Cheng, Fengying Yu, Tianbo Wu, Jing Xiao:
Joint Intent Detection and Slot Filling Based on Continual Learning Model. ICASSP 2021: 7643-7647 - [c27]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao, Georg Kucsko, Patrick K. O'Neill, Jagadeesh Balam, Slyne Deng, Adriana Flores, Boris Ginsburg, Jocelyn Huang, Oleksii Kuchaiev, Vitaly Lavrukhin, Jason Li:
Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition. ICME 2021: 1-6 - [c26]Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Loss Prediction: End-to-End Active Learning Approach For Speech Recognition. IJCNN 2021: 1-7 - [c25]Cheng Yi, Jianzong Wang, Ning Cheng, Shiyu Zhou, Bo Xu:
Transfer Ability of Monolingual Wav2vec2.0 for Low-resource Speech Recognition. IJCNN 2021: 1-6 - [c24]Cheng Yi, Jianzong Wang, Ning Cheng, Shiyu Zhou, Bo Xu:
A Language Model Based Pseudo-Sample Deliberation for Semi-supervised Speech Recognition. IJCNN 2021: 1-6 - [c23]Fengying Yu, Dewei Tao, Jianzong Wang, Yanfei Hui, Ning Cheng, Jing Xiao:
Semantic Extraction for Sentence Representation via Reinforcement Learning. IJCNN 2021: 1-8 - [c22]Nan Zhang, Jianzong Wang, Wenqi Wei, Xiaoyang Qu, Ning Cheng, Jing Xiao:
CACnet: Cube Attentional CNN for Automatic Speech Recognition. IJCNN 2021: 1-7 - [c21]Shijing Si, Jianzong Wang, Huiming Sun, Jianhan Wu, Chuanyao Zhang, Xiaoyang Qu, Ning Cheng, Lei Chen, Jing Xiao:
Variational Information Bottleneck for Effective Low-Resource Audio Classification. Interspeech 2021: 591-595 - [c20]Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation. Interspeech 2021: 1169-1173 - [c19]Shijing Si, Jianzong Wang, Xiaoyang Qu, Ning Cheng, Wenqi Wei, Xinghua Zhu, Jing Xiao:
Speech2Video: Cross-Modal Distillation for Speech to Video Generation. Interspeech 2021: 1629-1633 - [c18]Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Semantic Embedding Graph Convolutional Networks for Multi-label Video Segment Classification. PAAP 2021: 146-151 - [c17]Jian Luo, Jianzong Wang, Ning Cheng, Guilin Jiang, Jing Xiao:
Multi-Quartznet: Multi-Resolution Convolution for Speech Recognition with Multi-Layer Feature Fusion. SLT 2021: 82-88 - [c16]Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Lingwei Kong, Jing Xiao:
GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis. SLT 2021: 438-445 - [c15]Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
MelGlow: Efficient Waveform Generative Network Based On Location-Variable Convolution. SLT 2021: 485-491 - [c14]Jian Luo, Jianzong Wang, Ning Cheng, Guilin Jiang, Jing Xiao:
End-To-End Silent Speech Recognition with Acoustic Sensing. SLT 2021: 606-612 - [i15]Yanfei Hui, Jianzong Wang, Ning Cheng, Fengying Yu, Tianbo Wu, Jing Xiao:
Joint Intent Detection And Slot Filling Based on Continual Learning Model. CoRR abs/2102.10905 (2021) - [i14]Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition. CoRR abs/2102.11594 (2021) - [i13]Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao:
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation. CoRR abs/2107.04227 (2021) - [i12]Shijing Si, Jianzong Wang, Huiming Sun, Jianhan Wu, Chuanyao Zhang, Xiaoyang Qu, Ning Cheng, Lei Chen, Jing Xiao:
Variational Information Bottleneck for Effective Low-resource Audio Classification. CoRR abs/2107.04803 (2021) - [i11]Shijing Si, Jianzong Wang, Xiaoyang Qu, Ning Cheng, Wenqi Wei, Xinghua Zhu, Jing Xiao:
Speech2Video: Cross-Modal Distillation for Speech to Video Generation. CoRR abs/2107.04806 (2021) - 2020
- [c13]Wenqi Wei, Jianzong Wang, Ning Cheng, Yuanxu Chen, Bao Zhou, Jing Xiao:
Epidemic Guard: A COVID-19 Detection System for Elderly People. APWeb/WAIM (2) 2020: 545-550 - [c12]Zelong Yan, Jianzong Wang, Ning Cheng, Tianbo Wu, Jing Xiao:
Chinese Punctuation Prediction with Adaptive Attention and Dependency Tree. CCKS 2020: 3-14 - [c11]Zhen Zeng, Jianzong Wang, Ning Cheng, Tian Xia, Jing Xiao:
Aligntts: Efficient Feed-Forward Text-to-Speech System Without Explicit Alignment. ICASSP 2020: 6714-6718 - [c10]Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao:
GraphTTS: Graph-to-Sequence Modelling in Neural Text-to-Speech. ICASSP 2020: 6719-6723 - [c9]Wenqi Wei, Jianzong Wang, Jiteng Ma, Ning Cheng, Jing Xiao:
A Real-Time Robot-Based Auxiliary System for Risk Evaluation of COVID-19 Infection. INTERSPEECH 2020: 701-705 - [c8]Xueli Jia, Jianzong Wang, Zhiyong Zhang, Ning Cheng, Jing Xiao:
Large-Scale Transfer Learning for Low-Resource Spoken Language Understanding. INTERSPEECH 2020: 1555-1559 - [c7]Zhenpeng Zheng, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
MLNET: An Adaptive Multiple Receptive-Field Attention Neural Network for Voice Activity Detection. INTERSPEECH 2020: 3695-3699 - [c6]Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit. INTERSPEECH 2020: 4422-4426 - [i10]Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao:
GraphTTS: graph-to-sequence modelling in neural text-to-speech. CoRR abs/2003.01924 (2020) - [i9]Zhen Zeng, Jianzong Wang, Ning Cheng, Tian Xia, Jing Xiao:
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment. CoRR abs/2003.01950 (2020) - [i8]Zhenpeng Zheng, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection. CoRR abs/2008.05650 (2020) - [i7]Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit. CoRR abs/2008.05656 (2020) - [i6]Xueli Jia, Jianzong Wang, Zhiyong Zhang, Ning Cheng, Jing Xiao:
Large-scale Transfer Learning for Low-resource Spoken Language Understanding. CoRR abs/2008.05671 (2020) - [i5]Wenqi Wei, Jianzong Wang, Jiteng Ma, Ning Cheng, Jing Xiao:
A Real-time Robot-based Auxiliary System for Risk Evaluation of COVID-19 Infection. CoRR abs/2008.07695 (2020) - [i4]Jian Luo, Jianzong Wang, Ning Cheng, Guilin Jiang, Jing Xiao:
End-to-end Silent Speech Recognition with Acoustic Sensing. CoRR abs/2011.11315 (2020) - [i3]Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao:
MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution. CoRR abs/2012.01684 (2020) - [i2]Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Lingwei Kong, Jing Xiao:
GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis. CoRR abs/2012.02626 (2020) - [i1]Cheng Yi, Jianzong Wang, Ning Cheng, Shiyu Zhou, Bo Xu:
Applying wav2vec2.0 to Speech Recognition in various low-resource languages. CoRR abs/2012.12121 (2020)
2010 – 2019
- 2011
- [j1]Ning Cheng, Xunying Liu, Lan Wang:
A flexible framework for HMM based noise robust speech recognition using generalized parametric space polynomial regression. Sci. China Inf. Sci. 54(12): 2481-2491 (2011) - [c5]Ning Cheng, Xunying Liu, Lan Wang:
Generalized Variable Parameter HMMs for Noise Robust Speech Recognition. INTERSPEECH 2011: 481-484 - 2010
- [c4]Ning Cheng, Wenju Liu, Lan Wang:
Masking property based microphone array post-filter design. INTERSPEECH 2010: 961-964 - [c3]Wenju Liu, Ning Cheng, Chao Li:
A novel subspace speech enhancement approach based on test of hypothesis and masking properties. ISCSLP 2010: 275-280
2000 – 2009
- 2008
- [c2]Ning Cheng, Wenju Liu, Peng Li, Bo Xu:
An effective microphone array post-filter in arbitrary environments. INTERSPEECH 2008: 439-442 - [c1]Peng Li, Fengchai Liao, Ning Cheng, Bo Xu, Wenju Liu:
Microphone Array Post-Filter Based on Auditory Filtering. ISCSLP 2008: 374-377
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint