Subband fusion of complex spectrogram for fake speech detection
www.sciencedirect.com › article › pii
This paper proposes a novel subband fusion of the complex spectrogram method for fake speech detection.
A subband fusion of complex spectrogram is proposed for fake speech detection. We model different subbands of complex spectrogram respectively and fuse finally.
Feb 1, 2024 · To address this issue, this paper proposes a novel subband fusion of the complex spectrogram method for fake speech detection. The complex ...
Traditional approaches involve manual extraction of features such as first-order spectral features [5,6,7,8], second-order spectral features [9,10,11], and ...
In this paper, we propose the multi-perspective information fusion (MPIF) Res2Net with random Specmix for fake speech detection (FSD). The main purpose of this ...
Subband fusion of complex spectrogram for fake speech detection. C Fan, J Xue ... Spatial reconstructed local attention Res2Net with F0 subband for fake speech ...
Subband fusion of complex spectrogram for fake speech detection. Speech Communication. 2023 | Journal article. DOI: 10.1016/J.SPECOM.2023.102988. WOSUID: WOS ...
It is expected that the F0 feature contains the discriminative information for the fake speech detection (FSD)task. In this paper, we propose a novel F0 subband ...
It is found that subband transform captures the artifacts in synthetic speech more effectively than full band transform. In text-to-speech or voice ...
Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection. ... Subband fusion of complex spectrogram for fake speech detection.