User profiles for Sashi Novitasari
Sashi NovitasariIBM Tokyo Research Lab. Verified email at ibm.com Cited by 96 |
Cross-lingual machine speech chain for javanese, sundanese, balinese, and bataks speech recognition and synthesis
Even though over seven hundred ethnic languages are spoken in Indonesia, the available
technology remains limited that could support communication within indigenous communities …
technology remains limited that could support communication within indigenous communities …
A machine speech chain approach for dynamically adaptive lombard tts in static and dynamic noise environments
S Novitasari, S Sakti… - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Recent end-to-end text-to-speech synthesis (TTS) systems have successfully synthesized
high-quality speech. However, TTS speech intelligibility degrades in noisy environments …
high-quality speech. However, TTS speech intelligibility degrades in noisy environments …
Rude-words detection for Indonesian speech using support vector machine
This paper presents an approach to detect rude or swear-words in Indonesian transcribed
speech by using Support Vector Machine and various combinations of text and acoustic …
speech by using Support Vector Machine and various combinations of text and acoustic …
Sequence-to-sequence learning via attention transfer for incremental speech recognition
Attention-based sequence-to-sequence automatic speech recognition (ASR) requires a
significant delay to recognize long utterances because the output is generated after receiving …
significant delay to recognize long utterances because the output is generated after receiving …
[PDF][PDF] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder.
Although machine speech chains were originally proposed to mimic a closed-loop human
speech chain mechanism with auditory feedback, the existing machine speech chains are …
speech chain mechanism with auditory feedback, the existing machine speech chains are …
Simultaneous speech-to-speech translation system with neural incremental asr, mt, and tts
This paper presents a newly developed, simultaneous neural speech-to-speech translation
system and its evaluation. The system consists of three fully-incremental neural processing …
system and its evaluation. The system consists of three fully-incremental neural processing …
[PDF][PDF] Improving ASR Robustness in Noisy Condition Through VAD Integration.
S Novitasari, T Fukuda, G Kurata - INTERSPEECH, 2022 - isca-archive.org
Automatic speech recognition (ASR) systems are often deployed together with a voice activity
detection (VAD) system to run ASR only on the voiced acoustic signals. Although it can …
detection (VAD) system to run ASR only on the voiced acoustic signals. Although it can …
Neural incremental speech recognition toward real-time machine speech translation
S Novitasari, S Sakti, S Nakamura - IEICE TRANSACTIONS on …, 2021 - search.ieice.org
Real-time machine speech translation systems mimic human interpreters and translate
incoming speech from a source language to the target language in real-time. Such systems can …
incoming speech from a source language to the target language in real-time. Such systems can …
Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition
S Novitasari, S Sakti… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
A common approach for text-to-speech (TTS) in noisy conditions is offline fine-tuning, which
is generally utilized on static noises and predefined conditions. We recently proposed a self-…
is generally utilized on static noises and predefined conditions. We recently proposed a self-…
[PDF][PDF] Neural Incremental Speech Recognition Towards Simultaneous Speech Translation
S Novitasari - 2020 - naist.repo.nii.ac.jp
Simultaneous speech interpretation or translation is a required task to bridge a real-time
multilingual human-to-human communication. Speech-to-speech translation (S2ST) system …
multilingual human-to-human communication. Speech-to-speech translation (S2ST) system …