User profiles for Sashi Novitasari

Sashi Novitasari

IBM Tokyo Research Lab.
Verified email at ibm.com
Cited by 96

Cross-lingual machine speech chain for javanese, sundanese, balinese, and bataks speech recognition and synthesis

S Novitasari, A Tjandra, S Sakti… - arXiv preprint arXiv …, 2020 - arxiv.org
Even though over seven hundred ethnic languages are spoken in Indonesia, the available
technology remains limited that could support communication within indigenous communities …

A machine speech chain approach for dynamically adaptive lombard tts in static and dynamic noise environments

S Novitasari, S Sakti… - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
Recent end-to-end text-to-speech synthesis (TTS) systems have successfully synthesized
high-quality speech. However, TTS speech intelligibility degrades in noisy environments …

Rude-words detection for Indonesian speech using support vector machine

S Novitasari, DP Lestari, S Sakti… - … Conference on Asian …, 2018 - ieeexplore.ieee.org
This paper presents an approach to detect rude or swear-words in Indonesian transcribed
speech by using Support Vector Machine and various combinations of text and acoustic …

Sequence-to-sequence learning via attention transfer for incremental speech recognition

S Novitasari, A Tjandra, S Sakti… - arXiv preprint arXiv …, 2020 - arxiv.org
Attention-based sequence-to-sequence automatic speech recognition (ASR) requires a
significant delay to recognize long utterances because the output is generated after receiving …

[PDF][PDF] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder.

S Novitasari, S Sakti, S Nakamura - Interspeech, 2021 - isca-archive.org
Although machine speech chains were originally proposed to mimic a closed-loop human
speech chain mechanism with auditory feedback, the existing machine speech chains are …

Simultaneous speech-to-speech translation system with neural incremental asr, mt, and tts

K Sudoh, T Kano, S Novitasari, T Yanagita… - arXiv preprint arXiv …, 2020 - arxiv.org
This paper presents a newly developed, simultaneous neural speech-to-speech translation
system and its evaluation. The system consists of three fully-incremental neural processing …

[PDF][PDF] Improving ASR Robustness in Noisy Condition Through VAD Integration.

S Novitasari, T Fukuda, G Kurata - INTERSPEECH, 2022 - isca-archive.org
Automatic speech recognition (ASR) systems are often deployed together with a voice activity
detection (VAD) system to run ASR only on the voiced acoustic signals. Although it can …

Neural incremental speech recognition toward real-time machine speech translation

S Novitasari, S Sakti, S Nakamura - IEICE TRANSACTIONS on …, 2021 - search.ieice.org
Real-time machine speech translation systems mimic human interpreters and translate
incoming speech from a source language to the target language in real-time. Such systems can …

Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition

S Novitasari, S Sakti… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
A common approach for text-to-speech (TTS) in noisy conditions is offline fine-tuning, which
is generally utilized on static noises and predefined conditions. We recently proposed a self-…

[PDF][PDF] Neural Incremental Speech Recognition Towards Simultaneous Speech Translation

S Novitasari - 2020 - naist.repo.nii.ac.jp
Simultaneous speech interpretation or translation is a required task to bridge a real-time
multilingual human-to-human communication. Speech-to-speech translation (S2ST) system …