Google Scholar

User profiles for Sashi Novitasari

Sashi Novitasari

IBM Tokyo Research Lab.

Verified email at ibm.com

Cited by 96

[PDF] arxiv.org

Cross-lingual machine speech chain for javanese, sundanese, balinese, and bataks speech recognition and synthesis

S Novitasari, A Tjandra, S Sakti… - arXiv preprint arXiv …, 2020 - arxiv.org

Even though over seven hundred ethnic languages are spoken in Indonesia, the available
technology remains limited that could support communication within indigenous communities …

Save Cite Cited by 17 Related articles All 5 versions View as HTML

[PDF] ieee.org

A machine speech chain approach for dynamically adaptive lombard tts in static and dynamic noise environments

S Novitasari, S Sakti… - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

Recent end-to-end text-to-speech synthesis (TTS) systems have successfully synthesized
high-quality speech. However, TTS speech intelligibility degrades in noisy environments …

Save Cite Cited by 10 Related articles All 6 versions

Rude-words detection for Indonesian speech using support vector machine

S Novitasari, DP Lestari, S Sakti… - … Conference on Asian …, 2018 - ieeexplore.ieee.org

This paper presents an approach to detect rude or swear-words in Indonesian transcribed
speech by using Support Vector Machine and various combinations of text and acoustic …

Save Cite Cited by 7 Related articles

[PDF] arxiv.org

Sequence-to-sequence learning via attention transfer for incremental speech recognition

S Novitasari, A Tjandra, S Sakti… - arXiv preprint arXiv …, 2020 - arxiv.org

Attention-based sequence-to-sequence automatic speech recognition (ASR) requires a
significant delay to recognize long utterances because the output is generated after receiving …

Save Cite Cited by 16 Related articles All 8 versions View as HTML

[PDF] isca-archive.org

[PDF][PDF] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder.

S Novitasari, S Sakti, S Nakamura - Interspeech, 2021 - isca-archive.org

Although machine speech chains were originally proposed to mimic a closed-loop human
speech chain mechanism with auditory feedback, the existing machine speech chains are …

Save Cite Cited by 6 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Simultaneous speech-to-speech translation system with neural incremental asr, mt, and tts

K Sudoh, T Kano, S Novitasari, T Yanagita… - arXiv preprint arXiv …, 2020 - arxiv.org

This paper presents a newly developed, simultaneous neural speech-to-speech translation
system and its evaluation. The system consists of three fully-incremental neural processing …

Save Cite Cited by 17 Related articles All 3 versions View as HTML

[PDF] isca-archive.org

[PDF][PDF] Improving ASR Robustness in Noisy Condition Through VAD Integration.

S Novitasari, T Fukuda, G Kurata - INTERSPEECH, 2022 - isca-archive.org

Automatic speech recognition (ASR) systems are often deployed together with a voice activity
detection (VAD) system to run ASR only on the voiced acoustic signals. Although it can …

Save Cite Cited by 5 Related articles All 5 versions View as HTML

[PDF] jst.go.jp

Neural incremental speech recognition toward real-time machine speech translation

S Novitasari, S Sakti, S Nakamura - IEICE TRANSACTIONS on …, 2021 - search.ieice.org

Real-time machine speech translation systems mimic human interpreters and translate
incoming speech from a source language to the target language in real-time. Such systems can …

Save Cite Cited by 5 Related articles All 7 versions

[PDF] naist.jp

Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition

S Novitasari, S Sakti… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

A common approach for text-to-speech (TTS) in noisy conditions is offline fine-tuning, which
is generally utilized on static noises and predefined conditions. We recently proposed a self-…

Save Cite Related articles

[PDF] nii.ac.jp

[PDF][PDF] Neural Incremental Speech Recognition Towards Simultaneous Speech Translation

S Novitasari - 2020 - naist.repo.nii.ac.jp

Simultaneous speech interpretation or translation is a required task to bridge a real-time
multilingual human-to-human communication. Speech-to-speech translation (S2ST) system …

Create alert

Cite

Advanced search

Saved to My library

User profiles for Sashi Novitasari

Sashi Novitasari

Cross-lingual machine speech chain for javanese, sundanese, balinese, and bataks speech recognition and synthesis

A machine speech chain approach for dynamically adaptive lombard tts in static and dynamic noise environments

Rude-words detection for Indonesian speech using support vector machine

Sequence-to-sequence learning via attention transfer for incremental speech recognition

[PDF][PDF] Dynamically Adaptive Machine Speech Chain Inference for TTS in Noisy Environment: Listen and Speak Louder.

Simultaneous speech-to-speech translation system with neural incremental asr, mt, and tts

[PDF][PDF] Improving ASR Robustness in Noisy Condition Through VAD Integration.

Neural incremental speech recognition toward real-time machine speech translation

Self-Adaptive Incremental Machine Speech Chain for Lombard TTS with High-Granularity ASR Feedback in Dynamic Noise Condition

[PDF][PDF] Neural Incremental Speech Recognition Towards Simultaneous Speech Translation