


default search action
Akinori Ito
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j49]Xuecheng Niu
, Akinori Ito
, Takashi Nose:
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. IEEE Access 12: 46940-46952 (2024) - [j48]Xuecheng Niu
, Akinori Ito
, Takashi Nose:
A Replaceable Curiosity-Driven Candidate Agent Exploration Approach for Task-Oriented Dialog Policy Learning. IEEE Access 12: 142640-142650 (2024) - [j47]Rui Zhou
, Takaki Koshikawa
, Akinori Ito
, Takashi Nose, Chia-Ping Chen:
Multilingual Meta-Transfer Learning for Low-Resource Speech Recognition. IEEE Access 12: 158493-158504 (2024) - [j46]Yuya Chiba
, Akinori Ito
:
Speaker Intimacy Estimation in Chat-Talks Based on Verbal and Non-Verbal Information. IEEE Access 12: 184592-184606 (2024) - [j45]Hironobu Wakabayashi, Yutaka Hiroi
, Kenzaburo Miyawaki, Akinori Ito
:
Development of a Personal Guide Robot That Leads a Guest Hand-in-Hand While Keeping a Distance. Sensors 24(7): 2345 (2024) - [c146]Zikai Shu
, Takashi Nose
, Akinori Ito
:
Toward Photo-Realistic Facial Animation Generation Based on Keypoint Features. ICMLC 2024: 334-339 - [c145]Rui Zhou
, Akinori Ito
, Takashi Nose
:
Character Expressions in Meta-Learning for Extremely Low Resource Language Speech Recognition. ICMLC 2024: 525-529 - [c144]Changlong Wang
, Akinori Ito
, Takashi Nose
, Chia-Ping Chen
:
Evaluation of Environmental Sound Classification using Vision Transformer. ICMLC 2024: 665-669 - [i3]Xuecheng Niu, Akinori Ito, Takashi Nose:
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning. CoRR abs/2402.00085 (2024) - [i2]Akinori Ito:
Embedding Digital Signature into CSV Files Using Data Hiding. CoRR abs/2407.04959 (2024) - [i1]Rui Zhou, Akinori Ito, Takashi Nose:
Preserving Speaker Information in Direct Speech-to-Speech Translation with Non-Autoregressive Generation and Pretraining. CoRR abs/2412.07316 (2024) - 2023
- [c143]Simon Jolibois, Akinori Ito, Takashi Nose:
Multimodal Expressive Embodied Conversational Agent Design. HCI (43) 2023: 244-249 - [c142]Akinori Ito
:
Confidence-based Utterance Selection for a Recognizer-free Spoken Dialogue System. ICMLC 2023: 481-484 - 2022
- [c141]Hironobu Wakabayashi, Yutaka Hiroi, Kenzaburo Miyawaki, Akinori Ito:
Path Following Algorithm with Small Error for Guide Robot. RiTA 2022: 56-67 - [c140]Yoshitaka Kasai, Yutaka Hiroi, Kenzaburo Miyawaki, Akinori Ito:
Development of a Teleoperated Play Tag Robot with Semi-Automatic Play*. SII 2022: 165-170 - 2021
- [c139]Daisuke Horii, Akinori Ito, Takashi Nose:
Analysis of Feature Extraction by Convolutional Neural Network for Speech Emotion Recognition. GCCE 2021: 425-426 - [c138]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose, Akinori Ito:
Neural Spoken-Response Generation Using Prosodic and Linguistic Context for Conversational Systems. Interspeech 2021: 246-250 - [c137]Satsuki Naijo, Akinori Ito, Takashi Nose:
Improvement of Automatic English Pronunciation Assessment with Small Number of Utterances Using Sentence Speakability. Interspeech 2021: 4473-4477 - [c136]Ryota Yahagi, Yuya Chiba, Takashi Nose, Akinori Ito:
Multimodal Dialogue Response Timing Estimation Using Dialogue Context Encoder. IWSDS 2021: 133-141 - [c135]Yuki Misaki, Yutaka Hiroi, Akinori Ito:
A Light-weight Hand-waving Gesture Recognition Method Using Kinect V2 and Frequency Analysis. SII 2021: 750-755 - 2020
- [j44]Kosuke Nakamura, Takashi Nose, Yuya Chiba, Akinori Ito
:
A Symbol-level Melody Completion Based on a Convolutional Neural Network with Generative Adversarial Learning. J. Inf. Process. 28: 248-257 (2020) - [j43]Jiang Fu, Yuya Chiba, Takashi Nose, Akinori Ito
:
Automatic assessment of English proficiency for Japanese learners without reference sentences based on deep neural network acoustic models. Speech Commun. 116: 86-97 (2020) - [c134]Rikiya Takahashi, Takashi Nose, Yuya Chiba, Akinori Ito:
Successive Japanese Lyrics Generation Based on Encoder-Decoder Model. GCCE 2020: 126-127 - [c133]Ryota Yahagi, Yuya Chiba, Takashi Nose, Akinori Ito:
Incremental Response Generation Using Prefix-to-Prefix Model for Dialogue System. GCCE 2020: 349-350 - [c132]Satoru Mizuochi, Yuya Chiba, Takashi Nose, Akinori Ito:
Spoken Term Detection Based on Acoustic Models Trained in Multiple Languages for Zero-Resource Language. GCCE 2020: 351-352 - [c131]Satsuki Naijo, Yuya Chiba, Takashi Nose, Akinori Ito:
Analysis and Estimation of Sentence Speakability for English Pronunciation Evaluation. GCCE 2020: 353-355 - [c130]Aoi Kanagaki, Masaya Tanaka, Takashi Nose, Ryohei Shimizu, Akira Ito, Akinori Ito:
CycleGAN-Based High-Quality Non-Parallel Voice Conversion with Spectrogram and WaveRNN. GCCE 2020: 356-357 - [c129]Daisuke Fujimaki, Takashi Nose, Akinori Ito:
Integration of Accent Sandhi and Prosodic Features Estimation for Japanese Text-to-Speech Synthesis. GCCE 2020: 358-359 - [c128]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose, Akinori Ito:
Filler Prediction Based on Bidirectional LSTM for Generation of Natural Response of Spoken Dialog. GCCE 2020: 360-361 - [c127]Takuma Hayasaka, Takashi Nose, Akinori Ito:
A Study on Minimum Spectral Error Analysis of Speech. GCCE 2020: 362-363 - [c126]Takuto Fujimura, Takashi Nose, Akinori Ito:
LJSing: Large-Scale Singing Voice Corpus of Single Japanese Singer. GCCE 2020: 364-365 - [c125]Shuhei Imai, Takashi Nose, Aoi Kanagaki, Satoshi Watanabe, Akinori Ito:
Improving Pronunciation Clarity of Dysarthric Speech Using CycleGAN with Multiple Speakers. GCCE 2020: 366-367 - [c124]Yuya Chiba, Takashi Nose, Akinori Ito:
Multi-Stream Attention-Based BLSTM with Feature Segmentation for Speech Emotion Recognition. INTERSPEECH 2020: 3301-3305 - [c123]Yoshihiro Yamazaki, Yuya Chiba, Takashi Nose, Akinori Ito:
Construction and Analysis of a Multimodal Chat-talk Corpus for Dialog Systems Considering Interpersonal Closeness. LREC 2020: 443-448 - [c122]Koyuki Ikemoto, Yutaka Hiroi, Akinori Ito
:
Evaluation of Person Tracking Methods for Human-Robot Physical Play. SII 2020: 416-421
2010 – 2019
- 2019
- [j42]Ryo Masumura, Taichi Asami, Takanobu Oba, Sumitaka Sakauchi, Akinori Ito
:
Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 102-D(12): 2557-2567 (2019) - [j41]Yutaka Hiroi
, Akinori Ito
:
Realization of a Robot System That Plays "Darumasan-Ga-Koronda " Game with Humans. Robotics 8(3): 55 (2019) - [j40]Yutaka Hiroi
, Akinori Ito
:
A Pedestrian Avoidance Method Considering Personal Space for a Guide Robot. Robotics 8(4): 97 (2019) - [j39]Hafiyan Prafianto, Takashi Nose, Yuya Chiba, Akinori Ito
:
Improving human scoring of prosody using parametric speech synthesis. Speech Commun. 111: 14-21 (2019) - [e3]Jeng-Shyang Pan, Akinori Ito, Pei-Wei Tsai, Lakhmi C. Jain:
Recent Advances in Intelligent Information Hiding and Multimedia Signal Processing - Proceeding of the Fourteenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, November, 26-28, 2018, Sendai, Japan, Volume 1. Smart Innovation, Systems and Technologies 109, Springer 2019, ISBN 978-3-030-03744-4 [contents] - [e2]Jeng-Shyang Pan, Akinori Ito, Pei-Wei Tsai, Lakhmi C. Jain:
Recent Advances in Intelligent Information Hiding and Multimedia Signal Processing - Proceeding of the Fourteenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, November, 26-28, 2018, Sendai, Japan, Volume 2. Smart Innovation, Systems and Technologies 110, Springer 2019, ISBN 978-3-030-03747-5 [contents] - 2018
- [j38]Akinori Ito
:
Foreword. IEICE Trans. Inf. Syst. 101-D(1): 1 (2018) - [j37]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
:
Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 101-D(6): 1581-1590 (2018) - [c121]Shunsuke Tada, Yuya Chiba, Takashi Nose, Akinori Ito
:
Effect of Mutual Self-Disclosure in Spoken Dialog System on User Impression. APSIPA 2018: 806-810 - [c120]Akinori Ito
, Masatoshi Koizumi:
Spoken Term Detection of Zero-Resource Language using Machine Learning. ICIIT 2018: 45-49 - [c119]Akinori Ito
:
Muting Machine Speech Using Audio Watermarking. IIH-MSP (2) 2018: 74-81 - [c118]Akinori Ito
:
Leveraging a Small Corpus by Different Frame Shifts for Training of a Speech Recognizer. IIH-MSP (2) 2018: 82-89 - [c117]Jiang Fu, Yuya Chiba, Takashi Nose, Akinori Ito
:
Evaluation of English Speech Recognition for Japanese Learners Using DNN-Based Acoustic Models. IIH-MSP (2) 2018: 93-100 - [c116]Mai Yamanaka, Yuya Chiba, Takashi Nose, Akinori Ito
:
A Study on a Spoken Dialogue System with Cooperative Emotional Speech Synthesis Using Acoustic and Linguistic Information. IIH-MSP (2) 2018: 101-108 - [c115]Takashi Kimura, Takashi Nose, Shinji Hirooka, Yuya Chiba, Akinori Ito
:
Comparison of Speech Recognition Performance Between Kaldi and Google Cloud Speech API. IIH-MSP (2) 2018: 109-115 - [c114]Kosuke Nakamura, Takashi Nose, Yuya Chiba, Akinori Ito
:
Melody Completion Based on Convolutional Neural Networks and Generative Adversarial Learning. IIH-MSP (2) 2018: 116-123 - [c113]Shinya Hanabusa, Takashi Nose, Akinori Ito
:
Segmental Pitch Control Using Speech Input Based on Differential Contexts and Features for Customizable Neural Speech Synthesis. IIH-MSP (2) 2018: 124-131 - [c112]Sou Miyamoto, Takashi Nose, Kazuyuki Hiroshiba, Yuri Odagiri, Akinori Ito
:
Two-Stage Sequence-to-Sequence Neural Voice Conversion with Low-to-High Definition Spectrogram Mapping. IIH-MSP (2) 2018: 132-139 - [c111]Hiroto Aoyama, Takashi Nose, Yuya Chiba, Akinori Ito
:
Improvement of Accent Sandhi Rules Based on Japanese Accent Dictionaries. IIH-MSP (2) 2018: 140-148 - [c110]Takahiro Furuya, Yuya Chiba, Takashi Nose, Akinori Ito
:
Data Collection and Analysis for Automatically Generating Record of Human Behaviors by Environmental Sound Recognition. IIH-MSP (2) 2018: 149-156 - [c109]Toru Ishikawa, Takashi Nose, Akinori Ito
:
DNN-Based Talking Movie Generation with Face Direction Consideration. IIH-MSP (2) 2018: 157-164 - [c108]Haoran Wu, Yuya Chiba, Takashi Nose, Akinori Ito
:
Analyzing Effect of Physical Expression on English Proficiency for Multimodal Computer-Assisted Language Learning. INTERSPEECH 2018: 1746-1750 - [c107]Yukiko Kageyama, Yuya Chiba, Takashi Nose, Akinori Ito:
Improving User Impression in Spoken Dialog System with Gradual Speech Form Control. SIGDIAL Conference 2018: 235-240 - [c106]Yuya Chiba, Takashi Nose, Taketo Kase, Mai Yamanaka, Akinori Ito:
An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System. SIGDIAL Conference 2018: 371-375 - 2017
- [j36]Akinori Ito:
Foreword. IEICE Trans. Inf. Syst. 100-D(1): 1 (2017) - [j35]Yôiti Suzuki, Akinori Ito, Kazuhiro Kondo:
Guest Editorial: Introduction to the Special Issue on the Enrichment of Sound, Speech and Music Media. J. Inf. Hiding Multim. Signal Process. 8(6): 1323-1324 (2017) - [j34]Akinori Ito:
Enrichment of Audio Signal using Side Information. J. Inf. Hiding Multim. Signal Process. 8(6): 1325-1334 (2017) - [j33]Akinori Ito, Yuto Sasaki:
Manipulating Vocal Signal in Mixed Music Sounds using Side Information based on the Fundamental Frequency. J. Inf. Hiding Multim. Signal Process. 8(6): 1372-1381 (2017) - [j32]Yuya Chiba, Takashi Nose, Akinori Ito
:
Cluster-based approach to discriminate the user's state whether a user is embarrassed or thinking to an answer to a prompt. J. Multimodal User Interfaces 11(2): 185-196 (2017) - [j31]Kohei Morishita, Yutaka Hiroi
, Akinori Ito
:
A Crowd Avoidance Method Using Circular Avoidance Path for Robust Person Following. J. Robotics 2017: 3148202:1-3148202:10 (2017) - [c105]Yuya Chiba, Takashi Nose, Akinori Ito
:
Analysis of efficient multimodal features for estimating user's willingness to talk: Comparison of human-machine and human-human dialog. APSIPA 2017: 428-431 - [c104]Yukiko Kageyama, Yuya Chiba, Takashi Nose, Akinori Ito
:
Collection of Example Sentences for Non-task-Oriented Dialog Using a Spoken Dialog System and Comparison with Hand-Crafted DB. HCI (29) 2017: 458-464 - [c103]Hayato Mori, Yuya Chiba, Takashi Nose, Akinori Ito
:
Dialog-Based Interactive Movie Recommendation: Comparison of Dialog Strategies. IIH-MSP (2) 2017: 77-83 - [c102]Shunsuke Tada, Yuya Chiba, Takashi Nose, Akinori Ito
:
Response Selection of Interview-Based Dialog System Using User Focus and Semantic Orientation. IIH-MSP (2) 2017: 84-90 - [c101]Yusuke Yamada, Takashi Nose, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
Development and Evaluation of Julius-Compatible Interface for Kaldi ASR. IIH-MSP (2) 2017: 91-96 - [c100]Sou Miyamoto, Takashi Nose, Suzunosuke Ito, Harunori Koike, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
Voice Conversion from Arbitrary Speakers Based on Deep Neural Networks with Adversarial Learning. IIH-MSP (2) 2017: 97-103 - [c99]Kosuke Nakamura, Yuya Chiba, Takashi Nose, Akinori Ito
:
Evaluation of Nonlinear Tempo Modification Methods Based on Sinusoidal Modeling. IIH-MSP (2) 2017: 104-111 - [c98]Kazuki Sato, Takashi Nose, Akira Ito, Yuya Chiba, Akinori Ito
, Takahiro Shinozaki:
A Study on 2D Photo-Realistic Facial Animation Generation Using 3D Facial Feature Points and Deep Neural Networks. IIH-MSP (2) 2017: 112-118 - [c97]Isao Miyagawa, Yuya Chiba, Takashi Nose, Akinori Ito
:
Detection of Singing Mistakes from Singing Voice. IIH-MSP (2) 2017: 130-136 - [c96]Yuko Nakamori, Yutaka Hiroi, Akinori Ito
:
Enhancement of person detection and tracking for a robot that plays with human. SII 2017: 494-499 - 2016
- [j30]Koji Mikami
, Yosuke Nakamura, Akinori Ito, Motonobu Kawashima, Taichi Watanabe, Yoshihiro Kishimoto, Kunio Kondo:
Effectiveness of Game Jam-based iterative program for game production in Japan. Comput. Graph. 61: 1-10 (2016) - [j29]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
:
Investigation of Combining Various Major Language Model Technologies including Data Expansion and Adaptation. IEICE Trans. Inf. Syst. 99-D(10): 2452-2461 (2016) - [c95]Akinori Ito
:
Multiple description vector quantizer design based on redundant representation of central code. EUSIPCO 2016: 106-109 - [c94]Akinori Ito, Kengo Watanabe, Genki Kuroda, Ken'ichiro Ito:
Improvements of iSuperColliderKit and its Applications. ICMC 2016 - [c93]Yuya Chiba, Akinori Ito
:
Estimation of User's Willingness to Talk About the Topic: Analysis of Interviews Between Humans. IWSDS 2016: 411-419 - 2015
- [c92]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito
:
Hierarchical Latent Words Language Models for Robust Modeling to Out-Of Domain Tasks. EMNLP 2015: 1896-1901 - [c91]Taketo Kase, Takashi Nose, Akinori Ito
:
On Appropriateness and Estimation of the Emotion of Synthesized Response Speech in a Spoken Dialogue System. HCI (27) 2015: 747-752 - [c90]Akinori Ito, Kengo Watanabe, Genki Kuroda, Ken'ichiro Ito:
iSuperColliderKit: A Toolkit for iOS Using an Internal SuperCollider Server as a Sound Engine. ICMC 2015 - [c89]Tsukasa Nishino, Takashi Nose, Akinori Ito
:
Tempo Modification of Mixed Music Signal by Nonlinear Time Scaling and Sinusoidal Modeling. IIH-MSP 2015: 146-149 - [c88]Yuki Saito, Takashi Nose, Takahiro Shinozaki, Akinori Ito
:
Conversion of Speaker's Face Image Using PCA and Animation Unit for Video Chatting. IIH-MSP 2015: 433-436 - [c87]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition. INTERSPEECH 2015: 463-467 - [c86]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Latent words recurrent neural network language models. INTERSPEECH 2015: 2380-2384 - [c85]Takashi Nose, Yusuke Arao, Takao Kobayashi, Komei Sugiura, Yoshinori Shiga, Akinori Ito:
Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts. INTERSPEECH 2015: 3491-3495 - [c84]Yuma Fujiwara, Yutaka Hiroi, Yuki Tanaka, Akinori Ito
:
Development of a mobile robot moving on a handrail - Control for preceding a person keeping a distance. RO-MAN 2015: 413-418 - [c83]Keisuke Sakai, Yutaka Hiroi, Akinori Ito
:
Playing with a Robot: Realization of "Red Light, Green Light" Using a Laser Range Finder. RVSP 2015: 1-4 - [c82]Koji Mikami
, Yosuke Nakamura, Akinori Ito, Motonobu Kawashima, Taichi Watanabe, Yoshihiro Kishimoto, Kunio Kondo:
Game jam based iterative curriculum for game production in Japan. SIGGRAPH Asia Symposium on Education 2015: 11:1-11:6 - 2014
- [j28]Ryunosuke Daido, Masashi Ito, Shozo Makino, Akinori Ito
:
Automatic evaluation of singing enthusiasm for karaoke. Comput. Speech Lang. 28(2): 501-517 (2014) - [j27]Takeshi Nagano, Akinori Ito:
Packet Loss Concealment of Voice-over IP Packet using Redundant Parameter Transmission Under Severe Loss Conditions. J. Inf. Hiding Multim. Signal Process. 5(2): 286-295 (2014) - [c81]Kohei Machida, Takashi Nose, Akinori Ito
:
Speech recognition in a home environment using parallel decoding with GMM-based noise modeling. APSIPA 2014: 1-4 - [c80]Naoto Suzuki, Takashi Nose, Yutaka Hiroi, Akinori Ito
:
Controlling Switching Pause Using an AR Agent for Interactive CALL System. HCI (27) 2014: 588-593 - [c79]Hafiyan Prafianto, Takashi Nose, Yuya Chiba, Akinori Ito
, Kazuyuki Sato:
A study on the effect of speech rate on perception of spoken easy Japanese using speech synthesis. ICAILP 2014: 476-479 - [c78]Masahito Okamoto, Takashi Nose, Akinori Ito
, Takeshi Nagano:
Subjective evaluation of packet loss recovery techniques for voice over IP. ICAILP 2014: 711-714 - [c77]Noriko Totsuka, Yuya Chiba, Takashi Nose, Akinori Ito
:
Robot: Have I done something wrong? - Analysis of prosodic features of speech commands under the robot's unintended behavior. ICAILP 2014: 887-890 - [c76]Kazumichi Yoshida, Takashi Nose, Akinori Ito
:
Analysis of English Pronunciation of Singing Voices Sung by Japanese Speakers. IIH-MSP 2014: 554-557 - [c75]Akinori Ito
:
Assessing the Intended Enthusiasm of Singing Voice Using Energy Variance. IIH-MSP 2014: 558-561 - [c74]Takashi Nose, Akinori Ito:
Analysis of spectral enhancement using global variance in HMM-based speech synthesis. INTERSPEECH 2014: 2917-2921 - [c73]Yuya Chiba, Masashi Ito, Takashi Nose, Akinori Ito
:
User Modeling by Using Bag-of-Behaviors for Building a Dialog System Sensitive to the Interlocutor's Internal State. SIGDIAL Conference 2014: 74-78 - [e1]Junzo Watada, Akinori Ito, Jeng-Shyang Pan, Han-Chieh Chao, Chien-Ming Chen:
2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2014, Kitakyushu, Japan, August 27-29, 2014. IEEE 2014, ISBN 978-1-4799-5390-5 [contents] - 2013
- [c72]Kohei Machida, Akinori Ito
:
Speech recognition under noisy environments using multiple microphones based on asynchronous and intermittent measurements. APSIPA 2013: 1-4 - [c71]Yuya Chiba, Masashi Ito, Akinori Ito
:
Estimation of User's State during a Dialog Turn with Sequential Multi-modal Features. HCI (29) 2013: 572-576 - [c70]Yutaka Hiroi, Akinori Ito:
ASAHI: OK for failure: a robot for supporting daily life, equipped with a robot avatar. HRI 2013: 141-142 - [c69]Takeshi Nagano, Akinori Ito
:
A Packet Loss Recovery of G.729 Speech Using Discriminative Model and N-Gram. IIH-MSP 2013: 267-270 - [c68]Yohei Abe, Akinori Ito
:
Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal. IIH-MSP 2013: 271-274 - [c67]Keizo Kato, Akinori Ito
:
Acoustic Features and Auditory Impressions of Death Growl and Screaming Voice. IIH-MSP 2013: 460-463 - [c66]Yuki Igarashi, Masashi Ito, Akinori Ito
:
Evaluation of Sinusoidal Modeling for Polyphonic Music Signal. IIH-MSP 2013: 464-467 - 2012
- [j26]Yuya Chiba, Akinori Ito
:
Estimating a User's Internal State before the First Input Utterance. Adv. Hum. Comput. Interact. 2012: 865362:1-865362:10 (2012) - [j25]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito
:
Model Shrinkage for Discriminative Language Models. IEICE Trans. Inf. Syst. 95-D(5): 1465-1474 (2012) - [j24]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito
:
Round-Robin Duel Discriminative Language Models. IEEE Trans. Speech Audio Process. 20(4): 1244-1255 (2012) - [c65]Chihiro Abe, Akinori Ito:
A Japanese lyrics writing support system for amateur songwriters. APSIPA 2012: 1-4 - [c64]Takuya Anzai, Akinori Ito:
Recognition of utterances with grammatical mistakes based on optimization of language model towards interactive CALL systems. APSIPA 2012: 1-4 - [c63]Shinji Miyake, Akinori Ito:
A spoken dialogue system using virtual conversational agent with augmented reality. APSIPA 2012: 1-4 - [c62]Takeshi Nagano, Akinori Ito:
A packet loss recovery of G.729 speech under severe packet loss condition. APSIPA 2012: 1-4 - [c61]Yuya Chiba, Masashi Ito, Akinori Ito
:
Estimation of User's Internal State before the User's First Utterance Using Acoustic Features and Face Orientation. HSI 2012: 23-28 - [c60]Yutaka Hiroi, Takayuki Nakayama, Hisanori Kuroda, Shinji Miyake, Akinori Ito
:
Effect of Robot Height on Comfortableness of Spoken Dialog. HSI 2012: 29-34 - [c59]Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito
:
Spoken document retrieval by discriminative modeling in a high dimensional feature space. ICASSP 2012: 5153-5156 - [c58]Akinori Ito, Takeshi Nagano:
Packet loss concealment of VoIP under severe loss conditions. WPMC 2012: 489-490 - 2011
- [c57]Sigit Kusumanugraha, Akinori Ito, Koji Mikami
, Kunio Kondo:
An Analysis of Indonesian Traditional "Wayang Kulit" Puppet 3D Shapes Based on Their Roles in the Story. Culture and Computing 2011: 147-148 - [c56]Minoru Kohata, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
Bit rate reduction of the MELP coder using Lempel-Ziv segment quantization. ICASSP 2011: 5240-5243 - [c55]Takanobu Oba, Takaaki Hori, Akinori Ito
, Atsushi Nakamura:
Round-robin duel discriminative language models in one-pass decoding with on-the-fly error correction. ICASSP 2011: 5588-5591 - [c54]Yuto Sasaki, Seongjun Hahm, Akinori Ito
:
Manipulating Vocal Signal in Mixed Music Sounds Using Small Amount of Side Information. IIH-MSP 2011: 298-301 - [c53]Akinori Ito, Akihito Aiba, Masashi Ito, Shozo Makino:
Evaluation of Abnormal Sound Detection using Multi-Stage GMM in Various Environments. INTERSPEECH 2011: 301-304 - [c52]Ryo Masumura, Seongjun Hahm, Akinori Ito:
Training a Language Model Using Webdata for Large Vocabulary Japanese Spontaneous Speech Recognition. INTERSPEECH 2011: 1465-1468 - [c51]Ryo Masumura, Seongjun Hahm, Akinori Ito:
Language Model Expansion Using Webdata for Spoken Document Retrieval. INTERSPEECH 2011: 2133-2136 - [c50]Ryunosuke Daido, Seongjun Hahm, Masashi Ito, Shozo Makino, Akinori Ito:
A System for Evaluating Singing Enthusiasm for Karaoke. ISMIR 2011: 31-36 - 2010
- [j23]Koji Mikami
, Taichi Watanabe, Katsunori Yamaji, Kenji Ozawa, Akinori Ito, Motonobu Kawashima, Ryota Takeuchi, Kunio Kondo, Mitsuru Kaneko:
Construction trial of a practical education curriculum for game development by industry-university collaboration in Japan. Comput. Graph. 34(6): 791-799 (2010) - [j22]Akinori Ito
, Shun'ichiro Abe, Yôiti Suzuki:
Information Hiding for G.711 Speech Based on Substitution of Least Significant Bits and Estimation of Tolerable Distortion. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 93-A(7): 1279-1286 (2010) - [j21]Seongjun Hahm, Yuichi Ohkawa, Masashi Ito, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
Improved Reference Speaker Weighting Using Aspect Model. IEICE Trans. Inf. Syst. 93-D(7): 1927-1935 (2010) - [j20]Seongjun Hahm, Yuichi Ohkawa, Masashi Ito, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
Speech Recognition under Multiple Noise Environment Based on Multi-Mixture HMM and Weight Optimization by the Aspect Model. IEICE Trans. Inf. Syst. 93-D(9): 2407-2416 (2010) - [j19]Akinori Ito, Shozo Makino:
Designing Side Information of Multiple Description Coding. J. Inf. Hiding Multim. Signal Process. 1(1): 10-19 (2010) - [j18]Hoseok Wey, Akinori Ito, Takuma Okamoto, Yôiti Suzuki:
Multiple Description Coding Using Time Domain Division for MP3 coded Sound Signal. J. Inf. Hiding Multim. Signal Process. 1(4): 269-285 (2010) - [c49]Seongjun Hahm, Yuichi Ohkawa, Masashi Ito, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
Aspect-model-based reference speaker weighting. ICASSP 2010: 4302-4305 - [c48]Akinori Ito
, Kiyoshi Konno, Masashi Ito, Shozo Makino:
Improvement of Packet Loss Concealment for MP3 Audio Based on Switching of Concealment Method and Estimation of MDCT Signs. IIH-MSP 2010: 518-521 - [c47]Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano:
An effect of formant amplitude in vowel perception. INTERSPEECH 2010: 2490-2493 - [c46]Ryo Masumura, Akinori Ito
, Yu Uno, Masashi Ito, Shozo Makino:
Document expansion using relevant web documents for spoken document retrieval. NLPKE 2010: 1-8
2000 – 2009
- 2009
- [j17]Akinori Ito
, Yasutomo Kajiura, Motoyuki Suzuki, Shozo Makino:
Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition. EURASIP J. Audio Speech Music. Process. 2009 (2009) - [j16]Akinori Ito, Hiroaki Kinno, Masaharu Katoh, Tetsuo Kosaka, Masaki Kohda:
Dictation of Japanese Speech Based on Kana and Kanji Character String. Int. J. Comput. Process. Orient. Lang. 22(1): 75-98 (2009) - [j15]Motoyuki Suzuki, Takuto Ichikawa, Akinori Ito, Shozo Makino:
Novel Tonal Feature and Statistical User Modeling for Query-by-Humming. Inf. Media Technol. 4(2): 498-508 (2009) - [j14]Motoyuki Suzuki, Takuto Ichikawa, Akinori Ito
, Shozo Makino:
Novel Tonal Feature and Statistical User Modeling for Query-by-Humming. J. Inf. Process. 17: 95-105 (2009) - [j13]Yuichi Ohkawa, Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito
, Shozo Makino:
A speaker adaptation method for non-native speech using learners' native utterances for computer-assisted language learning systems. Speech Commun. 51(10): 875-882 (2009) - [c45]Akinori Ito
, Akihito Aiba, Masashi Ito, Shozo Makino:
Detection of Abnormal Sound Using Multi-stage GMM for Surveillance Microphone. IAS 2009: 733-736 - [c44]Akinori Ito
, Shun'ichiro Abe, Yôiti Suzuki:
Information hiding for G.711 speech based on substitution of least significant bits and estimation of tolerable distortion. ICASSP 2009: 1409-1412 - [c43]Akinori Ito
, Hironori Handa, Yôiti Suzuki:
A Band Extension of G.711 Speech with Low Computational Cost for Data Hiding Application. IIH-MSP 2009: 491-494 - [c42]Akinori Ito
, Shozo Makino:
Data Hiding is a Better Way for Transmitting Side Information for MP3 Bitstream. IIH-MSP 2009: 495-498 - [c41]Masashi Ito, Keiji Ohara, Akinori Ito, Masafumi Yano:
Relative importance of formant and whole-spectral cues for vowel perception. INTERSPEECH 2009: 124-127 - [c40]Akinori Ito, Tomoaki Konno, Masashi Ito, Shozo Makino:
Evaluation of English intonation based on combination of multiple evaluation scores. INTERSPEECH 2009: 596-599 - [c39]Motoyuki Suzuki, Daisuke Honma, Akinori Ito, Shozo Makino:
Detailed description of triphone model using SSS-free algorithm. INTERSPEECH 2009: 1399-1402 - [c38]Koji Mikami
, Taichi Watanabe, Katsunori Yamaji, Kenji Ozawa, Akinori Ito, Motonobu Kawashima, Ryota Takeuchi, Kunio Kondo, Mitsuru Kaneko:
Construction trial of a practical education curriculum for game development by industry/university collaboration. SIGGRAPH ASIA Educators Program 2009 - 2008
- [j12]Akinori Ito
, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino:
Selection of Optimum Vocabulary and Dialog Strategy for Noise-Robust Spoken Dialog Systems. IEICE Trans. Inf. Syst. 91-D(3): 538-548 (2008) - [j11]Akinori Ito, Shozo Makino:
Multiple description coding of an audio stream by optimum recovery transforms. J. Digit. Inf. Manag. 6(2): 189-195 (2008) - [c37]Akinori Ito
, Kiyoshi Konno, Shozo Makino, Motoyuki Suzuki:
Packet Loss Concealment for MDCT-Based Audio Codec Using Correlation-Based Side Information. IIH-MSP 2008: 612-615 - [c36]Akinori Ito, Toyomi Meguro, Shozo Makino, Motoyuki Suzuki:
Discrimination of task-related words for vocabulary design of spoken dialog systems. INTERSPEECH 2008: 207-210 - [c35]Seongjun Hahm, Akinori Ito, Shozo Makino, Motoyuki Suzuki:
A fast speaker adaptation method using aspect model. INTERSPEECH 2008: 1221-1224 - [c34]Akinori Ito, Ryohei Tsutsui, Shozo Makino, Motoyuki Suzuki:
Recognition of English utterances with grammatical and lexical mistakes for dialogue-based CALL system. INTERSPEECH 2008: 2819-2822 - [c33]Tomoaki Konno, Akinori Ito
, Masashi Ito, Shozo Makino, Motoyuki Suzuki:
Intonation evaluation of English utterances using synthesized speech for Computer-Assisted Language Learning. NLPKE 2008: 1-7 - [c32]Motoyuki Suzuki, Naoto Kuriyama, Akinori Ito
, Shozo Makino:
Automatic clustering of part-of-speech for vocabulary divided PLSA language model. NLPKE 2008: 1-7 - 2007
- [j10]Motoyuki Suzuki, Toru Hosoya, Akinori Ito
, Shozo Makino:
Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information. EURASIP J. Adv. Signal Process. 2007 (2007) - [j9]Minoru Kohata, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
A New Segment Quantization Using Lempel-Ziv Algorithm and Its Application to Quantization of Line Spectral Frequencies. IEEE Trans. Commun. 55(4): 661-664 (2007) - [c31]Akinori Ito
, Shozo Makino:
Increasing Correlation using a Few Bits for Multiple Description Coding. IIH-MSP 2007: 259-262 - 2006
- [j8]Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki, Shozo Makino:
A grammatical error detection method for dialogue-based CALL system. Inf. Media Technol. 1(1): 391-410 (2006) - [j7]S.-P. Heo, Motoyuki Suzuki, Akinori Ito
, Shozo Makino:
An effective music information retrieval method using three-dimensional continuous DP. IEEE Trans. Multim. 8(3): 633-639 (2006) - [c30]Akinori Ito
, Shozo Makino:
Multiple Description Coding of an Audio Stream by Optimum Recovery Transform. IIH-MSP 2006: 19-22 - [c29]Akinori Ito, Keisuke Shimada, Motoyuki Suzuki, Shozo Makino:
A user simulator based on voiceXML for evaluation of spoken dialog systems. INTERSPEECH 2006 - [c28]Motoyuki Suzuki, Yasutomo Kajiura, Akinori Ito, Shozo Makino:
Unsupervised language model adaptation based on automatic text collection from WWW. INTERSPEECH 2006 - [c27]Motoyuki Suzuki, Toru Hosoya, Akinori Ito, Shozo Makino:
Music Information Retrieval from a Singing Voice Based on Verification of Recognized Hypotheses. ISMIR 2006: 168-171 - 2005
- [c26]Akinori Ito
, Xinyue Wang, Motoyuki Suzuki, Shozo Makino:
Smile and Laughter Recognition using Speech Processing and Face Recognition from Conversation Video. CW 2005: 437-444 - [c25]Akinori Ito, Yen-Ling Lim, Motoyuki Suzuki, Shozo Makino:
Pronunciation error detection method based on error rule clustering using a decision tree. INTERSPEECH 2005: 173-176 - [c24]Motoyuki Suzuki, Yusuke Kato, Akinori Ito, Shozo Makino:
Construction method of acoustic models dealing with various background noises based on combination of HMMs. INTERSPEECH 2005: 973-976 - [c23]Akinori Ito, Takashi Kanayama, Motoyuki Suzuki, Shozo Makino:
Internal noise suppression for speech recognition by small robots. INTERSPEECH 2005: 2685-2688 - [c22]Toru Hosoya, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
Lyrics Recognition from a Singing Voice Based on Finite State Automaton for Music Information Retrieval. ISMIR 2005: 532-535 - 2004
- [c21]Takashi Konashi, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
A spoken dialog system based on automatic grammar generation and template-based weighting for autonomous mobile robots. INTERSPEECH 2004: 189-192 - [c20]Akinori Ito, Takanobu Oba, Takashi Konashi, Motoyuki Suzuki, Shozo Makino:
Noise adaptive spoken dialog system based on selection of multiple dialog strategies. INTERSPEECH 2004: 193-196 - [c19]Oh Pyo Kweon, Akinori Ito, Motoyuki Suzuki, Shozo Makino:
A Japanese dialogue-based CALL system with mispronunciation and grammar error detection. INTERSPEECH 2004: 1833-1836 - [c18]Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Yuichi Ohkawa, Shozo Makino:
Speaker adaptation method for CALL system using bilingual speakers' utterances. INTERSPEECH 2004: 2929-2932 - [c17]Akinori Ito, Sung-Phil Heo, Motoyuki Suzuki, Shozo Makino:
Comparison Of Features For DP-Matching Based Query-by-Humming System. ISMIR 2004 - 2003
- [c16]Sung-Phil Heo, Motoyuki Suzuki, Akinori Ito
, Shozo Makino, Hyun-Yeol Chung:
Error Tolerant Melody Matching Method in Music Information Retrieval. Adaptive Multimedia Retrieval 2003: 212-227 - [c15]Yuichi Ohkawa, Akihiro Yoshida, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
An optimized multi-duration HMM for spontaneous speech recognition. INTERSPEECH 2003: 485-488 - [c14]Sung-Phil Heo, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
Three-dimensional continuous DP algorithm for multiple pitch candidates in a music information retrieval system. ISMIR 2003 - 2002
- [j6]Akinori Ito
, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Erratum: Language modeling by stochastic dependency grammer for Japanese speech recognition. Syst. Comput. Jpn. 33(3): 74 (2002) - [j5]Chiori Hori, Masaharu Katoh, Akinori Ito
, Masaki Kohda:
Construction and evaluation of language models based on stochastic context-free grammar for speech recognition Chiori Hori, Masaharu Katoh, Akinori Ito, Masaki Koh. Syst. Comput. Jpn. 33(13): 48-59 (2002) - [c13]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano:
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models. LREC 2002 - 2001
- [j4]Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Language modeling by stochastic dependency grammar for Japanese speech recognition. Syst. Comput. Jpn. 32(12): 10-15 (2001) - [c12]Se-Jin Oh, Hyun-Yeol Chung, Cheol-Jun Hwang, Bum-Koog Kim, Akinori Ito:
New state clustering of hidden Markov network with Korean phonological rules for speech recognition. MMSP 2001: 39-44 - 2000
- [c11]Akinori Ito, Chiori Hori, Masaharu Katoh, Masaki Kohda:
Language modeling by stochastic dependency grammar for Japanese speech recognition. INTERSPEECH 2000: 246-249 - [c10]Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 - [c9]Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project. LREC 2000
1990 – 1999
- 1999
- [c8]Akinori Ito, Masaki Kohda, Mari Ostendorf:
A new metric for stochastic language model evaluation. EUROSPEECH 1999 - 1997
- [c7]Akinori Ito, Hideyuki Saitoh, Masaharu Katoh, Masaki Kohda:
N-gram language model adaptation using small corpus for spoken dialog recognition. EUROSPEECH 1997: 2735-2738 - 1996
- [c6]Akinori Ito, Masaki Kohda:
Language modeling by string pattern n-gram for Japanese speech recognition. ICSLP 1996: 490-493 - 1995
- [j3]Motoyuki Suzuki, Shozo Makino, Akinori Ito, Hirotomo Aso, Hiroshi Shimodaira:
A New HMnet Construction Algorithm Requiring No Contextual Factors. IEICE Trans. Inf. Syst. 78-D(6): 662-668 (1995) - 1994
- [j2]Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido:
A Coutinuous Speech Recognition System Using A Modified LVQ2 Method and A Dependency Grammar with Semantic Constraints. Int. J. Pattern Recognit. Artif. Intell. 8(1): 197-213 (1994) - [j1]Takashi Otsuki, Shozo Makino, Akinori Ito
, Toshio Sone:
Performance prediction of word recognition using the transition information between phonemes or between characters. Syst. Comput. Jpn. 25(7): 72-81 (1994) - [c5]Takashi Otsuki, Akinori Ito
, Shozo Makino, Teruhiko Otomo:
The performance prediction method on sentence recognition system using a finite state automaton. ICASSP (1) 1994: 397-400 - 1993
- [c4]Akinori Ito, Shozo Makino:
A new word pre-selection method based on an extended redundant hash addressing for continuous speech recognition. ICASSP (2) 1993: 299-302 - 1992
- [c3]Akinori Ito, Shozo Makino:
Word pre-selection using a redundant hash addressing method for continuous speech recognition. ICSLP 1992: 309-312 - 1991
- [c2]Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido:
A Japanese text dictation system based on phoneme recognition and a dependency grammar. ICASSP 1991: 273-276 - 1990
- [c1]Shozo Makino, Akinori Ito, Mitsuru Endo, Ken'iti Kido:
A Japanese text dictation system based on phoneme recognition using a modified LVQ2 method. ICSLP 1990: 241-244
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-28 22:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint