Gakuto Kurata  Gakuto Kurata photo         

contact information

Academy of Technology LogoSpeech and Language Processing
IBM Research - Tokyo
  

links


profile


Gakuto Kurata is a senior technical staff member and the manager of the speech technology group at IBM Research - Tokyo. He has more than 10 years of research and development experiences in speech technology, natural language processing, and their combinations. Since 2016, he has been managing the speech technology group focusing on (1) pushing the envelope on speech recognition performance in collaboration with IBM's global speech team, (2) delivering advanced technologies to IBM Watson Group, and (3) developing novel speech solutions.

He joined IBM in April 2004, after obtaining M.S. in Information Science and Technology from the University of Tokyo. He received a Ph.D. in Information Science and Technology from the University of Tokyo in 2013. He has been the Technical Assistant to the Director of IBM Research - Tokyo in 2014. He is an IBM Master Inventor and a member of IBM Academy of Technology.

 

Internship opportunities are available in the field of speech and language processing. Please apply from this page.

 

Conference Papers

  • Gakuto Kurata, Kartik Audhkhasi, "Improved Knowledge Distillation from Bi-directional to Uni-directional LSTM CTC for End-to-end Speech Recognition", in Proceedings of SLT 2018, Athens, Greece, December 2018 (to appear)
  • Takashi Fukuda, Raul Fernandez, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Alexander Sorin, Gakuto Kurata, "Data Augmentation Improves Recognition of Foreign Accented Speech", in Proceedings of INTERSPEECH 2018, September 2018
  • Masayuki Suzuki, Tohru Nagano, Gakuto Kurata, Samuel Thomas, "Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models", in Proceedings of INTERSPEECH 2018, September 2018
  • Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy, "Language Modeling with Highway LSTM", in Proceedings of ASRU 2017, Okinawa, Japan, December 2017
  • Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, George Saon, "Empirical Exploration of Novel Architectures and Objectives for Language Models", in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall, "English Conversational Telephone Speech Recognition by Humans and Machines", in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata, Samuel Thomas, Jia Cui, Bhuvana Ramabhadran, “Efficient knowledge distillation from an ensemble of teachers”, in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • Michael Heck, Masayuki Suzuki, Takashi Fukuda, Gakuto Kurata, Satoshi Nakamura, “Ensemble of multi-scale VGG acoustic models”, in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • Masayuki Suzuki, Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ken Church, Mark Drake, “Symbol sequence search from telephone conversation", in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • Osamu Ichikawa, Takashi Fukuda, Gakuto Kurata, Steven J. Rennie, “Factorial modeling for effective suppression of directional noise”, in Proceedings of INTERSPEECH 2017, Stockholm, Sweden, August 2017
  • Takashi FUKUDA, Osamu ICHIKAWA, Gakuto KURATA, Ryuki TACHIBANA, Samuel Thomas, Bhuvana Ramabhadran, "Effective Joint Training of Denoising Feature Space Transforms and Nueral Network Based Acoustic Models", in Proceedings of ICASSP 2017, March 2017
  • Osamu ICHIKAWA, Takashi FUKUDA, Masayuki SUZUKI, Gakuto KURATA, Bhuvana Ramabhadran, "Harmonic Feature Fusion for Robust Neural Network-based Acoustic Modeling", in Proceedings of ICASSP 2017, March 2017
  • Gakuto KURATA, Bing Xiang, Bowen Zhou, Mo Yu,"Leveraging Sentence-level Information with Encoder LSTM for Semantic Slot Filling", in Proceedings of EMNLP 2016, Austin, U.S.A., November 2016
  • Gakuto KURATA, Brian Kingsbury, "Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling", in Proceedings of INTERSPEECH 2016, San Francisco, U.S.A., September 2016
  • Gakuto KURATA, Bing Xiang, Bowen Zhou, "Labeled Data Generation with Encoder-Decoder LSTM for Semantic Slot Filling", in Proceedings of INTERSPEECH 2016, San Francisco, U.S.A., September 2016
  • Gakuto KURATA, Bing Xiang, Bowen Zhou, "Improved Neural Network-based Multi-label Classification with Better Initialization Leveraging Label Co-occurrence", in Proceedings of NAACL/HLT 2016, San Diego, U.S.A., June 2016
  • Gakuto KURATA, Daniel Willett, "Deep Neural Network Training Emphasizing Central Frames", in Proceedings of INTERSPEECH 2015, Dresden, Germany, September 2015
  • Masayuki SUZUKI, Gakuto KURATA, Tohru NAGANO, Ryuki TACHIBANA, "Speech Recognition Robust Against Speech Overlapping in Monaural Recordings of Telephone Conversations", in Proceedings of ICASSP 2016, March 2016
  • Nobuyasu ITOH, Gakuto KURATA, Ryuki TACHIBANA, Masafumi NISHIMURA,  "A Metric for Evaluating Speech Recognizer Output Based on Human-perception Model", in Proceedings of INTERSPEECH 2015, September 2015
  • Masayuki SUZUKI, Gakuto KURATA, Masafumi NISHIMURA, Nobuaki MINEMATSU, "Discriminative Reranking for LVCSR Leveraging Invariant Structure", in Proceedings of INTERSPEECH 2012, September 2011
  • Masayuki SUZUKI, Gakuto KURATA, Masafumi NISHIMURA, Nobuaki MINEMATSU, "Continuous Digits Recognition Leveraging Invariant Structure", in Proceedings of INTERSPEECH 2011, pp.993-996, Florence, Italy, August 2011
  • Gakuto KURATA, Nobuyasu ITOH, Masafumi NISHIMURA, "Acoustic Model Training with Detecting Transcription Errors in the Training Data", in Proceedings of INTERSPEECH 2011, pp.1689-1692, Florence, Italy, August 2011
  • Gakuto KURATA, Nobuyasu ITOH, Masafumi NISHIMURA, Abhinav Sethy, Bhuvana Ramabhadran, "Named Entity Recognition from Conversational Telephone Speech Leveraging Word Confusion Networks for Training and Recognition", in Proceedings of ICASSP 2011, pp.5576-5579, Prague, Czech Republic, May 2011
  • Gakuto KURATA, Nobuyasu ITOH, Masafumi NISHIMURA, "Training of Error-corrective Model for ASR without Using Audio Data", in Proceedings of ICASSP 2011, pp.5572-5575, Prague, Czech Republic, May 2011
  • Gakuto KURATA, Osamu ICHIKAWA, Masafumi NISHIMURA,  "Speech Input Method in Automobiles Reflecting Analysis on How Users Speak", The IEICE transactions on information and systems, Vol.J93-D, No.10, pp.2107-2117, October 2010
  • Gakuto KURATA, Nobuyasu ITOH, Masafumi NISHIMURA, "Acoustically Discriminative Training for Language Models", in Proceedings of ICASSP 2009, pp.4717-4720, Taipei, Taiwan, April 2009
  • Ryuki TACHIBANA, Tohru NAGANO, Gakuto KURATA, Masafumi NISHIMURA, Noboru BABAGUCHI,  "Preliminary Experiments toward Automatic Generation of New TTS Voices from Recorded Speech Alone", in Proceedings of INTERSPEECH 2007, Antwerp, Belgium, August 2007
  • Gakuto KURATA, Shinsuke MORI, Nobuyasu ITOH, Masafumi NISHIMURA, "Unsupervised Lexicon Acquisition from Speech and Text", in Proceedings of ICASSP 2007, Vol.4, pp.421-424, Honolulu, U.S.A, April 2007
  • Shinsuke MORI, Daisuke TAKUMA, Gakuto KURATA, "Phoneme-to-Text Transcription System with an Infinite Vocabulary", in Proceedings of COLING-ACL 2006, Sydney, Australia, July 2006
  • Gakuto KURATA, Shinsuke MORI, Masafumi NISHIMURA, "Unsupervised Adaptation of a Stochastic Language Model Using a Japanese Raw Corpus", in Proceedings of ICASSP 2006, Vol.1, pp.1037-1040, Toulouse, France, May 2006
  • Shinsuke MORI, Gakuto KURATA, "Class-based Variable Memory Length Markov Model", in Proceedings of INTERSPEECH 2005, pp.13-16, Lisbon, Portugal, July 2005
  • Gakuto KURATA, Naoaki OKAZAKI, Mitsuru ISHIZUKA, "GDQA: Graph Driven Question Answering System - NTCIR-4 QAC2 Experiments -", in Working Notes of NTCIR-4, Tokyo, Japan, June 2004
  • Nobuaki MINEMATSU, Gakuto KURATA, Keikichi HIROSE, "Corpus-based analysis of production and perception of Japanese English in view of the entire phonemic system of English," in Proceedings of ICPhS, pp.1569-1572, August 2003
  • Nobuaki MINEMATSU, Gakuto KURATA, Keikichi HIROSE, "Integration of MLLR Adaptation with Pronunciation Proficiency Adaptation for Non-Native Speech Recognition", in Proceedings of ICSLP 2002, Denver, U.S.A., September 2002
  • Nobuaki MINEMATSU, Gakuto KURATA, Keikichi HIROSE, "Corpus-Based Analysis of English Spoken by Japanese Students in View of the Entire Phonemic System of English", in Proceedings of ICSLP 2002, Denver, U.S.A., September 2002

Journal Papers

Chapter in Book

Domestic Conference Paper