Brian Kingsbury

Overview

Brian Kingsbury

Pronouns

He/Him/His

Title

Distinguished research scientist and manager, speech technologies

Location

IBM Research - Yorktown Heights Yorktown Heights, NY USA

Bio

My Background

I've been a researcher at IBM since 1999. Before that, I was working on my PhD in computer science at the International Computer Science Institute and the University of California, Berkeley. I earned my undergraduate degree in electrical engineering at Michigan State University.

Things I Do

I do research on machine learning and large vocabulary speech recognition, with a focus on acoustic modeling and robustness. I currently serve as a senior area editor for IEEE/ACM Transactions on Audio, Speech, and Language Processing, an action editor for Transactions of Machine Learning Research, and an associate editor for IEEE Transactions on Pattern Analysis and Machine Intelligence.

Publications

Semi-Autoregressive Streaming ASR With Label Context
- - Siddanth Arora
  - George Saon
  - et al.
- 2024
- ICASSP 2024
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization
- - A F M Saif
  - Xiaodong Cui
  - et al.
- 2024
- ICASSP 2024
High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction
- - Yuancheng Yu
  - Kristjan Greenewald
  - et al.
- 2023
- ISIT 2023
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding Systems
- - Samuel Thomas
  - Jeff Kuo
  - et al.
- 2022
- ICASSP 2022
Towards End-to-end Integration of Dialog History For Improved Spoken Language Understanding
- - Vishal Sunder
  - Samuel Thomas
  - et al.
- 2022
- ICASSP 2022
Improving End-to-End Models for Set Prediction in Spoken Language Understanding
- - Jeff Kuo
  - Zoltan Tuske
  - et al.
- 2022
- ICASSP 2022
Understanding Unequal Gender Classification Accuracy from Face Images
- - Vidya Muthukumar
  - Tejaswini Pedapati
  - et al.
- 2018
- arXiv

Visit Google Scholar

Patents

- 17 Apr 2024
- DE
- 11 2020 003 449
Soft-forgetting For Connectionist Temporal Classification Based Automatic Speech Recognition
- 11 Mar 2024
- US
- 11929062
End-to-end Spoken Language Understanding Without Full Transcripts
- 26 Feb 2024
- US
- 11914678
Input Encoding For Classifier Generalization
- 19 Feb 2024
- US
- 11908454
Integrating Text Inputs For Training And Adapting Neural Network Transducer Asr Models
- 19 Feb 2024
- US
- 11908458
Customization Of Recurrent Neural Network Transducers For Speech Recognition
- 10 Jan 2024
- TW
- I829312
Integrating Text Inputs For Training And Adapting Neural Network Transducer Asr Models
- 05 Sep 2023
- GB
- 2602227
Fast - Soft-forgetting For Connectionist Temporal Classification Based Automatic Speech Recognition
- 30 Jan 2023
- US
- 11568858
Transliteration Based Data Augmentation For Training Multilingual Asr Acoustic Models In Low Resource Settings
- 25 Oct 2021
- US
- 11158303
Soft-forgetting For Connectionist Temporal Classification Based Automatic Speech Recognition
- 20 Aug 2018
- US
- 10056075
Systems And Methods For Accelerating Hessian-free Optimization For Deep Neural Networks By Implicit Preconditioning And Sampling

Top collaborators

ST

Samuel Thomas

Samuel Thomas

Senior Research Scientist - Speech Recognition and Spoken Language Understanding

GS

George Saon

George Saon

Speech strategy lead, distinguished research scientist

KV

Kush Varshney

Kush Varshney

IBM Fellow

XC

Xiaodong Cui

Xiaodong Cui

Principal Research Scientist