Stephen Chu  Stephen Chu photo       

contact information

Mandarin STT
Thomas J. Watson Research Center, Yorktown Heights, NY USA
  +1dash914dash320dash9168

links



2015

Multi-view point registration via alternating optimization
Junchi Yan, Jun Wang, Hongyuan Zha, Xiaokang Yang, Stephen M Chu
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

A matrix decomposition perspective to multiple graph matching
Junchi Yan, Hongteng Xu, Hongyuan Zha, Xiaokang Yang, Huanxi Liu, Stephen Chu
Proceedings of the IEEE International Conference on Computer Vision, pp. 199--207, 2015

On machine learning towards predictive sales pipeline analytics
Junchi Yan, Chao Zhang, Hongyuan Zha, Min Gong, Changhua Sun, Jin Huang, Stephen Chu, Xiaokang Yang
Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Sales pipeline win propensity prediction: a regression approach
Junchi Yan, Min Gong, Changhua Sun, Jin Huang, Stephen M Chu
Integrated Network Management (IM), 2015 IFIP/IEEE International Symposium on, pp. 854--857

Discrete hyper-graph matching
Junchi Yan, Chao Zhang, Hongyuan Zha, Wei Liu, Xiaokang Yang, Stephen M Chu
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1520--1528, 2015


2014

Towards elimination of mother-to-child transmission of HIV: performance of different models of care for initiating lifelong antiretroviral therapy for pregnant women in Malawi (Option B+)
Monique van Lettow, Richard Bedell, Isabell Mayuni, Gabriel Mateyu, L, Megan es, Adrienne K Chan, Vanessa van Schoor, Teferi Beyene, Anthony D Harries, Stephen Chu, others
Journal of the International AIDS Society 17(1), 2014

Graduated consistency-regularized optimization for multi-graph matching
Junchi Yan, Yin Li, Wei Liu, Hongyuan Zha, Xiaokang Yang, Stephen Mingyu Chu
Computer Vision--ECCV 2014, pp. 407--422, Springer


2013

Joint optimization for consistent multiple graph matching
Junchi Yan, Yu Tian, Hongyuan Zha, Xiaokang Yang, Ya Zhang, Stephen Chu
Proceedings of the IEEE International Conference on Computer Vision, pp. 1649--1656, 2013


2012

Improving arabic broadcast transcription using automatic topic clustering
Stephen M Chu, Lidia Mangu
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, pp. 4449--4452


2011

The IBM 2009 GALE Arabic speech transcription system
Brian Kingsbury, Hagen Soltau, George Saon, Stephen Chu, Hong-Kwang Kuo, Lidia Mangu, Suman Ravuri, Nelson Morgan, Adam Janin
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 4672--4675

The IBM 2011 GALE Arabic speech transcription system
Lidia Mangu, Hong-Kwang Kuo, Stephen Chu, Brian Kingsbury, George Saon, Hagen Soltau, Fadi Biadsy
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on, pp. 272--277


2010

The 2009 IBM GALE Mandarin broadcast transcription system
Stephen M Chu, Daniel Povey, Hong-Kwang Kuo, Lidia Mangu, Shilei Zhang, Qin Shi, Yong Qin
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 4374--4377

Enhanced word classing for model M.
Stanley F Chen, Stephen M Chu
INTERSPEECH, pp. 1037--1040, 2010

The IBM 2008 GALE Arabic speech transcription system
George Saon, Hagen Soltau, Upendra Chaudhari, Stephen Chu, Brian Kingsbury, Hong-Kwang Kuo, Lidia Mangu, Daniel Povey
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 4378--4381

Spoken English Assessment System for Non-Native Speakers Using Acoustic and Prosodic Features
Q Shi, K Li, S L Zhang, S M Chu, J Xiao, Z J Ou
Eleventh Annual Conference of the International Speech Communication Association, 2010

Speaking rate adaptation using continuous frame rate normalization
S M Chu, D Povey
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 4306--4309

Enhanced word classing for model m
S F Chen, S M Chu
Eleventh Annual Conference of the International Speech Communication Association, 2010


2009

Locality preserving speaker clustering
S M Chu, H Tang, T S Huang
Multimedia and Expo, 2009, pp. 494--497

Sensitive Talking Heads [Applications Corner]
T S Huang, M A Hasegawa-Johnson, S M Chu, Z Zeng, H Tang
Signal Processing Magazine, IEEE 26(4), 67--72, IEEE, 2009

Approaches to Speech Recognition based on Speaker Recognition Techniques
D. Povey, S. Chu, J. Pelecanos and H. Soltau
Handbook of Natural Language Processing and Machine Translation, 2009

Generative model-based speaker clustering via mixture of von Mises-Fisher distributions
H Tang, S M Chu, T S Huang
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4101--4104

Fishervoice and semi-supervised speaker clustering
S M Chu, H Tang, T S Huang
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4089--4092

Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR (PDF)
S Zhang, Q Shi, S M Chu, Y Qin
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4561--4564

Spherical discriminant analysis in semi-supervised speaker clustering
H Tang, S M Chu, T S Huang
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 57--60

EMOTION RECOGNITION FROM SPEECH VIA BOOSTED GAUSSIAN MIXTURE MODELS
H Tang, STEPHEN CHU, M Hasegawa
Johnson, TS … - isle.uiuc.edu, 2009

Spherical Discriminant Analysis in Semi-supervised Speaker Clustering}
H Tang, S Chu, T Huang
Proceedings of Human Language Technologies: The …, 2009 - aclweb.org


2008

Recent advances in the IBM GALE mandarin transcription system
Stephen M Chu, Hong-Kwang Kuo, Lidia Mangu, Yi Liu, Yong Qin, Qin Shi, Shi Lei Zhang, Hagai Aronowitz
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4329--4332

Search and classification based language model adaptation
Q Shi, S M Chu, W Liu, H K J Kuo, Y Liu, Y Qin
Ninth Annual Conference of the International Speech Communication Association, 2008

UNIVERSAL BACKGROUND MODEL BASED SPEECH RECOGNITION1
D Povey, S M Chu, B Varadarajan
IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, pp. 4561--4564


Universal background model based speech recognition
D Povey, SM Chu, B Varadarajan
IEEE International Conference on Acoustics, Speech …, 2008 - ieeexplore.ieee.org

Quick fmllr for speaker adaptation in speech recognition
B Varadarajan, D Povey, SM Chu
IEEE International Conference on Acoustics, Speech …, 2008 - ieeexplore.ieee.org

Recent advances in the IBM GALE mandarin transcription system
S M Chu, H K Kuo, L Mangu, Y Liu, Y Qin, Q Shi, S L Zhang, H Aronowitz
Acoustics, Speech and Signal Processing, 2008, pp. 4329--4332


2007

The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms
K Choukri, G Potamianos, SM Chu, A Tyagi, JR …
Language Resources and Evaluation, 2007 - Springer

Audio-Visual Speech Fusion Using Coupled Hidden Markov Models
S M Chu, T S Huang
IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1--2

Fusion of multiple camera views for kernel-based 3D tracking
A Tyagi, G Potamianos, JW Davis, SM Chu
IEEE Workshop on Motion and Video Computing, …, 2007 - ieeexplore.ieee.org

The IBM Mandarin Broadcast Speech Transcription System
SM Chu, H Kuo, YY Liu, Y Qin, Q Shi, G Zweig
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. …, 2007 - ieeexplore.ieee.org


2006

Person tracking in smart rooms using dynamic programming and adaptive subspace learning
Z Zhang, G Potamianos, S M Chu, J Tu, T S Huang
Proc, pp. 2061--2064, 2006

A comparison of multicamera person-tracking algorithms
Senior, G Potamianos, S Chu, Z Zhang, A …
… IEEE International Workshop Visual Surveillance (VS …, 2006 - andrewsenior.com

Person tracking in smart rooms using dynamic programming and adaptive subspace …
Z Zhangy, G Potamianos, SM Chu, J Tu, TS …
2006 IEEE International Conference on Multimedia …, 2006 - ieeexplore.ieee.org

Automatic speech recognition and speech activity detection in the CHIL smart room
SM Chu, E Marcheret, G Potamianos
Lecture Notes in Computer Science, 2006 - Springer

Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program
Y Qin, Q Shi, Y Liu, H Aronowitz, S Chu, H K Kuo, G Zweig
Chinese Spoken Language Processing, 410--421, Springer, 2006


2005

Automatic speech activity detection, source localization, and speech recognition on the CHIL seminar corpus
D Macho, J Padrell, A Abad, C Nadeu, J Hernando, J McDonough, M Wolfel, U Klee, M Omologo, A Brutti, others
2005 IEEE International Conference on Multimedia and Expo, pp. 876--879

Automatic speech activity detection, source localization, and speech recognition on the …
A Brutti, P Svaizer, G Potamianos, SM Chu
IEEE International Conference on Multimedia and …, 2005 - ieeexplore.ieee.org

A joint system for person tracking and face detection
Z Zhang, G Potamianos, A Senior, S Chu, TS …
Lecture notes in computer science, 2005 - Springer

Automatic speech recognition and speech activity detection in the CHIL seminar room
SM Chu, E Marcheret, G Potamianos
Proc. Joint Works. on Multimodal Interaction and Related …, 2005


2004

Mutual information based visual feature selection for lipreading.
Patricia Scanlon, Gerasimos Potamianos, Vit Libal, Stephen M Chu
INTERSPEECH, 2004

System and method for likelihood computation in multi-stream HMM based speech recognition
S M Chu, V Goel, E Marcheret, G Potamianos
US Patent App. 10/ ..., 2004 - Google Patents, Google Patents
US Patent App. 10/946,381

Audio visual word spotting
M Liu, Z Xiong, SM Chu, Z Zhang, TS Huang
IEEE International Conference on Acoustics, Speech, …, 2004 - ieeexplore.ieee.org

Mutual information based visual feature selection for lipreading
P Scanlon, G Potamianos, V Libal, SM Chu
Eighth International Conference on Spoken Language …, 2004 - ISCA

Multistage information fusion for audio-visual speech recognition
SM Chu, V Libal, E Marcheret, C Neti, G Potamianos
Multimedia and Expo, 2004, pp. 1651--1654

Efficient Likelihood Computation in Multi-Stream HMM Based Audio-Visual Speech Recognition
E Marcheret, S M Chu, V Goel, G Potamianos
Eighth International Conference on Spoken Language Processing, 2004

Towards practical deployment of audio-visual speech recognition
G Potamianos, C Neti, J Huang, JH Connell, S Chu, V Libal, E Marcheret, N Haas, J Jiang
Acoustics, Speech, and Signal Processing, 2004, pp. iii--777


2003

Environment-adaptive multi-channel biometrics
SM Chu, M Yeung, L Liang, X Liu
2003 IEEE International Conference on Acoustics, …, 2003 - ieeexplore.ieee.org

Multimodal fusion with applications to audio-visual speech recognition
STEPHEN CHU
… d. dissertation, University of Illinois at Urbana-Champaign …, 2003

Music summarization system and method
B T Logan, S M Chu
US Patent 6,633,845, 2003 - Google Patents, Google Patents
US Patent 6,633,845


2002

An experimental study of coupled hidden Markov models
SM Chu, TS Huang
IEEE International Conference on Acoustics, Speech, …, 2002 - ieeexplore.ieee.org

Audio-visual speech modeling using coupled hidden Markov models
SM Chu, TS Huang
IEEE International Conference on Acoustics, Speech, …, 2002 - ieeexplore.ieee.org

Multimodal Dialog Systems Research at Illinois
K Chen, STEPHEN CHU, A Gaig, Z Jing, D Li, J Lin, M …
isle.uiuc.edu, 2002


2001



2000

Music summarization using key phrases
B Logan, S Chu
2000 IEEE International Conference on Acoustics, …, 2000 - ieeexplore.ieee.org

Automatic Head Gesture Learning and Synthesis from Prosodic Cues
SM Chu, TS Huang
Sixth International Conference on Spoken Language …, 2000 - ISCA

Bimodal speech recognition using coupled hidden Markov models
SM Chu, TS Huang
Sixth International Conference on Spoken Language …, 2000 - ISCA

Speech/gesture interface to a visual-computing environment
Pavlovic, TS Huang, Z Lo, S Chu, Y Zhao, JC …
IEEE Computer Graphics and Applications, 2000 - doi.ieeecomputersociety.org


1999

Model compensation methods for robust speech recognition
SM Chu
1999 - University of Illinois at Urbana-Champaign


1998

Robust speech recognition using discriminative stream weighting and parameter …
SM Chu, Y Zhao
Fifth International Conference on Spoken Language …, 1998 - ISCA


1997

A visual computing environment for very large scale biomolecular modeling
TS Huang, VI Pavlovic, Y Zhao, Z Lo, S Chu …
Proceedings of the IEEE International Conference on …, 1997 - www-s.ks.uiuc.edu


1996

Speech/gesture interface to a visual computing environment for molecular biologists
Pavlovic, Y Zhao, Z Lo, STEPHEN CHU, K Schulten, A …
Proceedings of 13th ICPR, 1996


1973

… -Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE …
YSQ Qin, YYAH Liu, SMKHK Chu, G Zweig
Lecture Notes in Computer Science, 2006 - Berlin: Springer-Verlag, 1973-


Year Unknown

The 2009 IBM GALE Mandarin broadcast transcription system
S M Chu, D Povey, H K Kuo, L Mangu, S Zhang, Q Shi, Y Qin
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 4374--4377

Sensitive Talking Heads
T S Huang, M A Hasegawa-Johnson, S M Chu, Z Zeng, H Tang
Johnson, SM Chu, ... - isle.illinois.edu, 0

Thomas S. Huang, University of Illinois at Urbana-Champaign Zhigang Zhu, City College, City University of New York
Y Tian, TJ IBM, T Boult, K W Bowyer, R Chellappa, S M Chu, L S Davis, R Duraiswami, J Houser, R Jain, others
csdl.computer.org, 0