User Interface Technologies (discontinued) Publications



2014



Automatic Speech Recognition
Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen Chu, Fadi Biadsy
Natural Language Processing of Semitic Languages, pp. 409--459, Springer Berlin Heidelberg, 2014


2013

Fast Face Detector Training Using Tailored Views
Kristina Scherbaum, Rogerio Feris, James Petterson, Volker Blanz
IEEE International Conference on Computer Vision (ICCV), 2013

Spatio-Temporal Fisher Vector Coding for Surveillance Event Detection
Qiang Chen, Yang Cai, Lisa Brown, Ankur Datta, Quanfu Fan, Rogerio Feris, Shuicheng Yan, Alex Hauptmann, Sharathchandra Pankanti
ACM Multimedia, 2013



2012

Unsupervised model selection for view-invariant object detection in surveillance environments
Behjat Siddiquie, Rogerio S Feris, Ankur Datta, Larry S Davis
Pattern Recognition (ICPR), 2012 21st International Conference on, pp. 3252--3255

Appearance Modeling for Person Re-identification using Weighted Brightness Transfer Functions
Ankur Datta, Lisa M Brown, Rogerio Feris, Sharathchandra Pankanti
21st International Conference on Pattern Recognition (ICPR), November, pp. 2367--2370, 2012


2011



2010

Cardiac disease detection from echocardiogram using edge filtered scale-invariant motion features
Ritwik Kumar, Fei Wang, David Beymer, Tanveer Syeda-Mahmood
Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pp. 162--169

Shape-based similarity retrieval of Doppler images for clinical decision support
T Syeda-Mahmood, P Turaga, D Beymer, F Wang, A Amir, H Greenspan, K Pohl
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pp. 855--862

Tracking Motion, Deformation, and Texture using Conditionally Gaussian Processes
T K Marks, J R Hershey, J R Movellan
IEEE transactions on pattern analysis and machine intelligence 32(2), 348--363, IEEE Computer Society, 2010

Restructuring Acoustic Models for Client and Server Based Automatic Speech Recognition
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
Spoken Query, to appear, 2010

The monaural speech separation and recognition challenge
M Cooke, J R Hershey, S J Rennie
Computer Speech & Language 24(1), 1--15, Elsevier, 2010

Super-human multi-talker speech recognition: A graphical modeling approach
J R Hershey, S J Rennie, P A Olsen, T T Kristjansson
Computer Speech & Language 24(1), 45--66, Elsevier, 2010


2009

Automatic estimation of left ventricular dysfunction from echocardiogram videos
D Beymer, T Syeda-Mahmood, A Amir, Fei Wang, S Adelman
Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on, pp. 164--171, IEEE

Closed-form jensen-renyi divergence for mixture of gaussians and applications to group-wise shape registration
Fei Wang, Tanveer Syeda-Mahmood, Baba C Vemuri, David Beymer, Anand Rangarajan
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2009, pp. 648--655, Springer Berlin Heidelberg

Refactoring acoustic models using variational density approximation
P L Dognin, J R Hershey, V Goel, P A Olsen
ICASSP, 2009

A fast, accurate approximation to log likelihood of Gaussian mixture models (PDF)
P L Dognin, V Goel, J R Hershey, P A Olsen
2009 - computer.org

Isometry-enforcing data transformations for improving sparse model learning
Avishy Carmi, Irina Rish, Guillermo Cecchi, Dimitri Kanevsky, Bhuvana Ramabhadran
IBM Tech Report RC24801, Tech. Rep. RC 24801, Human Language Technologies, IBM, 2009

Natural language system and method based on unisolated performance metric
S Deligne, Y Gao, V Goel, H K Kuo, C Wu
US Patent 7,574,358

Designing interactive voice response (IVR) interfaces: localisation for low literacy users
A Sharma Grover, O Stewart, D Lubensky
Proceedings of the Conference, pp. 22--24, 2009
Abstract

Variational Loopy Belief Propagation for Efficient Multi-talker Speech Recognition
Steven J. Rennie, John R. Hershey and Peder A. Olsen
Interspeech, pp. 1331-1334, 2009

Speech Transcription in AAL Solutions
A. Sorin and R. Hoory
Workshop on Designing Ambient Interactions for Older Users , 2009

Two-Wire Nuisance Attribute Projection
Y. Solewicz, H. Aronowitz
Interspeech, 2009

Optimal quantization and bit allocation for compressing large discriminative feature space transforms
E. Marcheret, V. Goel, P.A. Olsen
Automatic Speech Recognition \& Understanding, 2009. ASRU 2009. IEEE Workshop on, pp. 64--69

RTTS: Towards Enterprise-level Real-Time Speech Transcription and Translation Services
J M Huerta, C Wu, A Sakrajda, S Caskey, E E Jan, A Faisman, S Ben-David, W Liu, A Lee, O Stewart, others
Tenth Annual Conference of the International Speech Communication Association, 2009
Abstract

Designing crowdsourcing community for the enterprise
O Stewart, J M Huerta, M Sader
Proceedings of the ACM SIGKDD Workshop on Human Computation, pp. 50--53, 2009
Abstract

Web derived pronunciations for spoken term detection
D Can, E Cooper, A Ghoshal, M Jansche, S Khudanpur, B Ramabhadran, M Riley, M Saraclar, A Sethy, M Ulinski, others
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp. 83--90, 2009

Unsupervised pronunciation validation
C M White, A Sethy, B Ramabhadran, P Wolfe, E Cooper, M Saraclar, J K Baker
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4301--4304

Multimodal Classification of Activities of Daily Living Inside Smart Homes
V Libal, B Ramabhadran, N Mana, F Pianesi, P Chippendale, O Lanz, G Potamianos
Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living, pp. 694, 2009

A generalized family of parameter estimation techniques
D Kanevsky, T N Sainath, B Ramabhadran
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 1725--1728

Iterative Sentence--Pair Extraction from Quasi--Parallel Corpora for Machine Translation
R Sarikaya, S Maskey, R Zhang, E Jan, D Wang, B Ramabhadran, S Roukos
Interspeech, 2009

Cultural voice markers in speech-to-speech machine translation systems
O Stewart, M Picheny, D Lubensky, B Ramabhadran
Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 313--316, ACM New York, NY, USA


Combined Discriminative Training for Multi-Stream HMM-based Audio-Visual Speech Recognition
Jing Huang, Karthik Visweswariah
Interspeech, 2009

Long-Time Span Acoustic Activity Analysis From Far-Field Sensors In Smart Homes
Jing Huang, V Zhuang
ICASSP, 2009

ACOUSTIC FALL DETECTION USING GAUSSIAN MIXTURE MODELS AND GMM SUPERVECTORS
X Zhuang, Jing Huang, G Potamianos, M Hasegawa
… - isle.uiuc.edu, 2009

Automatic Speech Recognition
G Potamianos, L Lamel, M Wolfel, J Huang, E …
Computers in the Human Interaction Loop, 2009 - books.google.com

Effects of real-time transcription on non-native speaker's comprehension in computer-mediated communications
Y.X. Pan, D.N. Jiang, M Picheny, Y Qin
ACM CHI, Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009

Single-channel speech separation and recognition using loopy belief propagation
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3845--3848


Refactoring acoustic models using variational density approximation
P.L. Dognin, J.R. Hershey, V. Goel, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 4473--4476

A fast, accurate approximation to log likelihood of Gaussian mixture models
P.L. Dognin, V. Goel, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3817--3820

Acoustic modeling using exponential families
V. Goel, P.A. Olsen
Tenth Annual Conference of the International Speech Communication Association, 2009

RTTS: Towards Enterprise-level Real-Time Speech Transcription and Translation Services
Juan M Huerta, Cheng Wu, Andrzej Sakrajda (Andy), SASHA P CASKEY, Ea
Ee Jan, Antonio R Lee, OSAMUYIMEN T STEWART, ALEXANDER FAISMAN (ALEX), David M Lubensky - INTERSPEECH 2009

A new method for OOV detection using hybrid word/fragment system
A Rastrow, A Sethy, B Ramabhadran
Proceedings of the 2009 IEEE International …, 2009 - doi.ieeecomputersociety.org

An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation
A Sethy, PG Georgiou, B Ramabhadran, S Narayanan
IEEE Transactions on Audio, Speech, and Language Processing, 2009 - sail.usc.edu

UNSUPERVISED PRONUNCIATION VALIDATION
CM White, ABHINAV SETHY, Bhuvana Ramabhadran, P Wolfe, E …
sisl.seas.harvard.edu, 2009

Fast decoding for open vocabulary spoken term detection}
B Ramabhadran, A Sethy, J Mamou, B Kingsbury, U …
Proceedings of Human Language Technologies: The 2009 Annual …, 2009 - aclweb.org

Effect of pronunciations on oov queries in spoken term detection
D Can, E Cooper, A Sethy, B Ramabhadran, M …
ICASSP, April, 2009 - jhu.edu



Automatic Speech Recognition
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com


Automatic Speech Recognition
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com

A Fast, Accurate Approximation to Log Likelihood of Gaussian Mixture Models
Pierre L Dognin, Vaibhava Goel, John R Hershey, Peder A Olsen
ICASSP, 2009

Refactoring Acoustic Models using Variational Expectation-Maximization
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009

Refactoring Acoustic Models using Variational Density Approximation
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009


2008

Spatio-temporal motion estimation for disease discrimination in cardiac echo videos
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2008, pp. 121--124

There are many ways to watch people as they use web
D Beymer, D M Russell
2008 - en.scientificcommons.org

Inferring transducer viewpoints in cardiac echo videos
D Beymer, T Syeda-Mahmood, F Wang
Computers in Cardiology, 2008, pp. 117--120

Vector based Approaches to Semantic Similarity Measures
J M Huerta
Advances in Natural Language Processing and Applications, 163, Citeseer, 2008

Enhancing Speaker Recognition with Virtual Examples
Y Solewicz, H Aronowitz
2008 - eprints.pascal-network.org

Algorithm Optimizations: Low Computational Complexity
M Novak
Automatic speech recognition on mobile devices and over communication networks, 213, Springer, 2008

Using robust audio and video processing technologies to alleviate the elderly cognitive decline
V Mylonakis, J Soldatos, A Pnevmatikakis, L Polymenakos, A Sorin, H Aronowitz
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments, pp. 28, 2008

Time-compressing speech: ASR transcripts are an effective way to support gist extraction
S Tucker, N Kyprianou, S Whittaker
Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction, pp. 235, 2008


A paralinguistic template for creating persona in interactive voice response (IVR) systems
Osamuyimen Stewart
Emotions in the Human Voice: Volume III Culture and Perception, Plural, 2008
Abstract

Improving Large Scale Alphanumeric String Recognition using Redundant Information
E E Jan, O Stewart, R Co, D Lubensky
ICSLP 2008
Abstract

Research and Commercial Spoken Dialogue Systems
R Pieraccini, J M Huerta
Recent Trends in Discourse and Dialogue, 1, Springer, 2008

Relative rank statistics for dialog analysis
J M Huerta
Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 965--972, 2008

SYSTEM AND METHOD FOR A DEVICE SOUND INTERFACE MANAGER
A Aaron, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,153, 2008 - Google Patents, Google Patents
US Patent App. 12/019,153

METHOD AND SYSTEM FOR CAPABILITIES LEARNING
S H Basson, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,709, 2008 - Google Patents, Google Patents
US Patent App. 12/019,709

The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge
R Fernandez, Z Kons, S Shechtman, Z W Shuang, R Hoory, B Ramabhadran, Y Qin
Blizzard Workshop, 2008

Speaker Recognition in Two-Wire Test Sessions
H Aronowitz, Y A Solewicz
Ninth Annual Conference of the International Speech Communication Association, 2008

Online vocabulary adaptation using contextual information and information retrieval
H Aronowitz
Ninth Annual Conference of the International Speech Communication Association, pp. 1805--1808, 2008

Recent advances in the IBM GALE mandarin transcription system
S M Chu, H K Kuo, L Mangu, Y Liu, Y Qin, Q Shi, S L Zhang, H Aronowitz
Acoustics, Speech and Signal Processing, 2008, pp. 4329--4332

Audio -based unsupervised segmentation of multiparty dialogue
PY Hsueh
IEEE International Conference on Acoustics, Speech …, 2008 - ieeexplore.ieee.org

Automatic decision detection in meeting speech
PY Hsueh, JD Moore
Lecture Notes in Computer Science, 2008 - Springer

A Hardware-Software Framework for High-Reliability People Fall Detection
A Grassi, Jing Huang, G. Potamianos
IEEE SENSORS, 2008

A multi-sensor approach for people fall detecion in the home environment
Jing Huang, G A Leone
ECCV, 2008

Effective Acoustic Adaptation for A Distant-talking Interactive TV System
Jing Huang, Marco Matassoni, Mark E Epstein
Interspeech, 2008


Dynamic language model mixtures with history-based buckets
M. Franz, P.A. Olsen, others
US Patent 7,395,205



Variational bhattacharyya divergence for hidden markov models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4557--4560

Accelerated monte carlo for kullback-leibler divergence between gaussian mixture models
J.Y. Chen, J.R. Hershey, P.A. Olsen, E. Yashchin
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4553--4556

Efficient model-based speech separation and denoising using non-negative subspace analysis
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 1833--1836

Speech transformation solutions
D Kanevsky, S Basson, A Faisman, L Rachevsky, A …
Cognition Distributed: How Cognitive Technology Extends Our …, 2008 - books.google.com


Optimizing speech recognition grammars using a measure of similarity between hidden Markov models
B Mohanty, J Hershey, P Olsen, S Kozat, V Goel
Acoustics, Speech and Signal Processing, 2008, pp. 4953--4956

Bag-Of-Word normalized n-gram models
A Sethy, B Ramabhadran
A Sethy, B Ramabhadran, 2008

A study of unsupervised clustering techniques for language modeling
S Hahn, A Sethy, H K J Kuo, B Ramabhadran
S Hahn, A Sethy, HKJ Kuo, B Ramabhadran, 2008

Phonetic query expansion for spoken document retrieval
J Mamou, B Ramabhadran
Interspeech 2008

Generalization of Extended Baum-Welch Parameter Estimation for Discriminative Training and Decoding
D Kanevsky, T N Sainath, B Ramabhadran, D Nahamoo
Proceedings of the 9th annual conference of the international speech communication association, Brisbane, Australia, pp. 277--80, 2008

A New Family of Extended Baum-Welch Update Rules
Dimitri Kanevsky, Daniel Povey, Bhuvana Ramabhadran, Irina Rish, Tara Sainath
2008

Gradient steepness metrics using extended Baum-Welch transformations for universal pattern …
TN Sainath, D Kanevsky, B Ramabhadran
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. …, 2008 - ieeexplore.ieee.org

Boosted MMI for model and feature-space discriminative training
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4057--4060

Far-field multimodal speech processing and conversational interaction in smart spaces
G Potamianos, J Huang, E Marcheret, V Libal, R Balchandran, M Epstein, L Seredi, M Labsky, L Ures, M Black, others
Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, 2008

Automatic Speech Recognition in CHIL
G Potamianos, L Lamel, M W{\"o}lfel, J Huang, E Marcheret, C Barras, X Zhu, J McDonough, J Hernando, D Macho, others
iit.demokritos.gr, 2008

The IBM RT07 evaluation systems for speaker diarization on lecture meetings
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 497--508, Springer, 2008

The IBM rich transcription 2007 speech-to-text systems for lecture meetings
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 429--441, Springer, 2008



2007

Approximating the Kullback Leibler divergence between Gaussian mixture models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--317

Finding disease similarity by combining ECG with heart auscultation sound
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2007, pp. 261--264

AALIM: Multimodal mining for cardiac decision support
T Syeda-Mahmood, F Wang, D Beymer, A Amir, M Richmond, S Hashmi
Computers in Cardiology, 2007, pp. 209--212

Characterizing spatio-temporal patterns for disease discrimination in cardiac echo videos
T Syeda-Mahmood, F Wang, D Beymer, M London, R Reddy
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2007, pp. 261--269, Springer Berlin Heidelberg

How Predictable is ASR Confidence in Dialog Applications?
X Li, J M Huerta
Eighth Annual Conference of the International Speech Communication Association, 2007

Single channel speech separation using factorial dynamics
J.R. Hershey, T. Kristjansson, S. Rennie, P.A. Olsen
Advances in Neural Information Processing Systems19, 593, MIT; 1998, 2007

Word confusability-measuring hidden Markov model similarity
J.Y. Chen, P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007

Discriminative training of subspace constrained GMMs for speech recognition
S Axelrod, V Goel, R Gopinath, P Olsen, K Visweswariah
IEEE Transactions on Speech and Audio Processing 15(1), 172-189, 2007

Bhattacharyya error and divergence using variational importance sampling
P Olsen, J Hershey
Interspeech, Antwerp, Belgium (August 2007)

Efficient speaker recognition using approximated cross entropy (ACE)
H Aronowitz, D Burshtein
Audio, Speech, and Language Processing, IEEE Transactions on 15(7), 2033--2043, IEEE, 2007

Discriminative estimation of subspace constrained gaussian mixture models for speech recognition
Scott Axelrod, Vaibhava Goel, Ramesh Gopinath, Peder Olsen, Karthik Visweswariah
Audio, Speech, and Language Processing, IEEE Transactions on 15(1), 172--189, IEEE, 2007

Variational probabilistic speech separation using microphone arrays
S. Rennie, P. Aarabi, B. Frey
IEEE Transactions on Speech and Audio Processing, 2007

Bhattacharyya error and divergence using variational importance sampling
P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007


2006

The IBM expressive text-to-speech synthesis system for American English
J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, M.A. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(4), 1099--1108, IEEE, 2006

Spoken document retrieval from call-center conversations
J Mamou, D Carmel, R Hoory
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51--58, ACM, 2006

Super-human multi-talker speech recognition: The IBM 2006 speech separation challenge system
T. Kristjansson, J. Hershey, P. Olsen, S. Rennie, R. Gopinath
Ninth International Conference on Spoken Language Processing, pp. 97--100, 2006

Single channel speech separation using layered hidden Markov models
J Hershey, T Kristjansson, S Rennie, P Olsen
Advances in Neural Information Processing Systems (NIPS), 2006

Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation
L. Gu, Y. Gao, F.H. Liu, M. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(2), 377--392, IEEE, 2006


2005

Contructing ensembles of ASR systems using randomized decision trees
O Siohan, B Ramabhadran, B Kingsbury
Acoustics, Speech, and Signal Processing, 2005, pp. 197--200

A distance measure between gmms based on the unscented transform and its application to speaker recognition
J Goldberger, H Aronowitz
Ninth European Conference on Speech Communication and Technology, 2005

fMPE: Discriminatively trained features for speech recognition
Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon, Hagen Soltau, Geoffrey Zweig
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 961--964, Philadelphia, 2005

Reusable dialog component framework for rapid voice application development
R P Akolkar, T Faruquie, J Huerta, P Kankar, N Rajput, TV Raman, R U Udupa, A Verma
Component-Based Software Engineering, 306--321, Springer, 2005

Efficient speaker identification and retrieval
H Aronowitz, D Burshtein
Ninth European Conference on Speech Communication and Technology, pp. 2, 2005

Speaker indexing in audio archives using Gaussian mixture scoring simulation
H Aronowitz, D Burshtein, A Amir
Machine Learning for Multimodal Interaction, 243--252, Springer, 2005

Voicing features for robust speech detection
T. Kristjansson, S. Deligne, P. Olsen
Ninth European Conference on Speech Communication and Technology, pp. 3, 2005

Using semantic analysis to improve speech recognition performance
H. Erdogan, R. Sarikaya, S.F. Chen, Y. Gao, M. Picheny
Computer Speech \& Language 19(3), 321--343, Elsevier, 2005

Subspace constrained Gaussian mixture models for speech recognition
Scott Axelrod, Vaibhava Goel, Ramesh A Gopinath, Peder A Olsen, Karthik Visweswariah
Speech and Audio Processing, IEEE Transactions on 13(6), 1144--1160, IEEE, 2005


2004

TV personalization system: design of a TV show recommender engine and interface
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, J …
Personalized Digital Television: Targeting Programs to …, 2004 - citeseerx.ist.psu.edu

DISPLAYING SEARCH RESULTS
JA Martino, L Nikolavska, J Debont, J Zimmerman



TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004

Personalized news retrieval system
J H Elenbaas, N Dimitrova, T McGee, M Simpson, J A Martino, M Abdel-Mottaleb, M Garrett, C Ramsey, H L Wu, R Desai
US Patent App. 10/932,460, 2004 - Google Patents, Google Patents
US Patent App. 10/932,460

TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub

TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub

Personalized Digital Television: Targeting Programs to Individual Viewers, volume 6 of Human-Computer Interaction Series, chapter 5
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino, S Gutta
Kluwer, Kluwer, 2004

TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004

Automatic recognition of spontaneous speech for access to multilingual oral history archives
M Picheny, J Psutka, B Ramabhadran, D Soergel, T …
Speech and Audio ..., 2004 - ieeexplore.ieee.org

Single microphone source separation using high resolution signal reconstruction
T Kristjansson, H Attias, J Hershey
Proc, pp. 817--820, 2004



The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation
A Sorin, T Ramabadran, D Chazan, R Hoory, M McLaughlin, D Pearce, F C R Wang, Y Zhang
Proc. ICASSP04, 2004

Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition
J Hershey, T Kristjansson, Z Zhang
Transfer10, 15, Citeseer, 2004

Text independent speaker recognition using speaker dependent word spotting
H Aronowitz, D Burshtein, A Amir
Eighth International Conference on Spoken Language Processing, 2004

The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction
T Ramabadran, A Sorin, M McLaughlin, D Chazan, D Pearce, R Hoory
Proc. ICASSP, Montreal, Quebec, Canada, 2004

3d tracking of morphable objects using conditionally gaussian nonlinear filters
T K Marks, J Hershey, J C Roddey, J R Movellan
Conference on Computer Vision and Pattern Recognition Workshop, 2004, pp. 190--190, Citeseer


Look or listen: Discovering effective techniques for accessing speech data
S Whittaker, J Hirschberg
Proceedings of Human Computer Interaction, 207--222, Citeseer, 2004

Segmental minimum Bayes-risk decoding for automatic speech recognition
V Goel, S Kumar, W Byrne
IEEE transactions on Speech and Audio Processing 12(3), 234--249, Citeseer, 2004

A corpus-based approach to< ahem/> expressive speech synthesis
E Eide, A Aaron, R Bakis, W Hamza, M Picheny, J Pitrelli
Proccedings of 5th ISSW, 79--84, Citeseer, 2004

Modeling inverse covariance matrices by basis expansion
P.A. Olsen, R.A. Gopinath
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. 37--46, IEEE, 2004


2003



Touch-screen image scrolling system and method
J Zimmerman, J A Martino
US Patent App. 10/736,938, 2003 - Google Patents, Google Patents
US Patent App. 10/736,938

Search user interface with enhanced accessibility and ease-of-use features based on visual metaphors
L Nikolovska, J A Martino, A F Camplin
US Patent 6,505,194, 2003 - Google Patents, Google Patents
US Patent 6,505,194

Towards automatic transcription of large spoken archives-english ASR for the MALACH project
B. Ramabhadran, J. Huang, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--216

An architecture for rapid decoding of large vocabulary conversational speech
G Saon, G Zweig, B Kingsbury, L Mangu, U Chaudhari
Eighth European Conference on Speech Communication and Technology, 2003

Recent improvements to the IBM trainable speech synthesis system
E. Eide, A. Aaron, R. Bakis, R. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, others
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--708

A hand-held speech-to-speech translation system
B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny
Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669

Minimum Bayes risk methods in automatic speech recognition
V Goel, W Byrne
Pattern Recognition in Speech and Language Processing, 51--80, 2003

Automatic hierarchical color image classification
J Huang, SR Kumar, R Zabih
EURASIP Journal on Applied Signal Processing, 2003 - hindawi.com



2002


Remote control program selection by genre
KI Trovato, P Rankin, D Pelletier, JA Martino, CC …




Histogram method for characterizing video content
JA Martino, N Dimitrova, JH Elenbaas, J Rutgers


User interface for reviewing and controlling use of data objects
J Zimmerman, J A Martino, G Roberts
US Patent App. 10/055,338, 2002 - Google Patents, Google Patents
US Patent App. 10/055,338

VCR-style transport for navigating electronic program guide (EPG) and other textual information
M A Jacquelyn, J Zimmerman
US Patent App. 10/071,392, 2002 - Google Patents, Google Patents
US Patent App. 10/071,392

Context and time sensitive profile builder
J Martino, J Zimmerman, G Roberts
US Patent App. 10/185,405, 2002 - Google Patents, Google Patents
US Patent App. 10/185,405

Method and system for displaying search results
J A Martino, L Nikolovska, J De Bont, J Zimmerman
US Patent App. 10/086,008, 2002 - Google Patents, Google Patents
US Patent App. 10/086,008

User interface providing automatic organization and filtering of search criteria
K Kurapati, L Nikolovska, J A Martino, A F Camplin
US Patent 6,499,029, 2002 - Google Patents, Google Patents
US Patent 6,499,029

Large-Vocabulary Speech Recognition Algorithms
M Padmanabhan, M Picheny
COMPUTER, 2002 - doi.ieeecomputersociety.org

Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, Ruhi Sarikaya
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--53

Supporting access to large digital oral history archives
S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, D. Greenberg
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pp. 18--27, 2002

Maximum entropy model for punctuation annotation from speech
J Huang, G Zweig
Seventh International Conference on Spoken Language …, 2002 - ISCA

Modeling with a subspace constraint on inverse covariance matrices
S. Axelrod, R. Gopinath, P. Olsen
Seventh International Conference on Spoken Language Processing, 2002

Maximum likelihood training of bases for rapid adaptation
K Visweswariah, V Goel, R Gopinath
Proc. ICSLP, 2002

Alignment-based codeword-dependent cepstral normalization
J M Huerta
Speech and Audio Processing, IEEE Transactions on 10(7), 451--459, IEEE, 2002

Audio-Visual Speech Separation Using Hidden Markov Models
J Hershey, M Case
Proc. Advances in Neural Information Processing Systems14, 2002

Audio-visual sound separation via hidden markov models
J Hershey, M Casey
Advances in Neural Information Processing Systems2, 1173--1180, MIT; 1998, 2002

MARS: A statistical semantic parsing and generation-based multilingual automatic translation system
Y. Gao, B. Zhou, Z. Diao, J. Sorensen, M. Picheny
Machine Translation 17(3), 185--212, Springer, 2002

Theory and practice of acoustic confusability
H. Printz, P.A. Olsen
Computer Speech \& Language 16(1), 131--164, Elsevier, 2002

A robust high accuracy speech recognition system for mobile applications
S. Deligne, S. Dharanipragada, R. Gopinath, B. Maison, P. Olsen, H. Printz
Speech and Audio Processing, IEEE Transactions on 10(8), 551--561, IEEE, 2002

Automatic transcription of broadcast news
SS Chen, E. Eide, MJF Gales, RA Gopinath, D. Kanvesky, P. Olsen
Speech Communication 37(1-2), 69--87, Elsevier, 2002

Exploring features from natural language generation for prosody modeling
S Pan, K McKeown, J Hirschberg
Computer speech & language , 2002


2001

A multi-agent TV recommender
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Proceedings of the UM 2001 workshop Personalization in …, 2001 - www-2.cs.cmu.edu

A Multi-Agent TV Recorder
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Adaptive Systems Department, Philips Research Briarcliff, 2001




GRAPHICAL DATABASE BROWSING INTERFACE
J MARTINO, L Nikolovska
wipo.int, 2001


Method and apparatus for access and display of content allowing users to apply multiple profiles
J Martino, J Zimmerman
US Patent App. 10/037,464, 2001 - Google Patents, Google Patents
US Patent App. 10/037,464

Sort slider with context intuitive sort keys
J Martino, J Zimmerman, H Lamers, G Roberts, J Bont de
US Patent App. 10/037,445, 2001 - Google Patents, Google Patents
US Patent App. 10/037,445

Method of populating an explicit profile
J Zimmerman, J Martino
US Patent App. 10/040,245, 2001 - Google Patents, Google Patents
US Patent App. 10/040,245

Visualization of entertainment content
G Roberts, J Martino, J Debont, L Nikolovska, J Zimmerman
US Patent App. 10/038,874, 2001 - Google Patents, Google Patents
US Patent App. 10/038,874

FUSION OF MEDIA FOR INFORMATION SOURCES
J MARTINO, L Nikolovska, K Kurapati, A Camplin, H …

A Multi-Agent TV Recorder, Adaptive Systems Department
K Kurapati, S Gutta, D Schaffer, J Martino, J Zimmerman
Philips Research Briarcliff, 2001

A Multi-Agent TV Recommender
K K Srinivas, S Gutta, D Schaffer, J Martino, J Zimmerman
In Proceedings of the UM 2001 workshop “Personalization in Future TV

REMOTE CONTROL FOR PROGRAM SELECTION BY GENRE
K I Trovato, P Rankin, D Pelletier, J A Martino, C Ramsey
EP Patent 1,084,571

Minimum Bayes error feature selection for continuous speech recognition
G Saon, M Padmanabhan
Advances in Neural Information Processing Systems, 2001 - reference.kfupm.edu.sa

Data-driven approach to designing compound words for continuousspeech recognition
G Saon, M Padmanabhan, IBMTJWR Center, Y Heights
IEEE Transactions on Speech and Audio Processing, 2001 - ieeexplore.ieee.org

Use of non-negative matrix factorization for language modeladaptation in a lecture transcription …
M Novak, R Mammone, IBMTJWR Center, Y Heights
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org

Innovative approaches for large vocabulary name recognition
Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 53--56

Linear feature space projections for speaker adaptation
G Saon, G Zweig, M Padmanabhan, IBMTJWR Center, Y …
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org

Current status of the IBM trainable speech synthesis system
R. Donovan, A. Ittycheriah, M. Franz, B. Ramabhadran, E. Eide, M. Viswanathan, R. Bakis, W. Hamza, M. Picheny, P. Gleason, others
4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, 2001

Power exponential densities for the training and classification of acoustic feature vectors in speech recognition
S. Basu, C.A. Micchelli, P. Olsen
Journal of Computational and Graphical Statistics 10(1), 158--184, ASA, 2001

Recent advances in speech recognition system for ibm darpa communicator
Y. Gao, H. Erdogan, Y. Li, V. Goel, M. Picheny
SMALL 20(17.0), 16--2, Citeseer, 2001

Automatic analysis of spontaneous facial behavior: A final project report
M S Bartlett, B Braathen, G Littlewort-Ford, J Hershey, I Fasel, T Marks, E Smith, T J Sejnowski, J R Movellan
University of California at San Diego8, 2001


2000

Tv content recommender system
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, …
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL …, 2000 - aaai.org

TV Content Recommender System, 17 th AAAI
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, D Schaffer, J Zimmerman
July-August, July-August, 2000

Tv content recommender system
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman, others
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, pp. 1121--1122, 2000

TV Content Recommender System
S G Kaushal, S Gutta, K Kurapati, K Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman
In Proceedings of the 17th National Conference on Artificial Intelligence, 2000

Lattice-based unsupervised MLLR for speaker adaptation
M Padmanabhan, G Saon, G Zweig
ASR2000-Automatic Speech Recognition: Challenges for the new …, 2000 - ISCA


Modeling local context for pitch accent prediction
S Pan, J Hirschberg
Proceedings of ACL, 2000

Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
D Chazan, R Hoory, G Cohen, M Zibulski
IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2000

Multistage coarticulation model combining articulatory, formant and cepstral features
Y Gao, R Bakis, J Huang, B Xiang
Sixth International Conference on Spoken Language Processing, 2000

Towards language independent acoustic modeling
W Byrne, P Beyerlein, J Huerta, S Khudanpur, B Marthi, J Morgan, N Peterek, J Picone, D Vergyri, W Wang
IEEE International Conference on Acoustics Speech and Signal Processing, pp. 1029--1032, 2000

Penalized Maximum LikelihoodEstimators and the Baum Welch Algorithm for the Classifi cation of Acoustic Vectors in Speech Recognition
CA Micchelli, P Olsen
Journal of Computational and Applied Mathematics119, 301--331, 2000

Audio-vision: Using audio-visual synchrony to locate sounds
J Hershey, J Movellan
Advances in Neural Information Processing Systems12, 813--819, Citeseer, 2000

Penalized maximum-likelihood estimation, the Baum--Welch algorithm, diagonal balancing of symmetric matrices and applications to training acoustic data
C.A. Micchelli, P. Olsen
Journal of computational and applied mathematics 119(1), 301--331, Elsevier, 2000


1999

Recent improvements to IBM's speech recognition system forautomatic transcription of broadcast news
Gopinath, D Kanevsky, P Olsen, IBMTJWR Center, Y …
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. …, 1999 - ieeexplore.ieee.org

Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news
S.S. Chen, EM Eide, MJF Gales, R.A. Gopinath, D. Kanevsky, P. Olsen
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. Proceedings., 1999 IEEE International Conference on, pp. 37--40

Fluctuations of Brownian motion with drift
J G Conlon, P Olsen
Publicacions Matem{\`a}tiques 43(1), 85--125, 1999

Distortion-class weighted acoustic modeling for Robust Speech Recognition Under GSM RPE-LTP coding
J M Huerta, R M Stern
Proceedings of the Robust Methods for Speech Recognition in Adverse Conditions, Tampere Finland, Citeseer, 1999

Cursive word recognition using a random field based hidden Markov model. Int
G Saon
Journal of Pattern Recognition and Artificial Intelligence, 1999

Cursive word recognition using a random field based hidden Markov model
G Saon
International Journal on Document Analysis and Recognition, 1999 - Springer

Spatial color indexing and applications
J Huang, S Ravi Kumar, M Mitra, WJ Zhu, R Zabih
International Journal of Computer Vision, 1999 - Springer


1998

Video content management in consumer devices
N Dimitrova, T McGee, H Elenbaas, J Martino
IEEE Transactions on Knowledge and Data Engineering, 988--995, Published by the IEEE Computer Society, 1998

Project Reports
L Nikolovska, J Martino
IEEE MultiMedia Magazine 5(2), 78--83, Los Alamitos, CA: IEEE Computer Society, c1994-, 1998

Spatial browsing to retrieve multimedia information
L Nikolovska, J Martino
Multimedia, IEEE 5(2), 78--83, IEEE, 1998

Speech recognition performance on a voicemail transcription task
M Padmanabhan, E Eide, B Ramabhadran, G Ramaswamy, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org

Acoustics-only based automatic phonetic baseform generation
B Ramabhadran, LR Bahl, PV deSouza, M Padmanabhan, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org

Color-spatial image indexing and applications
J Huang
1998 - cs.cornell.edu

A real-time computer vision system for vehicle tracking and traffic surveillance
B Coifman, D Beymer, P McLauchlan, J Malik
Transportation Research Part C, 1998 - Elsevier

Factor Analysis Invariant to Linear Transformations of Data
RA Gopinath, B Ramabhadran, S Dharanipragada
Fifth International Conference on Spoken Language Processing, 1998 - ISCA

LVCSR rescoring with modified loss functions: a decision theoreticperspective
V Goel, W Byrne, S Khudanpur
Acoustics, Speech and Signal Processing, 1998. Proceedings …, 1998 - ieeexplore.ieee.org

Speech recognition performance on a new voicemail transcription task
M Padmanabhan, B Ramabhadran, S Basu
Fifth International Conference on Spoken Language …, 1998 - ISCA

Transcription of broadcast news-some recent improvements to IBM's LVCSR system
L. Polymenakos, P. Olsen, D. Kanvesky, RA Gopinath, PS Gopalakrishnan, S. Chen
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on, pp. 901--904

IBM's LVCSR System for Transcription of Broadcast News used in the 1997 HUB4 English Evaluation
S. Chen, MJF Gales, PS Gopalakrishnan, RA Gopinath, H. Printz, D. Kanevsky, P. Olsen, L. Polymenakos
Proceedings of the Speech Recognition Workshop, 1998

Speech recognition from GSM codec parameters
J M Huerta, R M Stern
Fifth International Conference on Spoken Language Processing, 1998

An automatic hierarchical image classification scheme
J Huang, SR Kumar, R Zabih
Proceedings of the sixth ACM international conference on …, 1998 - portal.acm.org

Micro-events in two serial verb constructions
C Y T Pi, O T Stewart
Proceedings from Semantics and Linguistic Theory VIII, 202, Cornell University, 1998

Robust Speech Recognition in GSM Codec Environments
J M Huerta, H Van Hamme
1998 - Citeseer, Citeseer


1997

CONIVAS: CONtent-based image and video access system
M Abdel
Mottaleb, N Dimitrova, R Desai, J Martino - Proceedings of the fourth ACM international conference on …, 1997 - portal.acm.org

CONIVAS: CONtent-based Image and Video Access System
M A M N Dimitrova, R D J Martino
ACM/Multimedia Conference Proceedings 1996, pp. 427, 1997

CONIVAS: CONtent-based image and video access system
M Abdel-Mottaleb, N Dimitrova, R Desai, J Martino
Proceedings of the fourth ACM international conference on Multimedia, pp. 427--428, 1997

In and Out of the Box: Interaction Paradigms in Electronic Environments
Jacquelyn Martino, Lira Nikolovska
INTERACT '97: Proceedings of the IFIP TC13 Interantional Conference on Human-Computer Interaction, pp. 697--698, Chapman \& Hall, Ltd., 1997

Combining supervised learning with color correlograms for content-based image retrieval
J Huang, SR Kumar, M Mitra
Proceedings of the fifth ACM international conference on …, 1997 - portal.acm.org

A Real-Time Computer Vision System for Measuring Traffic Parameters
D Beymer, P McLauchlan, B Coifman, J Malik
IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND …, 1997 - doi.ieeecomputersociety.org

High performance unconstrained word recognition system combining hmms and markov random fields
G Saon, A Belaid
Automatic Bankcheck Processing, 1997 - books.google.com


1996

CONIVAS: CONtent-based Image and Video Access System
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery

CONIVAS: CONtent-based Image and Video Access System
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery

CONIVAS: CONtent-based Image and Video Access System
M A M N Dimitrova, R D J Martino
System 28(9), 23--32, 1996

Image Representations for Visual Learning
D Beymer, T Poggio
Science, 1996 - sciencemag.org

ISSUES IN PRACTICAL LARGE VOCABULARY ISOLATED WORD RECOGNITION: THE IBM
SK Das, MA Picheny
Automatic Speech and Speaker Recognition: Advanced Topics355, 457, Springer, 1996


Diffusion of Directed Polymers in a Strong Random Environment, to appear in J
PA Olsen, R Song
Stat. Phys, 1996

A Brownian motion version of the directed polymer problem
J G Conlon, P A Olsen
Journal of statistical physics 84(3), 415--454, Springer, 1996

Diffusion of directed polymers in a strong random environment
P Olsen, R Song
Journal of statistical physics 83(3), 727--738, Springer, 1996

An efficient algorithm for parallel integer multiplication
B Singer, G Saon
Journal of Network and Computer Applications, 1996 - Elsevier


1995

Face Recognition From One Example View.
D Beymer, T Poggio, MASSACHUSETTS INST OF TECH …
1995 - doi.ieeecs.org

Face recognition from one model view
D Beymer, T Poggio
Proc. Fifth Intl Conf. Computer Vision, 1995

Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task
LR Bahl, S. Balakrishnan-Aiyer, JR Bellgarda, M. Franz, PS Gopalakrishnan, D. Nahamoo, M. Novak, M. Padmanabhan, MA Picheny, S. Roukos
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, pp. 41--44

Fractional integration, Morrey spaces and a Schr{\\"o}dinger equation
P A Olsen
Communications in Partial Differential Equations 20(11-12), 2005--2055, Taylor \& Francis, 1995

Performance of the IBM large vocabulary continuous speech recognition system
M Franz, PS Gopalakrishnan, D Nahamoo, M Novak, …
on the ARPA Wall Street Journal task, in Proc. ICASSP, 1995 - Citeseer


1993

A method for the construction of acoustic Markov models for words
LR Bahl, PF Brown, PV De Souza, RL Mercer, MA Picheny
Speech and Audio Processing, IEEE Transactions on 1(4), 443--452, IEEE, 1993

Example Based Image Analysis and Synthesis
D Beymer, A Shashua, T Poggio, MASSACHUSETTS INST …
1993 - cognitrn.psych.indiana.edu


1992

note on irregular discrete wavelet transform IEEE Trans inform theory
P Olsen, K Seip
Information Theory, IEEE Transactions on 38(2), 861--863, IEEE, 1992

An estimate of an upper bound for the entropy of English
Peter F Brown, Vincent J Della Pietra, Robert L Mercer, Stephen A Della Pietra, Jennifer C Lai
Computational Linguistics 18(1), 31--40, MIT Press, 1992

Class-based n-gram models of natural language
Peter F Brown, Peter V Desouza, Robert L Mercer, Vincent J Della Pietra, Jenifer C Lai
Computational linguistics 18(4), 467--479, MIT Press, 1992


1991

Finding Junctions Using the Image Gradient - .
DJ Beymer, Massachusetts Institute of Technology, …
1991 - Massachusetts Institute of Technology, Artificial Intelligence …

The MIT vision machine
W Yang, A Hurlbert, D Beymer, PO'Donnell, W …
1991 - portal.acm.org

Automatic Phonetic Baseform Determination
LR Bahl, S. Das, PV Desouza, M. Epstein, RL Mercer, B. Merialdo, D. Nahamoo, MA Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 173--176

Decision trees for phonological rules in continuous speech
L.R. Bahl, PV deSouza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 185--188

An inequality for rational functions with applications to somestatistical estimation problems
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
IEEE Transactions on Information Theory, 1991 - ieeexplore.ieee.org

Context dependent modeling of phones in continuous speech using decision trees
LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Proceedings DARPA Speech and Natural Language Processing Workshop, pp. 264--270, 1991


1989

Matrix fast match: a fast method for identifying a short list ofcandidate words for decoding
L Bahl, PS Gopalakrishnan, D Kanevsky, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org

A generalization of the Baum algorithm to rational objectivefunctions
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org

Large vocabulary natural language continuous speech recognition
LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467

Speech recognition using noise-adaptive prototypes
A. N\'adas, D. Nahamoo, M.A. Picheny
Acoustics, Speech and Signal Processing, IEEE Transactions on 37(10), 1495--1503, IEEE, 1989

When natural language is better than menus: A field study
M Walker, S Whittaker
Hewlett Packard Laboratories Technical Report HPL-BRC-TR-89-020, 1989



1988

Decoder selection based on cross-entropies
PS Gopalakrishnan, D. Kanevsky, A. Nadas, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 20--23

Acoustic Markov models used in the Tangora speech recognition system
LR Bahl, PF Brown, PV De Souza, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 497--500


1986

Rapid prototyping and system development: examination of an interface toolkit for voice and telephony applications
J T Richards, S J Boies, J D Gould
SIGCHI Bull. 17(4), 216--220, ACM, 1986
Abstract

Speaking clearly for the hard of hearing II: Acoustic characteristics of clear and conversational speech
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 29(4), 434, ASHA, 1986


1985

Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 28(1), 96, ASHA, 1985


1983

Recognition of isolated-word sentences from a 5000-word vocabulary office correspondence task
L. Bahl, A. Cole, F. Jelinek, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83., pp. 1065--1067, 1983


1982

User Interface for Audio Communication System
SJ Boies, JD Gould, WA Notz, JT Richards, JW Schoonard
IBM Technical Disclosure Bulletin 25(7A), 3371--3377, 1982


Year Unknown

TV personalization system
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, S …
this volume - Springer, 0

Srinivas Gutta. TV Personalization System: Design of a TV Show Recommender Engine and Interface in: Liliana Ardissono, Alfred Kobsa, Mark Maybury (ed). Personalized Digital Television: Targeting Programs to Individual Viewers
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino
Kluwer, Kluwer, 0


An Eye Tracking Study of How Pictures Influence Online Reading
D Beymer, P Orton, D Russell
Human-Computer Interaction--INTERACT 2007, 456--460, Springer

Echocardiogram View Classification using Edge Filtered Scale-invariant Motion
R Kumar, F Wang, D Beymer, T Syeda-Mahmood
Mahmood - Citeseer, Citeseer, 0


Audio-Visual Sound Separation Via
J Hershey, M Casey, M Cambridge
books.nips.cc, 0

AUDIO-VISUAL GRAPHICAL MODELS FOR SPEECH PROCESSING (2008)
J Hershey
en.scientificcommons.org, 0

MAXIMUM LIKELIHOOD TRAINING OF BASES FOR RAPID ADAPTATION (2008)
K Visweswariah, V Goel, R Gopinath
en.scientificcommons.org, 0


REAL-TIME SPEECH TRANSCRIPTION SERVICE TO IMPROVE NON-NATIVE SPEAKERS’LISTENING COMPREHENSION
D Jiang, Y Pan, W Liu, Y Qin, M Picheny, P Luther
www-304. ibm .com

Audio-Visual Speech Synchrony Detection by a Family of Bimodal Linear Prediction Models
K Kumar, G Potamianos, J Navratil, E Marcheret, V Libal
ece.cmu.edu, 0


Cross-Language Access to Recorded Speech 9n the M ALACE ProIect
D W Oardq, D Demner-Fushmanq, J Hajic, B Ramabhadran, S Gustman, W J Byrne, D Soergelq, B Dorrq, P Resnikq, M Picheny
Fushmanq, J Hajic, B Ramabhadran ... - terpconnect.umd.edu, 0

Lossy Speech Compression Via Compressed Sensing-Based Kalman Filtering
A Carmi, D Kanevsky, B Ramabhadran
domino.research. ibm .com, 0


Robust Audio-Visual Speech Synchrony Detection by Generalized Bimodal Linear Prediction
K Kumar, J Navratil, E Marcheret, V Libal, G Potamianos
ece.cmu.edu, 0

A Data Visualization and Analysis Method for Natural Language Call Routing System Design
H K J Kuo, V Goel
HKJ Kuo, V Goel, 0

New Adaptation Techniques for Large Vocabulary Continuous
S Recognition, Y Gao, B Ramabhadran, M Picheny
S Recognition, Y Gao , B Ramabhadran, M Picheny, 0


Audio-visual speech synchronization detection using a bimodal linear prediction model
K Kumar, J Navratil, E Marcheret, V Libal, G Ramaswamy, G Potamianos
To appear:) Proc. CVPR Biometrics Works., 2009

Single Channel Speech Separation Using Factorial Dynamics
JRHTK Steven, RPA Olsen
books.nips.cc, 0