User Interface Technologies (discontinued) Publications
2014
Method and system for smart mark-up of natural language business rules
Jacquelyn A. Martino, Paul M. Matchen, Rosario A. Uceda-Sosa
US Patent 8,862,457
Jacquelyn A. Martino, Paul M. Matchen, Rosario A. Uceda-Sosa
US Patent 8,862,457
Modular authoring and visualization of rules using trees
Jacquelyn A. Martino, Paul M. Matchen, Rosario Uceda-Sosa
US Patent 8,713,012
Jacquelyn A. Martino, Paul M. Matchen, Rosario Uceda-Sosa
US Patent 8,713,012
Automatic Speech Recognition
Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen Chu, Fadi Biadsy
Natural Language Processing of Semitic Languages, pp. 409--459, Springer Berlin Heidelberg, 2014
Hagen Soltau, George Saon, Lidia Mangu, Hong-Kwang Kuo, Brian Kingsbury, Stephen Chu, Fadi Biadsy
Natural Language Processing of Semitic Languages, pp. 409--459, Springer Berlin Heidelberg, 2014
2013
Fast Face Detector Training Using Tailored Views
Kristina Scherbaum, Rogerio Feris, James Petterson, Volker Blanz
IEEE International Conference on Computer Vision (ICCV), 2013
Kristina Scherbaum, Rogerio Feris, James Petterson, Volker Blanz
IEEE International Conference on Computer Vision (ICCV), 2013
Spatio-Temporal Fisher Vector Coding for Surveillance Event Detection
Qiang Chen, Yang Cai, Lisa Brown, Ankur Datta, Quanfu Fan, Rogerio Feris, Shuicheng Yan, Alex Hauptmann, Sharathchandra Pankanti
ACM Multimedia, 2013
Qiang Chen, Yang Cai, Lisa Brown, Ankur Datta, Quanfu Fan, Rogerio Feris, Shuicheng Yan, Alex Hauptmann, Sharathchandra Pankanti
ACM Multimedia, 2013
Intuitive visualization of Boolean expressions using flows
J A Martino, P M Matchen, R A Uceda-Sosa
US Patent 8,381,178
J A Martino, P M Matchen, R A Uceda-Sosa
US Patent 8,381,178
2012
Unsupervised model selection for view-invariant object detection in surveillance environments
Behjat Siddiquie, Rogerio S Feris, Ankur Datta, Larry S Davis
Pattern Recognition (ICPR), 2012 21st International Conference on, pp. 3252--3255
Behjat Siddiquie, Rogerio S Feris, Ankur Datta, Larry S Davis
Pattern Recognition (ICPR), 2012 21st International Conference on, pp. 3252--3255
Appearance Modeling for Person Re-identification using Weighted Brightness Transfer Functions
Ankur Datta, Lisa M Brown, Rogerio Feris, Sharathchandra Pankanti
21st International Conference on Pattern Recognition (ICPR), November, pp. 2367--2370, 2012
Ankur Datta, Lisa M Brown, Rogerio Feris, Sharathchandra Pankanti
21st International Conference on Pattern Recognition (ICPR), November, pp. 2367--2370, 2012
2011
ECHOCARDIOGRAM VIEW CLASSIFICATION USING EDGE FILTERED SCALE-INVARIANT MOTION FEATURES
Fei WANG, David BEYMER, Tanveer SYEDA-MAHMOOD, Ritwik KUMAR, others
WO Patent 2,011,157,579
Fei WANG, David BEYMER, Tanveer SYEDA-MAHMOOD, Ritwik KUMAR, others
WO Patent 2,011,157,579
2010
Cardiac disease detection from echocardiogram using edge filtered scale-invariant motion features
Ritwik Kumar, Fei Wang, David Beymer, Tanveer Syeda-Mahmood
Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pp. 162--169
Ritwik Kumar, Fei Wang, David Beymer, Tanveer Syeda-Mahmood
Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pp. 162--169
Shape-based similarity retrieval of Doppler images for clinical decision support
T Syeda-Mahmood, P Turaga, D Beymer, F Wang, A Amir, H Greenspan, K Pohl
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pp. 855--862
T Syeda-Mahmood, P Turaga, D Beymer, F Wang, A Amir, H Greenspan, K Pohl
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pp. 855--862
Tracking Motion, Deformation, and Texture using Conditionally Gaussian Processes
T K Marks, J R Hershey, J R Movellan
IEEE transactions on pattern analysis and machine intelligence 32(2), 348--363, IEEE Computer Society, 2010
T K Marks, J R Hershey, J R Movellan
IEEE transactions on pattern analysis and machine intelligence 32(2), 348--363, IEEE Computer Society, 2010
Restructuring Acoustic Models for Client and Server Based Automatic Speech Recognition
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
Spoken Query, to appear, 2010
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
Spoken Query, to appear, 2010
The monaural speech separation and recognition challenge
M Cooke, J R Hershey, S J Rennie
Computer Speech & Language 24(1), 1--15, Elsevier, 2010
M Cooke, J R Hershey, S J Rennie
Computer Speech & Language 24(1), 1--15, Elsevier, 2010
Super-human multi-talker speech recognition: A graphical modeling approach
J R Hershey, S J Rennie, P A Olsen, T T Kristjansson
Computer Speech & Language 24(1), 45--66, Elsevier, 2010
J R Hershey, S J Rennie, P A Olsen, T T Kristjansson
Computer Speech & Language 24(1), 45--66, Elsevier, 2010
2009
Automatic estimation of left ventricular dysfunction from echocardiogram videos
D Beymer, T Syeda-Mahmood, A Amir, Fei Wang, S Adelman
Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on, pp. 164--171, IEEE
D Beymer, T Syeda-Mahmood, A Amir, Fei Wang, S Adelman
Computer Vision and Pattern Recognition Workshops, 2009. CVPR Workshops 2009. IEEE Computer Society Conference on, pp. 164--171, IEEE
Closed-form jensen-renyi divergence for mixture of gaussians and applications to group-wise shape registration
Fei Wang, Tanveer Syeda-Mahmood, Baba C Vemuri, David Beymer, Anand Rangarajan
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2009, pp. 648--655, Springer Berlin Heidelberg
Fei Wang, Tanveer Syeda-Mahmood, Baba C Vemuri, David Beymer, Anand Rangarajan
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2009, pp. 648--655, Springer Berlin Heidelberg
Refactoring acoustic models using variational density approximation
P L Dognin, J R Hershey, V Goel, P A Olsen
ICASSP, 2009
P L Dognin, J R Hershey, V Goel, P A Olsen
ICASSP, 2009
A fast, accurate approximation to log likelihood of Gaussian mixture models (PDF)
P L Dognin, V Goel, J R Hershey, P A Olsen
2009 - computer.org
P L Dognin, V Goel, J R Hershey, P A Olsen
2009 - computer.org
Isometry-enforcing data transformations for improving sparse model learning
Avishy Carmi, Irina Rish, Guillermo Cecchi, Dimitri Kanevsky, Bhuvana Ramabhadran
IBM Tech Report RC24801, Tech. Rep. RC 24801, Human Language Technologies, IBM, 2009
Avishy Carmi, Irina Rish, Guillermo Cecchi, Dimitri Kanevsky, Bhuvana Ramabhadran
IBM Tech Report RC24801, Tech. Rep. RC 24801, Human Language Technologies, IBM, 2009
Natural language system and method based on unisolated performance metric
S Deligne, Y Gao, V Goel, H K Kuo, C Wu
US Patent 7,574,358
S Deligne, Y Gao, V Goel, H K Kuo, C Wu
US Patent 7,574,358
Designing interactive voice response (IVR) interfaces: localisation for low literacy users
A Sharma Grover, O Stewart, D Lubensky
Proceedings of the Conference, pp. 22--24, 2009
Abstract
A Sharma Grover, O Stewart, D Lubensky
Proceedings of the Conference, pp. 22--24, 2009
Abstract
Variational Loopy Belief Propagation for Efficient Multi-talker Speech Recognition
Steven J. Rennie, John R. Hershey and Peder A. Olsen
Interspeech, pp. 1331-1334, 2009
Steven J. Rennie, John R. Hershey and Peder A. Olsen
Interspeech, pp. 1331-1334, 2009
Speech Transcription in AAL Solutions
A. Sorin and R. Hoory
Workshop on Designing Ambient Interactions for Older Users , 2009
A. Sorin and R. Hoory
Workshop on Designing Ambient Interactions for Older Users , 2009
Optimal quantization and bit allocation for compressing large discriminative feature space transforms
E. Marcheret, V. Goel, P.A. Olsen
Automatic Speech Recognition \& Understanding, 2009. ASRU 2009. IEEE Workshop on, pp. 64--69
E. Marcheret, V. Goel, P.A. Olsen
Automatic Speech Recognition \& Understanding, 2009. ASRU 2009. IEEE Workshop on, pp. 64--69
RTTS: Towards Enterprise-level Real-Time Speech Transcription and Translation Services
J M Huerta, C Wu, A Sakrajda, S Caskey, E E Jan, A Faisman, S Ben-David, W Liu, A Lee, O Stewart, others
Tenth Annual Conference of the International Speech Communication Association, 2009
Abstract
J M Huerta, C Wu, A Sakrajda, S Caskey, E E Jan, A Faisman, S Ben-David, W Liu, A Lee, O Stewart, others
Tenth Annual Conference of the International Speech Communication Association, 2009
Abstract
Designing crowdsourcing community for the enterprise
O Stewart, J M Huerta, M Sader
Proceedings of the ACM SIGKDD Workshop on Human Computation, pp. 50--53, 2009
Abstract
O Stewart, J M Huerta, M Sader
Proceedings of the ACM SIGKDD Workshop on Human Computation, pp. 50--53, 2009
Abstract
Web derived pronunciations for spoken term detection
D Can, E Cooper, A Ghoshal, M Jansche, S Khudanpur, B Ramabhadran, M Riley, M Saraclar, A Sethy, M Ulinski, others
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp. 83--90, 2009
D Can, E Cooper, A Ghoshal, M Jansche, S Khudanpur, B Ramabhadran, M Riley, M Saraclar, A Sethy, M Ulinski, others
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp. 83--90, 2009
Unsupervised pronunciation validation
C M White, A Sethy, B Ramabhadran, P Wolfe, E Cooper, M Saraclar, J K Baker
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4301--4304
C M White, A Sethy, B Ramabhadran, P Wolfe, E Cooper, M Saraclar, J K Baker
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 4301--4304
Multimodal Classification of Activities of Daily Living Inside Smart Homes
V Libal, B Ramabhadran, N Mana, F Pianesi, P Chippendale, O Lanz, G Potamianos
Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living, pp. 694, 2009
V Libal, B Ramabhadran, N Mana, F Pianesi, P Chippendale, O Lanz, G Potamianos
Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living, pp. 694, 2009
A generalized family of parameter estimation techniques
D Kanevsky, T N Sainath, B Ramabhadran
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 1725--1728
D Kanevsky, T N Sainath, B Ramabhadran
Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing-Volume 00, pp. 1725--1728
Iterative Sentence--Pair Extraction from Quasi--Parallel Corpora for Machine Translation
R Sarikaya, S Maskey, R Zhang, E Jan, D Wang, B Ramabhadran, S Roukos
Interspeech, 2009
R Sarikaya, S Maskey, R Zhang, E Jan, D Wang, B Ramabhadran, S Roukos
Interspeech, 2009
Cultural voice markers in speech-to-speech machine translation systems
O Stewart, M Picheny, D Lubensky, B Ramabhadran
Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 313--316, ACM New York, NY, USA
O Stewart, M Picheny, D Lubensky, B Ramabhadran
Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 313--316, ACM New York, NY, USA
A Lower Bound on the Euclidean Distance for Fast Nearest Neighbor Retrieval in High-dimensional Spaces
G Saon, P Olsen
IBM Technical Report RC24859, 2009
G Saon, P Olsen
IBM Technical Report RC24859, 2009
Combined Discriminative Training for Multi-Stream HMM-based Audio-Visual Speech Recognition
Jing Huang, Karthik Visweswariah
Interspeech, 2009
Jing Huang, Karthik Visweswariah
Interspeech, 2009
Long-Time Span Acoustic Activity Analysis From Far-Field Sensors In Smart Homes
Jing Huang, V Zhuang
ICASSP, 2009
Jing Huang, V Zhuang
ICASSP, 2009
ACOUSTIC FALL DETECTION USING GAUSSIAN MIXTURE MODELS AND GMM SUPERVECTORS
X Zhuang, Jing Huang, G Potamianos, M Hasegawa
… - isle.uiuc.edu, 2009
X Zhuang, Jing Huang, G Potamianos, M Hasegawa
… - isle.uiuc.edu, 2009
Automatic Speech Recognition
G Potamianos, L Lamel, M Wolfel, J Huang, E …
Computers in the Human Interaction Loop, 2009 - books.google.com
G Potamianos, L Lamel, M Wolfel, J Huang, E …
Computers in the Human Interaction Loop, 2009 - books.google.com
Effects of real-time transcription on non-native speaker's comprehension in computer-mediated communications
Y.X. Pan, D.N. Jiang, M Picheny, Y Qin
ACM CHI, Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009
Y.X. Pan, D.N. Jiang, M Picheny, Y Qin
ACM CHI, Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009
Single-channel speech separation and recognition using loopy belief propagation
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3845--3848
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3845--3848
Method and apparatus for scene learning and three-dimensional tracking using stereo video cameras
T Kristjansson, H Attias, JR Hershey
T Kristjansson, H Attias, JR Hershey
Refactoring acoustic models using variational density approximation
P.L. Dognin, J.R. Hershey, V. Goel, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 4473--4476
P.L. Dognin, J.R. Hershey, V. Goel, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 4473--4476
A fast, accurate approximation to log likelihood of Gaussian mixture models
P.L. Dognin, V. Goel, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3817--3820
P.L. Dognin, V. Goel, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, pp. 3817--3820
Acoustic modeling using exponential families
V. Goel, P.A. Olsen
Tenth Annual Conference of the International Speech Communication Association, 2009
V. Goel, P.A. Olsen
Tenth Annual Conference of the International Speech Communication Association, 2009
RTTS: Towards Enterprise-level Real-Time Speech Transcription and Translation Services
Juan M Huerta, Cheng Wu, Andrzej Sakrajda (Andy), SASHA P CASKEY, Ea
Ee Jan, Antonio R Lee, OSAMUYIMEN T STEWART, ALEXANDER FAISMAN (ALEX), David M Lubensky - INTERSPEECH 2009
Juan M Huerta, Cheng Wu, Andrzej Sakrajda (Andy), SASHA P CASKEY, Ea
Ee Jan, Antonio R Lee, OSAMUYIMEN T STEWART, ALEXANDER FAISMAN (ALEX), David M Lubensky - INTERSPEECH 2009
A new method for OOV detection using hybrid word/fragment system
A Rastrow, A Sethy, B Ramabhadran
Proceedings of the 2009 IEEE International …, 2009 - doi.ieeecomputersociety.org
A Rastrow, A Sethy, B Ramabhadran
Proceedings of the 2009 IEEE International …, 2009 - doi.ieeecomputersociety.org
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation
A Sethy, PG Georgiou, B Ramabhadran, S Narayanan
IEEE Transactions on Audio, Speech, and Language Processing, 2009 - sail.usc.edu
A Sethy, PG Georgiou, B Ramabhadran, S Narayanan
IEEE Transactions on Audio, Speech, and Language Processing, 2009 - sail.usc.edu
UNSUPERVISED PRONUNCIATION VALIDATION
CM White, ABHINAV SETHY, Bhuvana Ramabhadran, P Wolfe, E …
sisl.seas.harvard.edu, 2009
CM White, ABHINAV SETHY, Bhuvana Ramabhadran, P Wolfe, E …
sisl.seas.harvard.edu, 2009
Fast decoding for open vocabulary spoken term detection}
B Ramabhadran, A Sethy, J Mamou, B Kingsbury, U …
Proceedings of Human Language Technologies: The 2009 Annual …, 2009 - aclweb.org
B Ramabhadran, A Sethy, J Mamou, B Kingsbury, U …
Proceedings of Human Language Technologies: The 2009 Annual …, 2009 - aclweb.org
Effect of pronunciations on oov queries in spoken term detection
D Can, E Cooper, A Sethy, B Ramabhadran, M …
ICASSP, April, 2009 - jhu.edu
D Can, E Cooper, A Sethy, B Ramabhadran, M …
ICASSP, April, 2009 - jhu.edu
Method for likelihood computation in multi-stream HMM based speech recognition
SM Chu, V Goel, E Marcheret, G Potamianos
SM Chu, V Goel, E Marcheret, G Potamianos
Method for likelihood computation in multi-stream HMM based speech recognition
SM Chu, V Goel, E Marcheret, G Potamianos
SM Chu, V Goel, E Marcheret, G Potamianos
Automatic Speech Recognition
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com
Method for likelihood computation in multi-stream HMM based speech recognition
SM Chu, V Goel, E Marcheret, G Potamianos
SM Chu, V Goel, E Marcheret, G Potamianos
Automatic Speech Recognition
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com
Lamel, M Wolfel, J Huang, E Marcheret, C Barras, X …
Computers in the Human Interaction Loop, 2009 - books.google.com
A Fast, Accurate Approximation to Log Likelihood of Gaussian Mixture Models
Pierre L Dognin, Vaibhava Goel, John R Hershey, Peder A Olsen
ICASSP, 2009
Pierre L Dognin, Vaibhava Goel, John R Hershey, Peder A Olsen
ICASSP, 2009
Refactoring Acoustic Models using Variational Expectation-Maximization
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009
Refactoring Acoustic Models using Variational Density Approximation
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009
Pierre L Dognin, John R Hershey, Vaibhava Goel, Peder A Olsen
ICASSP, 2009
2008
Spatio-temporal motion estimation for disease discrimination in cardiac echo videos
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2008, pp. 121--124
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2008, pp. 121--124
There are many ways to watch people as they use web
D Beymer, D M Russell
2008 - en.scientificcommons.org
D Beymer, D M Russell
2008 - en.scientificcommons.org
Inferring transducer viewpoints in cardiac echo videos
D Beymer, T Syeda-Mahmood, F Wang
Computers in Cardiology, 2008, pp. 117--120
D Beymer, T Syeda-Mahmood, F Wang
Computers in Cardiology, 2008, pp. 117--120
Vector based Approaches to Semantic Similarity Measures
J M Huerta
Advances in Natural Language Processing and Applications, 163, Citeseer, 2008
J M Huerta
Advances in Natural Language Processing and Applications, 163, Citeseer, 2008
Enhancing Speaker Recognition with Virtual Examples
Y Solewicz, H Aronowitz
2008 - eprints.pascal-network.org
Y Solewicz, H Aronowitz
2008 - eprints.pascal-network.org
Algorithm Optimizations: Low Computational Complexity
M Novak
Automatic speech recognition on mobile devices and over communication networks, 213, Springer, 2008
M Novak
Automatic speech recognition on mobile devices and over communication networks, 213, Springer, 2008
Using robust audio and video processing technologies to alleviate the elderly cognitive decline
V Mylonakis, J Soldatos, A Pnevmatikakis, L Polymenakos, A Sorin, H Aronowitz
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments, pp. 28, 2008
V Mylonakis, J Soldatos, A Pnevmatikakis, L Polymenakos, A Sorin, H Aronowitz
Proceedings of the 1st international conference on PErvasive Technologies Related to Assistive Environments, pp. 28, 2008
Time-compressing speech: ASR transcripts are an effective way to support gist extraction
S Tucker, N Kyprianou, S Whittaker
Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction, pp. 235, 2008
S Tucker, N Kyprianou, S Whittaker
Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction, pp. 235, 2008
METHOD AND SYSTEM FOR PROMPT CONSTRUCTION FOR SELECTION FROM A LIST OF ACOUSTICALLY CONFUSABLE ITEMS IN SPOKEN DIALOG SYSTEMS
E M Eide, V Goel, R Gopinath, O T Stewart
2008 - freepatentsonline.com
E M Eide, V Goel, R Gopinath, O T Stewart
2008 - freepatentsonline.com
A paralinguistic template for creating persona in interactive voice response (IVR) systems
Osamuyimen Stewart
Emotions in the Human Voice: Volume III Culture and Perception, Plural, 2008
Abstract
Osamuyimen Stewart
Emotions in the Human Voice: Volume III Culture and Perception, Plural, 2008
Abstract
Improving Large Scale Alphanumeric String Recognition using Redundant Information
E E Jan, O Stewart, R Co, D Lubensky
ICSLP 2008
Abstract
E E Jan, O Stewart, R Co, D Lubensky
ICSLP 2008
Abstract
Research and Commercial Spoken Dialogue Systems
R Pieraccini, J M Huerta
Recent Trends in Discourse and Dialogue, 1, Springer, 2008
R Pieraccini, J M Huerta
Recent Trends in Discourse and Dialogue, 1, Springer, 2008
Relative rank statistics for dialog analysis
J M Huerta
Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 965--972, 2008
J M Huerta
Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 965--972, 2008
SYSTEM AND METHOD FOR A DEVICE SOUND INTERFACE MANAGER
A Aaron, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,153, 2008 - Google Patents, Google Patents
US Patent App. 12/019,153
A Aaron, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,153, 2008 - Google Patents, Google Patents
US Patent App. 12/019,153
METHOD AND SYSTEM FOR CAPABILITIES LEARNING
S H Basson, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,709, 2008 - Google Patents, Google Patents
US Patent App. 12/019,709
S H Basson, D Kanevsky, E E Kelley, B Ramabhadran
US Patent App. 12/019,709, 2008 - Google Patents, Google Patents
US Patent App. 12/019,709
The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge
R Fernandez, Z Kons, S Shechtman, Z W Shuang, R Hoory, B Ramabhadran, Y Qin
Blizzard Workshop, 2008
R Fernandez, Z Kons, S Shechtman, Z W Shuang, R Hoory, B Ramabhadran, Y Qin
Blizzard Workshop, 2008
Speaker Recognition in Two-Wire Test Sessions
H Aronowitz, Y A Solewicz
Ninth Annual Conference of the International Speech Communication Association, 2008
H Aronowitz, Y A Solewicz
Ninth Annual Conference of the International Speech Communication Association, 2008
Online vocabulary adaptation using contextual information and information retrieval
H Aronowitz
Ninth Annual Conference of the International Speech Communication Association, pp. 1805--1808, 2008
H Aronowitz
Ninth Annual Conference of the International Speech Communication Association, pp. 1805--1808, 2008
Recent advances in the IBM GALE mandarin transcription system
S M Chu, H K Kuo, L Mangu, Y Liu, Y Qin, Q Shi, S L Zhang, H Aronowitz
Acoustics, Speech and Signal Processing, 2008, pp. 4329--4332
S M Chu, H K Kuo, L Mangu, Y Liu, Y Qin, Q Shi, S L Zhang, H Aronowitz
Acoustics, Speech and Signal Processing, 2008, pp. 4329--4332
Audio -based unsupervised segmentation of multiparty dialogue
PY Hsueh
IEEE International Conference on Acoustics, Speech …, 2008 - ieeexplore.ieee.org
PY Hsueh
IEEE International Conference on Acoustics, Speech …, 2008 - ieeexplore.ieee.org
Automatic decision detection in meeting speech
PY Hsueh, JD Moore
Lecture Notes in Computer Science, 2008 - Springer
PY Hsueh, JD Moore
Lecture Notes in Computer Science, 2008 - Springer
A Hardware-Software Framework for High-Reliability People Fall Detection
A Grassi, Jing Huang, G. Potamianos
IEEE SENSORS, 2008
A Grassi, Jing Huang, G. Potamianos
IEEE SENSORS, 2008
A multi-sensor approach for people fall detecion in the home environment
Jing Huang, G A Leone
ECCV, 2008
Jing Huang, G A Leone
ECCV, 2008
Effective Acoustic Adaptation for A Distant-talking Interactive TV System
Jing Huang, Marco Matassoni, Mark E Epstein
Interspeech, 2008
Jing Huang, Marco Matassoni, Mark E Epstein
Interspeech, 2008
Penalty function maximization for large margin hmm training
G Saon, D Povey
2008 - wiki.inf.ed.ac.uk
G Saon, D Povey
2008 - wiki.inf.ed.ac.uk
Dynamic language model mixtures with history-based buckets
M. Franz, P.A. Olsen, others
US Patent 7,395,205
M. Franz, P.A. Olsen, others
US Patent 7,395,205
Graphical models for robust speech recognition in adverse environments
S J Rennie
Ph.D. Thesis, 2008
S J Rennie
Ph.D. Thesis, 2008
Weak hypothesis generation apparatus and method, learning apparatus and method, detection apparatus …
JR Movellan, MS Bartlett, GC Littlewort, J Hershey …
JR Movellan, MS Bartlett, GC Littlewort, J Hershey …
Variational bhattacharyya divergence for hidden markov models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4557--4560
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4557--4560
Accelerated monte carlo for kullback-leibler divergence between gaussian mixture models
J.Y. Chen, J.R. Hershey, P.A. Olsen, E. Yashchin
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4553--4556
J.Y. Chen, J.R. Hershey, P.A. Olsen, E. Yashchin
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4553--4556
Efficient model-based speech separation and denoising using non-negative subspace analysis
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 1833--1836
S.J. Rennie, J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 1833--1836
Speech transformation solutions
D Kanevsky, S Basson, A Faisman, L Rachevsky, A …
Cognition Distributed: How Cognitive Technology Extends Our …, 2008 - books.google.com
D Kanevsky, S Basson, A Faisman, L Rachevsky, A …
Cognition Distributed: How Cognitive Technology Extends Our …, 2008 - books.google.com
System and method for management of call data using a vector based model and relational data …
C Wu, A Sakrajda, HJ Kuo, V Goel, D Lubensky
C Wu, A Sakrajda, HJ Kuo, V Goel, D Lubensky
Optimizing speech recognition grammars using a measure of similarity between hidden Markov models
B Mohanty, J Hershey, P Olsen, S Kozat, V Goel
Acoustics, Speech and Signal Processing, 2008, pp. 4953--4956
B Mohanty, J Hershey, P Olsen, S Kozat, V Goel
Acoustics, Speech and Signal Processing, 2008, pp. 4953--4956
Bag-Of-Word normalized n-gram models
A Sethy, B Ramabhadran
A Sethy, B Ramabhadran, 2008
A Sethy, B Ramabhadran
A Sethy, B Ramabhadran, 2008
A study of unsupervised clustering techniques for language modeling
S Hahn, A Sethy, H K J Kuo, B Ramabhadran
S Hahn, A Sethy, HKJ Kuo, B Ramabhadran, 2008
S Hahn, A Sethy, H K J Kuo, B Ramabhadran
S Hahn, A Sethy, HKJ Kuo, B Ramabhadran, 2008
Generalization of Extended Baum-Welch Parameter Estimation for Discriminative Training and Decoding
D Kanevsky, T N Sainath, B Ramabhadran, D Nahamoo
Proceedings of the 9th annual conference of the international speech communication association, Brisbane, Australia, pp. 277--80, 2008
D Kanevsky, T N Sainath, B Ramabhadran, D Nahamoo
Proceedings of the 9th annual conference of the international speech communication association, Brisbane, Australia, pp. 277--80, 2008
A New Family of Extended Baum-Welch Update Rules
Dimitri Kanevsky, Daniel Povey, Bhuvana Ramabhadran, Irina Rish, Tara Sainath
2008
Dimitri Kanevsky, Daniel Povey, Bhuvana Ramabhadran, Irina Rish, Tara Sainath
2008
Gradient steepness metrics using extended Baum-Welch transformations for universal pattern …
TN Sainath, D Kanevsky, B Ramabhadran
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. …, 2008 - ieeexplore.ieee.org
TN Sainath, D Kanevsky, B Ramabhadran
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. …, 2008 - ieeexplore.ieee.org
Boosted MMI for model and feature-space discriminative training
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4057--4060
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4057--4060
Far-field multimodal speech processing and conversational interaction in smart spaces
G Potamianos, J Huang, E Marcheret, V Libal, R Balchandran, M Epstein, L Seredi, M Labsky, L Ures, M Black, others
Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, 2008
G Potamianos, J Huang, E Marcheret, V Libal, R Balchandran, M Epstein, L Seredi, M Labsky, L Ures, M Black, others
Proceedings of the Joint Workshop on Hands-Free Speech Communication and Microphone Arrays, 2008
Automatic Speech Recognition in CHIL
G Potamianos, L Lamel, M W{\"o}lfel, J Huang, E Marcheret, C Barras, X Zhu, J McDonough, J Hernando, D Macho, others
iit.demokritos.gr, 2008
G Potamianos, L Lamel, M W{\"o}lfel, J Huang, E Marcheret, C Barras, X Zhu, J McDonough, J Hernando, D Macho, others
iit.demokritos.gr, 2008
The IBM RT07 evaluation systems for speaker diarization on lecture meetings
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 497--508, Springer, 2008
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 497--508, Springer, 2008
The IBM rich transcription 2007 speech-to-text systems for lecture meetings
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 429--441, Springer, 2008
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 429--441, Springer, 2008
Beyond Linear Transforms: Efficient Non-linear Dynamic Adaptation for Noise Robust Speech Recognition
Steven J Rennie, Pierre L Dognin
Interspeech, 2008
Steven J Rennie, Pierre L Dognin
Interspeech, 2008
2007
Approximating the Kullback Leibler divergence between Gaussian mixture models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--317
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--317
Finding disease similarity by combining ECG with heart auscultation sound
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2007, pp. 261--264
F Wang, T Syeda-Mahmood, D Beymer
Computers in Cardiology, 2007, pp. 261--264
AALIM: Multimodal mining for cardiac decision support
T Syeda-Mahmood, F Wang, D Beymer, A Amir, M Richmond, S Hashmi
Computers in Cardiology, 2007, pp. 209--212
T Syeda-Mahmood, F Wang, D Beymer, A Amir, M Richmond, S Hashmi
Computers in Cardiology, 2007, pp. 209--212
Characterizing spatio-temporal patterns for disease discrimination in cardiac echo videos
T Syeda-Mahmood, F Wang, D Beymer, M London, R Reddy
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2007, pp. 261--269, Springer Berlin Heidelberg
T Syeda-Mahmood, F Wang, D Beymer, M London, R Reddy
Medical Image Computing and Computer-Assisted Intervention--MICCAI 2007, pp. 261--269, Springer Berlin Heidelberg
How Predictable is ASR Confidence in Dialog Applications?
X Li, J M Huerta
Eighth Annual Conference of the International Speech Communication Association, 2007
X Li, J M Huerta
Eighth Annual Conference of the International Speech Communication Association, 2007
Single channel speech separation using factorial dynamics
J.R. Hershey, T. Kristjansson, S. Rennie, P.A. Olsen
Advances in Neural Information Processing Systems19, 593, MIT; 1998, 2007
J.R. Hershey, T. Kristjansson, S. Rennie, P.A. Olsen
Advances in Neural Information Processing Systems19, 593, MIT; 1998, 2007
Word confusability-measuring hidden Markov model similarity
J.Y. Chen, P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007
J.Y. Chen, P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007
Discriminative training of subspace constrained GMMs for speech recognition
S Axelrod, V Goel, R Gopinath, P Olsen, K Visweswariah
IEEE Transactions on Speech and Audio Processing 15(1), 172-189, 2007
S Axelrod, V Goel, R Gopinath, P Olsen, K Visweswariah
IEEE Transactions on Speech and Audio Processing 15(1), 172-189, 2007
Bhattacharyya error and divergence using variational importance sampling
P Olsen, J Hershey
Interspeech, Antwerp, Belgium (August 2007)
P Olsen, J Hershey
Interspeech, Antwerp, Belgium (August 2007)
Efficient speaker recognition using approximated cross entropy (ACE)
H Aronowitz, D Burshtein
Audio, Speech, and Language Processing, IEEE Transactions on 15(7), 2033--2043, IEEE, 2007
H Aronowitz, D Burshtein
Audio, Speech, and Language Processing, IEEE Transactions on 15(7), 2033--2043, IEEE, 2007
Discriminative estimation of subspace constrained gaussian mixture models for speech recognition
Scott Axelrod, Vaibhava Goel, Ramesh Gopinath, Peder Olsen, Karthik Visweswariah
Audio, Speech, and Language Processing, IEEE Transactions on 15(1), 172--189, IEEE, 2007
Scott Axelrod, Vaibhava Goel, Ramesh Gopinath, Peder Olsen, Karthik Visweswariah
Audio, Speech, and Language Processing, IEEE Transactions on 15(1), 172--189, IEEE, 2007
Variational probabilistic speech separation using microphone arrays
S. Rennie, P. Aarabi, B. Frey
IEEE Transactions on Speech and Audio Processing, 2007
S. Rennie, P. Aarabi, B. Frey
IEEE Transactions on Speech and Audio Processing, 2007
Bhattacharyya error and divergence using variational importance sampling
P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007
P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007
2006
The IBM expressive text-to-speech synthesis system for American English
J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, M.A. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(4), 1099--1108, IEEE, 2006
J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, M.A. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(4), 1099--1108, IEEE, 2006
Spoken document retrieval from call-center conversations
J Mamou, D Carmel, R Hoory
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51--58, ACM, 2006
J Mamou, D Carmel, R Hoory
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 51--58, ACM, 2006
Super-human multi-talker speech recognition: The IBM 2006 speech separation challenge system
T. Kristjansson, J. Hershey, P. Olsen, S. Rennie, R. Gopinath
Ninth International Conference on Spoken Language Processing, pp. 97--100, 2006
T. Kristjansson, J. Hershey, P. Olsen, S. Rennie, R. Gopinath
Ninth International Conference on Spoken Language Processing, pp. 97--100, 2006
Single channel speech separation using layered hidden Markov models
J Hershey, T Kristjansson, S Rennie, P Olsen
Advances in Neural Information Processing Systems (NIPS), 2006
J Hershey, T Kristjansson, S Rennie, P Olsen
Advances in Neural Information Processing Systems (NIPS), 2006
Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation
L. Gu, Y. Gao, F.H. Liu, M. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(2), 377--392, IEEE, 2006
L. Gu, Y. Gao, F.H. Liu, M. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(2), 377--392, IEEE, 2006
2005
Contructing ensembles of ASR systems using randomized decision trees
O Siohan, B Ramabhadran, B Kingsbury
Acoustics, Speech, and Signal Processing, 2005, pp. 197--200
O Siohan, B Ramabhadran, B Kingsbury
Acoustics, Speech, and Signal Processing, 2005, pp. 197--200
A distance measure between gmms based on the unscented transform and its application to speaker recognition
J Goldberger, H Aronowitz
Ninth European Conference on Speech Communication and Technology, 2005
J Goldberger, H Aronowitz
Ninth European Conference on Speech Communication and Technology, 2005
fMPE: Discriminatively trained features for speech recognition
Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon, Hagen Soltau, Geoffrey Zweig
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 961--964, Philadelphia, 2005
Daniel Povey, Brian Kingsbury, Lidia Mangu, George Saon, Hagen Soltau, Geoffrey Zweig
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 961--964, Philadelphia, 2005
Reusable dialog component framework for rapid voice application development
R P Akolkar, T Faruquie, J Huerta, P Kankar, N Rajput, TV Raman, R U Udupa, A Verma
Component-Based Software Engineering, 306--321, Springer, 2005
R P Akolkar, T Faruquie, J Huerta, P Kankar, N Rajput, TV Raman, R U Udupa, A Verma
Component-Based Software Engineering, 306--321, Springer, 2005
Efficient speaker identification and retrieval
H Aronowitz, D Burshtein
Ninth European Conference on Speech Communication and Technology, pp. 2, 2005
H Aronowitz, D Burshtein
Ninth European Conference on Speech Communication and Technology, pp. 2, 2005
Speaker indexing in audio archives using Gaussian mixture scoring simulation
H Aronowitz, D Burshtein, A Amir
Machine Learning for Multimodal Interaction, 243--252, Springer, 2005
H Aronowitz, D Burshtein, A Amir
Machine Learning for Multimodal Interaction, 243--252, Springer, 2005
Voicing features for robust speech detection
T. Kristjansson, S. Deligne, P. Olsen
Ninth European Conference on Speech Communication and Technology, pp. 3, 2005
T. Kristjansson, S. Deligne, P. Olsen
Ninth European Conference on Speech Communication and Technology, pp. 3, 2005
Using semantic analysis to improve speech recognition performance
H. Erdogan, R. Sarikaya, S.F. Chen, Y. Gao, M. Picheny
Computer Speech \& Language 19(3), 321--343, Elsevier, 2005
H. Erdogan, R. Sarikaya, S.F. Chen, Y. Gao, M. Picheny
Computer Speech \& Language 19(3), 321--343, Elsevier, 2005
Subspace constrained Gaussian mixture models for speech recognition
Scott Axelrod, Vaibhava Goel, Ramesh A Gopinath, Peder A Olsen, Karthik Visweswariah
Speech and Audio Processing, IEEE Transactions on 13(6), 1144--1160, IEEE, 2005
Scott Axelrod, Vaibhava Goel, Ramesh A Gopinath, Peder A Olsen, Karthik Visweswariah
Speech and Audio Processing, IEEE Transactions on 13(6), 1144--1160, IEEE, 2005
2004
TV personalization system: design of a TV show recommender engine and interface
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, J …
Personalized Digital Television: Targeting Programs to …, 2004 - citeseerx.ist.psu.edu
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, J …
Personalized Digital Television: Targeting Programs to …, 2004 - citeseerx.ist.psu.edu
METHOD AND APPARATUS FOR ACCESS AND DISPLAY OF CONTENT ALLOWING USERS TO COMBINE MULTIPLE PROFILES
J Martino, J Zimmerman
J Martino, J Zimmerman
METHOD OF POPULATING AN EXPLICIT PROFILE CROSS-REFERENCE TO RELATED APPLICATIONS
J Zimmerman, JA Martino
J Zimmerman, JA Martino
TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004
Personalized news retrieval system
J H Elenbaas, N Dimitrova, T McGee, M Simpson, J A Martino, M Abdel-Mottaleb, M Garrett, C Ramsey, H L Wu, R Desai
US Patent App. 10/932,460, 2004 - Google Patents, Google Patents
US Patent App. 10/932,460
J H Elenbaas, N Dimitrova, T McGee, M Simpson, J A Martino, M Abdel-Mottaleb, M Garrett, C Ramsey, H L Wu, R Desai
US Patent App. 10/932,460, 2004 - Google Patents, Google Patents
US Patent App. 10/932,460
TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub
TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to …, 2004 - Kluwer Academic Pub
Personalized Digital Television: Targeting Programs to Individual Viewers, volume 6 of Human-Computer Interaction Series, chapter 5
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino, S Gutta
Kluwer, Kluwer, 2004
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino, S Gutta
Kluwer, Kluwer, 2004
TV Personalization System
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004
D SCHAFFER, S GUTTA, J MARTINO
Personalized Digital Television: Targeting Programs to Individual Viewers, 27, Kluwer Academic Pub, 2004
Automatic recognition of spontaneous speech for access to multilingual oral history archives
M Picheny, J Psutka, B Ramabhadran, D Soergel, T …
Speech and Audio ..., 2004 - ieeexplore.ieee.org
M Picheny, J Psutka, B Ramabhadran, D Soergel, T …
Speech and Audio ..., 2004 - ieeexplore.ieee.org
Single microphone source separation using high resolution signal reconstruction
T Kristjansson, H Attias, J Hershey
Proc, pp. 817--820, 2004
T Kristjansson, H Attias, J Hershey
Proc, pp. 817--820, 2004
Asking the Right Questions: Task Hierarchy Predictive Traversal Mechanisms for Mixed Initiative Dialog Management
J M Huerta
NLUCS 2004, 106, Citeseer
J M Huerta
NLUCS 2004, 106, Citeseer
A session-GMM generative model using test utterance Gaussian mixture modeling for speaker verification
H Aronowitz, D Burshtein, A Amir
dim 1(2), 2, 2004
H Aronowitz, D Burshtein, A Amir
dim 1(2), 2, 2004
The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation
A Sorin, T Ramabadran, D Chazan, R Hoory, M McLaughlin, D Pearce, F C R Wang, Y Zhang
Proc. ICASSP04, 2004
A Sorin, T Ramabadran, D Chazan, R Hoory, M McLaughlin, D Pearce, F C R Wang, Y Zhang
Proc. ICASSP04, 2004
Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition
J Hershey, T Kristjansson, Z Zhang
Transfer10, 15, Citeseer, 2004
J Hershey, T Kristjansson, Z Zhang
Transfer10, 15, Citeseer, 2004
Text independent speaker recognition using speaker dependent word spotting
H Aronowitz, D Burshtein, A Amir
Eighth International Conference on Spoken Language Processing, 2004
H Aronowitz, D Burshtein, A Amir
Eighth International Conference on Spoken Language Processing, 2004
The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction
T Ramabadran, A Sorin, M McLaughlin, D Chazan, D Pearce, R Hoory
Proc. ICASSP, Montreal, Quebec, Canada, 2004
T Ramabadran, A Sorin, M McLaughlin, D Chazan, D Pearce, R Hoory
Proc. ICASSP, Montreal, Quebec, Canada, 2004
3d tracking of morphable objects using conditionally gaussian nonlinear filters
T K Marks, J Hershey, J C Roddey, J R Movellan
Conference on Computer Vision and Pattern Recognition Workshop, 2004, pp. 190--190, Citeseer
T K Marks, J Hershey, J C Roddey, J R Movellan
Conference on Computer Vision and Pattern Recognition Workshop, 2004, pp. 190--190, Citeseer
Speaker indexing in audio archives using test utterance Gaussian mixture modeling
H Aronowitz, D Burshtein, A Amir
2004 - eprints.pascal-network.org
H Aronowitz, D Burshtein, A Amir
2004 - eprints.pascal-network.org
Look or listen: Discovering effective techniques for accessing speech data
S Whittaker, J Hirschberg
Proceedings of Human Computer Interaction, 207--222, Citeseer, 2004
S Whittaker, J Hirschberg
Proceedings of Human Computer Interaction, 207--222, Citeseer, 2004
Segmental minimum Bayes-risk decoding for automatic speech recognition
V Goel, S Kumar, W Byrne
IEEE transactions on Speech and Audio Processing 12(3), 234--249, Citeseer, 2004
V Goel, S Kumar, W Byrne
IEEE transactions on Speech and Audio Processing 12(3), 234--249, Citeseer, 2004
A corpus-based approach to< ahem/> expressive speech synthesis
E Eide, A Aaron, R Bakis, W Hamza, M Picheny, J Pitrelli
Proccedings of 5th ISSW, 79--84, Citeseer, 2004
E Eide, A Aaron, R Bakis, W Hamza, M Picheny, J Pitrelli
Proccedings of 5th ISSW, 79--84, Citeseer, 2004
Modeling inverse covariance matrices by basis expansion
P.A. Olsen, R.A. Gopinath
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. 37--46, IEEE, 2004
P.A. Olsen, R.A. Gopinath
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. 37--46, IEEE, 2004
2003
Search user interface providing mechanism for manipulation of explicit and implicit criteria
JA Martino, L Nikolovska, AF Camplin
JA Martino, L Nikolovska, AF Camplin
METHOD AND APPARATUS FOR SCROLLING ELECTRONIC PROGRAM GUIDE (EPG) AND OTHER TEXTUAL INFORMATION
J MARTINO, J ZIMMERMAN
2003 - wipo.int
J MARTINO, J ZIMMERMAN
2003 - wipo.int
Touch-screen image scrolling system and method
J Zimmerman, J A Martino
US Patent App. 10/736,938, 2003 - Google Patents, Google Patents
US Patent App. 10/736,938
J Zimmerman, J A Martino
US Patent App. 10/736,938, 2003 - Google Patents, Google Patents
US Patent App. 10/736,938
Search user interface with enhanced accessibility and ease-of-use features based on visual metaphors
L Nikolovska, J A Martino, A F Camplin
US Patent 6,505,194, 2003 - Google Patents, Google Patents
US Patent 6,505,194
L Nikolovska, J A Martino, A F Camplin
US Patent 6,505,194, 2003 - Google Patents, Google Patents
US Patent 6,505,194
Towards automatic transcription of large spoken archives-english ASR for the MALACH project
B. Ramabhadran, J. Huang, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--216
B. Ramabhadran, J. Huang, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--216
An architecture for rapid decoding of large vocabulary conversational speech
G Saon, G Zweig, B Kingsbury, L Mangu, U Chaudhari
Eighth European Conference on Speech Communication and Technology, 2003
G Saon, G Zweig, B Kingsbury, L Mangu, U Chaudhari
Eighth European Conference on Speech Communication and Technology, 2003
Recent improvements to the IBM trainable speech synthesis system
E. Eide, A. Aaron, R. Bakis, R. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, others
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--708
E. Eide, A. Aaron, R. Bakis, R. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, others
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--708
A hand-held speech-to-speech translation system
B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny
Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669
B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny
Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669
Minimum Bayes risk methods in automatic speech recognition
V Goel, W Byrne
Pattern Recognition in Speech and Language Processing, 51--80, 2003
V Goel, W Byrne
Pattern Recognition in Speech and Language Processing, 51--80, 2003
Automatic hierarchical color image classification
J Huang, SR Kumar, R Zabih
EURASIP Journal on Applied Signal Processing, 2003 - hindawi.com
J Huang, SR Kumar, R Zabih
EURASIP Journal on Applied Signal Processing, 2003 - hindawi.com
To mix or not to mix synthetic speech and human speech? Contrasting impact on judge-rated task performance versus self-rated performance and attitudinal responses
Li Gong, Jennifer Lai
International Journal of Speech Technology 6(2), 123--131, Springer, 2003
Li Gong, Jennifer Lai
International Journal of Speech Technology 6(2), 123--131, Springer, 2003
2002
Method and apparatus for defining search queries and user profiles and viewing search results
L Nikolovska, JA Martino, A Camplin
L Nikolovska, JA Martino, A Camplin
Data search user interface with ergonomic mechanism for user profile definition and manipulation
L Nikolovska, JA Martino, AF Camplin
L Nikolovska, JA Martino, AF Camplin
User interface providing automatic generation and ergonomic presentation of keyword search criteria
KP Lee, JA Martino, L Nikolovska, AF Camplin
KP Lee, JA Martino, L Nikolovska, AF Camplin
User interface for reviewing and controlling use of data objects
J Zimmerman, J A Martino, G Roberts
US Patent App. 10/055,338, 2002 - Google Patents, Google Patents
US Patent App. 10/055,338
J Zimmerman, J A Martino, G Roberts
US Patent App. 10/055,338, 2002 - Google Patents, Google Patents
US Patent App. 10/055,338
VCR-style transport for navigating electronic program guide (EPG) and other textual information
M A Jacquelyn, J Zimmerman
US Patent App. 10/071,392, 2002 - Google Patents, Google Patents
US Patent App. 10/071,392
M A Jacquelyn, J Zimmerman
US Patent App. 10/071,392, 2002 - Google Patents, Google Patents
US Patent App. 10/071,392
Context and time sensitive profile builder
J Martino, J Zimmerman, G Roberts
US Patent App. 10/185,405, 2002 - Google Patents, Google Patents
US Patent App. 10/185,405
J Martino, J Zimmerman, G Roberts
US Patent App. 10/185,405, 2002 - Google Patents, Google Patents
US Patent App. 10/185,405
Method and system for displaying search results
J A Martino, L Nikolovska, J De Bont, J Zimmerman
US Patent App. 10/086,008, 2002 - Google Patents, Google Patents
US Patent App. 10/086,008
J A Martino, L Nikolovska, J De Bont, J Zimmerman
US Patent App. 10/086,008, 2002 - Google Patents, Google Patents
US Patent App. 10/086,008
User interface providing automatic organization and filtering of search criteria
K Kurapati, L Nikolovska, J A Martino, A F Camplin
US Patent 6,499,029, 2002 - Google Patents, Google Patents
US Patent 6,499,029
K Kurapati, L Nikolovska, J A Martino, A F Camplin
US Patent 6,499,029, 2002 - Google Patents, Google Patents
US Patent 6,499,029
Large-Vocabulary Speech Recognition Algorithms
M Padmanabhan, M Picheny
COMPUTER, 2002 - doi.ieeecomputersociety.org
M Padmanabhan, M Picheny
COMPUTER, 2002 - doi.ieeecomputersociety.org
Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, Ruhi Sarikaya
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--53
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, Ruhi Sarikaya
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--53
Supporting access to large digital oral history archives
S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, D. Greenberg
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pp. 18--27, 2002
S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, D. Greenberg
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pp. 18--27, 2002
Maximum entropy model for punctuation annotation from speech
J Huang, G Zweig
Seventh International Conference on Spoken Language …, 2002 - ISCA
J Huang, G Zweig
Seventh International Conference on Spoken Language …, 2002 - ISCA
Modeling with a subspace constraint on inverse covariance matrices
S. Axelrod, R. Gopinath, P. Olsen
Seventh International Conference on Spoken Language Processing, 2002
S. Axelrod, R. Gopinath, P. Olsen
Seventh International Conference on Spoken Language Processing, 2002
Maximum likelihood training of bases for rapid adaptation
K Visweswariah, V Goel, R Gopinath
Proc. ICSLP, 2002
K Visweswariah, V Goel, R Gopinath
Proc. ICSLP, 2002
Alignment-based codeword-dependent cepstral normalization
J M Huerta
Speech and Audio Processing, IEEE Transactions on 10(7), 451--459, IEEE, 2002
J M Huerta
Speech and Audio Processing, IEEE Transactions on 10(7), 451--459, IEEE, 2002
Audio-Visual Speech Separation Using Hidden Markov Models
J Hershey, M Case
Proc. Advances in Neural Information Processing Systems14, 2002
J Hershey, M Case
Proc. Advances in Neural Information Processing Systems14, 2002
Audio-visual sound separation via hidden markov models
J Hershey, M Casey
Advances in Neural Information Processing Systems2, 1173--1180, MIT; 1998, 2002
J Hershey, M Casey
Advances in Neural Information Processing Systems2, 1173--1180, MIT; 1998, 2002
MARS: A statistical semantic parsing and generation-based multilingual automatic translation system
Y. Gao, B. Zhou, Z. Diao, J. Sorensen, M. Picheny
Machine Translation 17(3), 185--212, Springer, 2002
Y. Gao, B. Zhou, Z. Diao, J. Sorensen, M. Picheny
Machine Translation 17(3), 185--212, Springer, 2002
Theory and practice of acoustic confusability
H. Printz, P.A. Olsen
Computer Speech \& Language 16(1), 131--164, Elsevier, 2002
H. Printz, P.A. Olsen
Computer Speech \& Language 16(1), 131--164, Elsevier, 2002
A robust high accuracy speech recognition system for mobile applications
S. Deligne, S. Dharanipragada, R. Gopinath, B. Maison, P. Olsen, H. Printz
Speech and Audio Processing, IEEE Transactions on 10(8), 551--561, IEEE, 2002
S. Deligne, S. Dharanipragada, R. Gopinath, B. Maison, P. Olsen, H. Printz
Speech and Audio Processing, IEEE Transactions on 10(8), 551--561, IEEE, 2002
Automatic transcription of broadcast news
SS Chen, E. Eide, MJF Gales, RA Gopinath, D. Kanvesky, P. Olsen
Speech Communication 37(1-2), 69--87, Elsevier, 2002
SS Chen, E. Eide, MJF Gales, RA Gopinath, D. Kanvesky, P. Olsen
Speech Communication 37(1-2), 69--87, Elsevier, 2002
Exploring features from natural language generation for prosody modeling
S Pan, K McKeown, J Hirschberg
Computer speech & language , 2002
S Pan, K McKeown, J Hirschberg
Computer speech & language , 2002
2001
A multi-agent TV recommender
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Proceedings of the UM 2001 workshop Personalization in …, 2001 - www-2.cs.cmu.edu
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Proceedings of the UM 2001 workshop Personalization in …, 2001 - www-2.cs.cmu.edu
A Multi-Agent TV Recorder
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Adaptive Systems Department, Philips Research Briarcliff, 2001
K Kurapati, S Gutta, D Schaffer, J Martino, J …
Adaptive Systems Department, Philips Research Briarcliff, 2001
USER INTERFACE PROVIDING AUTOMATIC GENERATION AND ERGONOMIC PRESENTATION OF KEYWORD SEARCH CRITERIA
L Nikolovska, A CAMPLIN, K LEE, J MARTINO
freepatentsonline.com, 2001
L Nikolovska, A CAMPLIN, K LEE, J MARTINO
freepatentsonline.com, 2001
SEARCH USER INTERFACE FOR CONSTRUCTING AND MANAGING USER PROFILES AND SEARCH CRITERIA
J MARTINO, L Nikolovska, A CAMPLIN
wipo.int, 2001
J MARTINO, L Nikolovska, A CAMPLIN
wipo.int, 2001
METHOD AND APPARATUS FOR REALIZING PERSONALIZED INFORMATION FROM MULTIPLE INFORMATION SOURCES
K Kurapati, J MARTINO
wipo.int, 2001
K Kurapati, J MARTINO
wipo.int, 2001
Method and apparatus for access and display of content allowing users to apply multiple profiles
J Martino, J Zimmerman
US Patent App. 10/037,464, 2001 - Google Patents, Google Patents
US Patent App. 10/037,464
J Martino, J Zimmerman
US Patent App. 10/037,464, 2001 - Google Patents, Google Patents
US Patent App. 10/037,464
Sort slider with context intuitive sort keys
J Martino, J Zimmerman, H Lamers, G Roberts, J Bont de
US Patent App. 10/037,445, 2001 - Google Patents, Google Patents
US Patent App. 10/037,445
J Martino, J Zimmerman, H Lamers, G Roberts, J Bont de
US Patent App. 10/037,445, 2001 - Google Patents, Google Patents
US Patent App. 10/037,445
Method of populating an explicit profile
J Zimmerman, J Martino
US Patent App. 10/040,245, 2001 - Google Patents, Google Patents
US Patent App. 10/040,245
J Zimmerman, J Martino
US Patent App. 10/040,245, 2001 - Google Patents, Google Patents
US Patent App. 10/040,245
Visualization of entertainment content
G Roberts, J Martino, J Debont, L Nikolovska, J Zimmerman
US Patent App. 10/038,874, 2001 - Google Patents, Google Patents
US Patent App. 10/038,874
G Roberts, J Martino, J Debont, L Nikolovska, J Zimmerman
US Patent App. 10/038,874, 2001 - Google Patents, Google Patents
US Patent App. 10/038,874
A Multi-Agent TV Recorder, Adaptive Systems Department
K Kurapati, S Gutta, D Schaffer, J Martino, J Zimmerman
Philips Research Briarcliff, 2001
K Kurapati, S Gutta, D Schaffer, J Martino, J Zimmerman
Philips Research Briarcliff, 2001
A Multi-Agent TV Recommender
K K Srinivas, S Gutta, D Schaffer, J Martino, J Zimmerman
In Proceedings of the UM 2001 workshop “Personalization in Future TV
K K Srinivas, S Gutta, D Schaffer, J Martino, J Zimmerman
In Proceedings of the UM 2001 workshop “Personalization in Future TV
REMOTE CONTROL FOR PROGRAM SELECTION BY GENRE
K I Trovato, P Rankin, D Pelletier, J A Martino, C Ramsey
EP Patent 1,084,571
K I Trovato, P Rankin, D Pelletier, J A Martino, C Ramsey
EP Patent 1,084,571
Minimum Bayes error feature selection for continuous speech recognition
G Saon, M Padmanabhan
Advances in Neural Information Processing Systems, 2001 - reference.kfupm.edu.sa
G Saon, M Padmanabhan
Advances in Neural Information Processing Systems, 2001 - reference.kfupm.edu.sa
Data-driven approach to designing compound words for continuousspeech recognition
G Saon, M Padmanabhan, IBMTJWR Center, Y Heights
IEEE Transactions on Speech and Audio Processing, 2001 - ieeexplore.ieee.org
G Saon, M Padmanabhan, IBMTJWR Center, Y Heights
IEEE Transactions on Speech and Audio Processing, 2001 - ieeexplore.ieee.org
Use of non-negative matrix factorization for language modeladaptation in a lecture transcription …
M Novak, R Mammone, IBMTJWR Center, Y Heights
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org
M Novak, R Mammone, IBMTJWR Center, Y Heights
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org
Innovative approaches for large vocabulary name recognition
Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 53--56
Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 53--56
Linear feature space projections for speaker adaptation
G Saon, G Zweig, M Padmanabhan, IBMTJWR Center, Y …
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org
G Saon, G Zweig, M Padmanabhan, IBMTJWR Center, Y …
2001 IEEE International Conference on Acoustics, Speech, and …, 2001 - ieeexplore.ieee.org
Current status of the IBM trainable speech synthesis system
R. Donovan, A. Ittycheriah, M. Franz, B. Ramabhadran, E. Eide, M. Viswanathan, R. Bakis, W. Hamza, M. Picheny, P. Gleason, others
4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, 2001
R. Donovan, A. Ittycheriah, M. Franz, B. Ramabhadran, E. Eide, M. Viswanathan, R. Bakis, W. Hamza, M. Picheny, P. Gleason, others
4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, 2001
Power exponential densities for the training and classification of acoustic feature vectors in speech recognition
S. Basu, C.A. Micchelli, P. Olsen
Journal of Computational and Graphical Statistics 10(1), 158--184, ASA, 2001
S. Basu, C.A. Micchelli, P. Olsen
Journal of Computational and Graphical Statistics 10(1), 158--184, ASA, 2001
Recent advances in speech recognition system for ibm darpa communicator
Y. Gao, H. Erdogan, Y. Li, V. Goel, M. Picheny
SMALL 20(17.0), 16--2, Citeseer, 2001
Y. Gao, H. Erdogan, Y. Li, V. Goel, M. Picheny
SMALL 20(17.0), 16--2, Citeseer, 2001
Automatic analysis of spontaneous facial behavior: A final project report
M S Bartlett, B Braathen, G Littlewort-Ford, J Hershey, I Fasel, T Marks, E Smith, T J Sejnowski, J R Movellan
University of California at San Diego8, 2001
M S Bartlett, B Braathen, G Littlewort-Ford, J Hershey, I Fasel, T Marks, E Smith, T J Sejnowski, J R Movellan
University of California at San Diego8, 2001
2000
Tv content recommender system
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, …
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL …, 2000 - aaai.org
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, …
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL …, 2000 - aaai.org
TV Content Recommender System, 17 th AAAI
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, D Schaffer, J Zimmerman
July-August, July-August, 2000
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, D Schaffer, J Zimmerman
July-August, July-August, 2000
Tv content recommender system
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman, others
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, pp. 1121--1122, 2000
S Gutta, K Kurapati, KP Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman, others
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, pp. 1121--1122, 2000
TV Content Recommender System
S G Kaushal, S Gutta, K Kurapati, K Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman
In Proceedings of the 17th National Conference on Artificial Intelligence, 2000
S G Kaushal, S Gutta, K Kurapati, K Lee, J Martino, J Milanski, J D Schaffer, J Zimmerman
In Proceedings of the 17th National Conference on Artificial Intelligence, 2000
Lattice-based unsupervised MLLR for speaker adaptation
M Padmanabhan, G Saon, G Zweig
ASR2000-Automatic Speech Recognition: Challenges for the new …, 2000 - ISCA
M Padmanabhan, G Saon, G Zweig
ASR2000-Automatic Speech Recognition: Challenges for the new …, 2000 - ISCA
Maximum likelihood discriminant feature spaces
George A Saon, M Padmanabhan, R Gopinath, S Chen
ICASSP 2000
George A Saon, M Padmanabhan, R Gopinath, S Chen
ICASSP 2000
Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
D Chazan, R Hoory, G Cohen, M Zibulski
IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2000
D Chazan, R Hoory, G Cohen, M Zibulski
IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2000
Multistage coarticulation model combining articulatory, formant and cepstral features
Y Gao, R Bakis, J Huang, B Xiang
Sixth International Conference on Spoken Language Processing, 2000
Y Gao, R Bakis, J Huang, B Xiang
Sixth International Conference on Spoken Language Processing, 2000
Towards language independent acoustic modeling
W Byrne, P Beyerlein, J Huerta, S Khudanpur, B Marthi, J Morgan, N Peterek, J Picone, D Vergyri, W Wang
IEEE International Conference on Acoustics Speech and Signal Processing, pp. 1029--1032, 2000
W Byrne, P Beyerlein, J Huerta, S Khudanpur, B Marthi, J Morgan, N Peterek, J Picone, D Vergyri, W Wang
IEEE International Conference on Acoustics Speech and Signal Processing, pp. 1029--1032, 2000
Penalized Maximum LikelihoodEstimators and the Baum Welch Algorithm for the Classifi cation of Acoustic Vectors in Speech Recognition
CA Micchelli, P Olsen
Journal of Computational and Applied Mathematics119, 301--331, 2000
CA Micchelli, P Olsen
Journal of Computational and Applied Mathematics119, 301--331, 2000
Audio-vision: Using audio-visual synchrony to locate sounds
J Hershey, J Movellan
Advances in Neural Information Processing Systems12, 813--819, Citeseer, 2000
J Hershey, J Movellan
Advances in Neural Information Processing Systems12, 813--819, Citeseer, 2000
Penalized maximum-likelihood estimation, the Baum--Welch algorithm, diagonal balancing of symmetric matrices and applications to training acoustic data
C.A. Micchelli, P. Olsen
Journal of computational and applied mathematics 119(1), 301--331, Elsevier, 2000
C.A. Micchelli, P. Olsen
Journal of computational and applied mathematics 119(1), 301--331, Elsevier, 2000
1999
Recent improvements to IBM's speech recognition system forautomatic transcription of broadcast news
Gopinath, D Kanevsky, P Olsen, IBMTJWR Center, Y …
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. …, 1999 - ieeexplore.ieee.org
Gopinath, D Kanevsky, P Olsen, IBMTJWR Center, Y …
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. …, 1999 - ieeexplore.ieee.org
Recent improvements to IBM's speech recognition system for automatic transcription of broadcast news
S.S. Chen, EM Eide, MJF Gales, R.A. Gopinath, D. Kanevsky, P. Olsen
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. Proceedings., 1999 IEEE International Conference on, pp. 37--40
S.S. Chen, EM Eide, MJF Gales, R.A. Gopinath, D. Kanevsky, P. Olsen
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. Proceedings., 1999 IEEE International Conference on, pp. 37--40
Fluctuations of Brownian motion with drift
J G Conlon, P Olsen
Publicacions Matem{\`a}tiques 43(1), 85--125, 1999
J G Conlon, P Olsen
Publicacions Matem{\`a}tiques 43(1), 85--125, 1999
Distortion-class weighted acoustic modeling for Robust Speech Recognition Under GSM RPE-LTP coding
J M Huerta, R M Stern
Proceedings of the Robust Methods for Speech Recognition in Adverse Conditions, Tampere Finland, Citeseer, 1999
J M Huerta, R M Stern
Proceedings of the Robust Methods for Speech Recognition in Adverse Conditions, Tampere Finland, Citeseer, 1999
Cursive word recognition using a random field based hidden Markov model. Int
G Saon
Journal of Pattern Recognition and Artificial Intelligence, 1999
G Saon
Journal of Pattern Recognition and Artificial Intelligence, 1999
Cursive word recognition using a random field based hidden Markov model
G Saon
International Journal on Document Analysis and Recognition, 1999 - Springer
G Saon
International Journal on Document Analysis and Recognition, 1999 - Springer
Spatial color indexing and applications
J Huang, S Ravi Kumar, M Mitra, WJ Zhu, R Zabih
International Journal of Computer Vision, 1999 - Springer
J Huang, S Ravi Kumar, M Mitra, WJ Zhu, R Zabih
International Journal of Computer Vision, 1999 - Springer
1998
Video content management in consumer devices
N Dimitrova, T McGee, H Elenbaas, J Martino
IEEE Transactions on Knowledge and Data Engineering, 988--995, Published by the IEEE Computer Society, 1998
N Dimitrova, T McGee, H Elenbaas, J Martino
IEEE Transactions on Knowledge and Data Engineering, 988--995, Published by the IEEE Computer Society, 1998
Project Reports
L Nikolovska, J Martino
IEEE MultiMedia Magazine 5(2), 78--83, Los Alamitos, CA: IEEE Computer Society, c1994-, 1998
L Nikolovska, J Martino
IEEE MultiMedia Magazine 5(2), 78--83, Los Alamitos, CA: IEEE Computer Society, c1994-, 1998
Spatial browsing to retrieve multimedia information
L Nikolovska, J Martino
Multimedia, IEEE 5(2), 78--83, IEEE, 1998
L Nikolovska, J Martino
Multimedia, IEEE 5(2), 78--83, IEEE, 1998
Speech recognition performance on a voicemail transcription task
M Padmanabhan, E Eide, B Ramabhadran, G Ramaswamy, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org
M Padmanabhan, E Eide, B Ramabhadran, G Ramaswamy, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org
Acoustics-only based automatic phonetic baseform generation
B Ramabhadran, LR Bahl, PV deSouza, M Padmanabhan, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org
B Ramabhadran, LR Bahl, PV deSouza, M Padmanabhan, …
Acoustics, Speech, and Signal Processing, 1998. ICASSP'98. …, 1998 - ieeexplore.ieee.org
A real-time computer vision system for vehicle tracking and traffic surveillance
B Coifman, D Beymer, P McLauchlan, J Malik
Transportation Research Part C, 1998 - Elsevier
B Coifman, D Beymer, P McLauchlan, J Malik
Transportation Research Part C, 1998 - Elsevier
Factor Analysis Invariant to Linear Transformations of Data
RA Gopinath, B Ramabhadran, S Dharanipragada
Fifth International Conference on Spoken Language Processing, 1998 - ISCA
RA Gopinath, B Ramabhadran, S Dharanipragada
Fifth International Conference on Spoken Language Processing, 1998 - ISCA
LVCSR rescoring with modified loss functions: a decision theoreticperspective
V Goel, W Byrne, S Khudanpur
Acoustics, Speech and Signal Processing, 1998. Proceedings …, 1998 - ieeexplore.ieee.org
V Goel, W Byrne, S Khudanpur
Acoustics, Speech and Signal Processing, 1998. Proceedings …, 1998 - ieeexplore.ieee.org
Speech recognition performance on a new voicemail transcription task
M Padmanabhan, B Ramabhadran, S Basu
Fifth International Conference on Spoken Language …, 1998 - ISCA
M Padmanabhan, B Ramabhadran, S Basu
Fifth International Conference on Spoken Language …, 1998 - ISCA
Transcription of broadcast news-some recent improvements to IBM's LVCSR system
L. Polymenakos, P. Olsen, D. Kanvesky, RA Gopinath, PS Gopalakrishnan, S. Chen
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on, pp. 901--904
L. Polymenakos, P. Olsen, D. Kanvesky, RA Gopinath, PS Gopalakrishnan, S. Chen
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on, pp. 901--904
IBM's LVCSR System for Transcription of Broadcast News used in the 1997 HUB4 English Evaluation
S. Chen, MJF Gales, PS Gopalakrishnan, RA Gopinath, H. Printz, D. Kanevsky, P. Olsen, L. Polymenakos
Proceedings of the Speech Recognition Workshop, 1998
S. Chen, MJF Gales, PS Gopalakrishnan, RA Gopinath, H. Printz, D. Kanevsky, P. Olsen, L. Polymenakos
Proceedings of the Speech Recognition Workshop, 1998
Speech recognition from GSM codec parameters
J M Huerta, R M Stern
Fifth International Conference on Spoken Language Processing, 1998
J M Huerta, R M Stern
Fifth International Conference on Spoken Language Processing, 1998
An automatic hierarchical image classification scheme
J Huang, SR Kumar, R Zabih
Proceedings of the sixth ACM international conference on …, 1998 - portal.acm.org
J Huang, SR Kumar, R Zabih
Proceedings of the sixth ACM international conference on …, 1998 - portal.acm.org
Micro-events in two serial verb constructions
C Y T Pi, O T Stewart
Proceedings from Semantics and Linguistic Theory VIII, 202, Cornell University, 1998
C Y T Pi, O T Stewart
Proceedings from Semantics and Linguistic Theory VIII, 202, Cornell University, 1998
Robust Speech Recognition in GSM Codec Environments
J M Huerta, H Van Hamme
1998 - Citeseer, Citeseer
J M Huerta, H Van Hamme
1998 - Citeseer, Citeseer
1997
CONIVAS: CONtent-based image and video access system
M Abdel
Mottaleb, N Dimitrova, R Desai, J Martino - Proceedings of the fourth ACM international conference on …, 1997 - portal.acm.org
M Abdel
Mottaleb, N Dimitrova, R Desai, J Martino - Proceedings of the fourth ACM international conference on …, 1997 - portal.acm.org
CONIVAS: CONtent-based Image and Video Access System
M A M N Dimitrova, R D J Martino
ACM/Multimedia Conference Proceedings 1996, pp. 427, 1997
M A M N Dimitrova, R D J Martino
ACM/Multimedia Conference Proceedings 1996, pp. 427, 1997
CONIVAS: CONtent-based image and video access system
M Abdel-Mottaleb, N Dimitrova, R Desai, J Martino
Proceedings of the fourth ACM international conference on Multimedia, pp. 427--428, 1997
M Abdel-Mottaleb, N Dimitrova, R Desai, J Martino
Proceedings of the fourth ACM international conference on Multimedia, pp. 427--428, 1997
In and Out of the Box: Interaction Paradigms in Electronic Environments
Jacquelyn Martino, Lira Nikolovska
INTERACT '97: Proceedings of the IFIP TC13 Interantional Conference on Human-Computer Interaction, pp. 697--698, Chapman \& Hall, Ltd., 1997
Jacquelyn Martino, Lira Nikolovska
INTERACT '97: Proceedings of the IFIP TC13 Interantional Conference on Human-Computer Interaction, pp. 697--698, Chapman \& Hall, Ltd., 1997
Combining supervised learning with color correlograms for content-based image retrieval
J Huang, SR Kumar, M Mitra
Proceedings of the fifth ACM international conference on …, 1997 - portal.acm.org
J Huang, SR Kumar, M Mitra
Proceedings of the fifth ACM international conference on …, 1997 - portal.acm.org
A Real-Time Computer Vision System for Measuring Traffic Parameters
D Beymer, P McLauchlan, B Coifman, J Malik
IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND …, 1997 - doi.ieeecomputersociety.org
D Beymer, P McLauchlan, B Coifman, J Malik
IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND …, 1997 - doi.ieeecomputersociety.org
High performance unconstrained word recognition system combining hmms and markov random fields
G Saon, A Belaid
Automatic Bankcheck Processing, 1997 - books.google.com
G Saon, A Belaid
Automatic Bankcheck Processing, 1997 - books.google.com
1996
CONIVAS: CONtent-based Image and Video Access System
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery
CONIVAS: CONtent-based Image and Video Access System
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery
MAMN Dimitrova, RDJ Martino
Proceedings, 1996 - Association for Computing Machinery
CONIVAS: CONtent-based Image and Video Access System
M A M N Dimitrova, R D J Martino
System 28(9), 23--32, 1996
M A M N Dimitrova, R D J Martino
System 28(9), 23--32, 1996
ISSUES IN PRACTICAL LARGE VOCABULARY ISOLATED WORD RECOGNITION: THE IBM
SK Das, MA Picheny
Automatic Speech and Speaker Recognition: Advanced Topics355, 457, Springer, 1996
SK Das, MA Picheny
Automatic Speech and Speaker Recognition: Advanced Topics355, 457, Springer, 1996
Negative Eigenvalues of the Schroedinger Equation: AN Approach Through Fractional Integration and Morrey Spaces.
P A Olsen
1996 - adsabs.harvard.edu
P A Olsen
1996 - adsabs.harvard.edu
Diffusion of Directed Polymers in a Strong Random Environment, to appear in J
PA Olsen, R Song
Stat. Phys, 1996
PA Olsen, R Song
Stat. Phys, 1996
A Brownian motion version of the directed polymer problem
J G Conlon, P A Olsen
Journal of statistical physics 84(3), 415--454, Springer, 1996
J G Conlon, P A Olsen
Journal of statistical physics 84(3), 415--454, Springer, 1996
Diffusion of directed polymers in a strong random environment
P Olsen, R Song
Journal of statistical physics 83(3), 727--738, Springer, 1996
P Olsen, R Song
Journal of statistical physics 83(3), 727--738, Springer, 1996
An efficient algorithm for parallel integer multiplication
B Singer, G Saon
Journal of Network and Computer Applications, 1996 - Elsevier
B Singer, G Saon
Journal of Network and Computer Applications, 1996 - Elsevier
1995
Face Recognition From One Example View.
D Beymer, T Poggio, MASSACHUSETTS INST OF TECH …
1995 - doi.ieeecs.org
D Beymer, T Poggio, MASSACHUSETTS INST OF TECH …
1995 - doi.ieeecs.org
Face recognition from one model view
D Beymer, T Poggio
Proc. Fifth Intl Conf. Computer Vision, 1995
D Beymer, T Poggio
Proc. Fifth Intl Conf. Computer Vision, 1995
Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task
LR Bahl, S. Balakrishnan-Aiyer, JR Bellgarda, M. Franz, PS Gopalakrishnan, D. Nahamoo, M. Novak, M. Padmanabhan, MA Picheny, S. Roukos
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, pp. 41--44
LR Bahl, S. Balakrishnan-Aiyer, JR Bellgarda, M. Franz, PS Gopalakrishnan, D. Nahamoo, M. Novak, M. Padmanabhan, MA Picheny, S. Roukos
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, pp. 41--44
Fractional integration, Morrey spaces and a Schr{\\"o}dinger equation
P A Olsen
Communications in Partial Differential Equations 20(11-12), 2005--2055, Taylor \& Francis, 1995
P A Olsen
Communications in Partial Differential Equations 20(11-12), 2005--2055, Taylor \& Francis, 1995
Performance of the IBM large vocabulary continuous speech recognition system
M Franz, PS Gopalakrishnan, D Nahamoo, M Novak, …
on the ARPA Wall Street Journal task, in Proc. ICASSP, 1995 - Citeseer
M Franz, PS Gopalakrishnan, D Nahamoo, M Novak, …
on the ARPA Wall Street Journal task, in Proc. ICASSP, 1995 - Citeseer
1993
A method for the construction of acoustic Markov models for words
LR Bahl, PF Brown, PV De Souza, RL Mercer, MA Picheny
Speech and Audio Processing, IEEE Transactions on 1(4), 443--452, IEEE, 1993
LR Bahl, PF Brown, PV De Souza, RL Mercer, MA Picheny
Speech and Audio Processing, IEEE Transactions on 1(4), 443--452, IEEE, 1993
Example Based Image Analysis and Synthesis
D Beymer, A Shashua, T Poggio, MASSACHUSETTS INST …
1993 - cognitrn.psych.indiana.edu
D Beymer, A Shashua, T Poggio, MASSACHUSETTS INST …
1993 - cognitrn.psych.indiana.edu
1992
note on irregular discrete wavelet transform IEEE Trans inform theory
P Olsen, K Seip
Information Theory, IEEE Transactions on 38(2), 861--863, IEEE, 1992
P Olsen, K Seip
Information Theory, IEEE Transactions on 38(2), 861--863, IEEE, 1992
An estimate of an upper bound for the entropy of English
Peter F Brown, Vincent J Della Pietra, Robert L Mercer, Stephen A Della Pietra, Jennifer C Lai
Computational Linguistics 18(1), 31--40, MIT Press, 1992
Peter F Brown, Vincent J Della Pietra, Robert L Mercer, Stephen A Della Pietra, Jennifer C Lai
Computational Linguistics 18(1), 31--40, MIT Press, 1992
Class-based n-gram models of natural language
Peter F Brown, Peter V Desouza, Robert L Mercer, Vincent J Della Pietra, Jenifer C Lai
Computational linguistics 18(4), 467--479, MIT Press, 1992
Peter F Brown, Peter V Desouza, Robert L Mercer, Vincent J Della Pietra, Jenifer C Lai
Computational linguistics 18(4), 467--479, MIT Press, 1992
1991
Finding Junctions Using the Image Gradient - .
DJ Beymer, Massachusetts Institute of Technology, …
1991 - Massachusetts Institute of Technology, Artificial Intelligence …
DJ Beymer, Massachusetts Institute of Technology, …
1991 - Massachusetts Institute of Technology, Artificial Intelligence …
Automatic Phonetic Baseform Determination
LR Bahl, S. Das, PV Desouza, M. Epstein, RL Mercer, B. Merialdo, D. Nahamoo, MA Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 173--176
LR Bahl, S. Das, PV Desouza, M. Epstein, RL Mercer, B. Merialdo, D. Nahamoo, MA Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 173--176
Decision trees for phonological rules in continuous speech
L.R. Bahl, PV deSouza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 185--188
L.R. Bahl, PV deSouza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 185--188
An inequality for rational functions with applications to somestatistical estimation problems
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
IEEE Transactions on Information Theory, 1991 - ieeexplore.ieee.org
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
IEEE Transactions on Information Theory, 1991 - ieeexplore.ieee.org
Context dependent modeling of phones in continuous speech using decision trees
LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Proceedings DARPA Speech and Natural Language Processing Workshop, pp. 264--270, 1991
LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Proceedings DARPA Speech and Natural Language Processing Workshop, pp. 264--270, 1991
1989
Matrix fast match: a fast method for identifying a short list ofcandidate words for decoding
L Bahl, PS Gopalakrishnan, D Kanevsky, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org
L Bahl, PS Gopalakrishnan, D Kanevsky, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org
A generalization of the Baum algorithm to rational objectivefunctions
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org
PS Gopalakrishnan, D Kanevsky, A Nadas, D Nahamoo, …
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., …, 1989 - ieeexplore.ieee.org
Large vocabulary natural language continuous speech recognition
LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467
LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467
Speech recognition using noise-adaptive prototypes
A. N\'adas, D. Nahamoo, M.A. Picheny
Acoustics, Speech and Signal Processing, IEEE Transactions on 37(10), 1495--1503, IEEE, 1989
A. N\'adas, D. Nahamoo, M.A. Picheny
Acoustics, Speech and Signal Processing, IEEE Transactions on 37(10), 1495--1503, IEEE, 1989
When natural language is better than menus: A field study
M Walker, S Whittaker
Hewlett Packard Laboratories Technical Report HPL-BRC-TR-89-020, 1989
M Walker, S Whittaker
Hewlett Packard Laboratories Technical Report HPL-BRC-TR-89-020, 1989
Speaking clearly for the hard of hearing III: An attempt to determine the contribution of speaking rate to differences in intelligibility between clear and conversational speech
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 32(3), 600, ASHA, 1989
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 32(3), 600, ASHA, 1989
1988
Decoder selection based on cross-entropies
PS Gopalakrishnan, D. Kanevsky, A. Nadas, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 20--23
PS Gopalakrishnan, D. Kanevsky, A. Nadas, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 20--23
Acoustic Markov models used in the Tangora speech recognition system
LR Bahl, PF Brown, PV De Souza, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 497--500
LR Bahl, PF Brown, PV De Souza, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 497--500
1986
Rapid prototyping and system development: examination of an interface toolkit for voice and telephony applications
J T Richards, S J Boies, J D Gould
SIGCHI Bull. 17(4), 216--220, ACM, 1986
Abstract
J T Richards, S J Boies, J D Gould
SIGCHI Bull. 17(4), 216--220, ACM, 1986
Abstract
Speaking clearly for the hard of hearing II: Acoustic characteristics of clear and conversational speech
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 29(4), 434, ASHA, 1986
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 29(4), 434, ASHA, 1986
1985
Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 28(1), 96, ASHA, 1985
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 28(1), 96, ASHA, 1985
1983
Recognition of isolated-word sentences from a 5000-word vocabulary office correspondence task
L. Bahl, A. Cole, F. Jelinek, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83., pp. 1065--1067, 1983
L. Bahl, A. Cole, F. Jelinek, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83., pp. 1065--1067, 1983
1982
User Interface for Audio Communication System
SJ Boies, JD Gould, WA Notz, JT Richards, JW Schoonard
IBM Technical Disclosure Bulletin 25(7A), 3371--3377, 1982
SJ Boies, JD Gould, WA Notz, JT Richards, JW Schoonard
IBM Technical Disclosure Bulletin 25(7A), 3371--3377, 1982
Year Unknown
TV personalization system
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, S …
this volume - Springer, 0
J Zimmerman, K Kurapati, AL Buczak, D Schaffer, S …
this volume - Springer, 0
Srinivas Gutta. TV Personalization System: Design of a TV Show Recommender Engine and Interface in: Liliana Ardissono, Alfred Kobsa, Mark Maybury (ed). Personalized Digital Television: Targeting Programs to Individual Viewers
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino
Kluwer, Kluwer, 0
J Zimmerman, K Kurapati, A L Buczak, D Schaffer, J Martino
Kluwer, Kluwer, 0
An Eye Tracking Study of How Pictures Influence Online Reading
D Beymer, P Orton, D Russell
Human-Computer Interaction--INTERACT 2007, 456--460, Springer
D Beymer, P Orton, D Russell
Human-Computer Interaction--INTERACT 2007, 456--460, Springer
Echocardiogram View Classification using Edge Filtered Scale-invariant Motion
R Kumar, F Wang, D Beymer, T Syeda-Mahmood
Mahmood - Citeseer, Citeseer, 0
R Kumar, F Wang, D Beymer, T Syeda-Mahmood
Mahmood - Citeseer, Citeseer, 0
MAXIMUM LIKELIHOOD TRAINING OF BASES FOR RAPID ADAPTATION (2008)
K Visweswariah, V Goel, R Gopinath
en.scientificcommons.org, 0
K Visweswariah, V Goel, R Gopinath
en.scientificcommons.org, 0
ADAPTATION EXPERIMENTS ON THE SPINE DATABASE WITH THE EXTENDED MAXIMUM LIKELIHOOD LINEAR TRANSFORMATION (EMLLT) MODEL (2008)
R Gopinath, V Goel, K Visweswariah, P Olsen
en.scientificcommons.org, 0
R Gopinath, V Goel, K Visweswariah, P Olsen
en.scientificcommons.org, 0
REAL-TIME SPEECH TRANSCRIPTION SERVICE TO IMPROVE NON-NATIVE SPEAKERS’LISTENING COMPREHENSION
D Jiang, Y Pan, W Liu, Y Qin, M Picheny, P Luther
www-304. ibm .com
D Jiang, Y Pan, W Liu, Y Qin, M Picheny, P Luther
www-304. ibm .com
Audio-Visual Speech Synchrony Detection by a Family of Bimodal Linear Prediction Models
K Kumar, G Potamianos, J Navratil, E Marcheret, V Libal
ece.cmu.edu, 0
K Kumar, G Potamianos, J Navratil, E Marcheret, V Libal
ece.cmu.edu, 0
A Conditional Random Field Approach to Classroom Discourse Analysis using Multilevel Features
J M Huerta
domino.research.ibm.com, 0
J M Huerta
domino.research.ibm.com, 0
Cross-Language Access to Recorded Speech 9n the M ALACE ProIect
D W Oardq, D Demner-Fushmanq, J Hajic, B Ramabhadran, S Gustman, W J Byrne, D Soergelq, B Dorrq, P Resnikq, M Picheny
Fushmanq, J Hajic, B Ramabhadran ... - terpconnect.umd.edu, 0
D W Oardq, D Demner-Fushmanq, J Hajic, B Ramabhadran, S Gustman, W J Byrne, D Soergelq, B Dorrq, P Resnikq, M Picheny
Fushmanq, J Hajic, B Ramabhadran ... - terpconnect.umd.edu, 0
Lossy Speech Compression Via Compressed Sensing-Based Kalman Filtering
A Carmi, D Kanevsky, B Ramabhadran
domino.research. ibm .com, 0
A Carmi, D Kanevsky, B Ramabhadran
domino.research. ibm .com, 0
TY-CONF JO-Multimedia and Expo, IEEE International Conference on TI-Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition SN-SP338
E Marcheret, K Visweswariah
doi.ieeecomputersociety.org, 0
E Marcheret, K Visweswariah
doi.ieeecomputersociety.org, 0
Robust Audio-Visual Speech Synchrony Detection by Generalized Bimodal Linear Prediction
K Kumar, J Navratil, E Marcheret, V Libal, G Potamianos
ece.cmu.edu, 0
K Kumar, J Navratil, E Marcheret, V Libal, G Potamianos
ece.cmu.edu, 0
A Data Visualization and Analysis Method for Natural Language Call Routing System Design
H K J Kuo, V Goel
HKJ Kuo, V Goel, 0
H K J Kuo, V Goel
HKJ Kuo, V Goel, 0
New Adaptation Techniques for Large Vocabulary Continuous
S Recognition, Y Gao, B Ramabhadran, M Picheny
S Recognition, Y Gao , B Ramabhadran, M Picheny, 0
S Recognition, Y Gao, B Ramabhadran, M Picheny
S Recognition, Y Gao , B Ramabhadran, M Picheny, 0
Single-channel speech separation and recognition using loopy belief propagation (PDF)
S J Rennie, J R Hershey, P A Olsen
computer.org, 0
S J Rennie, J R Hershey, P A Olsen
computer.org, 0
Audio-visual speech synchronization detection using a bimodal linear prediction model
K Kumar, J Navratil, E Marcheret, V Libal, G Ramaswamy, G Potamianos
To appear:) Proc. CVPR Biometrics Works., 2009
K Kumar, J Navratil, E Marcheret, V Libal, G Ramaswamy, G Potamianos
To appear:) Proc. CVPR Biometrics Works., 2009