- Computer Science
- Algorithms and Theory
- Artificial Intelligence
- Human Computer Interaction
- Knowledge Discovery and Data Mining
- Natural Language Processing
- Operations Research
- Performance Modeling and Analysis
- Signal Processing
- User Interface Technologies
My current work is in the Natural Language Processing area, and focuses on machine-learning algorithms for information extraction from text.
I belong to the Statistical Multilingual Information Extraction group of the Multilingual NLP Technologies department.
I am the technical lead for the DELPHI consortium team that participates to the BOLT IR task in the DARPA BOLT program. The team includes IBM as the primary and Columbia, UMASS, UMD, and Stanford as partners. I am also the architect of the DELPHI IR system.
I worked on algorithms for the DARPA GALE Distillation task (precursor to the BOLT IR task), and in the last two years of the program I was the principal architect of the distillation system for the Rosetta consortium, lead by IBM.
I served as the Watson chair of the Natural Language Processing Professional Interest Community.
My previous work at IBM has been in areas including intelligent user interfaces, autonomic computing, memory compression, statistical pattern recognition, image digital libraries, data mining, and multidimensional indexing structures.
In my spare time, I have taught Information Theory, as well as Statistical Pattern Recognition at Columbia University, through the EE department.