Christoph Tillmann

Overview

Pronouns

He/Him/His

Title

Senior Research Staff Member

Location

IBM Research - Yorktown Heights Yorktown Heights, NY USA

Bio

I am a Research Staff Member in the Neural Machine Translation (NMT) Group at the IBM T.J. Watson Research Center, and I have been working on algorithms for statistical or neural machine translation for more than 25 years.

During my time as a Ph.D. student and in the following years, my research focused on search algorithms for SMT (in particular dynamic-programming (DP) based algorithms) for word-based and phrase-based machine translation models. Here, I proposed a simple, sequential, block-based segmentation model for phrase-based SMT. That work has resulted in a first successful experiment with a linear model which relies entirely on binary feature functions. In addition, I have been working on beam search algorithms for extracting parallel sentence pairs from comparable data.

More recently, I have been working on the training and the evaluation of multi-lingual large language models as well as on HAP (hate/abuse/profanity) filtering to remove harmful content from text.

Personal

Native speaker of German. Excellent reading proficiency in Russian, English, and French. Excellent speaking proficiency in English and Russian. Close to native speaker proficiency in Russian (some days).
In 1986, I won a third place in a nationwide Russian language studies competition for high-school students, Stuttgart, Germany.
In 1991-1992, I have been an exchange student at Leningradskii Gosudarstvennyi Universitet (LGU) , St.Petersburg, Russian Federation with a grant of the German Academic Exchange Service (DAAD) .
During my studies in Computer Science from 1991 to 1995, I received a grant from the Cusanuswerk .
I like reading, in particular French-language and Russian-language works of fiction (mostly). I also love to read non-fiction about (military) history, philosophy and people's biographies. Occasionally I do photography and more recently, I have been playing (too much) chess online.

Publications

Automatic dialect classification for statistical machine translation
- - Saab Mansour
  - Yaser Al-Onaizan
  - et al.
- 2014
- AMTA 2014
Improved Sentence-Level Arabic Dialect Classification
- - Christoph Tillmann
  - Yaser Al-Onaizan
  - et al.
- 2014
- COLING 2014
An efficient unified extraction algorithm for bilingual data
- - Christoph Tillmann
  - Sanjika Hewavitharana
- 2011
- INTERSPEECH 2011
A unified alignment algorithm for bilingual data
- - C. Tillmann
  - S. Hewavitharana
- 2011
- Natural Language Engineering
A beam-search extraction algorithm for comparable data
- - Christoph Tillmann
- 2009
- ACL-IJCNLP 2009
A simple sentence-level extraction algorithm for comparable data
- - Christoph Tillmann
  - Jian-Ming Xu
- 2009
- NAACL-HLT 2009
An online relevant set algorithm for statistical machine translation
- - Christoph Tillmann
  - Tong Zhang
- 2008
- IEEE Transactions on Audio, Speech and Language Processing
TIPS: A translingual information processing system
- - Y. Al-Onaizan
  - R. Florian
  - et al.
- 2003
- NAACL-HLT 2003

Visit Google Scholar