SystemT - News
2018
- Research paper "DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm Based on Learned High Dimensional Encoding" is accepted at CONLL 2018. (IBM Research Blog Post)
- Research paper "Exploiting Structure in Representation of Named Entities using Active Learning" is accepted at COLING 2018.
- Officially joined NSF Center for Big Learning as an Industry Partner.
- Hosted Stanford professor Mark Musen's visit to IBM Research - Almaden
- Industry track paper on the design and implementation of SystemT is accepted at NAACL-HLT 2018 Industry Track
2017
- Research paper on a Rectangle Mining Method for Understanding the Semantics of Financial Tables is accepted at ICDAR 2017
- Research paper on Hardware Compilation Framework for Text Analytics Queries accepted to Journal of Parallel and Distributed Computing (JPDC)
- Research paper on Active Learning for Black-box Semantic Role Labeling is accepted at IJCAI 2017
- Demo paper on creating and interacting with large-scale knowledge bases is accepted in VLDB 2017 [video][poster]
- Workshop paper on understanding relationships in the financial domain appears in the Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM) Workshop, collocated with SIGMOD 2017 [paper]
- We are giving a talk on Crosslingual Text Analytics at the Natural Language and Dialog Systems Lab, UC Santa Cruz, in May 2017
- Demo paper on learning extractors from examples accepted at SIGMOD 2017 [video] [paper]
- We are teaching a lecture on SystemT at NYU Abu Dhabi in February 2017
- Research paper on learning rules from a small number of examples accepted at CHI 2017 [video] [paper] [slides in pdf]
- We are giving a talk on Declarative Information Extraction and Multilingual SRL at the Stanford Logic Seminar in January 2017.
2016
- We are giving a talk on Declarative Information Extraction and Multilingual SRL at at the International Computer Science Institute (ICSI) in Berkeley [abstract]
- Two COLING 2016 papers accepted! K-SRL: Instance-based Learning for Semantic Role Labeling [paper] and Multilingual Aliasing for Auto-Generating Proposition Banks [paper]
- Multilingual Information Extraction demo acepted at COLING 2016 [Video]
- SystemT MOOC is now online! [Text Analytics with SystemT]
- We will present our semi-automatic approach for generating propositional banks for low-resource languages at EMNLP 2016 [paper]
- We demonstrated our multilingual Semantic Role Labeler at ACL 2016 [video] [paper]
- We gave a talk on SystemT in MIT CSAIL on March 8, 2016 [link]
- We gave a talk on SystemT in University of Maryland, Baltimore County on March 7, 2016
- Shimei Pan taught SystemT in University of Maryland, Baltimore County, in Spring 2016. Shimei said: "The Information Extraction with SystemT class, which was offered for the first time at UMBC, has been a wonderful experience for me and my students. I really enjoyed teaching this course. Students were also very enthusiastic. Based on the feedback from my students, they have learned a lot. Some of my students even want me to offer this class every semester."
2015
- Alon Halevy used SystemT as a hands-on component of a Ph.D course on Data on the Web at the University of Aalborg in Denmark. Alon said: "The tutorial did a great job giving the students the feeling for the challenges involved in extracting structured data from text." - Nov 2015 [link to class]
- We are giving a tutorial on Transparent Machine Learning for Information Extraction at EMNLP 2015 on Sept. 17 [link] [slides in pdf]
- We are demoing VINERy, the latest SystemT Web Tooling in VLDB 2015 on Sept. 2 -3 [video] [link]
- Our ACL 2015 paper on Generating High Quality Proposition Banks for Multilingual Semantic Role Labeling is presented on July 27, 2015
- SystemT is taught in University of Washington, in Spring 2015 [link to class website]
- SystemT is taught in University of Oregon, in Spring 2015 [link to class website]
Before 2015
- We are teaching a class on Information Extraction and SystemT in UC Santa Cruz, in Spring 2014 [link to class website]
- We are teaching 3 lectures on SystemT as part of the Large-Scale Data Integration class in UC Santa Cruz, in Winter 2013 [link to class website]
Awards
2013 - IBM Research Outstanding Technical Accomplishment Award
2008, 2010, 2013 - IBM Research A-Level Accomplishment Award
Recent News
12/13/18
Yunyao gave a talk on Building Domain-Specific Knowledge with Human in the Loop at Robust Machine Learning Algorithms and Systems: Detection & Mitigation of Adversarial Attacks and Anomalies Workshop, National Academies
11/7/18
Yunyao gave a talk on Building Domain-Specific Knowledge with Human in the Loop at University of Michigan AI Lab
07/27/18
Research paper "DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm based on Learned High Dimensional Encoding" is accepted at CONLL 2018 (IBM Research Blog Post).
05/16/18
Research paper "Exploiting Structure in Representation of Named Entities using Active Learning" is accepted at COLING 2018.
05/05/18
Officially joined NSF Center for Big Learning as an Industry Partner.
04/16/18
Demoed LUSTRE an interactive system for entity understanding and standardization at ICDE 2018
04/05/18
Hosted Stanford professor Mark Musen's visit to IBM Research - Almaden
03/26/18
Industry track paper on the design and implementation of SystemT is accepted at NAACL-HLT 2018 Industry Track (the very first industry track at a major NLP conference).
10/04/17
Hosted Univ. of Washington professor Luke Zettlemoyer's visit to IBM Research - Almaden
10/02/17
Yunyao is co-chairing the very first NAACL-HLT Industry Track
08/29/17
Demo paper on Creating and Interacting with Large-Scale Domain-Specific Knowledge Bases is presented at VLDB 2017 [video] [poster]
08/06/17
Research paper on Distant Meta-Path Similarities for Text-Based Heterogeneous Information Networks is accepted at CIKM 2017
06/30/17
Research paper on Crowd-in-the-loop: A Hybrid Approach for Annotating Semantic Roles is accepted at EMNLP 2017
06/08/17
Hosted Stanford professor Dan Jurafsky's visit to IBM Research - Almaden
05/31/17
Research paper on Hardware Compilation Framework for Text Analytics Queries is accepted to Journal of Parallel and Distributed Computing (JPDC)
05/16/17
SEER, a system on learning extractors from examples, presented at CHI and SIGMOD 2017 [video] [paper]
05/16/17
Workshop paper on understanding relationships in the financial domain presented at DSMM 2017 [paper]