SystemT - News


2018

  • Research paper "DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm Based on Learned High Dimensional Encoding" is accepted at CONLL 2018. (IBM Research Blog Post)
  • Research paper "Exploiting Structure in Representation of Named Entities using Active Learning" is accepted at COLING 2018.
  • Officially joined NSF Center for Big Learning as an Industry Partner.
  • Hosted Stanford professor Mark Musen's visit to IBM Research - Almaden
  • Industry track paper on the design and implementation of SystemT is accepted at NAACL-HLT 2018 Industry Track

2017

  • Research paper on a Rectangle Mining Method for Understanding the Semantics of Financial Tables is accepted at ICDAR 2017
  • Research paper on Hardware Compilation Framework for Text Analytics Queries accepted to Journal of Parallel and Distributed Computing (JPDC)
  • Research paper on Active Learning for Black-box Semantic Role Labeling is accepted at IJCAI 2017
  • Demo paper on creating and interacting with large-scale knowledge bases is accepted in VLDB 2017 [video][poster]
  • Workshop paper on understanding relationships in the financial domain appears in the Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM) Workshop, collocated with SIGMOD 2017 [paper]
  • We are giving a talk on Crosslingual Text Analytics at the Natural Language and Dialog Systems Lab, UC Santa Cruz, in May 2017
  • Demo paper on learning extractors from examples accepted at SIGMOD 2017 [video] [paper]
  • We are teaching a lecture on SystemT at NYU Abu Dhabi in February 2017
  • Research paper on learning rules from a small number of examples accepted at CHI 2017 [video] [paper] [slides in pdf]
  • We are giving a talk on Declarative Information Extraction and Multilingual SRL at the Stanford Logic Seminar in January 2017.

2016

  • We are giving a talk on Declarative Information Extraction and Multilingual SRL at at the International Computer Science Institute (ICSI) in Berkeley [abstract]
  • Two COLING 2016 papers accepted! K-SRL: Instance-based Learning for Semantic Role Labeling [paper] and Multilingual Aliasing for Auto-Generating Proposition Banks [paper]
  • Multilingual Information Extraction demo acepted at COLING 2016 [Video]
  • SystemT MOOC is now online! [Text Analytics with SystemT]
  • We will present our semi-automatic approach for generating propositional banks for low-resource languages at EMNLP 2016 [paper]
  • We demonstrated our multilingual Semantic Role Labeler at ACL 2016 [video] [paper]
  • We gave a talk on SystemT in MIT CSAIL on March 8, 2016 [link]
  • We gave a talk on SystemT in University of Maryland, Baltimore County on March 7, 2016
  • Shimei Pan taught SystemT in University of Maryland, Baltimore County, in Spring 2016. Shimei said: "The Information Extraction with SystemT class, which was offered for the first time at UMBC, has been a wonderful experience for me and my students.  I really enjoyed teaching this course. Students were also very enthusiastic. Based on the feedback from my students, they have learned a lot. Some of my students even want me to offer this class every semester."

2015

  • Alon Halevy used SystemT as a hands-on component of a Ph.D course on Data on the Web at the University of Aalborg in Denmark. Alon said: "The tutorial did a great job giving the students the feeling for the challenges involved in extracting structured data from text." - Nov 2015 [link to class]
  • We are giving a tutorial on Transparent Machine Learning for Information Extraction at EMNLP 2015 on Sept. 17 [link] [slides in pdf]
  • We are demoing VINERy, the latest SystemT Web Tooling in VLDB 2015 on Sept. 2 -3  [video] [link]
  • Our ACL 2015 paper on Generating High Quality Proposition Banks for Multilingual Semantic Role Labeling is presented on July 27, 2015 
  • SystemT is taught in University of Washington, in Spring 2015 [link to class website]
  • SystemT is taught in University of Oregon, in Spring 2015 [link to class website]

Before 2015

  • We are teaching a class on Information Extraction and SystemT in UC Santa Cruz, in Spring 2014 [link to class website]
  • We are teaching 3 lectures on SystemT as part of the Large-Scale Data Integration class in UC Santa Cruz, in Winter 2013 [link to class website]

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 




Awards

2013 - IBM Research Outstanding Technical Accomplishment Award

2008, 2010, 2013 - IBM Research A-Level Accomplishment Award


Recent News

11/7/18

Yunyao gave a talk on Building Domain-Specific Knowledge with Human in the Loop at University of Michigan AI Lab

07/27/18

Research paper "DIMSIM: An Accurate Chinese Phonetic Similarity Algorithm based on Learned High Dimensional Encoding" is accepted at CONLL 2018 (IBM Research Blog Post).

05/16/18

Research paper "Exploiting Structure in Representation of Named Entities using Active Learning" is accepted at COLING 2018.

05/05/18

Officially joined NSF Center for Big Learning as an Industry Partner.

04/16/18

Demoed LUSTRE an interactive system for entity understanding and standardization at ICDE 2018

04/05/18

Hosted Stanford professor Mark Musen's visit to IBM Research - Almaden

03/26/18

Industry track paper on the design and implementation of SystemT is accepted at NAACL-HLT 2018 Industry Track (the very first industry track at a major NLP conference).

10/04/17

Hosted Univ. of Washington professor Luke Zettlemoyer's visit to IBM Research - Almaden

10/02/17

Yunyao is co-chairing the very first NAACL-HLT Industry Track

08/29/17

Demo paper on Creating and Interacting with Large-Scale Domain-Specific Knowledge Bases is presented at VLDB 2017 [video] [poster]

08/06/17

Research paper on Distant Meta-Path Similarities for Text-Based Heterogeneous Information Networks is accepted at CIKM 2017

06/30/17

Research paper on Crowd-in-the-loop: A Hybrid Approach for Annotating Semantic Roles is accepted at EMNLP 2017

06/08/17

Hosted Stanford professor Dan Jurafsky's visit to IBM Research - Almaden

05/31/17

Research paper on Hardware Compilation Framework for Text Analytics Queries is accepted to Journal of Parallel and Distributed Computing (JPDC)

05/16/17

SEER, a system on learning extractors from examples, presented at CHI and SIGMOD 2017 [video] [paper]

05/16/17

Workshop paper on understanding relationships in the financial domain presented at DSMM 2017 [paper]