- Computer Science
- Artificial Intelligence
- Computer Architecture
- Data Management
- Distributed and Fault-Tolerant Computing
- Knowledge Discovery and Data Mining
- Operating Systems
- Services Computing
I currently work on big data and distributed computing systems. Specifically, I am interested in accelerating machine learning using scale-out (e.g., Spark) and scale-up (e.g., GPU) approaches. I also work on NoSQL and services computing.
My work and code have been incorporated into IBM patent portfolio and software products such as BigInsights and Cognos. I am an adjunct professor at Department of Automation, Tsinghua University, China, and an associate editor of IEEE Transactions on Automation Science and Engineering.
Best Paper Award at CCGrid 2015. "Deferred Lightweight Indexing for Log-Structured Key-Value Stores", with Yuzhe Tang, Arun Iyengar, Liana Fong, Ling Liu, Balaji Palanisamy, May, 2015. (paper, ppt, code)
From 2008 to 2010 I worked at Computation Institute, University of Chicago and Argonne National Laboratory, on caGrid Workflow Toolkit, a web-service-based scientific workflow platform for cancer Biomedical Informatics Grid (caBIG), funded by US National Cancer Institute. This system has been adopted by many major US bioinformatics projects.