- Computer Science
- Artificial Intelligence
- Computer Architecture
- Data Management
- Distributed and Fault-Tolerant Computing
- Knowledge Discovery and Data Mining
- Operating Systems
- Services Computing
My current research is in big data and distributed computing systems. Specifically, I am interested in accelerating machine learning using scale-out (e.g., Hadoop/Spark) and scale-up (e.g., GPU). I worked on NoSQL and services computing before.
My work and code have been incorporated into IBM patent portfolio and software products such as BigInsights and Cognos. I am an adjunct professor at Department of Automation, Tsinghua University, China, and an associate editor of IEEE Transactions on Automation Science and Engineering.
From 2008 to 2010 I worked at Computation Institute, University of Chicago and Argonne National Laboratory, on caGrid Workflow Toolkit, a web-service-based scientific workflow platform for cancer Biomedical Informatics Grid (caBIG), funded by US National Cancer Institute. This system has been adopted by many major US bioinformatics projects.