Contact Information

Wei Tan
Research Staff Member - distributed computing and big data (NoSQL, Spark, GPU), services computing.
Thomas J. Watson Research Center, Yorktown Heights, NY USA

I currently work on big data and distributed computing systems. Specifically, I am interested in accelerating machine learning using scale-out (e.g., Spark) and scale-up (e.g., GPU) approaches. I also work on NoSQL and services computing.

My work and code have been incorporated into IBM patent portfolio and software products such as BigInsights and Cognos. I am an adjunct professor at Department of Automation, Tsinghua University, China, and an associate editor of IEEE Transactions on Automation Science and Engineering.

What's New.

Talk at IBM Research @ Insight 2015. Accelerating Machine Learning applications on Spark using GPUs (ppt).
Best Paper Award at CCGrid 2015. "Deferred Lightweight Indexing for Log-Structured Key-Value Stores", with Yuzhe Tang, Arun Iyengar, Liana Fong, Ling Liu, Balaji Palanisamy, May, 2015. (paper, ppt, code)

Invited talk at Tsinghua University and BUPT. Services computing: status, reflection and suggestions. Beijing, China, May 2015. (ppt in Chinese)
General co-chair of HotWeb 2015, Washington DC, USA, November 12-13, 2015.
Available from IEEE Xplore (free download with Xplore subsciption) and Amazon.

Brief Bio.

From 2008 to 2010 I worked at Computation Institute, University of Chicago and Argonne National Laboratory, on caGrid Workflow Toolkit, a web-service-based scientific workflow platform for cancer Biomedical Informatics Grid (caBIG), funded by US National Cancer Institute. This system has been adopted by many major US bioinformatics projects.

My awards include the Outstanding Technology Accomplishment Award from IBM (2014), Best Student Paper Award from IEEE ICWS (2014), Best Paper Award from IEEE SCC (2011), Pacesetter Award from Argonne National Laboratory (2010), and caBIG Teamwork Award from the NIH (2008). I got my Ph.D in Control Science and Engineering from Tsinghua University, China.

Research Streams.

Big Data: HBase Index (EDBT 14, ccGrid 15), NoSQL (ICWS 2014 tutorial).

Web service recommendation: IEEE T-ASE 2014, IEEE Computer 2009.

Distributed and cloud computing: IEEE T-ASE 2013, IEEE T-ASE 2012.