Contact Information

Wei Tan
Research Staff Member - distributed computing and big data (GPU, Spark, NoSQL), services computing.
Thomas J. Watson Research Center, Yorktown Heights, NY USA


I currently work on big data and distributed systems. Specifically, to accelerate machine learning algorithms using scale-out (e.g., Spark) and scale-up (e.g., GPU) approaches. I also work on NoSQL and services computing.

My work and code have been incorporated into IBM patent portfolio and software products such as BigInsights and Cognos. I am an adjunct professor at Department of Automation, Tsinghua University, China, and an associate editor of IEEE Transactions on Automation Science and Engineering.

What's New.

Paper "Faster and Cheaper: Parallelizing Large-Scale Matrix Factorization on GPUs" (with Liangliang Cao, Liana Fong) accepted by HPDC 2016 (preprint)! By optimizing memory access and parallelization on GPU, cuMF is much faster and cost-efficient than best CPU solutions. See this GTC 2016 talk (video) and code.
 
Riding and Thriving on the API Hype Cycle Guidelines for the enterprise. Maja Vukovic, Jim Laredo, Vinod Muthusamy, Aleksander Slominski, Roman Vaculin, Wei Tan, Vijay Naik, Ignacio Silva-Lepe, Arun Kumar, Biplav Srivastava, Joel W. Branch. Communications of the ACM, March 2016

Best Paper Award at CCGrid 2015. "Deferred Lightweight Indexing for Log-Structured Key-Value Stores", with Yuzhe Tang, Arun Iyengar, Liana Fong, Ling Liu, Balaji Palanisamy, May, 2015. (paper, ppt, code)
 
Available from IEEE Xplore (free download with Xplore subsciption) and Amazon.

Brief Bio.

From 2008 to 2010 I worked at Computation Institute, University of Chicago and Argonne National Laboratory, on caGrid Workflow Toolkit, a web-service-based scientific workflow platform for cancer Biomedical Informatics Grid (caBIG). It was funded by US National Cancer Institute and adopted by many major US bioinformatics projects.

My awards include the Outstanding Technology Accomplishment Award from IBM (2014), Best Student Paper Award at ccGrid (2015), Best Student Paper Award at IEEE ICWS (2014), Best Paper Award at IEEE SCC (2011), Pacesetter Award from Argonne National Laboratory (2010), and caBIG Teamwork Award from the NIH (2008). I got my Ph.D in Control Science and Engineering from Tsinghua University, China.

Research Streams.

GPU: cuMF (HPDC16, NIPS 15 WS)

Big Data: HBase Index (EDBT 14, ccGrid 15), NoSQL (ICWS 14 tutorial).

Web service: CACM 16, IEEE T-ASE 14, IEEE Computer 09.

Distributed and cloud computing: IEEE T-ASE 13, IEEE T-ASE 12.