Knowledge Discovery and Data Mining Publications



2014

Provable Deterministic Leverage Score Sampling
Dimitris Papailiopoulos, Anastasios Kyrillidis, Christos Boutsidis
Technical Report, 2014

Random Laplace Feature Maps for Semigroup Kernels on Histograms
Jiyan Yang, Vikas Sindhwani, Quanfu Fan, Haim Avron, Michael Mahoney
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014

Efficient Dimensionality Reduction for Canonical Correlation Analysis
Haim Avron, Christos Boutsidis, Sivan Toledo, Anastasios Zouzias
SIAM Journal on Scientific Computing 36(5), S111-S131, 2014
Preliminary version appeared in the Proceedings of the 30th International Conference on Machine Learning (ICML), 2013

Quasi-Monte Carlo Feature Maps for Shift-Invariant Kernels
Jiyan Yang*, Vikas Sindhwani*, Haim Avron*, Michael Mahoney
Proceedings of the 31th International Conference on Machine Learning (ICML), 2014
(*) Equal contributors.

Kernel Methods Match Deep Neural Networks on TIMIT
Po-Sen Huang, Haim Avron, Tara Sainath, Vikas Sindhwani, Bhuvana Ramabhadran
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2014
Best Student Paper Award

Consumer Segmentation and Knowledge Extraction from Smart Meter and Survey Data
Tri Kurniawan Wijaya, Tanuja Ganu, Dipanjan Chakraborty, Karl Aberer, Deva P. Seetharam
Accepted for publication at SIAM Data Mining (SDM 2014)

Efficient Dimensionality Reduction for Canonical Correlation Analysis
Haim Avron, Christos Boutsidis, Sivan Toledo, Anastasios Zouzias
SIAM Journal on Scientific Computing, to appear 36(5), S111--S131, SIAM, 2014

Random Projections for Linear Support Vector Machines
S. Paul, C. Boutsidis, M. Magdon-Ismail, P. Drineas
ACM Transactions on Knowledge Discovery from Data, to appear, 2014

Loss Localisation in Smart Distribution Networks
Vijay Arya, Balakrishnan Narayanaswamy
Sixth International Conference on Communication Systems and Networks (COMSNETS), IEEE, 2014
(Best Paper Runner up)

A Note on Sparse Least-squares Regression
C. Boutsidis and M. Magdon-Ismail
Information Processing Letters, to appear, 2014

Approximate Spectral Clustering via Randomized Sketching
A. Gittens, A. Kambadur, C. Boutsidis.
Technical Report, updated Feb 15, 2014

Mining generalized spatial association rule
W.S. Dong, A. Hampapur, Z.B. Jiang, H. Li, X. Liu, W. Sun
US Patent 8,819,065


2013

Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data
Mani Abedini, Michael Kirley, Raymond Chiong
The Australasian medical journal 6(5), 272, Australasian Medical Journal, 2013

IBM Research at ImageCLEF 2013 Medical Tasks
Mani Abedini, Liangliang Cao, Noel Codella, Jonathan H. Connell, Rahil Garnavi, Amir Geva, Michele Merler, Quoc- Bao Nguyen, Sharathchandra U. Pankanti, John R. Smith, Xingzhi Sun, and Asaf Tzadok
American Medical Informatics Association, 2013

Learning Hash Codes with Listwise Supervision
J Wang, W Liu, AX Sun, YG Jiang
IEEE International Conference on Computer Vision, 2013

Single Network Relational Transductive Learning
Amit Dhurandhar and Jun Wang
Journal of Artificial Intelligence Research48, 813-839, 2013

Exploring patient risk groups with incomplete knowledge
Xiang Wang, Fei Wang, Jun Wang, Buyue Qian, Jianying Hu
The IEEE International Conference on Data Mining series, pp. 1223-1228, 2013


Semi-Supervised Learning with Manifold Fitted Graphs
Tongtao Zhang, Rongrong Ji, Wei Liu, Dacheng Tao, and Gang Hua
International Joint Conference on Artificial Intelligence (IJCAI), AAAI, 2013
PIC top-quality conference (AI)

Semi-Supervised Learning Using Greedy Max-Cut
J Wang, T Jebara, SF Chang
Journal of Machine Learning Research 14( 771-800), 2013
Abstract

Multiple task learning using iteratively reweighted least square
J Pu, YG Jiang, J Wang, X Xue
Proceedings of the Twenty-Third international joint conference on Artificial Intelligence, pp. 1607-1613, 2013

Query-Adaptive Image Search with Hash Codes
YG Jiang, J Wang, X Xue, SF Chang
IEEE Transactions on Multimedia 15(2), 442-453, 2013

Comparing apples to oranges: a scalable solution with heterogeneous hashing
Mingdong Ou, Peng Cui, Fei Wang, Jun Wang, Wenwu Zhu, Shiqiang Yang
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 230-238, 2013

Inferring connectivity model from meter measurements in distribution networks
Vijay Arya, TS Jayram, Soumitra Pal, Shivkumar Kalyanaraman
Proceedings of the fourth international conference on Future energy systems, pp. 173--182, 2013

Sketching Structured Matrices for Faster Nonlinear Regression
Haim Avron, Vikas Sindhwani, David Woodruff
Advances in Neural Information Processing Systems (NIPS), 2013

Towards Effective Prioritizing Water Pipe Replacement and Rehabilitation
Junchi Yan, Yu Wang, Ke Zhou, Jin Huang, Chunhua Tian, Hongyuan Zha, Weishan Dong
The 23rd International Joint Conference on Artificial Intelligence (IJCAI), pp. 2931--2937, 2013
Abstract   Acceptance rate: 28% (413/1473)

Near-Optimal Column-Based Matrix Reconstruction
C. Boutsidis, P. Drineas, and M. Magdon-Ismail
SIAM Journal on Computing, special issue of FOCS 2011, 2013

Sparse Max-Margin Multiclass and Multi-label Classifier Design for Fast Inference
Tanuja Ganu, Shirish Shevade, S. Sundararajan
Proceedings of the SIAM International Conference on Data Mining (SDM'13), SIAM, 2013. (25% acceptance rate), pp. 1-9

Pipe Failure Prediction: A Data Mining Method
Rui Wang, Weishan Dong, Yu Wang, Ke Tang, and Xin Yao
The 29th IEEE International Conference on Data Engineering (ICDE), pp. 1208--1218, 2013
Abstract   Industry track, full paper. Acceptance rate ~20%

Constrained Text Co-Clustering with Supervised and Unsupervised Constraints
Yangqiu Song, Shimei Pan, Shixia Liu, Furu Wei, Michelle Zhou, Weihong Qian
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2013


Efficient Dimensionality Reduction for Canonical Correlation Analysis
H. Avron, C. Boutsidis, S. Toledo, A. Zouzias
Proceedings of the 30th International Conference on Machine Learning (ICML), 2013

Deterministic Feature Selection for K-means Clustering
C. Boutsidis, M. Magdon-Ismail
IEEE Transactions on Information Theory, 59(9), 6099 - 6110, 2013

Randomized Dimensionality Reduction for K-means Clustering
C. Boutsidis, A. Zouzias, M.W. Mahoney, and P. Drineas
arXiv preprint arXiv:1110.2897, 2013

Improved matrix algorithms via the Subsampled Randomized Hadamard Transform
C. Boutsidis, A. Gittens
SIAM Journal on Matrix Analysis and Applications, 34(3), 1301-1340, 2013

Near-optimal Coresets For Least-Squares Regression
C. Boutsidis, P. Drineas, M. Magdon-Ismail
IEEE Transactions on Information Theory, 59(10), 6880 - 6892, 2013

Faster Subset Selection for Matrices and Applications
H. Avron, C. Boutsidis
SIAM Journal on Matrix Analysis and Applications, 34(4), 1464-1499, 2013

Large-Scale Video Hashing via Structure Learning
G Ye, D Liu, J Wang, SF Chang
IEEE International Conference on Computer Vision, 2013


2012

An enhanced XCS rule discovery module using feature ranking
M Abedini, M Kirley
International Journal of Machine Learning and Cybernetics, 1--15, Springer, 2012

FS-XCS vs. GRD-XCS: An analysis using high-dimensional DNA microarray gene expression data sets
Mani Abedini, Michael Kirley, Raymond Chiong
The Second Australian Workshop on Artificial Intelligence in Health AIH 2012, pp. 21

GPU-accelerated eXtended Classifier System
Mani Abedini, Michael Kirley, Raymond Chiong and T. Weise
IEEE Symposium on Computational Intelligence and Data Mining (CIDM 2013), 2012

Compact Hyperplane Hashing with Bilinear Functions
Wei Liu, Jun Wang, Yadong Mu, Sanjiv Kumar, and Shih-Fu Chang
International Conference on Machine Learning (ICML), ACM, 2012
PIC top-quality conference (AI, Knowledge Discovery & Data Mining)

Robust and Scalable Graph-Based Semisupervised Learning
Wei Liu, Jun Wang, and Shih-Fu Chang
Proceedings of the IEEE 100(9), 2624-2638, IEEE, 2012

Legislative Prediction via Random Walks over a Heterogeneous Graph
J. Wang and K. R. Varshney and A. Mojsilovic
SIAM International Conference on Data Mining, 2012

Supervised Hashing with Kernels
W. Liu and J. Wang and R. Ji and Y.-G. Jiang and S.-F. Chang
IEEE Int. Conf. on Computer Vision and Pattern Recognition, 2012

Compact Hyperplane Hashing with Bilinear Functions
W. Liu and J. Wang and Y. Mu and S. Kumar and S.-F. Chang
The 29th International Conference on Machine Learning, 2012

Fast Graph Construction Using Auction Algorithm
J. Wang and Y. Xiang
The Conference on Uncertainty in Artificial Intelligence, 2012

Scalable Graph-Based Semi-Supervised Learning,
W. Liu, and J. Wang and S.-F. Chang
Proceedings of the IEEE, 2012

Semi-Supervised Hashing for Large Scale Search
J. Wang and S. Kumar and S.-F. Chang
IEEE Trans. on Pattern Analysis and Machine Intelligence, 2012
Abstract


Mass and social media corpus analysis after the 2011 great east Japan earthquake
S. Sato, M. Tatsubori, F. Imamura
Proceedings of the 21st international conference companion on World Wide Web, pp. 711--712, 2012

Location inference using microblog messages
Yohei Ikawa, Miki Enoki, Michiaki Tatsubori
Proceedings of the 21st international conference companion on World Wide Web, pp. 687--690, 2012

On Nested Palindromes in Clickstream Data
Michel Speiser, Gianluca Antonini, Abderrahim Labbi, Juliana Sutanto
The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2012), pp. 1460--1468

Finding needles in the haystack: Search and candidate generation
J. Chu-Carroll, J. Fan, BK Boguraev, D. Carmel, D. Sheinwald, C. Welty
IBM Journal of Research and Development 56(3/4), 2012

Special Questions and Techniques
J. Prager, E. Brown, J. Chu-Carroll
IBM Journal of Research and Development 56(3-4), 11--1, IBM, 2012

Identifying implicit relationships
J Chu-Carroll, E Brown, A Lally, JW Murdock
IBM Journal of Research and Development 56(3-4), 2012

Textual resource acquisition and engineering
J Chu-Carroll, J Fan, N Schlaefer, W. Zadrozny
IBM Journal of Research and Development 56(3-4), 2012

Utility-guided Clustering-based Transaction Data Anonymization
A. Gkoulalas-Divanis, G. Loukides
Transactions on Data Privacy5, 2012

I'd never get out of this!? $%# office: redesigning time management for the enterprise
C. Dugan, W. Geyer, M. Muller, A.N. Valente, K. James, S. Levy, L.T. Cheng, E. Daly, B. Brownholtz
Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems, pp. 1755--1764

SaNDVis: visual social network analytics for the enterprise
Adam Perer, Ido Guy
Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work Companion, pp. 275--276

Bon voyage: social travel planning in the enterprise
Netta Aizenbud-Reshef, Artem Barger, Ido Guy, Yael Dubinsky, Shiri Kremer-Davidson
Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work, pp. 819--828

Impression formation in corporate people tagging
D.R. Raban, A. Danan, I. Ronen, I. Guy
Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems, pp. 569--578

Simulating application resilience at exascale
Rolf Riesen, Kurt Ferreira, Maria Ruiz Varela, Michela Taufer, Arun Rodrigues
Euro-Par 2011 Workshops, Part II, pp. 221--230, Springer, Heidelberg, 2012
4th Workshop on Resiliency in High Performance Computing (Resilience) in Clusters, Clouds, and Grids in conjunction with the 17th International European Conference on Parallel and Distributed Computing (Euro-Par 2011), Bordeaux, France, August


Dynamic Matrix Factorization: A State Space Approach
John Z. Sun, Kush R. Varshney, Karthik Subbian
IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012

Decision Trees for Heterogeneous Dose-Response Signal Analysis
Kush R. Varshney, Moninder Singh, Jun Wang
IEEE Statistical Signal Processing Workshop, 2012

You've got video: increasing clickthrough when sharing enterprise video with email
Mercan Topkara, Shimei Pan, Jennifer Lai, Ahmet Dirik, Steve Wood and Jeff Boston
Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems (CHI)

EWNI: Efficient Anonymization of Vulnerable Individuals in Social Networks
F. Nagle, L. Singh, A. Gkoulalas-Divanis
Advances in Knowledge Discovery and Data Mining, 359--370, Springer, 2012

Action Detection by Fusing Hierarchically Filtered Motion With Spatiotemporal Interest Point Features
Y. Tian, L. Cao, Z. Liu, and Z. Zhang
IEEE Transactions on Systems, Man, and Cybernetic Part C 43(3), 2012

Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling
Zhijun Yin, Liangliang Cao, Quanquan Gu, Jiawei Han
ACM Transaction on Intelligent Systems and Technology 3(4), 2012

Distributed Learning, Communication Complexity and Privacy
Maria-Florina Balcan, Avrim Blum, Shai Fine, Yishay Mansour
The 25th Conference on Learning Theory (COLT) , 2012

EPIC: a multi-tiered approach to enterprise email prioritization
Jie Lu, Zhen Wen, Shimei Pan and Jennifer Lai
Proceedings of the 2012 ACM international conference on Intelligent User Interfaces (IUI), pp. 199--202


Managing data quality by identifying the noisiest data samples
K Hima Prasad, Snigdha Chaturvedi, Tanveer A Faruquie, L Venkata Subramaniam, Mukesh K Mohania
IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), pp. 90--95, 2012

Automated selection of blocking columns for record linkage
K Hima Prasad, Snigdha Chaturvedi, Tanveer A Faruquie, L Venkata Subramaniam, Mukesh K Mohania
IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), pp. 78--83, 2012

Data consolidation solution for internal security needs
K Hima Prasad, Soujanya Soni, Tanveer A Faruquie, L Venkata Subramaniam
IEEE International Conference on Service Operations and Logistics, and Informatics (SOLI), pp. 84--89, 2012

STATIC ANALYSIS OF CLIENT-SERVER APPLICATIONS USING FRAMEWORK INDEPENDENT SPECIFICATIONS
S. Artzi, R. Berg, Y.A. Haviv, J.T. Peyton Jr, M. Pistoia, M. Sridharan, B. Sharma, O. Weisman, R. Wiener
US Patent 20,120,102,474

Unchanged Object Management
P. Centonze, P.K. Malkin, M. Pistoia
US Patent 20,120,331,445

CONFIDENCE-BASED STATIC ANALYSIS
O. TRIPP, M. PISTOIA
WO Patent WO/2012/041,668

FAULT LOCALIZATION USING DIRECTED TEST GENERATION
S. Artzi, J. Dolby, M. Pistoia, F. Tip
US Patent 20,120,054,552

Global Variable Security Analysis
S. Artzi, R. Berg, J. Peyton, M. Pistoia, M. Sridharan, T. Tateishi, O. Tripp, R. Wiener
US Patent 20,120,131,670

GENERATING SPECIFICATIONS OF CLIENT-SERVER APPLICATIONS FOR STATIC ANALYSIS
S. Artzi, R. Berg, J.T. Peyton Jr, M. Pistoia, M. Sridharan, R. Wiener
US Patent 20,120,102,471

Verification of information-flow downgraders
Y.A. Haviv, R. Hay, M. Pistoia, A. Sharabani, T. Tateishi, O. Tripp, O. Weisman
US Patent 20,120,023,486


Simulating black box test results using information from white box testing
Stephen Fink, Yinnon A Haviv, Roee Hay, Marco Pistoia, Ory Segal, Adi Sharabani, Manu Sridharan, Frank Tip, Omer Tripp, Omri Weisman, others
US Patent App. 13/493,067

Power-Efficient Time-Sensitive Mapping in CPU/GPUHeterogeneous Systems
Cong Liu, Jian Li, Wei Huang, Juan Rubio, Evan Speight, Xiaozhu Felix Lin
International Conference on Parallel Architectures and Compilation Techniques (PACT 2012)

Best Faces Forward: A Large-scale Study of People Search in the Enterprise
I. Guy, S. Ur, I. Ronen, S. Weber, T. Oral
Proceedings of the 2012 annual conference on Human factors in computing systems (CHI'12), pp. 1775--1784

Prediction of low energy protein side chain configurations using Markov random fields
Chen Yanover, Menachem Fromer
Bayesian Methods in Structural Bioinformatics, Springer-Verlag, 2012

Fusing biographical and biometric classifiers for improved person identification
Vivek Tyagi, Hima P Karanam, Tanveer A Faruquie, LV Subramaniam, Nalini Ratha
International Conference on Pattern Recognition (ICPR) , pp. 2351--2354, 2012

A fast and scalable low dimensional solver for charged particle dynamics in large particle accelerators
Yves Ineichen, Andreas Adelmann, Costas Bekas, Alessandro Curioni, Peter Arbenz
Computer Science-Research and Development, 1-8, Springer, 2012

Latent Association Analysis of Document Pairs
Gengxin Miao, Ziyu Guan, Louise Moser, Xifeng Yan, Shu Tao, Nikos Anerousis, Jimeng Sun
The 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 1415--1423, 2012

Semantic model vectors for complex video event recognition
Michele Merler, Bert Huang, Lexing Xie, Gang Hua, Apostol Natsev
Multimedia, IEEE Transactions on 14(1), 88--101, IEEE, 2012

Hap-seq: An Optimal Algorithm for Haplotype Phasing with Imputation Using Sequencing Data
D. He, B. Han, E. Eskin
Research in Computational Molecular Biology, pp. 64--78, 2012

Levodopa and the feedback process on set-shifting in parkinson's disease
W.L. Au, J. Zhou, P. Palmes, Y.Y. Sitoh, L. Tan, J.C. Rajapakse
Human brain mapping, Wiley Online Library, 2012

Perfect hashing and cnf encodings of cardinality constraints
Yael Ben-Haim, Alexander Ivrii, Oded Margalit, Arie Matsliah
Theory and Applications of Satisfiability Testing--SAT 2012, pp. 397--409, Springer

Scene Aligned Pooling for Complex Video Recognition
Liangliang Cao, Yadong Mu, Paul Natsev, Shih-Fu Chang, Gang Hua, John R. Smith
ECCV, pp. 688--701, Springer, 2012

Submodular Video Hashing: A Unified Framework towards Video Pooling and Indexing
Liangliang Cao, Zhenguo Li, Yadong Mu, Shih-Fu Chang,
ACM Multimedia 2012

Observations regarding the immunogenicity of BDD-rFVIII derived from a mechanistic personalized medicine perspective
Zuben E. Sauna, Afshin Ameri, Benjamin Kim, Chen Yanover, Kevin R. Viel, Raja Rajalingam, Shelley A. Cole, Tom E. Howard
Journal of Thrombosis and Haemostasis 10(9), 1961--1965, Blackwell Publishing Ltd, 2012

Web-Scale Multimedia Information Networks
Guo-Jun Qi, Shen-Fu Tsai, Min-Hsuan Tsai, Liangliang Cao, and T. S. Huang,
Proceedings of the IEEE, 2012

Making sense of healthcare benefits
J. Bnayahu, M. Goldstein, M. Nisenson, Y. Simionovici
Software Engineering (ICSE), 2012 34th International Conference on, pp. 1034--1043

Efficient and Practical Stochastic Subgradient Descent for Nuclear Norm Regularization
Haim Avron, Satyen Kale, Shiva Kasiviswanathan, Vikas Sindhwani
Proceedings of the 29th International Conference on Machine Learning (ICML), 2012
Extended version appeared as an IBM Research Report (http://domino.research.ibm.com/library/cyberdig.nsf/papers/B6A6347CBFD55F4285257A1300500242)

Delta-SimRank Computing on MapReduce
Liangliang Cao, Hyun Duk Kim, Min-Hsuan Tsai, Brian Cho, Zhen Li, Indy Gupta, ChengXiang Zhai, and Thomas S. Huang
1st International Workshop on Big Data, Streams and Heterogeneous Source Mining (BigMine), 2012

Relation Extraction and Scoring in DeepQA.
Chang Wang, Aditya A. Kalyanpur, James Fan, Bran Boguraev, David Gondek
IBM Journal of Research and Development., 2012

DuSCA: A Multi-Channeling Strategy for Doubling Communication Capacity in Wireless NoC
Yi Wang, Danella Zhao and Jian Li
The 30th IEEE International Conference on Computer Design , 2012

Experimental Analysis of Different Optimization Techniques on Leakage Localization Using Series Alignment
Yu Wang, Junchi Yan, Chunhua Tian, Weishan Dong, and Jin Huang
IEEE International Conference on Service Operations, Logistics, and Informatics (SOLI), pp. 144--149, 2012

Point pattern analysis utilizing controlled randomization for police tactical planning
L. Li, W. Dong, C. Tian, W. Sun
Service Operations and Logistics, and Informatics (SOLI), 2012 IEEE International Conference on, pp. 13--18

Smarter City: Vision and Practice (in Chinese)
C Zhang, C Tian, Y Wang, J Huang, J Yan, W Sun, W Dong, L Li, X Fei, P Gao, F Cao, L Guan, Y Wu, Y Chen, J Sun, M Li, J Zhao, H Liu, and Y Dai
The Publishing House of Electronics Industry, 2012

Structured Data and Inference in DeepQA
A. Kalyanpur, B. Boguraev, S. Patwardhan, J.W. Murdock, A. Lally, C. Welty, J. Prager, B. Coppola, A. Fokoue
IBM Journal of Research and Development 56(3/4), 10:1 - 10:14, IBM, 2012

Temporal analytics on big data for web advertising
Badrish Chandramouli, Jonathan Goldstein, Songyun Duan
International Conference on Data Engineering (ICDE), Best Paper Award, 2012

Scalable multi-query optimization for SPARQL
Wangchao Le, Anastasios Kementsietsidis, Songyun Duan, Feifei Li
International Conference on Data Engineering (ICDE), 2012

Data and Task Parallelism in ILP using MapReduce
Ashwin Srinivasan, Tanveer A Faruquie, Sachindra Joshi
Machine Learning 86(1), 141-168, Springer, 2012

Layout decomposition for triple-patterning lithography
R. S. Ghaida, K. B. Agarwal, L. W. Liebmann, S. R. Nassif, and P. Gupta
SPIE Advanced Lithography - Design for Manufacturability through Design-Process Integration, 2012

Fault Localization for Dynamic Web Applications
S Artzi, J Dolby, F Tip, M Pistoia
IEEE Transactions on Software Engineering 38(2), 314-335, Published by the IEEE Computer Society, 2012

Privacy-Preserving Medical Data Sharing
A. Gkoulalas-Divanis, G. Loukides
SDM Tutorial, 2012

A Bayesian Markov-switching Model for Sparse Dynamic Network Estimation
H. Jiang, A.Lozano, F. Liu
SDM 2012: Proceedings of 2012 SIAM International Conference on Data Mining

Discovery of Generalized Spatial Association Rules
Weishan Dong, Li Li, Changjin Zhou, Yu Wang, Min Li, Chunhua Tian, Wei Sun
IEEE International Conference on Service Operations, Logistics, and Informatics (SOLI), pp. 60--65, 2012
Abstract

A General Framework to Encode Heterogeneous Information Sources for Contextual Pattern Mining
Weishan Dong, Wei Fan, Lei Shi, Changjin Zhou, and Xifeng Yan
The 21st ACM International Conference on Information and Knowledge Management (CIKM), pp. 65--74, 2012
Abstract   Full regular paper. Full paper acceptance rate: 13.4% (146/1088)

Relational Rule Learning in Decoupled Heterogeneous Subspaces
Xin Zhang, Ning Duan, Weishan Dong
IEEE International Conference on Service Operations, Logistics, and Informatics (SOLI), pp. 66--71, 2012

Detecting Irregularly Shaped Significant Spatial and Spatio-Temporal Clusters
W. Dong, X. Zhang, L. Li, C. Sun, L. Shi, W. Sun
SIAM International Conference on Data Mining (SDM), pp. 732--743, 2012
Abstract   Acceptance rate: 27% (99/363). GridScan code here.


2011

Guided Rule Discovery in XCS for High-Dimensional Classification Problems
M Abedini, M Kirley
AI 2011: Advances in Artificial Intelligence, 1--10, Springer

Hashing with Graphs
W. Liu and J. Wang and S. Kumar and S.-F. Chang
The 28th International Conference on Machine Learning, 2011





Processing call requests with respect to objects
M. Tatsubori, T. Takase, Y. Nakamura
US Patent 7,865,595

Ranking web-based partial orders by significance using a markov reference model
M. Speiser, G. Antonini, A. Labbi
Data Mining (ICDM), 2011 IEEE 11th International Conference on, pp. 665--674

Reliability models applied to a system of power converters in particle accelerators
D. Siemaszko, M. Speiser, S. Pittet
Power Electronics and Applications (EPE 2011), Proceedings of the 2011-14th European Conference on, pp. 1--9

Moving towards a collaborative decision support system for aeronautical data
L.I. Rusu, W. Rahayu, T. Torabi, F. Puersch, W. Coronado, A.T. Harris, K. Reed
Journal of Intelligent Manufacturing, 1--16, Springer, 2011

Privacy concerns in enterprise social travel: attitudes and actions
Netta Aizenbud-Reshef, Artem Barger, Yael Dubinsky, Ido Guy, Shiri Kremer-Davidson
Human-Computer Interaction--INTERACT 2011, 242--249, Springer

3rd workshop on recommender systems and the social web
J. Freyne, S.S. Anand, I. Guy, A. Hotho
Proceedings of the fifth ACM conference on Recommender systems, pp. 383--384, 2011

Triage of electronic mail
I. Guy
US Patent 7,890,596

AGGREGATION OF SOCIAL NETWORK DATA
I. GUY I. RONEN S. UR
WO Patent WO/2011/089,039

IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System
Liangliang Cao, Shih-Fu Chang, Noel Codella, Courtenay Cotton, Dan Ellis, Leiguang Gong, Matthew Hill, Gang Hua, John Kender, Michele Merler, Yadong Mu, Apostol Natseve, John R. Smith
NIST TRECVID Workshop, 2011


System, Method and Apparatus for Simultaneous Definition and Enforcement of Access-control and Integrity Policies
P. CENTONZE, Y.A. HAVIV, R. HAY, M. PISTOIA, A. SHARABANI, O. TRIPP
WO Patent WO/2011/062,674

Path-and index-sensitive string analysis based on monadic second-order logic
Winner of the ACM Distinguished Paper Award

T. Tateishi, M. Pistoia, O. Tripp
International Symposium on Software Testing and Analysis, 2011

Latent topic model-based group activity discovery
Tanveer A. Faruquie, Subhashis Banerjee, Prem Kalra
The Visual Computer 27(12), 1071-1082, Springer, 2011

Optimizing energy functions for protein-protein interface design
Oz Sharabi, Chen Yanover, Ayelet Dekel, Julia M Shifman
Journal of Computational Chemistry32, 23--32, 2011

Haemophilia Management: Time to Get Personal?
Tom E. Howard, Chen Yanover, Johnny Mahlangu, Amanda Krause, Kevin R. Viel, Carol K. Kasper, Kathleen P. Pratt
Haemophilia 17(5), 721--728, 2011

Extensive protein and DNA backbone sampling improves structure-based specificity prediction for C2H2 zinc fingers
Chen Yanover, Philip Bradley
Nucleic Acids Research 39(11), 4564-4576, 2011

Large-scale characterization of peptide-MHC binding landscapes with structural simulations
Chen Yanover, Philip Bradley
Proceedings of the National Academy of Sciences of the United States of America 108(17), 6981--6986, 2011

Pharmacogenetics and the immunogenicity of protein therapeutics
Chen Yanover, Nisha Jain, Glenn Pierce, Tom E Howard, Zuben E Sauna
Nature biotechnology 29(10), 870--873, 2011

HLA mismatches and hematopoietic cell transplantation: structural simulations assess the impact of changes in peptide binding specificity on transplant outcome
Chen Yanover, Effie W. Petersdorf, Mari Malkki, Ted Gooley, Stephen Spellman, Andrea Velardi, Peter Bardy, Alejandro Madrigal, Jean-Denis Bignon, Philip Bradley
Immunome Research7, 2:4, 2011

Machine learning competition in immunology: Prediction of HLA class I binding peptides
Guang Lan Zhang, Hifzur Rahman Ansari, Phil Bradley, Gavin C. Cawley, Tomer Hertz, Xihao Hu, Nebojsa Jojic, Yohan Kim, Oliver Kohlbacher, Ole Lund, Claus Lundegaard, Craig A. Magaret, Morten Nielsen, Harris Papadopoulos, G P S Raghava, Tal Vider-Shalit, L
Journal of Immunological Methods 374(1-2), 1--4, 2011

Analysis, Indexing and Visualization of Presentation Videos
Michele Merler, John R. Kender
ACM Multimedia Doctoral Symposium, pp. 871, 2011

Topical semantics of twitter links
M.J. Welch, U. Schonfeld, D. He, J. Cho
Proceedings of the fourth ACM international conference on Web search and data mining, pp. 327--336, 2011

Selecting the Best Faces to Index Presentation Videos
Michele Merler and John Kender
ACM Multimedia, pp. 1461 - 1464, 2011

An optimal weighted aggregated association test for identification of rare variants involved in common diseases
J.H. Sul, B. Han, D. He, E. Eskin
Genetics 188(1), 181--188, Genetics Soc America, 2011

Learning the funding momentum of research projects
D. He, D. Parker
Advances in Knowledge Discovery and Data Mining, 532--543, Springer, 2011

Efficient algorithms for tandem copy number variation reconstruction in repeat-rich regions
D. He, F. Hormozdiari, N. Furlotte, E. Eskin
Bioinformatics 27(11), 1513--1520, Oxford Univ Press, 2011

Using HLA binding prediction algorithms for epitope mapping in HIV vaccine clinical trials
D. He, P. Kunwar, E. Eskin, H. Horton, P. Gilbert, T. Hertz
Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine, pp. 594--601, 2011

MINING APPROXIMATE REPEATING PATTERNS FROM SEQUENCE DATA WITH GAP CONSTRAINTS
D. He, X. Zhu, X. Wu
Computational Intelligence 27(3), 336--362, Wiley Online Library, 2011

Genotyping common and rare variation using overlapping pool sequencing
D. He, N. Zaitlen, B. Pasaniuc, E. Eskin, E. Halperin
BMC Bioinformatics 12(Suppl 6), S2, BioMed Central Ltd, 2011

How Does Research Evolve? Pattern Mining for Research Meme Cycles
D. He, X. Zhu, D.S. Parker
Data Mining (ICDM), 2011 IEEE 11th International Conference on, pp. 1068--1073

Near-Optimal Column-Based Matrix Reconstruction
C. Boutsidis, P. Drineas, and M. Magdon-Ismail
Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2011

Sparse Features for PCA-like Linear Regression
C. Boutsidis, P. Drineas, and M. Magdon-Ismail
Annual Conference on Neural Information Processing Systems (NIPS), 2011

Atomic-level characterization of the ensemble of the Ab(1-42) monomer in water using unbiased molecular dynamics simulations and spectral algorithms
N. Sgourakis, M. Serrano, C. Boutsidis, P. Drineas, Z. Du, C. Wang, and A. Garcia.
Journal of Molecular Biology, 405(2):570-83, 2011.

Topics in Matrix Sampling Algorithms
C. Boutsidis
Ph.D Dissertation, Rensselaer Polytechnic Institute , 2011

System and method for failure association analysis
Wei Shan Dong, Rogerio S Feris, Arun Hampapur, Zhong Bo Jiang, Shilpa N Mahatma, Wei Sun, Lexing Xie
US Patent App. 12/984,019

Refinement of history-based policies
J Lobo, J Ma, A Russo, E Lupu, S Calo, M Sloman
Logic programming, knowledge representation, and nonmonotonic reasoning, 280--299, Springer, 2011

sslGolog: When conditional compositions of web services meet semantic links and causal laws
Freddy Lecue, Alexandre Delteil, Alain Leger
Web Intelligence and Agent Systems 9(1), 1-25, 2011

Adversaries' Holy Grail: access control analytics
I Molloy, J Lobo, S Chari
Proceedings of the First Workshop on Building Analysis Datasets and Gathering Experience Returns for Security, pp. 54--61, 2011

Seeking Quality of Web Service Composition in a Semantic Dimension
Freddy Lecue, Nikolay Mehandjiev
IEEE Trans. Knowl. Data Eng. 23(6), 942-959, 2011

A Hybrid Approach to Recommending Semantic Software Services
Liwei Liu, Freddy Lecue, Nikolay Mehandjiev
ICWS, pp. 379-386, 2011

Inferring Data Flow in Semantic Web Service Composition
Freddy Lecue
ICWS, pp. 347-354, 2011

Personalizing Access to Semantic Web Services
Freddy Lecue
ICWS, pp. 259-266, 2011

Personalizing Your Web Services with Constructive DL Reasoning Join
Freddy Lecue
AAAI, 2011

L1 vs. L2 Regularization in Text Classification when Learning from Labeled Features
S Mazilu, J Iria
Proceedings of the 10th IEEE International Conference on Machine Learning and Applications, 2011

Information Technology For Healthcare Transformation
J. P. Bigus, M. Campbell, B. Carmeli, M. Cefkin, H. Chang, C.-H. Chen-Ritzo, W. F. Cody, S. Ebadollahi, A. Evfimievski, A. Farkash, S. Glissmann, D. Gotz, T. W. A. Grandison, D. Gruhl, P. J. Haas, M. J. H. Hsiao, P.-Y. S. Hsueh, J. Hu, J. M. Jasinski, J.
IBM Journal of Research and Development - Special Issue on the Frontiers of IT, Vol 55, No 5, pp 6:1-6:14 55(5), 6--1, IEEE, 2011

HIgh-frequency graphene amplifier
S. -J. Han, K. A. Jenkins, A. V. Garcia, A. D. Franklin, A. A. Bol, and W. Haensch
Nano Lett11, 3690-3693, 2011

Medical Data Sharing: Privacy Challenges and Solutions
A Gkoulalas-Divanis, G Loukides
ECML/PKDD Tutorial, 2011

On balancing disclosure risk and data utility in transaction data sharing using RU confidentiality map
G Loukides, A Gkoulalas-Divanis, J Shao
Joint UNECE/Eurostat work session on statistical data confidentiality, 2011

Adaptable Fault Identification for Smart Buildings
Anika Schumann, Jer Hayes, Pascal Pompey, Olivier Verscheure
Artificial Intelligence and Smarter Living: The Conquest of Complexity, Papers from the 2011 AAAI Workshop, San Francisco, California, USA, August 8, 2011

A publication process model to enable privacy-aware data sharing
A Gkoulalas-Divanis, E W Cope
IBM Journal of Research and Development (Special Issue for the 100th Anniversary of IBM) 55(5), 8:1-8:10, IEEE, 2011

Recombination gives a new insight in the effective population size and the history of the Old World human populations
M Mel{'e}, A Javed, M Pybus, P Zalloua, M Haber, D Comas, M G Netea, O Balanovsky, E Balanovska, L Jin, others
Molecular Biology and Evolution, SMBE, 2011

Revisiting sequential pattern hiding to enhance utility
A Gkoulalas-Divanis, G Loukides
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1316--1324, 2011

RANDOM INJECTION-BASED DEACTIVATION OF WEB-SCRAPERS
V Bhagwan, T W A Grandison
US Patent 20,110,219,455

Security and privacy enforcement for discovery services in a network of electronic product code information repositories
Anthony C Asher, Steven P Beier, Christian C Clauss, Tyrone WA Grandison, Karin Kailing, Ralf Rantzau, Gary Robinson, others
US Patent 7,866,543

Using Syntactic and Semantic Structural Kernels for Classifying Definition Questions in Jeopardy!
A Moschitti, J Chu-Carroll, S Patwardhan, J Fan, G Riccardi
Proceedings of the Conference on Empirical Methods for Natural Language Processing, pp. 73--76, 2011

Leveraging Wikipedia Characteristics for Search and Candidate Generation in Question Answering
J Chu-Carroll, J Fan
Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Learning Dirichlet Processes from Partially Observed Groups
Avinava Dubey, Indrajit Bhattacharya, Mrinal Das, Tanveer A Faruquie, Chiranjib Bhattacharyya
IEEE International Conference on Data Mining (ICDM), 2011

Adapting a WSJ trained Part-of-Speech tagger to Noisy Text: Preliminary Results
Phani Gadde, L. Venkata Subramaniam, Tanveer A Faruquie
Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data, pp. 5, 2011

Using Text Reviews for Product Entity Completion
Mrinmaya Sachan, Tanveer A Faruquie, L V Subramaniam, Mukesh Mohania
International Joint Conference on Natural Language Processing, (IJCNLP), 2011

Discovering Customer Intent in Real-time for Streamlining Service Desk Conversations
Ullas Nambiar, Tanveer A Faruquie, L V Subramaniam, Sumit Negi, Ganesh Ramakrishnan
ACM Conference on Information and Knowledge Management (CIKM), pp. 1383--1388, 2011

Probabilistic model for discovering topic based communities in social networks
Mrinmaya Sachan, Danish Contractor, Tanveer Faruquie, Venkata Subramaniam
Proceedings of the 20th ACM international conference on Information and knowledge management, pp. 2349--2352, 2011

An OCL-compliant GELLO Engine.
J Mei, H Liu, G Xie, S Liu, B Zhou
XXIII International Conference of the European Federation for Medical Informatics (MIE 2011), pp. 130

WELLNESS DECISION SUPPORT SERVICES
P. Hsueh, S. Ramakrishnan, M. Hsiao

A system and method for in-take and location aware food recommdation for diabetes
M. Hsiao, P. Hsueh, L. Zeng, L. Liu, H. Chang

Health Management Application Development and Deployment Framework
L. Zeng, P. Hsueh, H. Chang

Latent Graphical Models for Quantifying and Predicting Patent Quality.
Liu Y., Hsueh, P. et al.
17th ACM Knowledge Discovery and Data Mining (KDD 2011)(TOP PIC CONFERENCE)

The New World Of Evidence-based Medicine: Thinking Beyond Randomized Controlled Trials.
Grandison et al.
Journal of American Medical Informatics (JAMIA), 2011

Outcome-driven Service Composition in GreenOlive Intelligent Living Platform
Zeng, L., Hsueh, P., and X. Zhu
The 13th International Congress on Medical Informatics (MIE 2011) (TOP PIC CONFERENCE)

Intelligent Nutrition Service for Personalized Dietary Guidelines and Lifestyle Intervention
Hsiao, R., Hsueh, P., et al.
International Joint Conference on Service Sciences, Computer Aided Service Experience Engineering Workshop. , 2011

Privacy Protection Issues for Healthcare Wellness Clouds
Grandison, T., Hsueh, P., Zeng, L., Chang, H.
Privacy Protection Measures and Technologies in Business Organizations , 2011

A novel hybrid gene prediction method employing protein multiple sequence alignments.
Oliver Keller, Martin Kollmar, Mario Stanke, Stephan Waack
Bioinformatics, 2011
Abstract

Cross-species protein sequence and gene structure prediction with fine-tuned Webscipio 2.0 and Scipio.
Klas Hatje, Oliver Keller, Björn Hammesfahr, Holger Pillmann, Stephan Waack, Martin Kollmar
BMC Res Notes4, 265, 2011
Abstract

The IBM 2009 GALE Arabic speech transcription system
Brian Kingsbury, Hagen Soltau, George Saon, Stephen Chu, Hong-Kwang Kuo, Lidia Mangu, Suman Ravuri, Nelson Morgan, Adam Janin
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 4672--4675


A Clustering-based Approach to Ontology Alignment
Songyun Duan, Achille Fokoue, Kavitha Srinivas, Brian Byrne
International Conference on Semantic Web (ISWC), pp. 146--161, Springer, 2011


Digital traces of interest: Deriving interest relationships from social media interactions
Michal Jacovi, Ido Guy, Inbal Ronen, Adam Perer, Erel Uziel, Michael Maslenko
ECSCW 2011: Proceedings of the 12th European Conference on Computer Supported Cooperative Work, 24-28 September 2011, Aarhus Denmark, pp. 21--40, Springer London

Do you want to know?: recommending strangers in the enterprise
Ido Guy, Sigalit Ur, Inbal Ronen, Adam Perer, Michal Jacovi
Proceedings of the ACM 2011 conference on Computer supported cooperative work, pp. 285--294, ACM

Mask assignment for multiple patterning
R. S. Ghaida, K. B. Agarwal, L. W. Liebmann, and S. R. Nassif (filed)

Pin-access maximization under manufacturing constraints
R. S. Ghaida, K. B. Agarwal, L. W. Liebmann, and S. R. Nassif (filed)

Double patterning layout decomposition for ease of conflict removal
R. S. Ghaida, K. B. Agarwal, L. W. Liebmann, and S. R. Nassif (filed)

Resolving double patterning conflict
R. S. Ghaida and K. B. Agarwal (filed)

Double-patterning conflict removal
R. S. Ghaida, K. B. Agarwal, S. R. Nassif, X. Yuan, L. W. Liebmann, and P. Gupta
IEEE/ACM Intl. Conf. on Computer-Aided Design (ICCAD), 2011

Collaborative research on emerging technologies and design
A. R. Neureuther et al.
SPIE Photomask Japan { Photomask and Next-Generation Lithography Mask Technology, 2011

Single-mask double-patterning lithography for reduced cost and improved overlay control
R. S. Ghaida, G. Torres, and P. Gupta
IEEE Trans. On Semiconductor Manufacturing (TSM) 24(1), 381-390, 2011

A Simulation Environment for Vehicle-to-Grid Integration Studies
C. Binding, O. Sundstroem
Summer Computer Simulation Conference (SCSC 2011)

Leveraging Cloud Computing and High Performance Computing Advances for Next-generation Architecture, Urban Design and Construction Projects
Francesco Iorio and Jane L. Snowdon
SimAUD 2011 Conference Proceedings: Symposium on Simulation for Architecture and Urban Design, pp. 69-76

Smarter Cities Series: Understanding the IBM Approach to Efficient Buildings
Brad Brech, Ravirajan Rajan, James Fletcher, Colin Harrison, John Hogan, Lisa Hopkins, Pamela K. Isom, John Meegan, Claire Penny, Jane L. Snowdon, Doug A. Wood
IBM Redbook, REDP-4735-0, IBM Corporation, 2011

Complexity of legacy city resource management and value modeling of interagency response
E M Huestis, J L Snowdon
IBM J. Res. Dev.55, 1--12, IBM Corp., 2011
Abstract

Surface Dynamics of Amorphous Polymers Used for High-Voltage Insulators
P.T. Shemella, T. Laino, O. Fritz, A. Curioni
The Journal of Physical Chemistry B 115(46), 13508-13512, ACS Publications, 2011

Controlling a Voice Site Using Non Standard Haptic Commands
Anupam Jain, Nitendra Rajput, Simon Robinson
Patent IN920110114US1

Efficient genomewide selection of PCA-correlated tSNPs for genotype imputation
Asif Javed, Petros Drineas, Michael W. Mahoney and Peristera Paschou
Annals of Human Genetics - in press, Wiley Online Library, 2011

IRiS*: Construction of ARG networks at genomic scales
A Javed, M Pybus, M Mel{'e}, F Utro, J Bertranpetit, F Calafell, L Parida
Bioinformatics, Oxford Univ Press, 2011

System and computer program product for protecting audio content
(United States: US7978853) R. Krishnapuram, Nandita Mahajan, L V Subramaniam, Vivek Tyagi, Tanveer A Faruquie
US Patent 7,978,853


Method for protecting audio content
(United States: US7974411) R. Krishnapuram, Nandita Mahajan, L V Subramaniam, Vivek Tyagi, Tanveer A Faruquie
US Patent 7,974,411

The Three Steps of Clustering in the Post-Genomic Era: A Synopsis
R Giancarlo, G Lo Bosco, L Pinello, F Utro
Computational Intelligence Methods for Bioinformatics and Biostatistics, pp. 13-30, Springer Berlin / Heidelberg, 2011
10.1007/978-3-642-21946-7_2

Relation Extraction with Relation Topics
Chang Wang, James Fan, Aditya Kalyanpur, and David Gondek
The 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011).

Sequencing of a QTL-rich Region of the Theobroma cacao Genome using Pooled BACs and the Identification of Trait Specific Candidate Genes
F A Feltus, C A Saski, K Mockaitis, N Haiminen, L Parida, Z Smith, J Ford, M E Staton, S P Ficklin, B P Blackmon, C-H Cheng, R J Schnell, D N Kuhn, J-C Motamayor
BMC Genomics 12(379), 379, BioMed Central Ltd, 2011

Folksonomy-Based Term Extraction for Word Cloud Generation
David Carmel, Ido Guy, Erel Uziel, Yosi Mass, Haggai Roitman
20th ACM Conference on Information and Knowledge Management (CIKM), pp. 2437--2440, [Text Mining, Social Media], 2011

Randomization techniques for assessing the significance of gene periodicity results
A Kallio, N Vuokko, M Ojala, N Haiminen, H Mannila
BMC Bioinformatics 12(330), 330, BioMed Central Ltd, 2011

IRiS: Construction of ARG networks at genomic scales
A Javed, M Pybus, M Melè, F Utro, J Bertranpetit, F Calafell, L Parida
Bioinformatics, 2011
Abstract

Discovering evolutionary patterns from gene order
L. Parida, N. Haiminen
Evolutionary genomics: statistical and computational methods (in series Methods in Molecular Biology), Springer Humana, in press, 2011

The relevance of the time domain to neural network models (Springer Series in Cognitive and Neural Systems)
A.R. Rao and G. A. Cecchi (editors)
Springer Verlag, 2011

Full-brain auto-regressive modeling (FARM) using fMRI
R. Garg, G.A. Cecchi, A.R. Rao
Neuroimage, 2011

The effects of feedback and lateral connections on perceptual
A.R. Rao, G.A. Cecchi
International Joint Conference on Neural Networks, IJCNN, IEEE, 2011

Brain as a self-predictor: Sparse full-brain auto-regressive modeling in fMRI
R. Garg, G.A. Cecchi, A.R. Rao
IEEE Symposium on Biomedical Imaging, ISBI, pp. 1581 - 1584, 2011

A spatio-temporal support vector machine searchlight for fMRI analysis,
A.R. Rao, R. Garg, G.A. Cecchi
IEEE Symposium on Biomedical Imaging ISBI, pp. 1023 - 1026, IEEE, 2011

Accelerating statistical image reconstruction algorithms for fan-beam x-ray CT using cloud computing
S. Srivastava, A. R. Rao, and V. Sheinin
SPIE Conference on Medical Imaging, SPIE, 2011

Characteristics of voxel prediction power in full-brain Granger causality analysis of fMRI data
R. Garg, G. A. Cecchi, and A. R. Rao
SPIE Conference on Medical Imaging, SPIE, 2011

An O (n^3/2 sqrt(n)) algorithm for sorting by reciprocal translocations
M Ozery-Flato, R Shamir
Journal of Discrete Algorithms, Elsevier, 2011

Large-scale analysis of chromosomal aberrations in cancer karyotypes reveals two distinct paths to aneuploidy
M Ozery-Flato, C Linhart, L Trakhtenbrot, S Izraeli, R Shamir
Genome Biology 12(6), R61, BioMed Central Ltd, 2011

An Open, Social Microcalendar for the Enterprise: Timely
Werner Geyer, Casey Dugan, Beth Brownholtz, Mikhil Masli, Elizabeth Daly, David R Millen
In Proceedings of the 2011 ACM conference on Human Factors in Computing (CHI '11), Vancouver, Canada. , pp. 247--256

Social Lens: Personalization Around User Defined Collections for Filtering Enterprise Message Streams
Elizabeth M. Daly, Michael Muller, Liang Gou, David R. Millen
In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM 2011), AAAI Publications

Data Augmentation as a Service for Single View Creation
Ullas Nambiar, Tanveer A. Faruquie, K. H. Prasad, L. V. Subramaniam, Mukesh K. Mohania.
IEEE International Conference on Services Computing (SCC), pp. 40--47, 2011

Scalable Proximity-Aware Cache Replication in Chip Multiprocessors (short paper)
Chongmin Li, Haixia Wang, Yibo Xue, Dongsheng Wang (Tsinghua University), Jian Li
The Twentieth International Conference on Parallel Architectures and Compilation Techniques (PACT) , 2011

Serving Information Needs In Business Process Consulting
M Gupta, D Mukherjee, S Mani, V S Sinha, S Sinha
Proceedings of the 9th International Conference on Business Process Management (BPM), pp. 231--247, Springer Link, 2011

System And Method For Collaborative Content Creation On The Telecom Web
Anupam Jain, Amit A Nanavati, Nitendra Rajput
Patent IN920100218US1

Relevance Feedback Exploiting Query-Specific Document Manifolds
Chang Wang, Emine Yilmaz, and Martin Szummer
The 20th ACM Conference on Information and Knowledge Management (CIKM2011)

A pattern discovery approach to retail fraud detection
Prasad Gabbur, Sharath Pankanti, Quanfu Fan, Hoang Trinh
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 307--315, 2011

Active online classification via information maximization
N Slonim, K Crammer, and E Yom-Tov
22nd International Joint Conference on Artificial Intelligence (IJCAI), 2011

Prognostic Data-Driven Clinical Decision Support - Formulation and Implications
R Rinott, B Carmeli, C Kent, D Landau, Y Maman, Y Rubin, and N Slonim
23rd International Conference of the European Federation for Medical Informatics (MIE), 2011

Fault localization in ABAP Programs
Diptikalyan Saha, Mangala Gowri Nanda, Pankaj Dhoolia, V Krishna Nandivada, Vibha Sinha, Satish Chandra
ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE) 2011, Technical Report RI11004, IBM, http://domino. research. ibm. com/library/cyberdig. nsf/index. html

Fast computation of functional networks from fMRI activity: a multi-platform comparison
A.R. Rao, R.Bordawekar, G.A. Cecchi
SPIE Conference on Medical Imaging, pp. 79624L, SPIE Press, 2011

TAPO: Thermal-Aware Power Optimization Techniques for Servers and Data Centers (Best Paper Award)
Wei Huang, Malcolm Allen-Ware, John Carter, Elmootazbellah Elnozahy, Hendrik Hamann, Tom Keller, Charles Lefurgy, Jian Li, Karthick Rajamani and Juan Rubio
Second International Green Computing Conference (IGCC'11), 2011

Managing Business Health in the Presence of Malicious Attacks
S A Zonouz, A Sharma, H V Ramasamy, Z T Kalbarczyk, B Pfitzmann, K McAuliffe, R K Iyer, W H Sanders, E Cope
Workshop on Recent Advances in Intrusion Tolerant Systems (WRAITS), To appear in Supplemental Proceedings of the IEEE Conference on Dependable Systems and Networks (DSN-2011)

A standard based approach for biomedical knowledge representation
Ariel Farkash, Hani Neuvirth, Yaara Goldschmidt, Costanza Conti, Federica Rizzi, Stefano Bianchi, Erika Salvi, Daniele Cusi, Amnon Shabo
Stud. Health Technol. Inform169, 689--693, 2011


Design and Analysis of Value Creation Networks
S Kameshwaran, S Mehta, V Pandit
AAAI, pp. 1551 - 1554, 2011

Cost-aware caching schemes in heterogeneous storage systems
Abhirup Chakraborty, Ajit Singh
Journal of Supercomputing 56(1), 56-78, 2011

Method and apparatus for determining decision points for streaming conversational data
(United States: US7904399) Ganesh Ramakrishnan, L V Subramaniam, Tanveer Faruquie
US Patent 7,904,399

Where is the Crowd?: Crowdedness Detection Scheme for Mobile CrowdSensing Applications
Desheng Zhang, Tian He, Fan Ye, Raghu Ganti, and Hui Lei
POSTER: IEEE Infocom 2011

Data Cleansing Techniques for Large Enterprise Datasets
K. H. Prasad, T. A. Faruquie, S. Joshi, S. Chaturvedi, L. V. Subramaniam, M. K. Mohania
SRII Global Conference, pp. 135--144, 2011

Optimal Training Data Selection for Rule-Based Cleansing Models
S. Chaturvedi, Tanveer A. Faruquie, L. Venkata Subramaniam, K. Hima Prasad, G. Venkatachaliah, S. Padmanabhan
SRII Global Conference, 2011

A Business Centric End-to-End Monitoring Approach for Service Composites
Geetika T. Lakshmanan, Paul T. Keyser, Aleksander Slominski, Francisco Curbera
IEEE Services Computing Conference (SCC'11), 2011


Discovering Event Correlation Rules for Semi-Structured Business Processes
Szabolcs Rozsnyai, Aleksander Slominski and Geetika Lakshmanan
Distributed Event Based Systems Conference (DEBS), pp. 75--86, 2011

Guest Editors' Introduction: Provenance in Web Applications
G T Lakshmanan, F Curbera, J Freire, A Sheth
Internet Computing, IEEE 15(1), 17--21, IEEE, 2011

Assessing Pooled BAC and Whole Genome Shotgun Strategies for Assembly of Complex Genomes
Niina Haiminen, F Alex Feltus, Laxmi Parida
BMC Genomics 12(194), 194, BioMed Central Ltd, 2011

Genomic regions tools for high-throughput analytics in genomics
A. Tsirigos, N. Haiminen, E. Bilal, F. Utro
IBM Research Report RC25125, 2011

Heterogeneous Domain Adaptation using Manifold Alignment
Chang Wang and Sridhar Mahadevan
The 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011)

Jointly Learning Data-Dependent Label and Locality-Preserving Projections
Chang Wang and Sridhar Mahadevan
The 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011)

Enterprise blogging in a global context: comparing Chinese and American practices
Qinying Liao, Shimei Pan, Jennifer C Lai, Chang Yang
Proceedings of the ACM 2011 conference on Computer supported cooperative work, pp. 35--44, ACM

Identifying Refactoring Opportunities in Process Model Repositories
Remco Dijkman, Beat Gfeller, Jochen Kuester, Hagen Voelzer
Information and Software Technology 53(9), 937-948, Elsevier , 2011

A minimal descriptor of an ancestral recombinations graph
L Parida, P Palamara, A Javed
BMC Bioinformatics 12(Suppl 1), S6, BioMed Central Ltd, 2011

Entering the circle of trust: Developer initiation as committers in open-source projects
V S Sinha, S Mani, S Sinha
Proceedings of the 8th Working Conference on Mining Software Repositories (MSR), pp. 133--142, ACM, 2011
Abstract

Using MATCON to generate CASE tools that guide deployment of pre-packaged applications
Elad Fein, Natalia Razinkov, Shlomit Shachor, Pietro Mazzoleni, Sweefen Goh, Richard Goodwin, Manisha Bhand, Shyh-Kwei Chen, Juhnyoung Lee, Vibha Singhal Sinha, others
Proceedings of the 33rd International Conference on Software Engineering, pp. 1016--1018, ACM, 2011
Abstract

Affinity Driven Distributed Scheduling Algorithm for Parallel Computations
A Narang, A Srivastava, N P Kumar, R K Shyamasundar
Distributed Computing and Networking: 12th International Conference, Icdcn 2011, Bangalore, India, January 2-5, 2011, Proceedings, pp. 167, Springer

Performance driven distributed scheduling of parallel hybrid computations
A Narang, R K Shyamasundar
Theoretical Computer Science 412(32), 4212--4225, Elsevier Science Publishers Ltd., 2011

High throughput data redundancy removal algorithm with scalable performance
Souvik Bhattacherjee, Ankur Narang, Vikas K Garg
Proceedings of the 6th International Conference on High Performance and Embedded Architectures and Compilers, pp. 87--96, ACM, 2011
Abstract

SLA-driven Applicability Analysis for Patch Management
Yang, B, Ayachitula, N., Zeng, S., Puri, R.,
Integrated Network Management, 2011

A performance study of an intelligent headlight control system
Ying Li, Sharath Pankanti
Applications of Computer Vision (WACV), 2011 IEEE Workshop on, pp. 440--447

Information at your fingertips: Contextual IR in enterprise email
J Lu, S Pan, J Lai, Z Wen
16th international conference on Intelligent User Interfaces (IUI), pp. 205--214, 2011

Multi-Channel Wireless Network-on-Chip: A New Approach to Improving On-Chip Communication Capacity
Dan Zhao, Yi Wang, Jian Li and Takamaro Kikkawa
International Symposium on Networks-on-Chip (NOCS), 2011

DEMO: DustDoctor: A Self-healing Sensor Data Collection System
Mohammad Maifi Hasan Khan, Hossein Ahmadi, Gulustan Dogan, Kannan Govindan, Raghu K. Ganti, Theodore Brown, Jiawei Han, Prasant Mohapatra, and Tarek Abdelzaher
The 10th International Conference on Information Processing in Sensor Networks (IPSN), 2011

The Sparse Regression Cube: A Reliable Modeling Technique for Open Cyber-physical Systems
Hossein Ahmadi, Tarek Abdelzaher, Jiawei Han, Nam Pham, and Raghu K. Ganti
ACM/IEEE Second International Conference on Cyber-Physical Systems, 2011


Use of Schema Associative Mapping for synchronization of the Virtual Machine Audit Logs
Sean Thorpe, Indrajit Ray, Tyrone Grandison
The 4th Intl Conference on Computational Intelligence in Security for Information Systems, 2011

Enforcing Data Quality Rules for a Synchronized VM Log Audit Environment using Transformation Mapping Techniques
Sean Thorpe, Indrajit Ray, Tyrone Grandison
The 4th Intl Conference on Computational Intelligence in Security for Information Systems, 2011

Apples and oranges: A comparison of RDF benchmarks and real RDF datasets
Songyun Duan, Anastasios Kementsietsidis, Kavitha Srinivas, Octavian Udrea
Proceedings of the 2011 international conference on Management of data (SIGMOD), pp. 145--156, ACM
Abstract

Helix: Online Enterprise Data Analytics
Oktie Hassanzadeh, Songyun Duan, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J Ward
WWW (Companion Volume), pp. 225-228, ACM, 2011
Abstract

PCTA: Privacy-constrained Clustering-based Transaction Data Anonymization
A. Gkoulalas-Divanis, G. Loukides
4th International Workshop on Privacy and Anonymity in the Information Society, pp. 5, ACM, 2011

Algorithm for Predicting Outcomes at Any Stage of a Document Content Driven Business Process Instance Execution
with G. Lakshmanan, D. Shamsi, R. Khalaf, P. Keyser

System and method for annotating schema elements based on associating data instances with knowledge base entities
with M. Ward, A. Fokoue, O. Hassanzadeh, A. Kementsietsidis, K. Srinivas

A Method and Apparatus for the Automatic and Dynamic Determination of Aggregation Hierarchies in Search Queries
with A. Kementsietsidis, K. Srinivas, M. Ward, A. Fokoue, O. Hassanzadeh

Method and Apparatus for Creating Benchmark Graph Data
with A. Kementsietsidis, K. Srinivas, O. Udrea

Rewriting queries on SPARQL views
Wangchao Le, Songyun Duan, Anastasios Kementsietsidis, Feifei Li, Min Wang
WWW, pp. 655-664, ACM, 2011
Abstract

Predicting completion times of batch query workloads using interaction-aware models and simulation
Mumtaz Ahmad, Songyun Duan, Ashraf Aboulnaga, Shivnath Babu
14th International Conference on Extending Database Technology (EDBT), 2011

Towards Efficient Resource Management for Data-Analytic Platforms
Claris Castillo, Mike Spreitzer, Malgorzata Steinder
IFIP/IEEE International Symposium on Integrated Network Management (IM), IEEE, 2011

Speeding up the Consensus Clustering methodology for microarray data analysis
R Giancarlo, F Utro
Algorithms for Molecular Biology 6(1), 1, BioMed Central Ltd, 2011

Enabling Security Uniformly Across Cloud Systems
Sean Thorpe, Indrajit Ray, Tyrone Grandison
ACM ASPLOS (Architectural Support for Programming Languages and Operating Systems) RESOLVE (Runtime Environments/Systems, Layering, and Virtualized Environments), ACM Press, 2011

Categorical Decision Making by People, Committees, and Crowds
Lav R. Varshney, Joong Bum Rhim, Kush R. Varshney, Vivek K. Goyal
Information Theory and its Applications Workshop, 2011

System And Method For Making User Generated Audio Content On The Spoken Web Navigable By Community Tagging
Sheetal K Agarwal, Anupam Jain, Arun Kumar, Amit A Nanavati, Nitendra Rajput
Patent IN920100223US1

Detecting human activities in retail surveillance using hierarchical finite state machine
Hoang Trinh, Quanfu Fan, Jiyan Pan, Prasad Gabbur, Sachiko Miyazawa, Sharath Pankanti
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 1337--1340, ICASSP

TapBack: Towards Richer Mobile Interfaces in Impoverished Contexts
Simon Robinson, Nitendra Rajput, Matt Jones, Anupam Jain, Shrey Sahay, Amit Nanavati
CHI, 2011

Using Standards to enable the transformation to Smarter Cities
J Hogan, J Meegan, R Parmar, V Narayan, R Schloss
IBM Journal of Research and Development 55(1), 4:1-4:10, IBM, 2011

Automatic Classification of Change Requests for Improved IT Service Quality
C Kadar, D Wiesmann, J Iria, D Husemann, M Lucic
Proceedings of the SRII Global Conference 2011, pp. 430--439

Domain Adaptation for Text Categorization by Feature Labeling
C Kadar, J Iria
Proceedings of the 33rd European Conference on Information Retrieval (ECIR'11) (runner up for best paper award), 2011

Data Kindness on the Internet
Christan Grant, Tyrone Grandison, Kun Liu
Richard Tapia Celebration of Diversity in Computing Conference., 2011

A Global Virtual Machine Attribute Access Control Policy for Auditing Federated Digital Identities within a Compute Cloud
Sean Thorpe, Indrajit Ray, Indrakshi Ray, Tyrone Grandison, Abbie Barbir
International Journal of Information Assurance and Security (JIAS)6, 2011

A Distributed Algorithm for Finding All Best Swap Edges of a Minimum Diameter Spanning Tree
B. Gfeller, N. Santoro, P. Widmayer
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING 8(1), 1--12, 2011

Large-scale vehicle detection in challenging urban surveillance environments
Rogerio Feris, James Petterson, Behjat Siddiquie, Lisa Brown, Sharath Pankanti
IEEE Workshop on Applications of Computer Vision (WACV), pp. 527--533, 2011

Soft margin keyframe comparison: Enhancing precision of fraud detection in retail surveillance
Jiyan Pan, Quanfu Fan, Sharath Pankanti, Hoang Trinh, Prasad Gabbur, Sachiko Miyazawa
Applications of Computer Vision (WACV), 2011 IEEE Workshop on, pp. 549--556, WACV

Power Shifting in Thrifty Interconnection Networks
Jian Li, Wei Huang, Lixin Zhang, Charles Lefurgy, Wolfgang Denzel, Richard Treumann and Kun Wang
To appear in International Symposiun on High Performance Computer Architecture (HPCA), 2011

COAT: COnstraint-based Anonymization of Transactions
G Loukides, A Gkoulalas-Divanis, B Malin
Knowledge and Information Systems: Special Issue on Context-Aware Data Mining 28(2), 251--282, Springer, 2011

A systematic framework for the analysis of movement
Gennady Andrienko, Natalia Andrienko, Peter Bak, Daniel A Keim, Slava Kisilevich, Stefan Wrobel
Journal of Visual Languages and Computing (JVLC), 2011
(in review)

Linearized Motion Estimation for Articulated Planes
A Datta, Y Sheikh, T Kanade
Pattern Analysis and Machine Intelligence, 2011

Automatic Boosting of Cross-Product Coverage Using Bayesian Networks
Dorit Baras, Shai Fine, Laurent Fournier, Dan Geiger, Avi Ziv
International Journal on Software Tools for Technology Transfer (STTT) 13(3), 247--261, Springer, 2011


Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Z Yin, L Cao, J Han, J Luo, T Huang
SIAM Conference on Data Mining (SDM), 2011

Citation recommendation without author supervision
Q He, D Kifer, J Pei, P Mitra, C L Giles
Proceedings of the fourth ACM international conference on Web search and data mining (WSDM), pp. 755--764, 2011

OOLAM: an opinion oriented link analysis model for influence persona discovery
Keke Cai, Shenghua Bao, Zi Yang, Jie Tang, Rui Ma, Li Zhang, Zhong Su
Proceedings of the Fourth International Conference on Web Search and Web Data Mining, WSDM, pp. 645-654, 2011

Privacy and Security Issues in Data Mining and Machine Learning: International ECML/PKDD Workshop, PSDML 2010, Barcelona, Spain, September 24, 2010. Revised Selected Papers
C. Dimitrakakis, A. Gkoulalas-Divanis, A. Mitrokotsa, V.S. Verykios, Y. Saygin
Divanis, A Mitrokotsa... - 2011 - books.google.com, Springer

Toward personalized care management of patients at risk: the diabetes case study
Hani Neuvirth, Michal Ozery-Flato, Jianying Hu, Jonathan Laserson, Martin S Kohn, Shahram Ebadollahi, Michal Rosen-Zvi
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 395--403, 2011

Smarter log analysis
Ehud Aharoni, Shai Fine, Yaara Goldschmidt, Ofer Lavi, Oded Margalit, Michal Rosen-Zvi, Lavi Shpigelman
IBM Journal of Research and Development 55(5)(10), 2011

Smarter log analysis
E Aharoni, S Fine, Y Goldschmidt, O Lavi, O Margalit, M Rosen-Zvi, L Shpigelman
IBM Journal of Research and Development 55(5), 10:1 - 10:10 , IBM, 2011

Detect irregularly shaped spatio-temporal clusters for decision support
Weishan Dong, Xin Zhang, Zhongbo Jiang, Wei Sun, Lexing Xie, Arun Hampapur
IEEE International Conference on Service Operations, Logistics, and Informatics (SOLI), Best Conference Paper Award, pp. 231--236, 2011
Abstract   Best Conference Paper Award

A general service-based spatio-temporal data analysis framework
J. Xu, W. Wang, W. Dong, L. Li, Z. Jiang
Energy Procedia13, 447--456, Elsevier, 2011


2010

Gene Expression Classification with a novel coevolutionary based learning classifier system on Public Clouds
C Vecchiola, M Abedini, M Kirley, X Chu, R Buyya
e-Science Workshops, 2010 Sixth IEEE International Conference on, pp. 92--97

A multiple population XCS: Evolving condition-action rules based on feature space partitions
M Abedini, M Kirleymkirley
Evolutionary Computation (CEC), 2010 IEEE Congress on, pp. 1--8

Sequential Projection Learning for Hashing with Compact Codes,
J. Wang, S. Kumar and S.-F. Chang
The 27th International Conference on Machine Learning, 2010

CROWD-SOURCING FOR GAP FILLING IN SOCIAL NETWORKS
Ohad Greenshpan, Ido Guy, Michal Jacovi, Itai Turbahn
US Patent App. 12/794,791

INCREMENTAL STATIC ANALYSIS
Daniel Kalman, Marco Pistoia, Guy Podjarny, Omer Tripp, Omri Weisman
US Patent App. 12/873,219

INJECTION CONTEXT BASED STATIC ANALYSIS OF COMPUTER SOFTWARE APPLICATIONS
Yinnon A Haviv, Roee Hay, Marco Pistoia, Ory Segal, Adi Sharabani, Takaaki Tateishi, Omer Tripp, Omri Weisman
US Patent App. 12/825,293

Enforcement Of Data Privacy To Maintain Obfuscation Of Certain Data
Michael G Burke, Igor Peshansky, Marco Pistoia, Omer Tripp
US Patent App. 12/776,465



A Cluster-Level Semi-supervision Model for Interactive Clustering
A Dubey, I Bhattacharya, S Godbole
ECMLPKDD , pp. 409--424, Springer, 2010

Predicting customer churn in mobile networks through analysis of social groups
Y Richter, N Slonim, and E Yom-Tov
SIAM International Conference on Data Mining (SDM), 2010

Twitterrank: finding topic-sensitive influential twitterers
J Weng, E P Lim, J Jiang, Q He
Proceedings of the third ACM international conference on Web search and data mining (WSDM), pp. 261--270, 2010

Author-Topic Models for Recommendation Tasks
Ya'ara Goldschmidt, Iris Eiron, Benjamin Sznajder, Michal Rosen-Zvi
Workshop on Topic Models, ICML, 2010

Prediction of response to antiretroviral therapy by human experts and by the EuResist data-driven expert system (the EVE study)
M. Zazzi, R. Kaiser, A. Sonnerborg, D. Struck, A. Altmann, M. Prosperi, M. Rosen-Zvi, A. Petroczi, Y. Peres, E. Schulter, C. Boucher, F. Brun-Vezinet, R. Harigan, L. Morris, M. Obermeier, C. F. Perno, R. Shafer, A. Vandamme, K. van Laethem, A. Wensing, T.
HIV Medicine pp. 1468-1293, 2010

Learning author-topic models from text corpora
M Rosen-Zvi, C Chemudugunta, T Griffiths, P Smyth, M Steyvers
ACM Transactions on Information Systems (TOIS) 28(1), 1--38, ACM, 2010

Improving Accessibility of Transaction-centric Web Objects.
M A Islam, F Ahmed, Y Borodin, J Mahmud, I. V. Ramakrishnan
SIAM International Conference on Data Mining (SDM), 2010

Towards a rigorous assessment of systems biology models: the DREAM3 challenges.
Robert J Prill, Daniel Marbach, Julio Saez-Rodriguez, Peter K Sorger, Leonidas G Alexopoulos, Xiaowei Xue, Neil D Clarke, Gregoire Altan-Bonnet, Gustavo Stolovitzky
PLoS One 5(2), e9202, 2010
Abstract

Revealing strengths and weaknesses of methods for gene network inference.
Daniel Marbach, Robert J Prill, Thomas Schaffter, Claudio Mattiussi, Dario Floreano, Gustavo Stolovitzky
Proc Natl Acad Sci U S A 107(14), 6286--6291, 2010
Abstract


2009

CoXCS: a coevolutionary learning classifier based on feature space partitioning
M Abedini, M Kirley
AI 2009: Advances in Artificial Intelligence, 360--369, Springer

Graph Construction and b-Matching for Semi-Supervised Learning
T. Jebara, J. Wang, and S.-F. Chang
The 26th International Conference on Machine Learning, 2009

SYSTEM AND METHOD FOR STATIC DETECTION AND CATEGORIZATION OF INFORMATION-FLOW DOWNGRADERS
Yinnon Haviv, Roee Hay, Marco Pistoia, Guy Podjarny, Adi Sharabani, Takaaki Tateishi, Omer Tripp, Omri Weisman
US Patent App. 12/575,647

SYSTEMS AND METHODS FOR ORGANIZING DOCUMENTED PROCESSES
B. Srivastava, D. Mukherjee
US Patent App. 12/608,435

Parallel Pairwise Clustering
E Yom-Tov, N Slonim
SIAM International Conference on Data Mining (SDM), 2009

Proximity-Based Anomaly Detection using Sparse Structure Learning
\bf Tsuyoshi Id\'e, Aurelie C. Lozano, Naoki Abe, Yan Liu
Proceedings of 2009 SIAM International Conference on Data Mining (SDM 09), pp. 97--108

Investigation of expert rule bases, logistic regression, and non-linear machine learning techniques for predicting response to antiretroviral treatment
M C F Prosperi, A Altmann, M Rosen-Zvi, E Aharoni, G Borgulya, F Bazso, A S{`a}nnerborg, E Schulter, D Struck, G Ulivi, others
Antiviral therapy 14(3), 433--442, International Medical Press, 2009

Extending Task Parallelism For Frequent Pattern Mining
P Kambadur, A Ghoting, A Gupta, A Lumsdaine
Proceedings of the International Conference on Parallel Computing (ParCO), 2009

The Parallel Machine Learning (PML) Framework and the Transform Regression Algorithm
S Asur, A Ghoting, R Natarajan, E Pednault
2009 - domino.watson.ibm.com

Lessons from the DREAM2 Challenges.
Gustavo Stolovitzky, Robert J Prill, Andrea Califano
Ann N Y Acad Sci1158, 159--195, 2009
Abstract


2008

Graph Transduction via Alternating Minimization
J. Wang, T. Jebara, and S.-F. Chang
The 25th International Conference on Machine Learning, 2008

Beyond basic faceted search
O. Ben-Yitzhak, N. Golbandi, N. Har'El, R. Lempel, A. Neumann, S. Ofek-Koifman, D. Sheinwald, E. Shekita, B. Sznajder, S. Yogev
WSDM '08: Proceedings of the international conference on Web search and web data mining, ACM, NY, 2008

Improved Prediction of HIV Resistance In-Vitro by Biochemically-Driven Models
H Neuvirth, M Rosen-Zvi, N Srebro, E Aharoni, M Zazzi, N Tishby
Zvi, N Srebro, E Aharoni, M Zazzi, ... - 2008 - Citeseer, Citeseer

The euresist approach for predicting response to anti hiv-1 therapy
A Altmann, M Rosen-Zvi, M Prosperi, E Aharoni, H Neuvirth, E Sch{"u}lter, J B{"u}ch, Y Peres, F Incardona, A S{"o}nnerborg, others
The 6th European HIV Drug Resistance Workshop, Cascais, Portugal, 2008


Consistent dimensionality reduction scheme and its application to clinical HIV data
M Rosen-Zvi, H Neuvirth, E Aharoni, M Zazzi
In other words34, 65879, Citeseer, 2008

Comparison of classifier fusion methods for predicting response to anti HIV-1 therapy
A Altmann, M Rosen-Zvi, M Prosperi, E Aharoni, H Neuvirth, E Sch{"u}lter, J B{"u}ch, D Struck, Y Peres, F Incardona, others
PLoS One 3(10), 3470, PLoS, 2008

Selecting anti-HIV therapies based on a variety of genomic and clinical factors
M Rosen-Zvi, A Altmann, M Prosperi, E Aharoni, H Neuvirth, A Sonnerborg, E Schulter, D Struck, Y Peres, F Incardona, others
Bioinformatics 24(13), i399, Oxford Univ Press, 2008

Fast mining of distance-based outliers in high-dimensional datasets
A Ghoting, S Parthasarathy, M E Otey
Data Mining and Knowledge Discovery 16(3), 349--364, Springer, 2008

Architecture Conscious Data Mining: Current Progress and Future Outlook
S Parthasarathy, S Tatikonda, G Buehrer, A Ghoting
Next Generation of Data Mining, 2008


2007

Computing Statistical Profiles of Active Sites in Proteins
Chang Zhao, Jalal Mahmud, IV Ramakrishnan, S Swaminathan
SDM , SIAM, 2007

Temporal causal modeling with graphical granger methods
A Arnold, Y Liu, N Abe
Proceedings of the 13th ACM SIGKDD international …, 2007 - portal.acm.org

Bursty feature representation for clustering text streams
Q He, K Chang, E P Lim, J Zhang
Proceedings of the SIAM International Conference on Data Mining (SDM), pp. 26--28, 2007

EuResist: From Research to Practice-Clinical Genomics of HIV
Y Peres, C Kent, M Rosen-Zvi, E Aharoni, H Neuvirth-Telem, A Altmann, T Lengauer, R Kaiser
Zvi, E Aharoni, H Neuvirth-Telem... - 2007 - getcited.org

Method and Computer Program Product for Wafer Manufacturing Process Abnormalities Detection
M Rosen-Zvi, J W Wong, Y Xu, E Yom-Tov
Zvi, JW Wong, Y Xu, E Yom- ... - US Patent App. 11/ ..., 2007 - Google Patents, Google Patents
US Patent App. 11/946,064

Cache-conscious frequent pattern mining on modern and emerging processors
Amol Ghoting, Gregory Buehrer, Srinivasan Parthasarathy, Daehyun Kim, Anthony Nguyen, Yen-Kuang Chen, Pradeep Dubey
The VLDB Journal16, 77--96, Springer-Verlag New York, Inc., 2007
Abstract

A survey of distributed mining of data streams
S Parthasarathy, A Ghoting, M E Otey
Data Streams, 289--307, Springer, 2007

An Introduction to the IBM Parallel Mining Toolkit
E. Yom-Tov, U. Aharoni, A. Ghoting, E. Pednault, D. Pelleg, H. Toledano, R. Natarajan
IBM developerWorks, 2007

Knowledge and Cache Conscious Algorithm Design and Systems Support for Data Mining Algorithms
A Ghoting, G Buehrer, M Goyder, S Tatikonda, X Zhang, S Parthasarathy, T Kurc, J Saltz
Parallel and Distributed Processing Symposium, 2007, pp. 1--6


2006

Why does Subsequence Time-Series Clustering Produce Sine Waves?
\bf Tsuyoshi Id\'e
Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 06), pp. 311-322, 2006

Consistent dimensionality reduction scheme and its application to clinical HIV data
M Rosen-Zvi, H Neuvirth, E Aharoni, M Zazzi, N Tishby
NIPS 2006 workshop, Novel Applications of Dimensionality Reduction

Fast distributed outlier detection in mixed-attribute data sets
M E Otey, A Ghoting, S Parthasarathy
Data mining and knowledge discovery 12(2), 203--228, Springer, 2006

Out-of-core frequent pattern mining on a commodity PC
Gregory Buehrer, Srinivasan Parthasarathy, Amol Ghoting
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 86--95, ACM, 2006
Abstract

Knowledge-conscious data clustering
A Ghoting, S Parthasarathy
Knowledge Discovery in Databases: PKDD 2006, 511--519, Springer

Efficient pattern mining on shared memory systems: implications for chip multiprocessor architectures
Gregory Buehrer, Yen-Kuang Chen, Srinivasan Parthasarathy, Anthony Nguyen, Amol Ghoting, Daehyun Kim
Proceedings of the 2006 workshop on Memory system performance and correctness, pp. 31--40, ACM
Abstract

I/O conscious algorithm design and systems support for data analysis on emerging architectures
G Buehrer, A Ghoting, X Zhang, S Tatikonda, S Parthasarathy, T Kurc, J Saltz
Parallel and Distributed Processing Symposium, 2006, pp. 8--pp


2005

Knowledge Discovery from Heterogeneous Dynamic Systems using Change-Point Correlations
\bf Tsuyoshi Id\'e, Keisuke Inoue
Proceedings of 2005 SIAM International Conference on Data Mining (SDM 05), pp. 571-575

Cache-conscious frequent pattern mining on a modern processor
Amol Ghoting, Gregory Buehrer, Srinivasan Parthasarathy, Daehyun Kim, Anthony Nguyen, Yen-Kuang Chen, Pradeep Dubey
Proceedings of the 31st international conference on Very large data bases, pp. 577--588, VLDB Endowment, 2005
Abstract

Loaded: Link-based outlier and anomaly detection in evolving data sets
A Ghoting, M E Otey, S Parthasarathy
Data Mining, 2004, pp. 387--390, 2005

A characterization of data mining algorithms on a modern processor
Amol Ghoting, Gregory Buehrer, Srinivasan Parthasarathy, Daehyun Kim, Anthony Nguyen, Yen-Kuang Chen, Pradeep Dubey
Proceedings of the 1st international workshop on Data management on new hardware, ACM, 2005

A services oriented framework for next generation data analysis centers
Huai Wang, Amol Ghoting, Gregory Buehrer, Shirish Tatikonda, Srinivasan Parthasarathy, T Kurc, J Saltz
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International, pp. 8--pp

An Empirical Comparison of Outlier Detection Algorithms
M E Otey, S Parthasarathy, A Ghoting
Data Mining Methods for Anomaly Detection, 45, Citeseer, 2005


2004

Eigenspace-based Anomaly Detection in Computer Systems
\bf Tsuyoshi Id\'e, Hisashi Kashima
Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '04), pp. 440-449, 2004

Probabilistic author-topic models for information discovery
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, Thomas L. Griffiths
KDD 2004, pp. 306-315

Facilitating interactive distributed data stream processing and mining
A Ghoting, S Parthasarathy
Parallel and Distributed Processing Symposium, 2004, pp. 86


2003

Multilayer Neural Networks with Extensively Many Hidden Units
A Engel, I Kanter, M Rosen-Zvi
Physical Review Letters 87(07), 78101, APS, 2003

Towards NIC-based intrusion detection
O P Ghoting, M Otey, S Parthasarathy, A Ghoting, G Li, S Narravula
In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 723--728, ACM, 2003


2001

Understanding of User Behavior in Immersive Environments
C Shahabi, L Kaghazian, S Mehta, A Ghoting, G Shanbhag, M L McLaughlin
Touch in Virtual Environments: Haptics and the Design of Interactive Systems, 2001

Analysis of haptic data for sign language recognition
C Shahabi, L Kaghazian, S Mehta, A Ghoting, G Shanbhag, M McLaughlin
Universal access in HCI: Towards and information society for all. Vol. 3. Proceedings, 441, Lawrence Erlbaum Associates, 2001


2000

On-line learning in the Ising perceptron
M Rosen-Zvi
Journal of Physics A: Mathematical and General33, 7277--7287, Institute of Physics Publishing, 2000