Data Management (discontinued)       

links

Data Management (discontinued) Publications



2015

Schema Management for Document Stores
Lanjun Wang, Oktie Hassanzadeh, Shuo Zhang, Juwei Shi
PVLDB 8(9), 922--933, 2015

Seamlessly integrating disk and tape in a multi-tiered distributed file system.
Ioannis Koltsidas, Slavisa Sarafijanovic, Martin Petermann, Nils Haustein, Harald Seipp, Robert Haas, Jens Jelitto, Thomas Weigold, Edwin R. Childers, David Pease, Evangelos Eleftheriou
31st IEEE International Conference on Data Engineering (ICDE 2015)

D^2WORM: A Management Infrastructure for Distributed Data-centric Workflows.
Martin Jergler, Mohammad Sadoghi, Hans-Arno Jacobsen
Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2015, Melbourne, Australia, May 31-4, 2015

The FQP Vision: Flexible Query Processing on a Reconfigurable Computing Fabric
Mohammadreza Najafi, Mohammad Sadoghi, Hans-Arno Jacobsen
SIGMOD Record - Special Issue on Visionary Ideas in Data Management, 2015

General Incremental Sliding-Window Aggregation
K. Tangwongsan, M. Hirzel, S. Schneider and K.-L. Wu
PVLDB 8(7), 702-713, 2015


2014

CaSSanDra: An SSD boosted key-value store
Prashanth Menon, Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen
IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, pp. 1162--1167

Adaptive parallel compressed event matching
Mohammad Sadoghi, Hans-Arno Jacobsen
IEEE 30th International Conference on Data Engineering, Chicago, ICDE 2014, IL, USA, March 31 - April 4, 2014, pp. 364--375

IQR: An Interactive Query Relaxation System for the Empty-Answer Problem
Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis
ACM SIG International conference on Management of Data / Principles of Database Systems (SIGMOD), 2014

Dynamically optimizing queries over large scale data platforms
Konstantinos Karanasos, Andrey Balmin, Marcel Kutsch, Fatma Ozcan, Vuk Ercegovac, Chunyang Xia, Jesse Jackson
ACM SIGMOD, 2014

Cleaning inconsistencies in information extraction via prioritized repairs
Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 164--175, 2014

Exploring Big Data with Helix: Finding Needles in a Big Haystack
Jason B. Ellis, Achille Fokoue, Oktie Hassanzadeh, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J. Ward
SIGMOD Record 43(4), 43--54, 2014

Towards Building Wind Tunnels for Data Center Design
A. Floratou, F. Bertsch, J. M. Patel, G. Laskaris
PVLDB 7(9), 2014

MRTuner: A Toolkit to Enable Holistic Optimization for MapReduce Jobs
Juwei Shi, Jia Zou, Jiaheng Lu, Zhao Cao, Shiqiang Li, Chen Wang
Proceedings of the VLDB Endowment 7(13), 1319-1330, 2014

SQL-on-Hadoop:Full Circle Back to Shared-Nothing Database Architectures
Avrilia Floratou, Umar Farooq Minhas, Fatma Ozcan
PVLDB, 2014

SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures
Avrilia Floratou, Umar Farooq Minhas, and Fatma Ozcan
Proceedings of the VLDB Endowment (PVLDB) 7(12), 2014


VLDB 2014 Ph.D. Workshop - An Overview.
Yunyao Li, Erich J. Neuhold
Proceedings of the VLDB Endowment 7(13), 2014

Reducing Database Locking Contention Through Multi-version Concurrency
Mohammad Sadoghi, Mustafa Canim, Bishwaranjan Bhattacharjee, Fabian Nagel, Kenneth A. Ross
PVLDB 7(13), 1331--1342, 2014


Specifying ubiquitous systems through the Algebra of Contextualized Ontologies
Isabel Cafezeiro, Jos\'e Viterbo Filho, Alexandre Rademaker, Edward Hermann Haeusler, Markus Endler
Knowledge Engineering Review 29(2), 171--185, 2014
Abstract

Faster Set Intersection with SIMD instructions by Reducing Branch Mispredictions
Hiroshi Inoue, Moriyoshi Ohara, Kenjiro Taura
Proceedings of the VLDB Endowment 8(3), 2014


2013

Automating Pattern Discovery for Rule Based Data Standardization Systems
Snigdha Chaturvedi, K. Hima Prasad, Tanveer A Faruquie, Bhupesh Chawda, L. V. Subramaniam, Raghuram Krishnapuram
IEEE International Conference on Data Engineering (ICDE), pp. 1231--1241, 2013

Ferrari: Flexible and efficient reachability range assignment for graph indexing
Stephan Seufert, Avishek Anand, Srikanta Bedathur, Gerhard Weikum
Data Engineering (ICDE), 2013 IEEE 29th International Conference on, pp. 1009--1020

Identifying hot and cold data in main-memory databases
Justin J Levandoski, P-A Larson, Radu Stoica
Data Engineering (ICDE), 2013 IEEE 29th International Conference on, pp. 26--37


STEM: A Spatio-temporal Miner for Bursty Activity
Theodoros Lappas, Marcos R Vieira, Dimitrios Gunopulos, Vassilis J Tsotras
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 1021--1024

High-Performance Holistic XML Twig Filtering Using GPUs
Ildar Absalyamov, Roger Moussalli, Vassilis Tsotras, Walid Najjar
ADMS, In Conjunction With VLDB, 2013

Modeling the execution semantics of stream processing engines with SECRET
N Dindar, N Tatbul, RJ Miller, LM Haas, I Botan
VLDB Journal 22(4), 421-446, 2013

Streaming Algorithms for k-core Decomposition
A. E. Sariyuce, B. Gedik, G. Jacques-Silva, K.-L. Wu and U. V. Catalyurek
PVLDB 6(6), 2013

Counting and Sampling Triangles from a Graph Stream
A. Pavan, K. Tangwongsan, S. Tirthapura, K.-L. Wu
PVLDB 6(14), 2013

ENVIROMETER: A Platform for Querying CommunitySensed Data
A. Oviedo, S. Sathe, D. Chakraborty, K. Aberer
International Conference on Very Large Data Bases (VLDB) [demo paper], 2013

DB2 with BLU Acceleration: So Much More than Just a Column Store
Vijayshankar Raman, Gopi K. Attaluri, Ronald Barber, Naresh Chainani, David Kalmuk, Vincent KulandaiSamy, Jens Leenstra, Sam Lightstone, Shaorong Liu, Guy M. Lohman, Tim Malkemus, Ren\'{e} M\"{u}ller, Ippokratis Pandis, Berni Schiefer, David Sharpe, Richa
PVLDB 6(11), 1080--1091, 2013

Next Generation Data Analytics at IBM Research
Oktie Hassanzadeh, Anastasios Kementsietsidis, Benny Kimelfeld, Rajasekar Krishnamurthy, Fatma Ozcan, Ippokratis Pandis
PVLDB 6(11), 1174-1175, VLDB Endowment, 2013

Toward Scalable Transaction Processing - Evolution of Shore-MT
Anastasia Ailamaki, Ryan Johnson, Ippokratis Pandis, Pinar Tozun
PVLDB6, 2013

DB2 with BLU Acceleration: So Much More than Just a Column Store
Vijayshankar Raman, Gopi Attaluri, Ronald Barber, Naresh Chainani, David Kalmuk, Vincent Kulandai Samy, Jens Leenstra, Sam Lightstone, Shaorong Liu, Guy M. Lohman, Tim Malkemus, Rene Mueller, Ippokratis Pandis, Berni Schiefer, David Sharpe, Richard Sidle,
VLDB 2013
Abstract

Making updates disk-I/O friendly using SSDs
Mohammad Sadoghi, Kenneth A Ross, Mustafa Canim, Bishwaranjan Bhattacharjee
Proceedings of the VLDB Endowment 6(11), 997--1008, VLDB Endowment, 2013

Flexible query processor on FPGAs
Mohammadreza Najafi, Mohammad Sadoghi, Hans-Arno Jacobsen
Proceedings of the VLDB Endowment 6(12), 1310--1313, VLDB Endowment, 2013

A probabilistic optimization framework for the empty-answer problem
Davide Mottin, Alice Marascu, Senjuti Basu Roy, Gautam Das, Themis Palpanas, Yannis Velegrakis
Proceedings of the VLDB Endowment 6(14), 1762--1773, VLDB Endowment, 2013

Next Generation Data Analytics at IBM Research
Oktie Hassanzadeh, Anastasios Kementsietsidis, Benny Kimelfeld, Rajasekar Krishnamurthy, Fatma Ozcan, Ippokratis Pandis
Proceedings of the VLDB Endowment (PVLDB) 6(11), 2013

Improving flash write performance by using update frequency
Radu Stoica, Anastasia Ailamaki
Proceedings of the VLDB Endowment 6(9), 733--744, VLDB Endowment, 2013

From “think like a vertex” to “think like a graph”
Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, Shirish Tatikonda, John McPherson
Proceedings of the VLDB Endowment 7(3), Citeseer, 2013

Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities
Bogdan Nicolae
BigDataCloud'13: 2nd Workshop on Big Data Management in Clouds, 2013

SYSTEM AND METHOD FOR DETERMINING AND OPTIMIZING RESOURCES OF A DATA PROCESSING SYSTEM UTILIZED BY A SERVICE REQUEST
F Bernardini, R N Chang, C-S Perng, K Gomadam, E C So, C Tang, T Tao, C Zhang
US Patent 8472330

Cloud Computing and Scientific Applications - Big Data, Scalable Analytics, and Beyond
Suraj Pandey, Surya Nepal
Future Generation Computer Systems 29(7), 1774-1776, 2013

Discovering linkage points over web data
Oktie Hassanzadeh, Ken Q Pu, Soheil Hassas Yeganeh, Ren\'ee J Miller, Lucian Popa, Mauricio A Hern\'andez, Howard Ho
Proceedings of the VLDB Endowment 6(6), 445--456, VLDB Endowment, 2013

Boosting object detection performance in crowded surveillance videos
Rogerio Feris, Ankur Datta, Sharath Pankanti, Ming-Ting Sun
Workshop on Applications of Computer Vision (WACV), pp. 427--432, 2013

Hosting A Voice Response System On A Mobile Phone
Vivek Sanghi, Anupam Jain
Patent US201313930427

Leveraging Collaborative Content Exchange for On-Demand VM Multi-Deployments in IaaS Clouds
Bogdan Nicolae, Mustafa Rafique
Euro-Par '13: 19th International Euro-Par Conference on Parallel Processing, pp. 305-316, 2013

Mining Discriminative Subgraphs from Global-State networks
Sayan Ranu, Minh Hoang, Ambuj Singh
SIGKDD, 2013

Evaluating Multivariate Visualizations as Multi-objective Decision Aids
Meirav Taieb-Maimon, Lior Limonad, David Amid, David Boaz, Ateret Anaby-Tavor
Human-Computer Interaction--INTERACT 2013, pp. 419--436, Springer

A statistical approach to mining customers' conversational data from social media
Konopnicki, D.Shmueli-Scheuer, M. ; Cohen, D. ; Sznajder, B. ; Herzig, J. ; Raviv, A. ; Zwerling, N., Roitman, H., Mass, Y.
IBM Journal of Research and Development, [User Modeling, Social Media], 2013

Processing archive content based on hierarchical classification levels
Karen W Brannon, Wenling Cai, Sangeeta T Doraiswamy, Ryan John Minniear, David A Pease, Mark Andrew Smith
US Patent 8,442,951

Understanding election candidate approval ratings using social media data
Danish Contractor, Tanveer Afzal Faruquie
Proceedings of the 22nd international conference on World Wide Web companion, pp. 189--190, 2013

BLACK-BOX PERFORMANCE CONTROL FOR HIGH-VOLUME THROUGHPUT-CENTRIC SYSTEMS
R N Chang, K Sonnenleiter, C Tang, S Tara, C Zhang
US Patent 8387059

METHOD AND SYSTEM FOR MANAGING SERVICE LEVELS PROVIDED BY SERVICE PROVIDERS
J M Berthaud, M Buco, R N Chang, J D Dalsky, S C Fang, L Z Luan, L Tsiao, C Ward
US Patent 8438117

Cloud Analytics for Capacity Planning and Instant VM Provisioning
Yexi Jiang, Chang-Shing Perng, Tao Li, and Rong Chang
IEEE Transactions on Network and Service Management 10(3), IEEE, 2013

Dynamic Sharing of GPUs in Cloud Systems
Khaled Diab, M. Mustafa Rafique, Mohamed Hefeeda
High-Performance Grid and Cloud Computing Workshop, IEEE, 2013

Modeling the uniqueness of user preferences for recommendation systems
Haggai Roitman, David Carmel, Yosi Mass, Iris Eiron
ACM Special Interest Group on Information Retrieval (SIGIR), [User Modeling, Recommender Systems], 2013

Efficient Multifaceted Screening of Job Applicants
Sameep Mehta, Rakesh Pimplikar, Amit Singh, Lav R. Varshney, Karthik Visweswariah
Proceedings of the 16th International Conference on Extending Database Technology (EDBT), pp. 661--671, 2013

Vimprint: Exploring Alternative Learning through Low-end Mobiles
Sheetal K Agarwal, Jyoti Grover, Anupam Jain, Arun Kumar
Interact 2013

AI-Ckpt: Leveraging Memory Access Patterns for Adaptive Asynchronous Incremental Checkpointing
Bogdan Nicolae, Franck Cappello
HPDC '13: 22th International ACM Symposium on High-Performance Parallel and Distributed Computing, pp. 155-166, 2013

Predicting Knowledge in An Ontology Stream
Freddy Lecue, Jeff Z.Pan
In Proceedings of the 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013), AAAI

Westland row why so slow? Fusing social media and linked data sources for understanding real-time traffic conditions
Elizabeth M. Daly, Freddy Lecue, Veli Bicer
In Proceedings of the 18th International Conference on Intelligent User Interfaces (IUI 2013), pp. 203-212, ACM

Detecting factual inconsistencies between a document and a fact-base
Indrajit Bhattacharya, Tanveer A. Faruquie, Shantanu Godbole, Mukesh K. Mohania, Ullas B. Nambiar (United States: US8370275)
US Patent 8,370,275

Processing multi-way spatial joins on map-reduce
Himanshu Gupta, Bhupesh Chawda, Sumit Negi, Tanveer A. Faruquie, L. Venkata Subramaniam, Mukesh K. Mohania
International Conference on Extending Database Technology (EDBT), 2013

Hardware acceleration of an efficient and accurate proton therapy Monte Carlo
Thomas H. Osiecki, Min-yu Tsai, Anne E. Gattiker, Damir A. Jamsek, Sani R. Nassif, W. Evan Speight, and Cliff C. N. Sze
International Conference on Computational Science, 2013
To appear

The POWER 775 Architecture at Scale
Mark Stephenson, Ram Rajamony, Evan Speight
27th International Conference on Supercomputing, pp. 183--192, 2013
To Appear

Building an efficient RDF store over a relational database
Mihaela Bornea, Julian Dolby, Anastasios Kementsietsidis, Kavitha Srinivas, Patrick Dantressangle, Octavian Udrea, Bishwaranjan Bhattacharjee
ACM SIGMOD International Conference on Management of Data 2013, pp. 121--132

Pluggable Analysis Viewpoints for Design Space Exploration
Michael Masin, Lior Limonad, Aviad Sela, David Boaz, Lev Greenberg, Nir Mashkif, Ran Rinat
Procedia Computer Science 16(0), 226--235, Elsevier, 2013
2013 Conference on Systems Engineering Research

SOMMOS - Self-Organizing Maps for multi-objective Pareto frontiers
Chen, S. and Amid, D. and Shir, O. and Boaz, D. and Limonad, L. and Anaby-Tavor, A. and Schreck, Tobias
IEEE Pacific Visualization Symposium 2013

SYSTEM FOR SIMPLIFYING THE PROCESS OF CREATING XML DOCUMENT TRANSFORMATIONS
Joshua W Hui, Peter M Schwarz
US Patent 20,130,036,349

A Markov Prediction Model for Semi-Structured Business Processes
Geetika T. Lakshmanan, Davood Shamsi, Yurdaer Doganata, Merve Unuvar, and Rania Khalaf
Accepted on 8/20/2013 to journal: Knowledge and Information Systems (KAIS)

Discovery and Analysis of Evolving Topical Social Discussions on Unstructured Microblogs
Kanika Narang, Seema Nagar, Sameep Mehta, LV Subramaniam, Kuntal Dey
Advances in Information Retrieval, pp. 545--556, Springer Berlin Heidelberg, 2013

Utility-driven Evolution Recommender for a Constrained Ontology
Pramod Anantharam, Biplav Srivastava, Amit Sheth
3rd International Conference on Web Intelligence, Mining and Semantics (WIMS'13) , 2013

Eliminating unscalable communication in transaction processing
Ryan Johnson, Ippokratis Pandis, Anastasia Ailamaki
The VLDB Journal, 2013

Microvolunteering: Helping the Helpers in Development
Michael Bernstein, Mike Bright, Ed Cutrell, Steven Dow, Elizabeth Gerber, Anupam Jain, Anand Kulkarni
CSCW 2013, ACM

Extending the "Web of Drug Identity" with Knowledge Extracted from United States Product Labels
Oktie Hassanzadeh, Qian Zhu, Robert R. Freimuth, Richard D. Boyce
Proceedings of the 2013 AMIA Summit on Translational Bioinformatics

Towards Scalable Checkpoint Restart: A Collective Inline Memory Contents Deduplication Proposal
Bogdan Nicolae
IPDPS '13: The 27th IEEE International Parallel and Distributed Processing Symposium, pp. 19-28, 2013

From A to E: Analyzing TPC's OLTP Benchmarks - The obsolete, the ubiquitous, the unexplored
Pinar Tozun, Ippokratis Pandis, Cansu Kaynak, Djordje Jevdjic, Anastasia Ailamaki
EDBT, 2013

On the Equivalence of Incremental and Fixpoint Semantics for Business Artifacts with Guard-Stage-Milestone Lifecycles
Elio Damaggio, Richard Hull, Roman Vaculin
Information Systems 38(4), 561--584, Elsevier, 2013
Abstract

Dynamic enhancement of drug product labels to support drug safety, efficacy, and effectiveness
R.D. Boyce, J.R. Horn, O. Hassanzadeh, A. de Waard, J. Schneider, J.S. Luciano, M. Rastegar-Mojarad, M. Liakata
Journal of Biomedical Semantics 4(1), 5, BioMed Central Ltd, 2013

Static and Dynamic Semantics of NoSQL Languages
V. Benzaken, G. Castagna, K. Nguyen, J. Simeon
Principle of Programming Languages, 2013

A Framework for Mining Signatures from Event Sequences and Its Applications in Healthcare Data
Fei Wang, Noah Lee, Jianying hu, Jimeng Sun, Shahram Ebadollahi
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 35(2), 272-285, IEEE, 2013

Using Conceptual Models to Theorize about the Relationship Between Records and Risk in the Global Financial Crisis
Kafui Monu, Victoria Lemieux, Lior Limonad, Carson Woo
Financial Analysis and Risk Management, pp. 73-98, Springer Berlin Heidelberg, 2013

Taming Complex Healthcare Data Models with Dictionary Tooling
John T. E. Timm, Joshua Hui, Sarah Knoop, and Peter Schwarz
5th International Workshop on Software Engineering in Health Care (SEHC), 2013
Abstract

HIERARCHICAL RANKING OF FACIAL ATTRIBUTES
Ankur Datta, Rogerio S Feris, Sharathchandra U Pankanti, Daniel A Vaquero
US Patent 20,130,124,514

Querying Persistent Graphs using Solid State Storage
Manos Athanassoulis, Bishwaranjan Bhattacharjee, Mustafa Canim, Kenneth A Ross
The Fourth Annual NonVolatile Memories Workshop (NVMW) 2013

Towards Discovery-Oriented Patient Similarity Search
Haggai Roitman, Sivan Yogev, Yevgenia Tsimerman and Yardena Peres
Health Search and Discovery (HSD) workshop (@SIGIR), [Social-Medical Discovery], 2013

NUMA-aware algorithms: the case of data shuffling
Yinan Li, Ippokratis Pandis, Ren\'{e} M\"{u}ller, Vijayshankar Raman, Guy M. Lohman
CIDR 2013, Sixth Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 6-9, 2013, Online Proceedings

Evaluating Demand Response Programs By Means Of Key Performance Indicators
G. Thanos, M. Minou, G.D. Stamoulis, T. Ganu, V. Arya, D. Chakraborty
Energy in Communication, Information, and Cyber-physical Systems (E6) Workshop, IEEE COMSNETS 2013, pp. 1--6

The need for NUMA-aware algorithms: the case of data shuffling.
Yinan Li, Ippokratis Pandis, Rene Mueller, Vijayshankar Raman, Guy Lohman
CIDR, 2013

The bionic DBMS is coming, but what will it look like?
Ryan Johnson, Ippokratis Pandis
CIDR, 2013

BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds
Bogdan Nicolae, Franck Cappello
J. Parallel Distrib. Comput. 73(5), 698-711, Academic Press, Inc., 2013

Scalable Data Management for Map-Reduce-based Data-Intensive Applications: A View for Cloud and Hybrid Infrastructures
Gabriel Antoniu, Julien Bigot, Cristophe Blanchet, Luc Bouge, Francois Briant, Franck Cappello, Alexandru Costan, Frederic Desprez, Gilles Fedak, Sylvain Gault, Kate Keahey, Bogdan Nicolae, Christian Perez, Anthony Simonet, Frederic Suter et al.
International Journal of Cloud Computing 2(2), 150-170, Inderscience Publishers, 2013


2012

Relevance Matters: Capitalizing on Less (Top-k Matching in Publish/Subscribe)
Mohammad Sadoghi, Hans-Arno Jacobsen
IEEE 28th International Conference on Data Engineering (ICDE 2012), Washington, DC, USA (Arlington, Virginia), 1-5 April, 2012, pp. 786-797

Multi-query Stream Processing on FPGAs
Mohammad Sadoghi, Rija Javed, Naif Tarafdar, Harsh Singh, Rohan Palaniappan, Hans-Arno Jacobsen
IEEE 28th International Conference on Data Engineering (ICDE 2012), Washington, DC, USA (Arlington, Virginia), 1-5 April, 2012, pp. 1229-1232

Towards an extensible efficient event processing kernel
Mohammad Sadoghi
Proceedings of the ACM SIGMOD/PODS PhD Symposium 2012, Scottsdale, AZ, USA, May 20, 2012, pp. 3-8

Scalable and dynamically balanced shared-everything OLTP with physiological partitioning
Pinar Tozun, Ippokratis Pandis, Ryan Johnson, Anastasia Ailamaki
The VLDB Journal, 2012

Path processing using Solid State Storage
Manos Athanassoulis, Bishwaranjan Bhattacharjee, Mustafa Canim, Kenneth A Ross
3rdt International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures (ADMS) 2012 (collocated with VLDB)

Sorting networks on FPGAs
Rene Mueller, Jens Teubner, Gustavo Alonso
The VLDB Journal 21(1), 1--23, 2012
Abstract

gbase: an efficient analysis platform for large graphs
U. Kang, H. Tong, J. Sun, C.Y. Lin, C. Faloutsos
The VLDB Journal, 1--14, Springer, 2012


Solving Big Data Challenges for Enterprise Application Performance Management
Tilmann Rabl, Mohammad Sadoghi, Hans-Arno Jacobsen, Sergio G\'omez-Villamor, Victor Munt\'es-Mulero, Serge Mankowskii
PVLDB 5(12), 1724-1735, VLDB Endowment, 2012

On the Optimization of Schedules for MapReduce Workloads in the Presence of Shared Scans
J. L. Wolf, A. Balmin, D. Rajan, K. Hildrum, R. Khandekar, S. Parekh, K.-L. Wu, and R. Vernica
VLDB Journal 21(5), 589-609, 2012

Building User-defined Runtime Adaptation Routines for Stream Processing Applications
G. Jacques-Silva, B. Gedik, R. Wagle, K.-L. Wu and V. Kumar
PVLDB 5(12), 1826-1837, VLDB Endowment, 2012

On the spatiotemporal burstiness of terms
Theodoros Lappas, Marcos R Vieira, Dimitrios Gunopulos, Vassilis J Tsotras
Proceedings of the VLDB Endowment 5(9), 836--847, VLDB Endowment, 2012

Exploiting evidence from unstructured data to enhance master data management
Karin Murthy, Prasad M Deshpande, Atreyee Dey, Ramanujam Halasipuram, Mukesh Mohania, P Deepak, Jennifer Reed, Scott Schumacher
International Conference on Very Large Databases (VLDB'12), pp. 1862--1873, VLDB Endowment, 2012

Can the Elephants Handle the NoSQL Onslaught?
A. Floratou, N. Teletia, D. J. DeWitt, J. M. Patel, D. Zhang
PVLDB 5(12), 2012

The filter-placement problem and its application to minimizing information multiplicity
Dora Erdos, Vatche Ishakian, Andrei Lapets, Evimaria Terzi, Azer Bestavros
Proceedings of the VLDB Endowment 5(5), 418--429, VLDB Endowment, 2012

Data Management Issues on the Semantic Web
O. Hassanzadeh, A. Kementsietsidis, Y. Velegrakis
Data Engineering (ICDE), 2012 IEEE 28th International Conference on, pp. 1204--1206

Private-HERMES: A Benchmark Framework for Privacy-Preserving Mobility Data Querying and Mining Methods
N Pelekis, A Plemenos, A Gkoulalas-Divanis, D Kopanaki, M Vodas, Y Theodoridis
Divanis... - 2012 - edbt.org

Temporal analytics on big data for web advertising
Badrish Chandramouli, Jonathan Goldstein, Songyun Duan
International Conference on Data Engineering (ICDE), Best Paper Award, 2012

Using content and interactions for discovering communities in social networks
Mrinmaya Sachan, Danish Contractor, Tanveer A Faruquie, L Venkata Subramaniam
Proceedings of the 21st international conference on World Wide Web, pp. 331--340, 2012

Surfacing Time-critical Insights from Social Media
B. Alexe, M. A. Hernandez, K. W. Hildrum, R. Krishnamurthy, G. Koutrika, M. Nagarajan, H. Roitman, M. Shmueli-Scheuer,
ACM SIGMOD, pp. 657--660, [Social Media], 2012

Towards Expressive Exploratory Search Over Entity-Relationship Data
Sivan Yogev, Haggai Roitman, David Carmel, Naama Zwerdling
World Wide Web conference (WWW), [Entity Oriented Search], 2012

Scalable multi-query optimization for SPARQL
Wangchao Le, Anastasios Kementsietsidis, Songyun Duan, Feifei Li
International Conference on Data Engineering (ICDE), 2012

Semantic Link Discovery over Relational Data
Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renee J. Miller, and Min Wang
Semantic Search over the Web, pp. 193-224, Springer, 2012


2011

How soccer players would do stream joins
Jens Teubner, Rene Mueller
Proceedings of the ACM SIGMOD Conference on Management of Data (SIGMOD 2011), pp. 625--636
Abstract

Designing internal control points in partially managed processes by using business vocabulary (in conjunction with ICDE 2011)
Yurdaer Doganata
First International Workshop on Data Management and Analytics for Semi-Structured Business Processes, 2011

Massively Parallel XML Twig Filtering Using Dynamic Programming on FPGAs
Roger Moussalli, Mariam Salloum, Walid Najjar, Vassilis J Tsotras
Data Engineering (ICDE), 2011 IEEE 27th International Conference on, pp. 948--959

On query result diversification
Marcos R Vieira, Humberto L Razente, Maria CN Barioni, Marios Hadjieleftheriou, Divesh Srivastava, Caetano Traina, Vassilis J Tsotras
IEEE 27th International Conference on Data Engineering (ICDE), pp. 1163--1174, 2011

On Query Result Diversification
Marcos R Vieira, Humberto L Razente, Maria CN Barioni, Marios Hadjieleftheriou, Divesh Srivastava, Caetano Traina, Vassilis J Tsotras
IEEE 27th International Conference on Data Engineering (ICDE), 2011

The SystemT IDE: an integrated development environment for information extraction rules
Laura Chiticariu, Vivian Chu, Sajib Dasgupta, Thilo W. Goetz, Howard Ho, Rajasekar Krishnamurthy, Alexander Lang, Yunyao Li, Bin Liu, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu
SIGMOD (Demonstration), pp. 1291-1294, 2011

BE-Tree: an index structure to efficiently match boolean expressions over high-dimensional discrete space
Mohammad Sadoghi, Hans-Arno Jacobsen
Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2011, Athens, Greece, June 12-16, 2011, pp. 637-648

MaSM: efficient online updates in data warehouses
Manos Athanassoulis, Shimin Chen, Anastasia Ailamaki, Phillip B Gibbons, Radu Stoica
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pp. 865--876

Data is dead... without what-if models
Haas, P., Maglio, P., Selinger, P., Tan, W-C
Proceedings of VLDB, 2011

HIWAS - Enabling Technology for Analysis of Clinical Data in XML Documents
Peter Schwarz, Joshua Hui, and Sarah Knoop
Proceedings of the VLDB Endowment 4(12), 2011

Auto-grouping emails for faster e-discovery
Sachindra Joshi, Danish Contractor, Kenney Ng, Prasad M Deshpande, Thomas Hampp
Proceedings of the VLDB Endowment 4(12), 1284--1294, 2011

Efficient XML Path Filtering Using GPUs
Roger Moussalli, Robert Halstead, Mariam Salloum, Walid Najjar, Vassilis J Tsotras
ADMS, In Conjunction With VLDB, 2011

DivDB: A System for Diversifying Query Results
Marcos R Vieira, Humberto L Razente, Maria CN Barioni, Marios Hadjieleftheriou, Divesh Srivastava, Caetano Traina Jr, Vassilis J Tsotras
Proceedings of the VLDB Endowment 4(12), 2011

Column-Oriented Storage Techniques for MapReduce
A. Floratou, J. M. Patel, S. Tata, E. Shekita
PVLDB 4(7), 2011

Online aggregation for large mapreduce jobs
Niketan Pansare, Vinayak R Borkar, Chris Jermaine, Tyson Condie
Proc. VLDB Endow 4(11), 1135--1145, 2011

On the Benefits of Transparent Compression for Cost-Effective Cloud Data Storage
Bogdan Nicolae
Transactions on Large-Scale Data- and Knowledge-Centered Systems 3(1), 167-184, Springer Berlin / Heidelberg, 2011

BlobSeer: Next-generation data management for large scale infrastructures
Bogdan Nicolae, Gabriel Antoniu, Luc Bouge, Diana Moise, Alexandra Carpen-Amarie
J. Parallel Distrib. Comput.71, 169--184, Academic Press, Inc., 2011

Apples and oranges: A comparison of RDF benchmarks and real RDF datasets
Songyun Duan, Anastasios Kementsietsidis, Kavitha Srinivas, Octavian Udrea
Proceedings of the 2011 international conference on Management of data (SIGMOD), pp. 145--156, ACM
Abstract

Emerging trends in the enterprise data analytics: connecting Hadoop and DB2 warehouse
F Ozcan, D Hoa, K S Beyer, A Balmin, C J Liu, Y Li
Proceedings of ACM SIGMOD 2011, pp. 1161--1164

The SystemT IDE: an integrated development environment for information extraction rules
Laura Chiticariu, Vivian Chu, Sajib Dasgupta, Thilo W Goetz, Howard Ho, Rajasekar Krishnamurthy, Alexander Lang, Yunyao Li, Bin Liu, Sriram Raghavan, others
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pp. 1291--1294

Rewriting queries on SPARQL views
Wangchao Le, Songyun Duan, Anastasios Kementsietsidis, Feifei Li, Min Wang
WWW, pp. 655-664, ACM, 2011
Abstract

Helix: Online Enterprise Data Analytics
Oktie Hassanzadeh, Songyun Duan, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J Ward
WWW (Companion Volume), pp. 225-228, ACM, 2011
Abstract

Using Standards to enable the transformation to Smarter Cities
J Hogan, J Meegan, R Parmar, V Narayan, R Schloss
IBM Journal of Research and Development 55(1), 4:1-4:10, IBM, 2011

Enabling Integrated City Operations
D Bartlett, W Harthoorn, J Hogan, M Kehoe, R Schloss
IBM Journal of Research and Development 55(1), 15:1-15:10, 2011

A publication process model to enable privacy-aware data sharing
A Gkoulalas-Divanis, E W Cope
IBM Journal of Research and Development (Special Issue for the 100th Anniversary of IBM) 55(5), 8:1-8:10, IEEE, 2011

Information Technology For Healthcare Transformation
J. P. Bigus, M. Campbell, B. Carmeli, M. Cefkin, H. Chang, C.-H. Chen-Ritzo, W. F. Cody, S. Ebadollahi, A. Evfimievski, A. Farkash, S. Glissmann, D. Gotz, T. W. A. Grandison, D. Gruhl, P. J. Haas, M. J. H. Hsiao, P.-Y. S. Hsueh, J. Hu, J. M. Jasinski, J.
IBM Journal of Research and Development - Special Issue on the Frontiers of IT, Vol 55, No 5, pp 6:1-6:14 55(5), 6--1, IEEE, 2011

Practical computer vision: example techniques and challenges
S Pankanti, L Brown, J Connell, A Datta, Q Fan, R Feris, N Haas, Y Li, N Ratha, H Trinh
IBM Journal of Research and Development 55(5), 2011

Analytics-driven asset management
A Hampapur, H Cao, A Davenport, WS Dong, D Fenhagen, RS Feris, G Goldszmidt, ZB Jiang, J Kalagnanam, T Kumar, others
IBM Journal of Research and Development 55(1.2), 13--1, IBM, 2011

Distributed middleware reliability and fault tolerance support in system S
Rohit Wagle, Henrique Andrade, Kirsten Hildrum, Chitra Venkatramani, Michael Spicer
Proceedings of the 5th ACM international conference on Distributed event-based system, pp. 335--346, 2011

Rewrite rules for search database systems
R. Fagin, B. Kimmelfeld, Y.Li, S. Raghavan, and S. Vaithyanathan
PODS, pp. 271-282, 2011

Predicting completion times of batch query workloads using interaction-aware models and simulation
Mumtaz Ahmad, Songyun Duan, Ashraf Aboulnaga, Shivnath Babu
14th International Conference on Extending Database Technology (EDBT), 2011

Mining GPS data to determine interesting locations
S Khetarpaul, R Chauhan, SK Gupta, L V Subramaniam, U Nambiar
Proceedings of the 8th International Workshop on Information Integration on the Web: in conjunction with WWW 2011, pp. 8

Jaql: A scripting language for large scale semistructured data analysis
K S Beyer, V Ercegovac, R Gemulla, A Balmin, M Eltabakh, C C Kanne, F Ozcan, E J Shekita
Proceedings of the VLDB Endowment 4(12), 2011

SystemML: Declarative machine learning on MapReduce
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, S. Vaithyanathan
Data Engineering (ICDE), 2011 IEEE 27th International Conference on, pp. 231--242

Locality sensitive outlier detection: A ranking driven approach
Y Wang, S Parthasarathy, S Tatikonda
Data Engineering (ICDE), 2011 IEEE 27th International Conference on, pp. 410--421

Demo: Debugging Data Exchange with Vagabond
B Glavic, J Du, RJ Miller, G Alonso and LM Haas
PVLDB 4(12), 1383-1386, 2011

Load Shedding in Mobile Systems with MobiQual
B Gedik, K L Wu, L Liu, P S Yu
Knowledge and Data Engineering, IEEE Transactions on 23(2), 248--265, IEEE, 2011

HIWAS: Enabling Technology for Analysis of Clinical Data in XML Documents
Joshua Hui, Sarah Knoop, Peter Schwarz
37th International Conference on Very Large Data Bases (VLDB) 2011
Abstract

CoHadoop: flexible data placement and its exploitation in Hadoop
M.Y. Eltabakh, Y. Tian, F. Ozcan, R. Gemulla, A. Krettek, J. McPherson
VLDB Proceedings, pp. 575--585, VLDB Endowment, 2011

A Dual Framework and Algorithms for Targeted Data Delivery
Haggai Roitman, Avigdor Gal, Louiqa Raschid
IEEE Transactions on Knowledge and Data Engineering, [Online Data Delivery], 2011


2010

ROADTRACK: Scaling Location Updates for Mobile Clients on Road Networks with Query Awareness
P Pesti, L Liu, B Bamba, A Iyengar, M Weber
Proceedings of the VLDB Endowment 3(2), 2010

Ratio threshold queries over distributed data sources
R. Gupta, K. Ramamritham, M. Mohania
Data Engineering (ICDE), 2010 IEEE 26th International Conference on, pp. 581--584

Discovery-driven graph summarization
N. Zhang, Y. Tian, J.M. Patel
Data Engineering (ICDE), 2010 IEEE 26th International Conference on, pp. 880--891

Data cleansing as a transient service
Tanveer A Faruquie, K Hima Prasad, L Venkata Subramaniam, Mukesh Mohania, Girish Venkatachaliah, Shrinivas Kulkarni, Pramit Basu
Data Engineering (ICDE), 2010 IEEE 26th International Conference on, pp. 1025--1036

Enterprise information extraction: recent developments and open challenges
Laura Chiticariu, Yunyao Li, Sriram Raghavan, Frederick Reiss
SIGMOD (Tutorial), pp. 1257-1258, 2010

Durable top-k search in document archives
Nikos Mamoulis, Klaus Berberich, Srikanta Bedathur, others
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, pp. 555--566

The DataPath System: A Data-centric Analytic Processing Engine for Large Data Warehouses
Subi Arumugam, Alin Dobra, Christopher M. Jermaine, Niketan Pansare, Luis Perez
Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 519--530, ACM

SSD bufferpool extensions for database systems
Mustafa Canim, George A Mihaila, Bishwaranjan Bhattacharjee, Kenneth A Ross, Christian A Lang
36th International Conference on Very Large Databases (VLDB) 2010 , pp. 1435--1446, VLDB Endowment

Buffered Bloom filters on solid state storage
Mustafa Canim, George A Mihaila, B Bhattacharjee, CA Lang, KA Ross
1st International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures (ADMS) 2010 (collocated with VLDB)

Automatic Rule Refinement for Information Extraction
Bin Liu, Laura Chiticariu, Vivian Chu, H. V. Jagadish, Frederick Reiss
Proceedings of the VLDB Endowment Journal 3(1), 588-597, VLDB Endowment, 2010

Efficient Event Processing through Reconfigurable Hardware for Algorithmic Trading
Mohammad Sadoghi, Hans-Arno Jacobsen, Martin Labrecque, Warren Shum, Harsh Singh
PVLDB 3(2), 1525-1528, VLDB Endowment, 2010

Interesting-phrase mining for ad-hoc text analytics
Srikanta Bedathur, Klaus Berberich, Jens Dittrich, Nikos Mamoulis, Gerhard Weikum
Proceedings of the VLDB Endowment 3(1-2), 1348--1357, VLDB Endowment, 2010

InZeit: efficiently identifying insightful time points
Vinay Setty, Srikanta Bedathur, Klaus Berberich, Gerhard Weikum
Proceedings of the VLDB Endowment 3(1-2), 1605--1608, VLDB Endowment, 2010

BlobSeer: Bringing High Throughput under Heavy Concurrency to Hadoop Map/Reduce Applications
Bogdan Nicolae, Diana Moise, Gabriel Antoniu, Luc Bouge, Matthieu Dorier
IPDPS '10: 24th IEEE International Parallel and Distributed Processing Symposium, pp. 1-11, 2010

BlobSeer: Efficient Data Management for Data-Intensive Applications Distributed at Large-Scale
Bogdan Nicolae
IPDPS '10: 24th IEEE International Symposium on Parallel and Distributed Processing: Workshops and Phd Forum, pp. 1-4, 2010


Data Cleansing as a Transient Service
Tanveer. A. Faruquie, K. H. Prasad, L. V. Subramaniam, M. K. Mohania, G. Venkatachaliah, S. Kulkarni, P. Basu
IEEE International Conference on Data Engineering (ICDE), 2010

Hashing tree-structured data: Methods and applications
S Tatikonda, S Parthasarathy
Data Engineering (ICDE), 2010 IEEE 26th International Conference on, pp. 429--440

Efficient B-tree Based Indexing for Cloud Data Processing
S Wu, D Jiang, B C Ooi, K L Wu
Proceedings of the VLDB Endowment 3(1), 2010

Efficient RkNN retrieval with arbitrary non-metric similarity measures
P Deepak, Prasad M Deshpande
Proceedings of the VLDB Endowment 3(1-2), 1243--1254, VLDB Endowment, 2010
Abstract

From a stream of relational queries to distributed stream processing
Qiong Zou, Huayong Wang, Robert Soul'{e}, Martin Hirzel, Henrique Andrade, Buv{g}ra Gedik, Kun-Lung Wu
Proc. VLDB Endow.3, 1394--1405, VLDB Endowment, 2010
Abstract

TRAMP: understanding the behavior of schema mappings through provenance
B Glavic, G Alonso, R J Miller, L M Haas
Proceedings of the VLDB Endowment 3(1-2), 1314--1325, VLDB Endowment, 2010

Secret: A model for analysis of the execution semantics of stream processing systems
I Botan, R Derakhshan, N Dindar, L Haas, R J Miller, N Tatbul
Proceedings of the VLDB Endowment 3(1-2), 232--243, VLDB Endowment, 2010

Time for Our Field to Grow Up (Panel)
A Ailamaki, LM Haas, HV Jagadish, D Maier, MT Ozsu, M Winslett
PVLDB 3(2), 1658, 2010

Understanding queries in a search database system
R Fagin, B Kimelfeld, Y Li, S Raghavan, S Vaithyanathan
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 273--284, ACM, 2010

On the Relationship Between Novelty and Popularity of User-Generated Content
David Carmel, Haggai Roitman, Elad Yom-Tov
19th ACM Conference on Information and Knowledge Management (CIKM), [User Modeling, Social Media], 2010

Estimating Accuracy for Text Classification Tasks on Large Unlabelled Data
Snigdha Chaturvedi, Tanveer A. Faruquie, L. Venkata Subramaniam, M. K. Mohania
ACM Conference on Information and Knowledge Management (CIKM), 2010

DiscoveryLink: A system for integrated access to life sciences data sources
L M Haas, P M Schwarz, P Kodali, E Kotlar, J E Rice, W C Swope
IBM systems Journal 40(2), 489--511, IBM, 2010

Interaction-aware prediction of business intelligence workload completion times
Mumtaz Ahmad, Songyun Duan, Ashraf Aboulnaga, Shivnath Babu
IEEE 26th International Conference on Data Engineering (ICDE), pp. 413--416, 2010

A Demonstration of the MaxStream Federated Stream Processing System
I Botan, Y Cho, R Derakhshan, N Dindar, A Gupta, L Haas, K Kim, C Lee, G Mundada, M C Shan, others
ICDE, IEEE Computer Society, 2010

The structure of inverses in schema mappings
R Fagin, A Nash
Journal of the ACM (JACM) 57(6), 31, ACM, 2010

Statistics-based parallelization of XPath queries in shared memory systems
R Bordawekar, L Lim, A Kementsietsidis, B W L Kok
Proceedings of the 13th International Conference on Extending Database Technology, pp. 159--170, ACM, 2010

Social Bookmark Weighting for Search and Recommendation
David Carmel, Haggai Roitman, Elad Yom-Tov
VLDB Journal , Content Analysis, Social Media, Recommender Systems, 2010

DEDUCE: at the intersection of MapReduce and stream processing
Vibhore Kumar, Henrique Andrade, Buu{g}ra Gedik, Kun-Lung Wu
Proceedings of the 13th International Conference on Extending Database Technology, pp. 657--662, ACM, 2010
Abstract


2009

An object placement advisor for DB2 using solid state storage
Mustafa Canim, George A Mihaila, Bishwaranjan Bhattacharjee, Kenneth A Ross, Christian A Lang
35th International Conference on Very Large Databases (VLDB) 2009 , pp. 1318--1329, VLDB Endowment

Efficient Index Compression in DB2 LUW
B. Bhattacharjee, L. Lim, T. Malkemus, G. Mihaila, K. Ross, S. Lau, C. McArthur, Z. Toth, R. Sherkat
35th International Conference on Very Large Databases (VLDB) 2009

Data processing on FPGAs
Rene Mueller, Jens Teubner, Alonso Gustavo
Proceedings of the VLDB Endowment 2(1), 910--921, VLDB Endowment, 2009
Abstract

Streams on wires: a query compiler for FPGAs
Rene Mueller, Jens Teubner, Gustavo Alonso
Proceedings for the VLDB Endowment 2(1), 229--240, VLDB Endowment, 2009
Abstract

Enabling High Data Throughput in Desktop Grids Through Decentralized Data and Metadata Management: The BlobSeer Approach
Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
Euro-Par '09: 15th International Euro-Par Conference on Parallel Processing, pp. 404-416, 2009

Towards A Grid File System Based On A Large-Scale BLOB Management Service
Viet-Trung Tran, Gabriel Antoniu, Bogdan Nicolae, Luc Bouge
Euro-Par '09: CoreGRID ERCIM Working Group on Grids, P2P and Service computing, 2009

BlobSeer: how to enable efficient versioning for large object storage under heavy access concurrency
Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
EDBT/ICDT '09 Workshops, pp. 18--25, ACM, 2009

Best Effort Top-K Query Processing Under Budgetary Constraints
M. Shmueli-Scheuer, Chen Li, Yosi Mass, Haggai Roitman, Ralf Schenkel, Gerhard Weikum
25th IEEE International Conference on Data Engineering (ICDE), [Query Processing], 2009

Web Monitoring 2.0 - Crossing Streams to Satisfy Complex Data Needs
Haggai Roitman, Avigdor Gal, Louiqa Raschid
25th IEEE International Conference on Data Engineering (ICDE), [Online Data Delivery, Web Monitoring], 2009

Fa: A system for automating failure diagnosis
Songyun Duan, Shivnath Babu, Kamesh Munagala
IEEE 25th International Conference on Data Engineering (ICDE), pp. 1012--1023, 2009

Automated diagnosis of system failures with Fa
Songyun Duan, Shivnath Babu
IEEE 25th International Conference on Data Engineering (ICDE), demo, pp. 1499--1502, 2009

Shaman: A self-Healing database system
Songyun Duan, Peter Franklin, Vamsi Thummala, Dongdong Zhao, Shivnath Babu
IEEE 25th International Conference on Data Engineering (ICDE), demo, pp. 1539--1542, 2009

Business Intelligence from Voice of Customer
L. Venkata Subramaniam, Tanveer A. Faruquie, Shajith Ikbal, Shantanu Godbole, Mukesh K. Mohania
IEEE International Conference on Data Engineering (ICDE), 2009


Federated Stream Processing Support for Real-Time Business Intelligence Applications
I Botan, Y Cho, R Derakhshan, N Dindar, L Haas, K Kim, N Tatbul
VLDB BIRTE Workshop, 2009

SMDM: enhancing enterprise-wide master data management using semantic web technologies
X Wang, X Sun, F Cao, L Ma, N Kanellos, K Zhang, Y Pan, Y Yu
Proceedings of the VLDB Endowment 2(2), 1594--1597, VLDB Endowment, 2009

Tuning database configuration parameters with iTuned
Songyun Duan, Vamsidhar Thummala, Shivnath Babu
Proc. VLDB Endow. 2(1), 1246--1257, VLDB Endowment, 2009
Abstract

Mashup by surfing a web of data APIs
H Chen, B Lu, Y Ni, G Xie, C Zhou, J Mi, Z Wu
Proceedings of the VLDB Endowment 2(2), 1602--1605, VLDB Endowment, 2009

CellJoin: a parallel stream join operator for the cell processor
B Gedik, R R Bordawekar, P S Yu
The VLDB Journal 18(2), 501--519, Springer, 2009

Creating Probabilistic Databases from Duplicated Data
O. Hassanzadeh, R. J. Miller
The VLDB Journal 18(5), 1141--1166, Springer-Verlag New York, Inc., 2009

Mining tree-structured data on multicore systems
S Tatikonda, S Parthasarathy
Proceedings of the VLDB Endowment 2(1), 694--705, VLDB Endowment, 2009

Serial and parallel methods for i/o efficient suffix tree construction
Amol Ghoting, Konstantin Makarychev
SIGMOD, pp. 827--840, ACM, 2009
Abstract

Top-k generation of integrated schemas based on directed and weighted correspondences
A Radwan, L Popa, I R Stanoi, A Younis
Proceedings of the 35th SIGMOD international conference on Management of data, pp. 641--654, 2009

Enabling enterprise mashups over unstructured text feeds with infosphere mashuphub and systemt
David E Simmen, Frederick Reiss, Yunyao Li, Suresh Thalamati
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, pp. 1123--1126, ACM
Abstract

Reverse data exchange: coping with nulls
R Fagin, P G Kolaitis, L Popa, W C Tan
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 23--32, 2009

A gauss function based approach for unbalanced ontology matching
Q Zhong, H Li, J Li, G Xie, J Tang, L Zhou, Y Pan
Proceedings of the 35th SIGMOD international conference on Management of data, pp. 669--680, 2009

Highly scalable web applications with zero-copy data transfer
Toyotaro Suzumura, Michiaki Tatsubori, Scott Trent, Akihiko Tozawa, Tamiya Onodera
Proceedings of the 18th International Conference on World Wide Web, WWW 2009, Madrid, Spain, April 20-24, 2009, pp. 921--930

HTML Templates that Fly - a Template Engine Approach to Automated Offloading from Server to Client
Michiaki Tatsubori, Toyotaro Suzumura
Proceedings of the 18th international conference on World Wide Web (WWW 2009), Madrid, Spain, April 20-24, 2009, pp. 951-960, ACM

Who Tags the Tags? A Framework for Bookmark Weighting
David Carmel, Haggai Roitman, Elad Yom-Tov
18th ACM Conference on Information and Knowledge Management (CIKM), [Content Analysis, Social Media], 2009

Network Visualization and Analysis with Parallel Coordinates
Ohad Greenshpan, Yaron Singer, Al Inselberg
Book Chapter in "Parallel Coordinates: Visual Multidimensional Geometry with Applications" / Springer Verlag (In Press), 2009

Enriching One Taxonomy Using Another
L V Subramaniam, A A Nanavati, S Mukherjea
IEEE Transactions on Knowledge and Data Engineering, Published by the IEEE Computer Society, 2009

Exact Knowledge Hiding through Database Extension
A Gkoulalas-Divanis, V S Verykios
IEEE Transactions on Knowledge and Data Engineering 21(5), 699--713, IEEE, 2009

A survey of uncertain data algorithms and applications
C C Aggarwal, P S Yu
IEEE Transactions on Knowledge and Data Engineering 21(5), 1, 2009

On the efficiency of provenance queries
A Kementsietsidis, M Wang
Data Engineering, 2009, pp. 1223--1226

Linkage Query Writer
O Hassanzadeh, R Xin, R J Miller, A Kementsietsidis, L Lim, M Wang
Proceedings of the 35th International Conference on Very Large Data Bases (VLDB 2009)-Demonstrations Track, pp. 1590--1593, VLDB Endowment

Provenance query evaluation: what's so special about it?
A Kementsietsidis, M Wang
Proceeding of the 18th ACM conference on Information and knowledge management, pp. 681--690, 2009

Profile-based Retrieval of Records in Medical Databases
A Kementsietsidis, L Lim, M Wang
AMIA Annual Symposium Proceedings, pp. 312, 2009

Semantic Link Discovery
O. Hassanzadeh, A. Kementsietsidis, L. Lim, M. Wang
US Patent App. 12/609,657


Optimizing Queries to Hierarchically Structured Data
R Bordawekar, A Kementsietsidis, B W L Kok, L Lim
US Patent App. 12/624,675

ONTOLOGY-BASED SEARCHING IN DATABASE SYSTEMS
LIM Lipyeow, A Kementsietsidis, W Min
US Patent App. 12/481,009

Framework for Evaluating Clustering Algorithms in Duplicate Detection
O Hassanzadeh, F Chiang, H C Lee, R J Miller
Proceedings of the VLDB Endowment 2(1), 1282--1293, VLDB Endowment, 2009

A Declarative Framework for Semantic Link Discovery over Relational Data
O Hassanzadeh, L Lim, A Kementsietsidis, M Wang
Proceedings of the 18th International Conference on World wide web, pp. 1101--1102, ACM, 2009

XOntoRank: Ontology-Aware Search of Electronic Medical Records
F Farf{'a}n, V Hristidis, A Ranganathan, M Weiner
Proceedings of the 2009 IEEE International Conference on Data Engineering, pp. 820--831

Matchup: Autocompletion for mashups
S Abiteboul, O Greenshpan, T Milo, N Polyzotis
Proceedings of the 2009 IEEE International Conference on Data Engineering, pp. 1479--1482

A Declarative Framework for Semantic Link Discovery over Relational Data
O Hassanzadeh, L Lim, A Kementsietsidis, M Wang
Proceedings of the 18th International Conference on World wide web, pp. 1101--1102, ACM, 2009

Framework for Evaluating Clustering Algorithms in Duplicate Detection
O Hassanzadeh, F Chiang, H C Lee, R J Miller
Proceedings of the VLDB Endowment 2(1), 1282--1293, VLDB Endowment, 2009

MatchUp: Autocompletion for Mashups (demo)
Ohad Greenshpan, Serge Abiteboul, Neoklis Polyzotis, Tova Milo
ICDE, 2009

COBRA--mining web for COrporate Brand and Reputation Analysis
S Spangler, Y Chen, L Proctor, A Lelescu, A Behal, B He, T D Griffin, A Liu, B Wade, T Davis
Web Intelligence and Agent Systems 7(3), 243--254, IOS Press, 2009


2008

Muse: A System for Understanding and Designing Mappings
B. Alexe, L. Chiticariu, R. J. Miller, D. Pepper, W. Tan
SIGMOD (Demonstration), pp. 1281-1284, 2008

XArch: archiving scientific and reference data
Heiko M\"uller, Peter Buneman, Ioannis Koltsidas
Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 1295--1298

Periscope/gq: a graph querying toolkit
Y. Tian, J.M. Patel, V. Nair, S. Martini, M. Kretzler
Proceedings of the VLDB Endowment 1(2), 1404--1407, VLDB Endowment, 2008

Flashing up the storage layer
Ioannis Koltsidas, Stratis D Viglas
Proceedings of the VLDB Endowment 1(1), 514--525, VLDB Endowment, 2008

Sorting hierarchical data in external memory for archiving
Ioannis Koltsidas, Heiko M\"uller, Stratis D Viglas
Proceedings of the VLDB Endowment 1(1), 1205--1216, VLDB Endowment, 2008

A Probabilistic Model for Blocking Risks of Atomic Transactions in P2P Networks
Joos-Hendrik Boese, Juergen Bross, Heinz Schweppe
Proceedings of 6th International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2008) at VLDB2008

Enabling lock-free concurrent fine-grain access to massive distributed data: Application to supernovae detection
Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
Cluster '08: 10th IEEE International Conference on Cluster Computing, pp. 310-315, 2008

Distributed Management of Massive Data: An Efficient Fine-Grain Data Access Scheme
Bogdan Nicolae, Gabriel Antoniu, Luc Bouge
VecPar '08: 8th International Meeting on High Performance Computing for Computational Science, pp. 532-543, 2008

Grouping and Optimization of XPath Expressions in System RX
A Balmin, F Ozcan, A Singh, E Ting
ICDE 2008: Proceedings of the 24th International Conference on Data Engineering, pp. 1507--1509

Clip: a visual language for explicit schema mappings
A Raffio, D Braga, S Ceri, P Papotti, M A Hern{'a}ndez
Proceedings of International Conference on Data Engineering (ICDE), pp. 30--39, 2008

Capturing Approximated Data Delivery Trade-offs
Haggai Roitman, Avigdor Gal, Louiqa Raschid
24th IEEE International Conference on Data Engineering (ICDE), [Online Data Delivery], 2008

Satisfying Complex Data Needs using Pull-Based Online Monitoring of Volatile Data Sources
Haggai Roitman, Avigdor Gal, Louiqa Raschid
24th IEEE International Conference on Data Engineering (ICDE), [Online Data Delivery, Web Monitoring], 2008

Processing diagnosis queries: A principled and scalable approach
Shivnath Babu, Songyun Duan, Kamesh Munagala
IEEE 24th International Conference on Data Engineering (ICDE), pp. 1468--1470, 2008

SEDA: a system for search, exploration, discovery, and analysis of XML Data
A Balmin, L Colby, E Curtmola, Q Li, F Ozcan, S Srinivas, Z Vagena
Proceedings of VLDB 1(2), 1408--1411, VLDB Endowment, 2008

Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases
D Van de Craen, F Neven, S VANSUMMEREN, A KEMENTSIETSIDIS
Proceedings of the VLDB Endowment 1(1), 16--27, VLDB Endowment, 2008

Main-memory scan sharing for multi-core CPUs
Lin Qiao, Vijayshankar Raman, Frederick Reiss, Peter J. Haas, Guy M. Lohman
PVLDB 1(1), 610--621, 2008

Row-wise parallel predicate evaluation
R Johnson, V Raman, R Sidle, G Swart
Proceedings of the VLDB Endowment archive 1(1), 622--634, VLDB Endowment, 2008

Multidimensional content eXploration
Alkis Simitsis, Akanksha Baid, Yannis Sismanis, Berthold Reinwald
Proceedings of the VLDB Endowment 1(1), 660--671, VLDB Endowment, 2008

DBPubs: multidimensional exploration of database publications
Akanksha Baid, Andrey Balmin, Heasoo Hwang, Erik Nijkamp, Jun Rao, Berthold Reinwald, Alkis Simitsis, Yannis Sismanis, Frank van Ham
Proceedings of the VLDB Endowment 1(2), 1456--1459, VLDB Endowment, 2008

ManyAspects: a system for highlighting diverse concepts in documents
K Liu, E Terzi, T Grandison
Proceedings of the VLDB Endowment 1(2), 1444--1447, VLDB Endowment, 2008

Data exchange with data-metadata translations
M A Hernandez, P Papotti, W C Tan
Proceedings of the VLDB Endowment archive 1(1), 260--273, VLDB Endowment, 2008

Maintaining Dynamic Channel Profiles on the Web
Haggai Roitman, David Carmel, Elad Yom-Tov
34th International Conference on Very Large Data Bases (VLDB), [Content Analysis, Web Monitoring, Streams], 2008

LeeWave: level-wise distribution of wavelet coefficients for processing k NN queries over distributed streams
M Y Yeh, K L Wu, P S Yu, M S Chen
Proceedings of the VLDB Endowment 1(1), 586--597, VLDB Endowment, 2008

Efficiently approximating query optimizer plan diagrams
A Dey, S Bhaumik, others
Proceedings of the VLDB Endowment 1(2), 1325--1336, VLDB Endowment, 2008

Grouping and optimization of XPath expressions in DB2 pureXML
A Balmin, F Ozcan, A Singh, E Ting
SIGMOD 2008: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 1065--1074

A generic flow algorithm for shared filter ordering problems
Z Liu, S Parthasarathy, A Ranganathan, H Yang
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), pp. 79--88, 2008

Near-Optimal Algorithms for Shared Filter Evaluation in Data Stream Systems
Z. Liu, S. Parthasarathy, A. Ranganathan, H. Yang
ACM SIGMOD International Conference on Management of Data, pp. 133--146, 2008

http://portal.acm.org/citation.cfm?id=1376731&dl=GUIDE,
U Nambiar, H Gupta, R Balakrishnan, M Mohania
Proceedings of the 2008 ACM SIGMOD international conference …, 2008 - portal.acm.org

XML query optimization in the presence of side effects
Giorgio Ghelli, Nicola Onose, Kristoffer H\ogsbro Rose, Jerome Simeon
Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, Vancouver, BC, Canada, June 10-12, 2008, pp. 339--352

SPADE: the system s declarative stream processing engine
Bugra Gedik, Henrique Andrade, Kun-Lung Wu, Philip S Yu, Myungcheol Doo
Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 1123--1134, ACM
Abstract

Discovering topical structures of databases
Wensheng Wu, Berthold Reinwald, Yannis Sismanis, Rajesh Manjrekar
Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 1019--1030, Google Patents
US Patent 7,818,323

SystemT: a system for declarative information extraction
Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu
ACM SIGMOD Record 37(4), 7--13, ACM, 2008
Abstract

Towards a theory of schema-mapping optimization
R Fagin, P G Kolaitis, A Nash, L Popa
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 33--42, 2008


PODS '08: Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A Evfimievski, R Fagin, D P Woodruff
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 171--180, ACM, 2008
Abstract   475080

SQAK: doing more with keywords
Sandeep Tata, Guy M. Lohman
Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, Vancouver, BC, Canada, June 10-12, 2008, pp. 889--902

XML query optimization in the presence of side effects
Giorgio Ghelli, Nicola Onose, Kristoffer Rose, Jerome Simeon
Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 339--352, ACM
Abstract

The Claremont Report on Database Research
R Agrawal, A Ailamaki, PA Bernstein, EA Brewer, MJ Carey, S Chaudhuri, A Doan, D Florescu, MJ Franklin, H Garcia-Molina, J Gehrke, L Gruenwald, LM Haas, AY Halevy, JM Hellerstein, Y Ioannidis, HF Korth, D Kossmann, S Madden, R Magoulas, BC Ooi, T O'Reilly
SIGMOD Record 37(3), 9-19, ACM, 2008

Efficient online top-K retrieval with arbitrary similarity measures
Prasad M Deshpande, Krishna Kummamuru, others
Proceedings of the 11th international conference on Extending database technology: Advances in database technology, pp. 356--367, 2008

EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology

... of the 11th international conference on ..., 2008 - portal.acm.org, ACM
Abstract

Exploiting Context to Detect Sensitive Information in Call Center Conversations
Tanveer A Faruquie, Sumit Negi, Anup K Chalamala, L V Subramaniam
ACM Conference on Information and Knowledge Management (CIKM), pp. 1513--1514, 2008

Evolution of Rule-Based Information Extraction (Tutorial)
Rajasekar Krishnamurthy, Sriram Raghavan, and Huaiyu Zhu
Conference on Information and Knowledge Management (CIKM), 2008

Conditional functional dependencies for capturing data inconsistencies
W Fan, F Geerts, X Jia, A Kementsietsidis
ACM Transactions on Database Systems (TODS) 33(2), 6, ACM, 2008

Quasi-inverses of schema mappings
R Fagin, P G Kolaitis, L Popa, W C Tan
ACM Transactions on Database Systems (TODS) 33(2), 11, ACM, 2008
US Patent App. 11/970,057

Commutativity analysis for XML updates
Giorgio Ghelli, Kristoffer H Rose, Jerome Simeon
ACM Transactions on Database Systems 33(4), 29, ACM, 2008


2007

Increasing Buffer-Locality for Multiple Index Based Scans through Intelligent Placement and Index Scan Speed Control
C. Lang, B. Bhattacharjee, T. Malkemus, K. Wong
33rd International Conference on Very Large Databases (VLDB) 2007

Efficient Bulk Deletes for Multi Dimensional Clustered Tables in DB2
" B. Bhattacharjee, T. Malkemus, S. Lau, S. McKeough, J. Kirton, R. Von Boeschoten, J. P. Kennedy
33rd International Conference on Very Large Databases (VLDB) 2007

The Omni-family of all-purpose access methods: a simple and effective way to make similarity search more efficient
Caetano Traina, Roberto F Santos Filho, Agma JM Traina, Marcos R Vieira, Christos Faloutsos
The VLDB Journal 16(4), 483--505, Springer, 2007



XQuery streaming 'a la carte'
M Fernandez, P Michiels, J Simeon, M Stark, TU Darmstadt
ICDE, Istanbul, Turkey, Citeseer, 2007

Challenges and experience in prototyping a multi-modal stream analytic and monitoring application on System S
K L Wu, K W Hildrum, W Fan, P S Yu, C C Aggarwal, D A George, B Gedik, E Bouillet, X Gu, G Luo, others
Proceedings of the 33rd international conference on Very large data bases, pp. 1185--1196, VLDB Endowment, 2007

OLAP over uncertain and imprecise data
TS Jayram, R Ramakrishnan, S Vaithyanathan
The VLDB Journal The International Journal on Very Large …, 2007 - Springer

On the correctness criteria of fine-grained access control in relational databases
Q W T Y N Li, J Lobo, E Bertino, K Irwin, J W Byun
33rd International Conference on Very Large Data Bases (Vldb 2007), pp. 555

Processing forecasting queries
Songyun Duan, Shivanath Babu
VLDB '07: Proceedings of the 33rd international conference on Very large data bases, pp. 711--722, VLDB Endowment, 2007
Abstract

Cache-conscious frequent pattern mining on modern and emerging processors
Amol Ghoting, Gregory Buehrer, Srinivasan Parthasarathy, Daehyun Kim, Anthony Nguyen, Yen-Kuang Chen, Pradeep Dubey
The VLDB Journal16, 77--96, Springer-Verlag New York, Inc., 2007
Abstract

CellSort: high performance sorting on the cell processor
Buv{g}ra Gedik, Rajesh R Bordawekar, Philip S Yu
Proceedings of the 33rd international conference on Very large data bases, pp. 1286--1297, VLDB Endowment, 2007
Abstract

Executing stream joins on the cell processor
Buv{g}ra Gedik, Philip S Yu, Rajesh R Bordawekar
Proceedings of the 33rd international conference on Very large data bases, pp. 363--374, VLDB Endowment, 2007
Abstract

Distributed query evaluation with performance guarantees
G Cong, W Fan, A Kementsietsidis
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 509--520, ACM

BIwTL: a business information warehouse toolkit and language for warehousing simplification and automation
B He, R Wang, Y Chen, A Lelescu, J Rhodes
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 1052

Highly distributed XQuery with DXQ
M F Fernandez, T Jim, K Morton, N Onose, J Simeon
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 1161

How to barter bits for chronons: compression and bandwidth trade offs for database scans
A L Holloway, V Raman, G Swart, D J DeWitt
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 400

Lazy, adaptive rid-list intersection, and its application to index anding
V Raman, L Qiao, W Han, I Narang, Y L Chen, K H Yang, F L Ling
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 784

Towards keyword-driven analytical processing
Ping Wu, Yannis Sismanis, Berthold Reinwald
International Conference on Management of Data: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 617--628

On synopses for distinct-value estimation under multiset operations
Kevin Beyer, Peter J Haas, Berthold Reinwald, Yannis Sismanis, Rainer Gemulla
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 199--210

Making database systems usable
HV Jagadish, A Chapman, A Elkiss, M Jayapandian, Y Li, A Nandi, C Yu
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 24

DaNaLIX: a domain-adaptive natural language interface for querying XML
Yunyao Li, Ishan Chaudhuri, Huahai Yang, Satinder Singh, HV Jagadish
Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 1165--1168

Leveraging Data and Structure in Ontology Integration.
Octavian Udrea, Lise Getoor, and Rene J. Miller
Proceedings of the the ACM SIGMOD International Conference on the Management Of Data, ACM, 2007

Inverting schema mappings
R Fagin
ACM Transactions on Database Systems (TODS) 32(4), 25, ACM, 2007

an adaptive, multiway, windowed stream join with time correlation-aware CPU load shedding
G Bugra, W Kun-Lung, S G J Yu Philip
IEEE Transactions on Knowledge and Data Engineering 19(10), 1363--1380, 2007

Grubjoin: An adaptive, multi-way, windowed stream join with time correlation-aware cpu load shedding
B Gedik, K L Wu, P S Yu, L Liu
IEEE Transactions on Knowledge and Data Engineering 19(10), 1363--1380, Published by the IEEE Computer Society, 2007

Toward Exploratory Test-Instance-Centered Diagnosis in High-Dimensional Classification
CC Aggarwal
IEEE Transactions on Knowledge and Data Engineering 19(8), 1001--1015, 2007

Rank Aggregation for Automatic Schema Matching
Carmel Domshlak, Avigdor Gal, Haggai Roitman
IEEE Transactions on Knowledge and Data Engineering 19(4), [Schema Matching], 2007

Beauty and the Beast: The Theory and Practice of Information Integration
L Haas
Int'l Conf on Database Theory (ICDT), pp. 28-43, Springer Lecture Notes in Computer Science, 2007


2006

Debugging Schema Mappings with Routes
L. Chiticariu, W. Tan
VLDB, pp. 79--90, 2006

SPIDER: a Schema mapPIng DEbuggeR
B. Alexe, L. Chiticariu, W. Tan
VLDB (Demonstration), pp. 1179-1182, 2006

MONDRIAN: Annotating and querying databases through colors and blocks
F Geerts, A Kementsietsidis, D Milano
ICDE06: Proceedings of the 22nd International Conference on … - doi.ieeecomputersociety.org, 2006

MONDRIAN: Annotating and querying databases through colors and blocks
F Geerts, A Kementsietsidis, D Milano
Proceedings of ICDE conference, Atlanta, GA, USA, 2006 - homepages.inf.ed.ac.uk


On the path to efficient XML queries
A Balmin, K S Beyer, F Ozcan, M Nicola
VLDB 2006: Proceedings of the 32nd international conference on Very large data bases, pp. 1117-1128

Using partial evaluation in distributed query evaluation
P Buneman, G Cong, W Fan, A Kementsietsidis
Proceedings of the 32nd international conference on Very large data bases, pp. 211--222, VLDB Endowment, 2006

SMOQE: a system for providing secure access to XML
W Fan, F Geerts, X Jia, A Kementsietsidis
Proceedings of the 32nd international conference on Very large data bases, pp. 1227--1230, VLDB Endowment, 2006

Intelligent system monitoring on large clusters
J Sun, E Hoke, J D Strunk, G R Ganger, C Faloutsos
Proceedings of the 3rd workshop on Data management for sensor networks: in conjunction with VLDB 2006, pp. 47--52

Avatar semantic search: a database approach to information retrieval
Eser Kandogan, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan, Huaiyu Zhu
Proceedings of the 2006 ACM SIGMOD international conference on Management of data, pp. 790--792

Query reformulation with constraints
A Deutsch, L Popa, V Tannen
ACM SIGMOD Record 35(1), 73, ACM, 2006

Proactive identification of performance problems
Songyun Duan, Shivnath Babu
Proceedings of the 2006 ACM SIGMOD international conference on Management of Data (SIGMOD), demo, pp. 766--768, ACM
Abstract

Design, implementation, and evaluation of the linear road bnchmark on the stream processing core
N Jain, L Amini, H Andrade, R King, Y Park, P Selo, C Venkatramani
Proceedings of the 2006 ACM SIGMOD international conference on Management of data, pp. 431--442, ACM

Commutativity analysis in XML update languages
G Ghelli, K Rose, J Sim{'e}on
Database Theory--ICDT 2007, 374--388, Springer, 2006

Incremental processing of continual range queries over moving objects
K L Wu, S K Chen, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 1560--1575, Published by the IEEE Computer Society, 2006

A framework for on-demand classification of evolving data streams
CC Aggarwal, J Han, J Wang, PS Yu
IEEE Transactions on Knowledge and Data Engineering 18(5), 577--589, 2006

Processing moving queries over moving objects using motion-adaptive indexes
B Gedik, K L Wu, P S Yu, L Liu
IEEE Transactions on Knowledge and Data Engineering 18(5), 651--668, Published by the IEEE Computer Society, 2006


2005

DBNotes: a Post-it System for Relational Databases based on Provenance
L. Chiticariu, W. Tan, G. Vijayvargiya
SIGMOD (Demonstration), pp. 942--944, 2005

An Annotation Management System for Relational Databases
D. Bhagwat, L. Chiticariu, W. Tan, G. Vijayvargiya
VLDB Journal, Best papers of VLDB 2004 14(4), 373--396, 2005
A preliminary version of this paper appeared in the VLDB 2004 proceedings

An effective and efficient algorithm for high-dimensional outlier detection
C C Aggarwal, P S Yu
The VLDB journal 14(2), 211--221, Springer, 2005

Practical methods for constructing suffix trees
Y. Tian, S. Tata, R.A. Hankins, J.M. Patel
The VLDB Journal 14(3), 281--299, Springer, 2005

Cache-conscious frequent pattern mining on a modern processor
Amol Ghoting, Gregory Buehrer, Srinivasan Parthasarathy, Daehyun Kim, Anthony Nguyen, Yen-Kuang Chen, Pradeep Dubey
Proceedings of the 31st international conference on Very large data bases, pp. 577--588, VLDB Endowment, 2005
Abstract

Hubble: An Advanced Dynamic Folder Technology for XML
N L J H Hui, I H K Beyer
Proceedings, very large data bases, pp. 541, VLDB Endowment, 2005

Clio grows up: from research prototype to industrial tool
L M Haas, M A Hern{'a}ndez, H Ho, L Popa, M Roth
Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp. 805--810

Extending XQuery for analytics
K Beyer, D Chambrlin, LS Colby, F Ozcan, H Pirahesh, Y Xu
SIGMOD 2005: Proceedings of the 2005 ACM SIGMOD international conference on management of data, pp. 503-514

DB2/XML: designing for evolution
K Beyer, F Ozcan, S Saiprasad, B Van der Linden
SIGMOD 2005: Proceedings of the 2005 ACM SIGMOD international conference on management of data, pp. 948-952

System RX: One Part Relational, One Part XML
Kevin S. Beyer, Roberta Cochrane, Vanja Josifovski, Jim Kleewein, George Lapis, Guy M. Lohman, Robert Lyle, Fatma \"{O}zcan, Hamid Pirahesh, Normen Seemann, Tuong C. Truong, Bert Van der Linden, Brian Vickery, Chun Zhang
Proceedings of the ACM SIGMOD International Conference on Management of Data, Baltimore, Maryland, USA, June 14-16, 2005, pp. 347--358

Composing schema mappings: Second-order dependencies to the rescue
R Fagin, P G Kolaitis, L Popa, W C Tan
Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 994--1055, ACM, 2005

Automated statistics collection in action
P Haas, M Kandil, A Lerner, V Markl, I Popivanov, V Raman, D Zilio
Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp. 933--935

NaLIX: an interactive natural language interface for querying XML
Yunyao Li, Huahai Yang, HV Jagadish
Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp. 900--902

Multi-structural databases
R Fagin, R Guha, R Kumar, J Novak, D Sivakumar, A Tomkins
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 184--195, 2005

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1411743
Z Nie, S Kambhampati, U Nambiar
IEEE Transactions on Knowledge and Data Engineering, 2005 - ieeexplore.ieee.org

On change diagnosis in evolving data streams
C C Aggarwal
IEEE Transactions on Knowledge and Data Engineering 17(5), 587--600, Institute of Electrical and Electronics Engineers, Inc, 445 Hoes Ln, Piscataway, NJ, 08854-1331, USA,, 2005

Data exchange: semantics and query answering
R Fagin, P G Kolaitis, R J Miller, L Popa
Proceedings of ICDT, pp. 89--124, Elsevier, 2005

An adaptive, fast, and safe XML parser based on byte sequences memorization
Toshiro Takase, Hisashi Miyashita, Toyotaro Suzumura, Michiaki Tatsubori
Proceedings of the 14th international conference on World Wide Web, WWW 2005, Chiba, Japan, May 10-14, 2005, pp. 692--701


2004

Computing Clusters of Correlation Connected Objects
Christian Boehm, Karin Kailing, Peer Kroeger, and Arthur Zimek
Proc. of ACM SIGMOD International Conference on Management of Data (SIGMOD'04), pp. 455--466, 2004

An Annotation Management System for Relational Databases
D. Bhagwat, L. Chiticariu, W. Tan, G. Vijayvargiya
VLDB, pp. 900-911, 2004

A framework for using materialized XPath views in XML query processing
A Balmin, F Ozcan, K S Beyer, R J Cochrane, H Pirahesh
VLDB 2004: Proceedings of the Thirtieth international conference on Very large data bases, pp. 60-71

Preserving mapping consistency under schema changes
Y Velegrakis, R J Miller, L Popa
The VLDB Journal 13(3), 274--293, Springer, 2004

Comparing and aggregating rankings with ties
R Fagin, R Kumar, M Mahdian, D Sivakumar, E Vee
Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, 2004, pp. 58

Locally consistent transformations and query answering in data exchange
M Arenas, P Barcelo, R Fagin, L Libkin
Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, 2004, pp. 240

Robust Query Processing through Progressive Optimization
Volker Markl, Vijayshankar Raman, David E. Simmen, Guy M. Lohman, Hamid Pirahesh
Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, June 13-18, 2004, pp. 659--670

Constraint-based XML query rewriting for data integration
C Yu, L Popa
Proceedings of the 2004 ACM SIGMOD international conference on Management of data, pp. 371--382

Comparing and aggregating with ties
R Fagin, R Kumar, M Mahdian
ACM PODS, pp. 47--58, 2004

WALRUS: A similarity retrieval algorithm for image databases
A Natsev, R Rastogi, K Shim
IEEE Transactions on Knowledge and Data Engineering, 301--316, Published by the IEEE Computer Society, 2004

Self-tuning of the relationships among rules' components in active databases systems
D Botzer, O Etzion
IEEE Transactions on Knowledge and Data Engineering 16(3), 375--379, Institute of Electrical and Electronics Engineers, Inc, 445 Hoes Ln, Piscataway, NJ, 08854-1331, USA,, 2004


On using partial supervision for text categorization
C C Aggarwal, S C Gates, P S Yu
IEEE Transactions on Knowledge and Data Engineering 16(2), 245--255, Institute of Electrical and Electronics Engineers, Inc, 445 Hoes Ln, Piscataway, NJ, 08854-1331, USA,, 2004

A human-computer interactive method for projected clustering
C C Aggarwal
IEEE transactions on knowledge and data engineering, 448--460, IEEE Computer Society, 2004

Concise Papers\_
D Botzer
IEEE Transactions on Knowledge and Data Engineering 16(3), 375, 2004


2003

A Nanotechnology-based Approach to Data Storage
E. Eleftheriou, P. Bachtold, G. Cherubini, A. Dholakia, C. Hagleitner, T. Loeliger, A. Pantazi, H. Pozidis, T. Albrecht, G. Binnig, M. Despont, U. Drechsler, U. Durig, B. Gotsmann, D. Jubin, W. Haberle, M. Lantz, H. Rothuizen, R. Stutz, P. Vettiger, et al
VLDB, pp. 3-7, 2003

Complex queries over web repositories
Sriram Raghavan, Hector Garcia-Molina
Proceedings of the 29th international conference on Very large data bases - Volume 29, pp. 33--44, VLDB Endowment, 2003
Abstract

Mapping data in peer-to-peer systems: semantics and algorithmic issues
A Kementsietsidis, M Arenas, RJ Miller
Proceedings of the 2003 ACM SIGMOD international conference …, 2003 - portal.acm.org

Efficient similarity search and classification via rank aggregation
R Fagin, R Kumar, D Sivakumar
Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp. 301--312, Google Patents
US Patent App. 10/458,512

Data exchange: getting to the core
R Fagin, P G Kolaitis, L Popa
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 90--101, ACM, 2003

On nearest neighbor indexing of nonlinear trajectories
C C Aggarwal, D Agrawal
Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 252--259, 2003

A framework for diagnosing changes in evolving data streams
C C Aggarwal
Proceedings of the 2003 ACM SIGMOD international conference on Management of data, pp. 575--586

A framework for change diagnosis of data streams
C C Aggarwal
SIGMOD Conference, pp. 575--586, 2003

Efficient similarity search and classification via rank aggregation In Proceedings of the 2003 ACM SIGMOD international conference on Management of data (San Diego, California, USA, June 09--12, 2003)
R Fagin, R Kumar, D Sivakumar
ACM, New York, NY, USA, 301--312, 2003

Analysis of predictive spatio-temporal queries
Y Tao, J Sun, D Papadias
ACM Transactions on Database Systems (TODS) 28(4), 295--336, ACM, 2003

Searching the workplace web
R Fagin, R Kumar, K McCurley, J Novak, D Sivakumar, J Tomlin, D Williamson
Proceedings of WWW2003, Budapest, Hungary, 2003, pp. 375

Searching the Corporate Web
R Fagin, R Kumar, K McCurley, J Novak, D Sivakumar, J Tomlin, D Williamson
Proceedings of WWW2003, Budapest, Hungary, pp. 366--375

Searching the Corporate Web Proceedings of WWW2003
R Fagin, R Kumar, K McCurley, J Novak, D Sivakumar, J Tomlin, D Williamson
Budapest, Hungary, 366--375, 2003

Quality driven web services composition
Z Liangzhao, B Benatallah, M Dumas, J Kalagnanam, Q Z Sheng
WWW2003

Information Extraction from Biomedical Literature: Methodology, Evaluation and an Application
L V Subramaniam, Sougata Mukherjea, P kankar, Biplav Srivastava, V S Batra, Pasumarti V Kamesam
CIKM 2003, Proceedings of the twelfth international conference on Information and Knowledge Managemen

Conflict resolution using logic programming
J Chomicki, J Lobo, S Naqvi
IEEE Transactions on Knowledge and Data Engineering 15(1), 244--249, 2003

The Yin/Yang Web: A unified model for XML syntax and RDF semantics
P F Patel-Schneider, J Sim{'e}on
IEEE Transactions on Knowledge and Data Engineering, 797--812, IEEE Computer Society, 2003

On the use of conceptual reconstruction for mining massively incomplete data sets
S Parthasarathy, C C Aggarwal
IEEE Transactions on Knowledge and Data Engineering, 1512--1521, IEEE Computer Society, 2003

Optimizing index allocation for sequential data broadcasting in wireless mobile computing
M S Chen, K L Wu, P S Yu
IEEE Transactions on Knowledge and Data Engineering 15(1), 161--173, Published by the IEEE Computer Society, 2003


2002

Garlic: a new flavor of federated query processing for DB2
Vanja Josifovski, Peter Schwarz, Laura Haas, Eileen Lin
Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 524--532, ACM
Abstract


StatiX: making XML count
J Freire, J R Haritsa, M Ramanath, P Roy, J Sim{'e}on
Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 181--191

Continuously adaptive continuous queries over streams
S Madden, M Shah, J M Hellerstein, V Raman
Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 49--60

Partial results for online query processing
V Raman, J M Hellerstein
Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 275--286

Dwarf: Shrinking the petacube
Yannis Sismanis, Antonios Deligiannakis, Nick Roussopoulos, Yannis Kotidis
Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 464--475

Data Exchange: Semantics and Query Answering., ICDT 2003
R Fagin, PG Kolaitis, RJ Miller, L Popa
Lecture Notes in Computer Science2572, 207--224, 2002


The Social Contract Core
James H. Kaufman, Stefan Edlund, Daniel Alexander Ford, Calvin Powers
WWW, pp. 210-220, 2002

Finding localized associations in market basket data
C C Aggarwal, C Procopiuc, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 51--62, IEEE Computer Society, 2002

Fast algorithms for online generation of profile association rules
C C Aggarwal, Z Sun, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 1017--1028, IEEE Computer Society, 2002

Redefining clustering for high-dimensional applications
C C Aggarwal, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 210--225, IEEE Computer Society, 2002

Attribute Classification Using Feature Analysis
Felix Naumann, Ching
Tien Ho (Howard), Xuqing Tian, Laura Haas, Nimrod Megiddo - ICDE 2002 - 18th International Conference on Data Engineering, 2002.


2001

Supporting incremental join queries on ranked inputs
A Natsev, Y C Chang, J R Smith, C S Li, J S Vitter
Proceedings of VLDB, pp. 281--290, 2001

Clio: A semi-automatic tool for schema mapping
M A Hernandez, R J Miller, L M Haas
Proceedings of the 2001 ACM SIGMOD international conference on Management of data

Data-driven understanding and refinement of schema mappings
L L Yan, R J Miller, L M Haas, R Fagin
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, pp. 485--496, ACM

The Clio project: managing heterogeneity
R J Miller, M A Hernandez, L M Haas, L Yan, CT Howard Ho, R Fagin, L Popa
ACM Sigmod Record 30(1), 78--83, ACM, 2001

The clio project: Managing heterogeneity.
MA Hernandez, LM Haas, L Yan, CTH Ho, R Fagin, L &
SIGMOD Record 30(1), 78-83, ACM, 2001

The Clio project: managing heterogeneity. 2001
MA Hernandez, LM Haas, L Yan, CTH Ho, R Fagin, L &
SIGMOD Record 30(1), 78-83, ACM, 2001

On the design and quantification of privacy preserving data mining algorithms
D Agrawal, C C Aggarwal
Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 247--255, 2001

Outlier detection for high dimensional data
C C Aggarwal, P S Yu
ACM Sigmod Record 30(2), 37--46, ACM New York, NY, USA, 2001

On the effects of dimensionality reduction on high dimensional similarity search
C C Aggarwal
Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 256--266, 2001


Personal Automation: Combining Personal Information Management Systems and Rule Engines
Edlund, S.
WWW 2001

Tempus Fugit: A System for Making Semantic Connections.
Daniel Alexander Ford, Joann Ruvolo, Stefan Edlund, Jussi Myllymaki, James Kaufman, Jared Jackson, Martin Gerlach
CIKM, pp. 520-522, 2001

A new approach to online generation of association rules
C C Aggarwal, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 527--540, IEEE Computer Society, 2001

Mining associations with the collective strength approach
C C Aggarwal, P S Yu
IEEE Transactions on Knowledge and Data Engineering, 863--873, IEEE Computer Society, 2001


2000

Panei: Is Generic Metadata Management Feasible?
PA Bernstein, L Haas, M Jarke, E Rahm, G …
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE …, 2000 - sigmod.org

The onion technique: indexing for linear optimization queries
Yuan-Chi Chang, Lawrence Bergman, Vittorio Castelli, Chung-Sheng Li, Ming-Ling Lo, John R Smith
SIGMOD Rec. 29(2), 391--402, ACM, 2000

Finding generalized projected clusters in high dimensional spaces
C C Aggarwal, P S Yu
Proceedings of the 2000 ACM SIGMOD international conference on Management of data, pp. 70--81

A chase too far?
L Popa, A Deutsch, A Sahuguet, V Tannen
ACM SIGMOD Record 29(2), 284, ACM, 2000



1998

Materialized view selection for multidimensional datasets
Amit Shukla, Prasad Deshpande, Jeffrey F Naughton, others
VLDB, pp. 488--499, 1998


1996

Storage estimation for multidimensional aggregates in the presence of hierarchies
Amit Shukla, Prasad Deshpande, Jeffrey F Naughton, Karthikeyan Ramasamy
VLDB, pp. 522--531, 1996


Year Unknown

Predictions and challenges for database systems in the year 2000

... OF THE INTERNATIONAL CONFERENCE ON VERY ..., 1993 - vldb.org, 0

Information Integration and XML in IBM's DB2

Proceedings of the 28th International Conference on ..., 2002 - vldb.org

Data is Dead… Without What-If Models

Proceedings of the VLDB ..., 2011 - almaden.ibm.com