DSL Research Areas
- Query Optimization:
Design of techniques for effectively reusing plans produced
by the query optimizer for earlier queries to generate the plans
for fresh queries.
Development of tools for characterizing the behavior of industrial-strength
database query optimizers.
Improving the speed and scaling of query optimizers.
Papers in VLDB 2002, SIGMOD 2003, VLDB 2004, VLDB 2005, ICDE 2007, VLDB 2007, VLDB 2008, VLDB 2010.
- Biological Databases:
Development of index structures and buffer management
policies for efficiently supporting genomic (DNA and protein) sequences.
Development of database systems for hosting bio-diversity information.
Papers in COMAD 2000, Information Systems Journal 2003, ICDE 2004, SIGMOD 2004, HiPC 2005.
In the Recent Past
- Multilingual Databases:
Characterization of the performance impacts of storing
data in multilingual format. Design of database architectures
that address multilingual performance drawbacks. Design of
multilingual SQL operators.
Papers in VLDB 2003, ICDE 2004, EDBT 2004, SIGMOD 2004, CIKM 2004, DASFAA 2005, ICDE 2006.
- Privacy-Preserving Mining:
Development of privacy-preserving techniques for
extraction of association rules on large boolean databases.
Papers in VLDB 2002, DASFAA 2004, ICDE 2005, DMKD 2009.
- XML Databases:
Design of translation mechanisms for storing XML documents
in relational engines. Design of XML compression techniques
that directly support querying in the compressed domain. Design
of XML indexing systems.
Papers in ICDE 2002, SIGMOD 2002, VLDB 2002, ICDE 2003, ICDE 2005, COMAD 2005b, DASFAA 2006, Information Systems Journal 2007.
- Web Databases:
Design of server-side cacheing mechanisms for dynamic Web content.
Papers in SIGMOD 2002, APWEB 2004, COMAD 2005b.
- Database Mining:
Design of efficient, incremental and concise data mining algorithms for boolean association rules.
Papers in Information Systems Journal 2000, SIGMOD 2000, ICDE 2002, PAKDD 2002, ICDE 2003, PAKDD 2003.
In the Previous Millenium
- Distributed Web Database Processing:
Designed a new Web querying system called Discover that processes
queries in a truly distributed manner, thereby deriving several
advantages, whereas most other systems proposed in the
literature have a centralized processing architecture.
- Database Compression:
Data compression has traditionally been used for reducing disk space
usage in database systems. A major problem, however, is that
processing queries on compressed databases becomes very slow. We have
developed novel compression algorithms that simultaneously provide
fast query processing and significant disk savings.
- Transaction Management in Distributed Databases:
Incorporating distributed data into the database framework incurs the
well-known additional complexities that are associated with transaction
concurrency control and recovery. We have designed
high-performance distributed concurrency control protocols and
distributed commit protocols.
- Object-oriented Database Solutions for Advanced Applications:
Modern applications place sophisticated demands on the functionality
expected from database systems. Solutions based on object-oriented
database systems are required to adequately support such applications.
The applications that we have built using OO technology include Flexible
Manufacturing, VLSI Design and Bio-Diversity.
- Resource Management Algorithms for Real-Time Database Systems:
In real-time database applications, transactions have completion
deadlines and the system performance is measured in terms of missed
deadlines, not response time. Therefore, new database resource
management algorithms have to be developed to efficiently support the
new metric. Our work includes concurrency control protocols, index access protocols, recovery
protocols and security protocols.
- Performance Guarantees in Overloaded Real-Time Systems:
Real-time systems may encounter emergency situations wherein the
processing required to handle the emergency exceeds the system
capacity, resulting in system overload and disastrous consequences. We
have designed real-time scheduling algorithms that provably
provide a guaranteed level of performance even under overload
- Scheduling in I/O Subsystems:
There is an increasing demand on computer systems to provide service
guarantees to end-users with regard to a variety of metrics.
Multimedia applications, for example, require meeting real-time and
synchronization constraints. Time-shared systems, on the other hand,
are usually expected to provide fair service. We have studied
this issue in detail for I/O subsystems and developed a new set of disk
scheduling algorithms that efficiently achieve the desired goals.