Overview
I joined the Department of Computer Science at Duke University in Fall 2015.
Before joining Duke, I was a postdoctoral research associate in the Department of Computer Science and Engineering,University of Washington where I worked with Prof. Dan Suciu and the database group.
I graduated from the University of Pennsylvania with a Ph.D. in Computer and Information Science where I was advised by Prof. Susan Davidson and Prof. Sanjeev Khanna. During my Ph.D., I did two internships at IBM Research, Almaden,and received a Google PhD fellowship in Structured Data in 2011.
I obtained my master's and bachelor's degrees in Computer Science from Indian Institute of Technology, Kanpur and Jadavpur University respectively.Research Interests I am broadly interested in data and information management with a focus on foundational aspects of big data analysis. My research objective is to help users with heterogenous backgrounds and interests leverage the maximum benefit from the available data. While my ongoing work on explanations in databases directly aims to assist users get deep insights into data by providing rich explanations to their questions, my work in the areas of data and workow provenance, probabilistic databases, and crowd-sourcing probes into compelling, fundamental questions that need to be answered to enable end-to-end processing and analysis of unstructured, noisy, and unreliable data in today's world while preserving its entire context.
Current Appointments & Affiliations
Associate Professor of Computer Science
·
2022 - Present
Computer Science,
Trinity College of Arts & Sciences
Recent Publications
Differentially private explanations for aggregate query answers
Journal Article VLDB Journal · March 1, 2025 Differential privacy (DP) is the state-of-the-art and rigorous notion of privacy for answering aggregate database queries while preserving the privacy of sensitive information in the data. In today’s era of data analysis, however, it poses new challenges f ... Full text CiteThe Cost of Representation by Subset Repairs
Journal Article Proceedings of the VLDB Endowment · January 1, 2025 Datasets may include errors, and specifically violations of integrity constraints, for various reasons. Standard techniques for "minimal cost" database repairing resolve these violations by aiming for a minimum change in the data, and in the process, may s ... Full text CiteWhat Teaching Databases Taught Us about Researching Databases: Extended Talk Abstract
Conference ACM International Conference Proceeding Series · June 9, 2024 Declarative querying is a cornerstone of the success and longevity of database systems, yet it is challenging for novice learners accustomed to different coding paradigms. The transition is further hampered by a lack of query debugging tools compared to th ... Full text CiteEducation, Training & Certifications
University of Pennsylvania ·
2012
Ph.D.