Publications

(2023). Join Path Based Data Augmentation for Decision Trees . DBML'22 workshop.

(2023). Amalur: Data Integration Meets Machine Learning.

(2023). An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs. HardBD & Active'23 @ICDE.

(2023). Metadata Representations for Queryable ML Model Zoos. Workshop on Benchmarking Data for Data-Centric AI (DataPerf) @ICML 2022.

(2021). Data lake concept and systems: a survey. arXiv preprint arXiv:2106.09592.

(2021). Amalur: Next-generation Data Integration in Data Lakes. CIDR'22, Abstract.