Kexin Rong

I am starting as an Assistant Professor in the School of Computer Science at Georgia Tech in Fall 2022. I am actively looking for master and PhD students. Please reach out if you are interested in working with me!

I am currently a Postdoctoral Researcher in the VMware Research Group. I received my PhD in CS from Stanford, where I was fortunate to be co-advised by Peter Bailis and Philip Levis. My dissertation investigates systems and algorithms to improve the computational and human efficiency of large-scale data analytics, supporting applications such as earthquake detection, infrastructure monitoring, and OLAP queries on big-data clusters.

Previously, I received my bachelor's degree in Computer Science from Caltech (2015). I've also spend time at the DMX group at Microsoft Research in Redmond (2019).

Email  /  Google Scholar  /  CV  /  Github

Improving Computational and Human Efficiency in Large-Scale Data Analytics
Kexin Rong
PhD Thesis, 2021 (SIGMOD Doctoral Dissertation Award Honorable Mention)
Approximate Partition Selection for Big-Data Workloads using Summary Statistics
Kexin Rong, Yao Lu, Peter Bailis, Srikanth Kandula, Philip Levis
VLDB, 2020

A system that leverages summary statistics to select weighted, partition-level samples to approximate analytical queries on big-data clusters.

Rehashing Kernel Evaluation in High Dimensions
Paris Siminelakis*, Kexin Rong*, Peter Bailis, Moses Charikar, Philip Levis.
ICML, 2019 (Long talk)
[blog] [code] [supplementary]

LSH-based sketching and importance sampling algorithms to accelerate kernel evaluation in high dimensions.

Locality-Sensitive Hashing for Earthquake Detection: A Case Study of Scaling Data-Driven Science
Kexin Rong, Clara Yoon, Karianne Bergen, Hashem Elezabi, Peter Bailis, Philip Levis, Gregory Beroza.
VLDB, 2018
[blog] [video] [code] [seismology paper]

An unsupervised, end-to-end earthquake detection pipeline based on pairwise similarity search on seismic waveforms.

ASAP: Prioritizing Attention via Time Series Smoothing
Kexin Rong, Peter Bailis.
VLDB, 2017
[Datadog blog] [Timescale blog] [blog] [demo] [talk] [slides] [code]

An automatic smoothing algorithm for time series visualization that removes short-term fluctuations while preserving large-scale deviations.

MacroBase: Prioritizing Attention in Fast Data
Peter Bailis, Edward Gan, Samuel Madden, Deepak Narayanan, Kexin Rong, Sahaana Suri.
SIGMOD, 2017 (Invited to ACM TODS "Best of SIGMOD 2017" Special Issue.)
[website] [code] [journal paper] [vision paper] [demo paper]

A data analytics engine that highlights and aggregates important and unusual behavior in high-volume fast data streams.

Fall 2022: Human-in-the-loop Data Analytics (CS8803)

Template Source.