Research Papers
Class 1
- P1 - MapReduce: Simplified Data Processing on Large Clusters
- P2 - Bigtable: A Distributed Storage System for Structured Data
- P3 - Pig Latin: A Not-So-Foreign Language for Data Processing
Class 2
- P4 - C-Store: A Column-oriented DBMS
- P5 - Column-stores vs. row-stores: how different are they really?
- P6 - An Empirical Evaluation of Columnar Storage Formats
Class 3
- P7 - Dynamo: Amazon’s Highly Available Key-value Store
- P8 - Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
- P9 - The Snowflake Elastic Data Warehouse
Class 4
- P10 - Resource Management in Aurora Serverless
- P11 - Milvus: A Purpose-Built Vector Data Management System
- P12 - Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples