Research Papers
Class 1
- P1 - MapReduce: Simplified Data Processing on Large Clusters
- P2 - Bigtable: A Distributed Storage System for Structured Data
- P3 - C-Store: A Column-oriented DBMS
Class 2
- P4 - Pig Latin: A Not-So-Foreign Language for Data Processing
- P5 - Dynamo: Amazon’s Highly Available Key-value Store
- P6 - Hive – A Petabyte Scale Data Warehouse Using Hadoop
Class 3
- P7 - A Comparison of Approaches to Large-Scale Data Analysis
- P8 - Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
- P9 - The Snowflake Elastic Data Warehouse