Schedule
*This syllabus is subject to change at the discretion of the instructor.
Introduction
Part I - Data Scientist
| Date | Topic | Content | Presentor |
|---|
| 08/29 | Overview | Part I topics overview CI notebook | Kexin |
| 08/31 | Interactive SQL | BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data (slides) | Eric, Sahil |
| 09/05 | No Class (Labor Day) | | |
| 09/07 | Interactive SQL | AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics (slides) | Gaurav, Sankalp, Hamsika |
| 09/12 | Interactive SQL | Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters (slides) | Ashmita, Aniruddha, Myna, Abhinav, Andrew |
| 09/14 | Interactive Viz | Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data (slides) | Hamsika, Jingfan |
| 09/19 | Interactive Viz | Hillview: A trillion-cell spreadsheet for big data (slides) | Akshay, Eric, Vishnu, Ashmita |
| 09/21 | Project Proposal | (slides) | |
| 09/26 | Interactive Viz | M4: A Visualization-Oriented Time Series Data Aggregation (slides) | Shubham, Bojun, Jingfan, Tanya |
| 09/28 | Data Science Tools | Benchmarking Spreadsheet Systems (slides) | Abhinav, Harshal, Qiandong, Ting, Cuong |
| 10/03 | Data Science Tools | Finding Related Tables in Data Lakes for Interactive Data Science (slides) | Qiandong, Shen En, Vishnu, Yanhao, Haotian |
| 10/05 | Data Science Tools | Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks (slides) | Bojun, Siddhi, Shubham, Shen En, Aniruddha, Jingfan, Ting |
| 10/10 | Data Science Tools | Towards Effective Foraging by Data Scientists to Find Past Analysis Choices (slides) | Myna, Sahil, Cangdi, Tanya, Siddhi |
Part II - Data Consumer
| Date | Topic | Content | Presentor |
|---|
| 10/12 | Overview | Research Skills Part II | Kexin |
| 10/17 | No Class (Fall Break) | | |
| 10/19 | Explanation | MacroBase: Prioritizing Attention in Fast Data (slides) | Haotian, Yiheng, Eric, Cuong |
| 10/24 | Explanation | Slice Finder: Automated Data Slicing for Model Validation (slides) | Andrew, Qiandong, Bojun, Shen En |
| 10/26 | Explanation | Domino: Discovering Systematic Errors with Cross-Modal Embeddings (slides) | Cuong, Jingfan, Sankalp, Tanya, Abhinav, Shubham |
| 10/31 | Project Update | (slides) | |
| 11/02 | Recommendation | SeeDB: efficient data-driven visualization recommendations to support visual analytics (slides) | Ting, Shen En, Harshal, Cangdi, Ashmita |
| 11/07 | Recommendation | Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows (slides) | Sahil, Gaurav, Bojun, Cuong |
| 11/09 | Recommendation | Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis (slides) | Tanya, Yanhao, Siddhi, Harshal, Cangdi, Akshay |
| 11/14 | Interfaces | Vega-lite: A grammar of interactive graphics (slides) | Yanhao, Yiheng, Aniruddha, Qiandong, Haotian |
| 11/16 | Interfaces | Expressive Time Series Querying with Hand-Drawn Scale-Free Sketches (slides) | Harshal, Cangdi, Haotian, Akshay, Siddhi |
| 11/21 | Interfaces | Falx: Synthesis-Powered Visualization Authoring (slides) | Vishnu, Sankalp, Yiheng |
| 11/23 | No Class (Thanksgiving) | | |
Project Presentations
| Date | Topic | Content | Presentor |
|---|
| 11/28 | Additional OH | | |
| 11/30 | Peer Review (form) | | |
| 12/05 | Final Project Presentation | (slides) | |
| 12/07 | Final Project Presentation | (slides) | |