Schedule
*This syllabus is subject to change at the discretion of the instructor.
Introduction
Part I - Data Scientist
Date | Topic | Content | Presentor |
---|
08/29 | Overview | Part I topics overview CI notebook | Kexin |
08/31 | Interactive SQL | BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data (slides) | Eric, Sahil |
09/05 | No Class (Labor Day) | | |
09/07 | Interactive SQL | AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics (slides) | Gaurav, Sankalp, Hamsika |
09/12 | Interactive SQL | Experiences with Approximating Queries in Microsoft’s Production Big-Data Clusters (slides) | Ashmita, Aniruddha, Myna, Abhinav, Andrew |
09/14 | Interactive Viz | Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data (slides) | Hamsika, Jingfan |
09/19 | Interactive Viz | Hillview: A trillion-cell spreadsheet for big data (slides) | Akshay, Eric, Vishnu, Ashmita |
09/21 | Project Proposal | (slides) | |
09/26 | Interactive Viz | M4: A Visualization-Oriented Time Series Data Aggregation (slides) | Shubham, Bojun, Jingfan, Tanya |
09/28 | Data Science Tools | Benchmarking Spreadsheet Systems (slides) | Abhinav, Harshal, Qiandong, Ting, Cuong |
10/03 | Data Science Tools | Finding Related Tables in Data Lakes for Interactive Data Science (slides) | Qiandong, Shen En, Vishnu, Yanhao, Haotian |
10/05 | Data Science Tools | Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks (slides) | Bojun, Siddhi, Shubham, Shen En, Aniruddha, Jingfan, Ting |
10/10 | Data Science Tools | Towards Effective Foraging by Data Scientists to Find Past Analysis Choices (slides) | Myna, Sahil, Cangdi, Tanya, Siddhi |
Part II - Data Consumer
Date | Topic | Content | Presentor |
---|
10/12 | Overview | Research Skills Part II | Kexin |
10/17 | No Class (Fall Break) | | |
10/19 | Explanation | MacroBase: Prioritizing Attention in Fast Data (slides) | Haotian, Yiheng, Eric, Cuong |
10/24 | Explanation | Slice Finder: Automated Data Slicing for Model Validation (slides) | Andrew, Qiandong, Bojun, Shen En |
10/26 | Explanation | Domino: Discovering Systematic Errors with Cross-Modal Embeddings (slides) | Cuong, Jingfan, Sankalp, Tanya, Abhinav, Shubham |
10/31 | Project Update | (slides) | |
11/02 | Recommendation | SeeDB: efficient data-driven visualization recommendations to support visual analytics (slides) | Ting, Shen En, Harshal, Cangdi, Ashmita |
11/07 | Recommendation | Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows (slides) | Sahil, Gaurav, Bojun, Cuong |
11/09 | Recommendation | Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis (slides) | Tanya, Yanhao, Siddhi, Harshal, Cangdi, Akshay |
11/14 | Interfaces | Vega-lite: A grammar of interactive graphics (slides) | Yanhao, Yiheng, Aniruddha, Qiandong, Haotian |
11/16 | Interfaces | Expressive Time Series Querying with Hand-Drawn Scale-Free Sketches (slides) | Harshal, Cangdi, Haotian, Akshay, Siddhi |
11/21 | Interfaces | Falx: Synthesis-Powered Visualization Authoring (slides) | Vishnu, Sankalp, Yiheng |
11/23 | No Class (Thanksgiving) | | |
Project Presentations
Date | Topic | Content | Presentor |
---|
11/28 | Additional OH | | |
11/30 | Peer Review (form) | | |
12/05 | Final Project Presentation | (slides) | |
12/07 | Final Project Presentation | (slides) | |