Link Search Menu Expand Document

Schedule

*This syllabus is subject to change at the discretion of the instructor.

Introduction

DateTopicContentPresentor
08/22IntroductionCourse Introduction and LogisticsKexin
08/24IntroductionResearch SkillsKexin

Part I - Data Scientist

DateTopicContentPresentor
08/29OverviewPart I topics overview
CI notebook
Kexin
08/31Interactive SQLBlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data
(slides)
Eric, Sahil
09/05No Class (Labor Day)  
09/07Interactive SQLAQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics
(slides)
Gaurav, Sankalp, Hamsika
09/12Interactive SQLExperiences with Approximating Queries in Microsoft’s Production Big-Data Clusters
(slides)
Ashmita, Aniruddha, Myna, Abhinav, Andrew
09/14Interactive VizDatabase Benchmarking for Supporting Real-Time Interactive Querying of Large Data
(slides)
Hamsika, Jingfan
09/19Interactive VizHillview: A trillion-cell spreadsheet for big data
(slides)
Akshay, Eric, Vishnu, Ashmita
09/21Project Proposal(slides) 
09/26Interactive VizM4: A Visualization-Oriented Time Series Data Aggregation
(slides)
Shubham, Bojun, Jingfan, Tanya
09/28Data Science ToolsBenchmarking Spreadsheet Systems
(slides)
Abhinav, Harshal, Qiandong, Ting, Cuong
10/03Data Science ToolsFinding Related Tables in Data Lakes for Interactive Data Science
(slides)
Qiandong, Shen En, Vishnu, Yanhao, Haotian
10/05Data Science ToolsAuto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks
(slides)
Bojun, Siddhi, Shubham, Shen En, Aniruddha, Jingfan, Ting
10/10Data Science ToolsTowards Effective Foraging by Data Scientists to Find Past Analysis Choices
(slides)
Myna, Sahil, Cangdi, Tanya, Siddhi

Part II - Data Consumer

DateTopicContentPresentor
10/12OverviewResearch Skills Part IIKexin
10/17No Class (Fall Break)  
10/19ExplanationMacroBase: Prioritizing Attention in Fast Data
(slides)
Haotian, Yiheng, Eric, Cuong
10/24ExplanationSlice Finder: Automated Data Slicing for Model Validation
(slides)
Andrew, Qiandong, Bojun, Shen En
10/26ExplanationDomino: Discovering Systematic Errors with Cross-Modal Embeddings
(slides)
Cuong, Jingfan, Sankalp, Tanya, Abhinav, Shubham
10/31Project Update(slides) 
11/02RecommendationSeeDB: efficient data-driven visualization recommendations to support visual analytics
(slides)
Ting, Shen En, Harshal, Cangdi, Ashmita
11/07RecommendationLux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows
(slides)
Sahil, Gaurav, Bojun, Cuong
11/09RecommendationInvestigating the Effect of the Multiple Comparisons Problem in Visual Analysis
(slides)
Tanya, Yanhao, Siddhi, Harshal, Cangdi, Akshay
11/14InterfacesVega-lite: A grammar of interactive graphics
(slides)
Yanhao, Yiheng, Aniruddha, Qiandong, Haotian
11/16InterfacesExpressive Time Series Querying with Hand-Drawn Scale-Free Sketches
(slides)
Harshal, Cangdi, Haotian, Akshay, Siddhi
11/21InterfacesFalx: Synthesis-Powered Visualization Authoring
(slides)
Vishnu, Sankalp, Yiheng
11/23No Class (Thanksgiving)  

Project Presentations

DateTopicContentPresentor
11/28Additional OH  
11/30Peer Review
(form)
  
12/05Final Project Presentation(slides) 
12/07Final Project Presentation(slides)