Difference between revisions of "Course: Big Data 2014"

From VistrailsWiki
Jump to navigation Jump to search
Line 17: Line 17:
* Reading: Chapter 1 of Mining of Massive Data Sets (version 1.1)
* Reading: Chapter 1 of Mining of Massive Data Sets (version 1.1)
* Course survey: https://docs.google.com/spreadsheet/embeddedform?formkey=dDRoTVcyMnRQUXhFUjl0cFFuTEVER1E6MA
* Course survey: https://docs.google.com/spreadsheet/embeddedform?formkey=dDRoTVcyMnRQUXhFUjl0cFFuTEVER1E6MA
* Other useful reading:
** [http://philip.greenspun.com/sql/introduction.html Greenspun's SQL for Web Nerds Intro]
** [http://philip.greenspun.com/sql/data-modeling.html SQL/Nerds Modeling (parts)]


== Week 2 -- Feb 3: Introduction to Databases ==
== Week 2 -- Feb 3: Introduction to Databases ==

Revision as of 23:27, 2 February 2014

DS-GA 1004/CSCI-GA 2568 Big Data: Tentative Schedule -- subject to change

  • Lecture: Mondays, 7:10pm-9:00pm at Silver, room 207
  • Lab: Thursdays, 7:10pm-8:00pm at CIWW, room 109


Background (4 weeks)

Week 1 -- Jan 27: Course Overview; the evolution of Data Management

Week 2 -- Feb 3: Introduction to Databases

Week 3 -- Feb 10: Overview: Relational Model and SQL

Week 4 -- Feb 24: Overview: Advanced SQL and Query Optimization

Big Data Foundations and Infrastructure (4 weeks)

Week 5 -- Mar 3: Cloud computing, Map Reduce and Hadoop

Week 6 -- Mar 10: Data Management for Big Data

Week 7 -- Mar 24: No-SQL and NewSQL Systems

Week 8 -- Mar 31: Query Processing on Mapreduce and High-level Languages

Big Data Algorithms and Techniques (6 weeks)

Week 9 -- Apr 7: Map Reduce Algorithm Design

Week 10 -- Apr 14: Finding similar items and information integration

Week 11 -- Apr 21: Graph Analysis

Week 12 -- Apr 28: Frequent Itemset Mining

Week 13 -- May 5: Interactive Analysis and Visualization of Big Data

Week 14 -- May 12: Machine Learning for Big Data

Week 15 -- May 19: Final Exam