Difference between revisions of "Course: Big Data Analysis"

From VistrailsWiki
Jump to navigation Jump to search
Line 5: Line 5:
== Week 1: Monday Sept. 10th - Course Overview ==
== Week 1: Monday Sept. 10th - Course Overview ==


* Course overview  (First day of classes!)
* [http://vgc.poly.edu/~juliana/courses/cs9223/Lectures/intro.pdf Course overview  and introduction to Big Data Analysis]
* Student survey
* [https://docs.google.com/spreadsheet/viewform?fromEmail=true&formkey=dFdHT3BST2l1TW9KeHYzYjBDaTU0V1E6MQ Student survey] -- to be filled out today!
* Introduction to Big Data


=== Readings ===


=== Required Readings ===
* [http://i.stanford.edu/~ullman/mmds/book.pdf Mining of Massive Datasets, Chapter 1]
* [http://lintool.github.com/MapReduceAlgorithms/MapReduce-book-final.pdf Data-Intensive Text Processing with MapReduce, Chapter1]
=== Additional References ===
* [http://dilbert.com/strips/comic/2012-07-29/ Dilbert's BigData]
* [http://dilbert.com/strips/comic/2012-07-29/ Dilbert's BigData]
* [http://www.nytimes.com/2012/08/12/business/how-big-data-became-so-big-unboxed.html?ref=stevelohr New York Time's "How BigData Became so Big"]
* [http://www.nytimes.com/2012/08/12/business/how-big-data-became-so-big-unboxed.html?ref=stevelohr New York Time's "How BigData Became so Big"]
Line 16: Line 20:
* [http://www.analytics-magazine.org/november-december-2010/54-the-analytics-journey.html The Analytics Journey]
* [http://www.analytics-magazine.org/november-december-2010/54-the-analytics-journey.html The Analytics Journey]
* [http://practicalanalytics.wordpress.com/2011/12/12/big-data-analytics-use-cases/ BigData Analytics Usecases]
* [http://practicalanalytics.wordpress.com/2011/12/12/big-data-analytics-use-cases/ BigData Analytics Usecases]
* [http://lintool.github.com/MapReduceAlgorithms/MapReduce-book-final.pdf Data-Intensive Text Processing with MapReduce, Chapter1]
* [http://cacm.acm.org/magazines/2010/1/55743-mapreduce-and-parallel-dbmss-friends-or-foes/fulltext PDMBS vs. MapReduce]
* [http://cacm.acm.org/magazines/2010/1/55743-mapreduce-and-parallel-dbmss-friends-or-foes/fulltext PDMBS vs. MapReduce]
* [http://database.cs.brown.edu/sigmod09/benchmarks-sigmod09.pdf Benchmark DBMS vs MapReduce (2009)]
* [http://database.cs.brown.edu/sigmod09/benchmarks-sigmod09.pdf Benchmark DBMS vs MapReduce (2009)]

Revision as of 22:52, 9 September 2012

This schedule is tentative and subject to change

Make sure to check my.poly.edu for course announcements

Week 1: Monday Sept. 10th - Course Overview


Required Readings

Additional References

Week 2: Monday Sept. 17th - Map-Reduce

Readings

Week 3: Monday Sept. 24th - Databases and Big Data

Readings

Week 4: Monday Oct. 1st - Statistics is easy - Invited Speaker: Dennis Shasha

Readings


Week 5: Monday Oct. 8st - Finding Similar Items

  • Overview of information integration

Readings

Week 6: Monday Oct. 15st - Invited Speaker: Torsten Suel

  • Reading: inverted index and crawling (Lin chapter 4)
  • Ask Torsten (tentative, ask him for reading material)

Readings

Week 7: Monday Oct. 22st - Invited Speakers: Claudio Silva and Lauro Lins

  • Introduction to Visualization; Data stewardship and provenance
  • Guest lecture by Claudio Silva and Lauro Lins

Readings

  • Hellerstein (ask Claudio for additional references)
  • ADD: provenance and reproducibility

Week 8: Monday Oct. 29th - Graph Analysis

  • Graph algorithms, link analysis, social networks

Readings

  • Data-Intensive Text Processing with MapReduce, Chapter 4


Week 9: Monday Nov. 5th - Frequent Itemsets

Reading

  • Mining of Massive Datasets, Chapter 6


Week 10: Monday Nov. 12th - Mining Data Streams =

Readings

  • Mining of Massive Datasets, Chapter 4


Week 11: Monday Nov. 19th - Clustering

Readings

  • Mining of Massive Datasets, Chapter 7

Week 12: Monday Nov. 26th - Recommendation Systems

Readings

  • Mining of Massive Datasets, Chapter 9

Week 13 Monday Dec. 3rd - EM algorithms for text processing

  • Data-Intensive Text Processing with MapReduce, Chapter 6

Week 14: Monday Dec. 10th - Project presentation

Further Readings