User contributions
Jump to navigation
Jump to search
- 16:00, 15 April 2014 diff hist −3 Assignment 3 - FAQ →How do I specify which subset of a key to be used by the partitioner? current
- 03:07, 15 April 2014 diff hist +48 Course: Big Data 2014
- 18:09, 14 April 2014 diff hist +941 Course: Big Data 2014
- 17:58, 14 April 2014 diff hist −1 Assignment 3 - FAQ →Computing relative frequencies
- 17:57, 14 April 2014 diff hist 0 Assignment 3 - FAQ →Computing relative frequencies
- 17:56, 14 April 2014 diff hist +833 Assignment 3 - FAQ →Frequently Asked Questions
- 15:28, 14 April 2014 diff hist +5 Assignment 3 - FAQ →Associative Collections in Python
- 15:25, 14 April 2014 diff hist +682 Assignment 3 - FAQ →Frequently Asked Questions
- 13:44, 14 April 2014 diff hist +104 Assignment 3 - FAQ →Frequently Asked Questions
- 01:11, 14 April 2014 diff hist +353 Course: Big Data 2014 →Machine Learning and Big Data (3 weeks)
- 01:03, 14 April 2014 diff hist −105 Course: Big Data 2014
- 00:59, 14 April 2014 diff hist +1 Course: Big Data 2014 →Week 3.1 -- Feb 17 Holiday
- 00:59, 14 April 2014 diff hist +7 Course: Big Data 2014 →Week 3.1 -- Feb 17
- 04:24, 11 April 2014 diff hist +1,303 Assignment 3 - FAQ
- 04:12, 11 April 2014 diff hist +435 Assignment 3 - FAQ →Frequently Asked Questions
- 04:09, 11 April 2014 diff hist +74 Assignment 3 - FAQ →How do I specify which subset of a key to be used by the partitioner on AWS?
- 03:37, 11 April 2014 diff hist 0 N File:Emr-partitioner.png current
- 03:29, 11 April 2014 diff hist +8 Assignment 3 - FAQ
- 03:28, 11 April 2014 diff hist +1,859 N Assignment 3 - FAQ Created page with '== Frequently Asked Questions == === How do I specify which subset of a key to be used by the partitioner? === * Hadoop Streaming provides an option for you to modify the parti…'
- 03:13, 11 April 2014 diff hist +387 Course: Big Data 2014
- 18:08, 6 April 2014 diff hist +6 Assignment 3 - MapReduce algorithm design →When and What to submit current
- 18:07, 6 April 2014 diff hist +1 Assignment 3 - MapReduce algorithm design →When and What to submit
- 18:07, 6 April 2014 diff hist +116 Assignment 3 - MapReduce algorithm design →When and What to submit
- 18:06, 6 April 2014 diff hist +29 Assignment 3 - MapReduce algorithm design →When and What to submit
- 18:05, 6 April 2014 diff hist +95 Assignment 3 - MapReduce algorithm design →Requirements
- 18:25, 24 March 2014 diff hist −5 Assignment 3 - MapReduce algorithm design →Task
- 18:24, 24 March 2014 diff hist +5 Assignment 3 - MapReduce algorithm design →Task
- 18:24, 24 March 2014 diff hist +11 Assignment 3 - MapReduce algorithm design →Task
- 18:23, 24 March 2014 diff hist 0 N File:Relative-frequency.png current
- 18:23, 24 March 2014 diff hist +11 Assignment 3 - MapReduce algorithm design →Task
- 18:22, 24 March 2014 diff hist +3,047 N Assignment 3 - MapReduce algorithm design Created page with '== Assignment 3: Computing Relative Frequencies == === Dataset description === For this assignment you will explore a set of 100,000 Wikipedia documents: * s3://cs9223/wikitext…'
- 18:10, 24 March 2014 diff hist +69 Course: Big Data 2014
- 05:16, 24 March 2014 diff hist +121 Course: Big Data 2014 →Week 9 -- Apr 7: Large Scale Machine Learning in the Real World
- 05:14, 24 March 2014 diff hist +619 Course: Big Data 2014
- 22:06, 10 March 2014 diff hist +286 Course: Big Data 2014 →Big Data Foundations and Infrastructure (4 weeks)
- 18:17, 3 March 2014 diff hist +135 Course: Big Data 2014 →Week 5 -- Mar 3: Cloud computing, Map Reduce and Hadoop
- 18:16, 3 March 2014 diff hist +110 Course: Big Data 2014 →Week 5 -- Mar 3: Cloud computing, Map Reduce and Hadoop
- 17:48, 3 March 2014 diff hist +320 Course: Big Data 2014
- 15:13, 27 February 2014 diff hist +25 Assignment 2 - Data Exploration using SQL →Assignment Description current
- 19:16, 25 February 2014 diff hist −12 Lab notes 02/06/14 →The Problem: Analyzing MTA Fare Data current
- 19:15, 25 February 2014 diff hist −1 Lab notes 02/06/14 →The Problem: Analyzing MTA Fare Data
- 19:15, 25 February 2014 diff hist +226 Lab notes 02/06/14 →The Problem: Analyzing MTA Fare Data
- 17:21, 25 February 2014 diff hist −1 Big Data Lab notes 02/19/14 →Installing and starting mySQL current
- 03:39, 25 February 2014 diff hist −11 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 03:39, 25 February 2014 diff hist +36 Assignment 2 - Data Exploration using SQL →Assignment Description
- 23:12, 24 February 2014 diff hist +24 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 23:11, 24 February 2014 diff hist +139 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 23:09, 24 February 2014 diff hist +50 Assignment 2 - Data Exploration using SQL →Assignment Description
- 23:08, 24 February 2014 diff hist +369 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:37, 24 February 2014 diff hist +97 Big Data Lab notes 02/19/14 →Installing and starting mySQL
- 20:35, 24 February 2014 diff hist −47 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:34, 24 February 2014 diff hist −2 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:33, 24 February 2014 diff hist +2 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:31, 24 February 2014 diff hist +4 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:30, 24 February 2014 diff hist −15 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:30, 24 February 2014 diff hist −35 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:29, 24 February 2014 diff hist +1 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:29, 24 February 2014 diff hist +35 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:28, 24 February 2014 diff hist +529 Assignment 2 - Data Exploration using SQL →Assignment Description
- 20:25, 24 February 2014 diff hist +41 Assignment 2 - Data Exploration using SQL →Queries
- 20:18, 24 February 2014 diff hist +153 Assignment 2 - Data Exploration using SQL
- 20:14, 24 February 2014 diff hist +2 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 20:13, 24 February 2014 diff hist +2 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 20:12, 24 February 2014 diff hist +273 Assignment 2 - Data Exploration using SQL →Submission Instructions
- 20:09, 24 February 2014 diff hist +1,799 N Assignment 2 - Data Exploration using SQL Created page with '== Assignment Description == In your first assignment, you explored MTA data about subway fares using a data exploration tool. Now, you will further explore this data set using S…'
- 18:36, 24 February 2014 diff hist +69 Course: Big Data 2014
- 18:28, 24 February 2014 diff hist +178 Course: Big Data 2014
- 13:53, 19 February 2014 diff hist +342 Big Data Lab notes 02/19/14 →SQL Cheat Sheet
- 13:50, 19 February 2014 diff hist +14 Big Data Lab notes 02/19/14 →Our running example
- 13:50, 19 February 2014 diff hist +14 Big Data Lab notes 02/19/14 →Our running example
- 13:49, 19 February 2014 diff hist −17 Big Data Lab notes 02/19/14 →Our running example
- 13:49, 19 February 2014 diff hist +3 Big Data Lab notes 02/19/14 →Our running example
- 13:48, 19 February 2014 diff hist −27 Big Data Lab notes 02/19/14 →Our running example
- 13:47, 19 February 2014 diff hist −15 Big Data Lab notes 02/19/14 →Our running example
- 13:47, 19 February 2014 diff hist +8 Big Data Lab notes 02/19/14
- 13:47, 19 February 2014 diff hist +4 Big Data Lab notes 02/19/14
- 13:46, 19 February 2014 diff hist +44 Big Data Lab notes 02/19/14
- 13:45, 19 February 2014 diff hist +656 Big Data Lab notes 02/19/14
- 13:38, 19 February 2014 diff hist +1,379 N Big Data Lab notes 02/19/14 Created page with 'Today, we will work on SQL queries. Before the class, please download mySQL from from http://dev.mysql.com/downloads There are many different versions, select MySQL Community Se…'
- 13:34, 19 February 2014 diff hist +35 Course: Big Data 2014
- 03:41, 19 February 2014 diff hist +137 Course: Big Data 2014
- 18:10, 13 February 2014 diff hist +34 Development →Feb 11, 2014
- 20:02, 10 February 2014 diff hist +153 Course: Big Data 2014 →Week 3 -- Feb 10: Overview: Relational Model and SQL
- 19:34, 10 February 2014 diff hist +298 Course: Big Data 2014
- 17:22, 10 February 2014 diff hist +37 Lab notes 02/06/14
- 06:47, 9 February 2014 diff hist +169 Course: Big Data 2014 →News
- 06:37, 9 February 2014 diff hist +124 Assignment 1 - Data Exploration →Assignment Description current
- 06:33, 9 February 2014 diff hist −96 Assignment 1 - Data Exploration
- 06:33, 9 February 2014 diff hist +14 Assignment 1 - Data Exploration →Submission Instructions
- 06:32, 9 February 2014 diff hist +19 Assignment 1 - Data Exploration
- 06:28, 9 February 2014 diff hist +989 Assignment 1 - Data Exploration
- 06:11, 9 February 2014 diff hist +724 N Assignment 1 - Data Exploration Created page with 'During our lab, we went explored MTA data about subway fares. For your assignment, you will further explore this data set and try to find at least 3 ''interesting'' facts/observa…'
- 05:57, 9 February 2014 diff hist +401 Lab notes 02/06/14 →Provenance and Reproducibility
- 05:52, 9 February 2014 diff hist +2,982 N Lab notes 02/06/14 Created page with '== Provenance and Reproducibility == Data exploration is inherently a trial-and-error process -- as well formulate and test hypothesis, we often need to follow many different li…'
- 05:52, 9 February 2014 diff hist +9 Course: Big Data 2014 →Week 2 -- Feb 3: Introduction to Databases
- 05:51, 9 February 2014 diff hist +50 Course: Big Data 2014
- 02:52, 6 February 2014 diff hist +17 Lab notes →Installation current
- 02:40, 6 February 2014 diff hist +137 Lab notes
- 02:37, 6 February 2014 diff hist +38 Lab notes →Installation
- 02:36, 6 February 2014 diff hist +5 Lab notes →Installation