Difference between revisions of "Assignment 2 - Data Exploration using SQL"

From VistrailsWiki
Jump to navigation Jump to search
Line 32: Line 32:
* as2.log -- containing the answers to you queries  
* as2.log -- containing the answers to you queries  


== Queries ==


* 1. How many subway stations are there in New York City?
* 1. How many subway stations are there in New York City?
Line 40: Line 41:
* 6. Which station had the largest increase in the number of full-fare tickets sold between Jan 18 and Feb 1?
* 6. Which station had the largest increase in the number of full-fare tickets sold between Jan 18 and Feb 1?
* 7. Which station had the largest decrease in the number of full-fare tickets sold between Jan 18 and Feb 1?
* 7. Which station had the largest decrease in the number of full-fare tickets sold between Jan 18 and Feb 1?
* 8. What was the average difference in the number of full-fare tickets sold between Jan 18 and Feb 1 across the stations of the Broadway line?
* 8. List the names of the stations whose decrease in the number of full-fare tickets sold between Jan 18 and Feb 1 is greater than 1000.
* 9. How many full-fare tickets were sold in NYC in the week of Jan 18?
* 9. What was the average difference in the number of full-fare tickets sold between Jan 18 and Feb 1 across the stations of the Broadway line?
* 10. How many full-fare tickets were sold in NYC in the week of Jan 18?

Revision as of 20:18, 24 February 2014

Assignment Description

In your first assignment, you explored MTA data about subway fares using a data exploration tool. Now, you will further explore this data set using SQL.

You will use MySQL. Please follow the instructions in the lab notes to install the system.

You are encouraged to use the Web as a resource to find more information about MySQL.

You can exchange ideas with your classmates, but the work you submit should be your own. Copying is not allowed.

Submission Instructions

You will submit to NYU Classes two text files:

  • as2.sql -- containing the queries you will write to answer the questions below. Use the following template for this file:

-- remember to end each query with a semi-colon

use test;

-- ********************Q1*******************

SELECT 'Q1' AS ' '; <insert your query Q1>;

-- ********************Q2*******************

SELECT 'Q2' AS ' '; <insert your query Q1>;

....

  • as2.log -- containing the answers to you queries

Queries

  • 1. How many subway stations are there in New York City?
  • 2. List all stations in the Broadway Line.
  • 3. List the latitude, longitude and number of full-fare tickets sold during the week of Jan 18 at all stations in the Broadway line in decreasing order of number of tickets.
  • 4. List the difference between the number of full-fare tickets sold at the different stations of the Broadway line between Jan 18 and Feb 1.
  • 5. List the difference between the number of 7-day and 30-day tickets sold at the different stations in the Broadway line between on Jan 18 and Feb 1.
  • 6. Which station had the largest increase in the number of full-fare tickets sold between Jan 18 and Feb 1?
  • 7. Which station had the largest decrease in the number of full-fare tickets sold between Jan 18 and Feb 1?
  • 8. List the names of the stations whose decrease in the number of full-fare tickets sold between Jan 18 and Feb 1 is greater than 1000.
  • 9. What was the average difference in the number of full-fare tickets sold between Jan 18 and Feb 1 across the stations of the Broadway line?
  • 10. How many full-fare tickets were sold in NYC in the week of Jan 18?