Difference between revisions of "CS6093/Selected Papers and Topics"

From VistrailsWiki
Jump to navigation Jump to search
Line 36: Line 36:
== NoSQL Databases ==  
== NoSQL Databases ==  


* Intro to Hadoop
* Intro to Hadoop (TBD)
* Languages
 
* Automatic optimization for MapReduce programs. Eaman Jahani, Michael J. Cafarella, Christopher Ré. .PVLDB, 2011.
jahani-vldb2011.pdf
 
* Parallel data processing with MapReduce: a survey. Lee et al, SIGMOD Record 2011
 
* Scalable SQL and NoSQL Data Stores Rick Cattel, SIGMOD Record 2011. (overview of current data stores)
cattel-sigrec2011.pdf
 
* [http://infolab.stanford.edu/~usriv/papers/pig-latin.pdf Pig latin: a not-so-foreign language for data processing].C Olston, B Reed, U Srivastava, R Kuma, A. Tomkins. SIGMOD 2008.
 
* [http://infolab.stanford.edu/~usriv/papers/pnuts.pdf PNUTS : Yahoo !’ s Hosted Data Serving Platform.] Brian F Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-arno Jacobsen, et al. in Proceedings of the VLDB Endowment (2008).
 
Additional suggested reading:
 
* [http://wwwlgis.informatik.uni-kl.de/cms/fileadmin/publications/2010/SQLvsNoSQLDatabases.pdf SQL databases v. NoSQL databases.] Michael Stonebraker, CACM 2010.
 
* [http://www.christof-strauch.de/nosqldbs.pdf NoSQL Databases.] Christof Strauch. 2010.


== Relational Data on the Web ==  
== Relational Data on the Web ==  

Revision as of 21:04, 7 February 2012

Provenance and Databases

  • Peter Buneman, Sanjeev Khanna, Wang Chiew Tan: Why and Where: A Characterization of Data Provenance. ICDT 2001: 316-330

http://db.cis.upenn.edu/DL/whywhere.pdf

  • A. Das Sarma, M. Theobald, and J. Widom. LIVE: A Lineage-Supported Versioned DBMS. Proceedings of the 22nd International Conference on Scientific and Statistical Database Management, Heidelberg, Germany, June 2010.

http://ilpubs.stanford.edu:8090/926/1/versioning-TR.pdf

  • Total Recall | Oracle Database

http://www.oracle.com/technetwork/database/focus-areas/storage/total-recall-whitepaper-171749.pdf

Additional Suggested Reading:

Graph Indexing

  • Answering pattern match queries in large graph databases via graph embedding

Lei Zou, Lei Chen, M. Tamer Özsu and Dongyan Zhao [graph-matching-vldbj2011]

  • Chenghui Ren, Eric Lo, Ben Kao, Xinjie Zhu, Reynold Cheng: On Querying Historical Evolving Graph Sequences. PVLDB 4(11): 726-737 (2011)

evolving-graphs-vldb11.pdf

Provenance Applications: Reproducible Publications

- papers from challenge

Web Schema Matching and Integration

NoSQL Databases

  • Intro to Hadoop (TBD)
  • Automatic optimization for MapReduce programs. Eaman Jahani, Michael J. Cafarella, Christopher Ré. .PVLDB, 2011.

jahani-vldb2011.pdf

  • Parallel data processing with MapReduce: a survey. Lee et al, SIGMOD Record 2011
  • Scalable SQL and NoSQL Data Stores Rick Cattel, SIGMOD Record 2011. (overview of current data stores)

cattel-sigrec2011.pdf

Additional suggested reading:

Relational Data on the Web

Deep Web

Using and Analyzing Social Networking Data