Difference between revisions of "Reading List"

From VistrailsWiki
Jump to navigation Jump to search
Line 69: Line 69:
* [http://videolectures.net/mlg07_han_miasg/ Mining, Indexing, and Searching Graphs in Large Data Sets] by Jiawei Han Nature 2007
* [http://videolectures.net/mlg07_han_miasg/ Mining, Indexing, and Searching Graphs in Large Data Sets] by Jiawei Han Nature 2007


== Provenance Mining  ==
=== Provenance Mining  ===


* [http://www.cs.utah.edu/~juliana/pub/tvcg-recommendation2008.pdf VisComplete: Automating Suggestions for Visualization Pipelines.] David Koop, Carlos E. Scheidegger, Steven P. Callahan, Huy T. Vo, Juliana Freire and Claudio T. Silva. In IEEE Transactions on Visualization and Computer Graphics, 14(6), pp. 1691-1698, 2008.  
* [http://www.cs.utah.edu/~juliana/pub/tvcg-recommendation2008.pdf VisComplete: Automating Suggestions for Visualization Pipelines.] David Koop, Carlos E. Scheidegger, Steven P. Callahan, Huy T. Vo, Juliana Freire and Claudio T. Silva. In IEEE Transactions on Visualization and Computer Graphics, 14(6), pp. 1691-1698, 2008.  
Line 87: Line 87:
* [http://gking.harvard.edu/files/dvn.pdf An Introduction to the Dataverse Network as an Infrastructure for Data Sharing.] Gary King. Sociological Methods and Research. Vol. 32, No. 2 (November, 2007): Pp. 173--199,
* [http://gking.harvard.edu/files/dvn.pdf An Introduction to the Dataverse Network as an Infrastructure for Data Sharing.] Gary King. Sociological Methods and Research. Vol. 32, No. 2 (November, 2007): Pp. 173--199,


== Provenance: Security and Privacy ==
=== Provenance: Security and Privacy ===


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/braun-hotsec2008.pdf Securing provenance.]  Braun, A. Shinnar, and M. Seltzer.  In HotSec’08, 2008.  
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/braun-hotsec2008.pdf Securing provenance.]  Braun, A. Shinnar, and M. Seltzer.  In HotSec’08, 2008.  
Line 94: Line 94:


* [http://www.ragibhasan.com/publications/papers/storagess2007-rhasan.pdf Introducing Secure Provenance: Problems and Challenges], Ragib Hasan, Radu Sion, Marianne Winslett, in ACM StorageSS 2007.
* [http://www.ragibhasan.com/publications/papers/storagess2007-rhasan.pdf Introducing Secure Provenance: Problems and Challenges], Ragib Hasan, Radu Sion, Marianne Winslett, in ACM StorageSS 2007.


* [http://www.ragibhasan.com/research/provenance.html Secure Provenance Project at UIUC]
* [http://www.ragibhasan.com/research/provenance.html Secure Provenance Project at UIUC]
Line 143: Line 142:
* [http://portal.acm.org/citation.cfm?id=1107499.1107502 From databases to dataspaces: a new abstraction for information management] by Michael Franklin, Alon Halevy, David Maier, SIGMOD 2005
* [http://portal.acm.org/citation.cfm?id=1107499.1107502 From databases to dataspaces: a new abstraction for information management] by Michael Franklin, Alon Halevy, David Maier, SIGMOD 2005
* [http://portal.acm.org/citation.cfm?id=1454159.1454217 A first tutorial on dataspaces] by Michael Franklin, Alon Halevy, David Maier, VLDB 2008
* [http://portal.acm.org/citation.cfm?id=1454159.1454217 A first tutorial on dataspaces] by Michael Franklin, Alon Halevy, David Maier, VLDB 2008
=== NoSQL Databases ===
* [http://wwwlgis.informatik.uni-kl.de/cms/fileadmin/publications/2010/SQLvsNoSQLDatabases.pdf SQL databases v. NoSQL databases.] Michael Stonebraker, CACM 2010.
* [http://www.christof-strauch.de/nosqldbs.pdf NoSQL Databases.] Christof Strauch. 2010.
* [http://infolab.stanford.edu/~usriv/papers/pig-latin.pdf Pig latin: a not-so-foreign language for data processing].C Olston, B Reed, U Srivastava, R Kuma, A. Tomkins. SIGMOD 2008.
* [http://infolab.stanford.edu/~usriv/papers/pnuts.pdf PNUTS : Yahoo !’ s Hosted Data Serving Platform.] Brian F Cooper, Raghu Ramakrishnan, Utkarsh Srivastava, Adam Silberstein, Philip Bohannon, Hans-arno Jacobsen, et al. in Proceedings of the VLDB Endowment (2008).


=== Relational data on the Web ===
=== Relational data on the Web ===

Revision as of 17:50, 31 January 2012

Provenance

Overview

Provenance in Databases

  • Curated Databases W. Tan, P. Buneman, J. Cheney, S. Vansumerren. ACM Symposium on Principles of Database Systems (PODS), 2008.

Provenance Management: Storage, Indexing and Querying

  • Querying and Creating Visualizations by Analogy. Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire and Claudio T. Silva. IEEE Transactions on Visualization and Computer Graphics, 13(6), pp. 1560-1567, 2007. Best paper in IEEE Visualization 2007.

Provenance/Workflow/Graph Indexing


Additional papers:


Presentation:

Video lecture:

Provenance Mining

Provenance Applications: Publications

  • Reproducible Research Fomel, Sergey; Claerbout, Jon F. CiSE Volume: 11 Issue: 1 Date: Jan.-Feb. 2009 Page(s): 5-7 Digital Object Identifier 10.1109/MCSE.2009.14

Provenance: Security and Privacy


Data on the Web

Web Schema Matching and Integration


Additional: papers

Additional papers on Dataspaces:


NoSQL Databases

Relational data on the Web

  • [1] Information-theoretic tools for mining database structure from large data sets. Periklis Andritsos, Renee J. Miller and Panayiotis Tsaparas. SIGMOD 2004

Additional Papers:


Data integration on the fly (or almost...)


Usable query interfaces for structured data


Snippet Generation and Ranking

The Deep Web

  • Google's Deep Web crawl. Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy. PVLDB 1(2): 1241-1252 (2008) (*Ramesh will present this)

Information Extraction