Difference between revisions of "Reading List"

From VistrailsWiki
Jump to navigation Jump to search
Line 104: Line 104:
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/swamy-ieeessp2008 SELinks: End to end security for Web applications.] Hicks, Swamy, and Corcoran. [http://www.cs.umd.edu/projects/PL/selinks/ Project Web Site]
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/swamy-ieeessp2008 SELinks: End to end security for Web applications.] Hicks, Swamy, and Corcoran. [http://www.cs.umd.edu/projects/PL/selinks/ Project Web Site]


== Storing Scientific Data ==


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/sears-cidr2007 To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem?] Russell Sears; Catharine Van Ingen; Jim Gray. CIDR 2007
== Data on the Web ==


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/thakar-cise2003.pdf The Sloan Digital Sky Survey Science Archive: Migrating a Multi-Terabyte Astronomical Archive from Object to Relational DBMS]. Aniruddha R. Thakar, Alexander S. Szalay, Peter Z. Kunszt, Jim Gray. CoRR cs.DB/0403020: (2004)
===Web Schema Matching and Integration ===
 
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/szlavecz-arvix2007.pdf Life Under Your Feet: An End-to-End Soil Ecology Sensor Network, Database, Web Server, and Analysis Service.] Katalin Szlavecz, Andreas Terzis, Stuart Ozer, Razvan Musaloiu-E, Joshua Cogan, Sam Small, Randal Burns, Jim Gray, Alex Szalay
 
== Web Schema Matching and Integration ==


* [http://portal.acm.org/citation.cfm?id=1007582 An interactive clustering-based approach to integrating source query interfaces on the deep Web] Wensheng Wu, Clement Yu, AnHai Doan, Weiyi Meng, SIGMOD 2004
* [http://portal.acm.org/citation.cfm?id=1007582 An interactive clustering-based approach to integrating source query interfaces on the deep Web] Wensheng Wu, Clement Yu, AnHai Doan, Weiyi Meng, SIGMOD 2004
Line 148: Line 143:
* [http://portal.acm.org/citation.cfm?id=1107499.1107502 From databases to dataspaces: a new abstraction for information management] by Michael Franklin, Alon Halevy, David Maier, SIGMOD 2005
* [http://portal.acm.org/citation.cfm?id=1107499.1107502 From databases to dataspaces: a new abstraction for information management] by Michael Franklin, Alon Halevy, David Maier, SIGMOD 2005
* [http://portal.acm.org/citation.cfm?id=1454159.1454217 A first tutorial on dataspaces] by Michael Franklin, Alon Halevy, David Maier, VLDB 2008
* [http://portal.acm.org/citation.cfm?id=1454159.1454217 A first tutorial on dataspaces] by Michael Franklin, Alon Halevy, David Maier, VLDB 2008
Presentation:
* [http://fleixeiras.cs.utah.edu/researchTopics/index.php/Image:WSM.pdf Thanh]
* Avishek
== Querying Diverse Data ==


=== Relational data on the Web ===
=== Relational data on the Web ===
Line 171: Line 160:
* [http://www.cs.toronto.edu/~periklis/pubs/edbt04.pdf LIMBO: Scalable Clustering of Categorical Data]Periklis Andritsos, Panayiotis Tsaparas, Ren´ee J. Miller, and Kenneth C. Sevcik. In EDBT 2004.
* [http://www.cs.toronto.edu/~periklis/pubs/edbt04.pdf LIMBO: Scalable Clustering of Categorical Data]Periklis Andritsos, Panayiotis Tsaparas, Ren´ee J. Miller, and Kenneth C. Sevcik. In EDBT 2004.


Presentation:
* [http://fleixeiras.cs.utah.edu/researchTopics/images/c/ca/Presentation1.pdf Pravin 1]
* [http://fleixeiras.cs.utah.edu/researchTopics/images/c/ca/Presentation2.pdf Pravin 2]
* [[Media:Mining database structure.pdf | Huong]]


=== Data integration on the fly (or almost...) ===
=== Data integration on the fly (or almost...) ===
Line 189: Line 174:
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/doan-cidr2009.pdf The Case for a Structured Approach to Managing Unstructured Data.]  A. Doan, J. F. Naughton, A. Baid, X. Chai, F. Chen, T. Chen, E. Chu, P. DeRose, B. Gao, C. Gokhale, J. Huang, W. Shen, B. Vuong. CIDR-09.
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/doan-cidr2009.pdf The Case for a Structured Approach to Managing Unstructured Data.]  A. Doan, J. F. Naughton, A. Baid, X. Chai, F. Chen, T. Chen, E. Chu, P. DeRose, B. Gao, C. Gokhale, J. Huang, W. Shen, B. Vuong. CIDR-09.


Presentation PPT:
* [[Media:indexing dataspaces (modified).ppt | Zhan]]


=== Usable query interfaces for structured data ===
=== Usable query interfaces for structured data ===
Line 202: Line 185:
* [http://www.cs.uic.edu/~fliu1/Sigmod06_Keyword_FangLiu_UIC.pdf Effective keyword search in relational databases.] Liu,, Fang and Yu,, Clement and Meng,, Weiyi and Chowdhury,, Abdur. SIGMOD 2006, pp 563--574.
* [http://www.cs.uic.edu/~fliu1/Sigmod06_Keyword_FangLiu_UIC.pdf Effective keyword search in relational databases.] Liu,, Fang and Yu,, Clement and Meng,, Weiyi and Chowdhury,, Abdur. SIGMOD 2006, pp 563--574.


Presentation:
* [[Media:Discover.ppt|Discover]]
* [[Media:Bidirectional_search.ppt|Bidirectional Search]]


=== Snippet Generation and Ranking ===
=== Snippet Generation and Ranking ===
Line 218: Line 196:
* [http://portal.acm.org/citation.cfm?id=1066220 Page quality: in search of an unbiased web ranking]. Junghoo Cho, Sourashis Roy, Robert E. Adams. SIGMOD, 2005
* [http://portal.acm.org/citation.cfm?id=1066220 Page quality: in search of an unbiased web ranking]. Junghoo Cho, Sourashis Roy, Robert E. Adams. SIGMOD, 2005


== The Deep Web ==
=== The Deep Web ===


* [http://www.cs.cornell.edu/~lucja/Publications/I03.pdf Google's Deep Web crawl.]  Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy. PVLDB 1(2): 1241-1252 (2008) (*Ramesh will present this)
* [http://www.cs.cornell.edu/~lucja/Publications/I03.pdf Google's Deep Web crawl.]  Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy. PVLDB 1(2): 1241-1252 (2008) (*Ramesh will present this)
Line 228: Line 206:
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/wu-icde2006.pdf Query Selection Techniques for Efficient Crawling of Structured Web Sources.] Ping Wu , Ji-Rong Wen , Huan Liu , Wei-Ying Ma. ICDE 2006
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/wu-icde2006.pdf Query Selection Techniques for Efficient Crawling of Structured Web Sources.] Ping Wu , Ji-Rong Wen , Huan Liu , Wei-Ying Ma. ICDE 2006


== Information Extraction ==
=== Information Extraction ===


* [http://www.it.iitb.ac.in/~sunita/papers/ieSurvey.pdf Information extraction] Sunita Sarawagi.  FnT Databases, 1(3), 2008.
* [http://www.it.iitb.ac.in/~sunita/papers/ieSurvey.pdf Information extraction] Sunita Sarawagi.  FnT Databases, 1(3), 2008.

Revision as of 14:48, 24 January 2012

Provenance

Overview

Provenance in Databases

  • Curated Databases W. Tan, P. Buneman, J. Cheney, S. Vansumerren. ACM Symposium on Principles of Database Systems (PODS), 2008.

Provenance Management: Storage, Indexing and Querying

  • Querying and Creating Visualizations by Analogy. Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire and Claudio T. Silva. IEEE Transactions on Visualization and Computer Graphics, 13(6), pp. 1560-1567, 2007. Best paper in IEEE Visualization 2007.

Provenance/Workflow/Graph Indexing


Additional papers:


Presentation:

Video lecture:

Provenance Mining

Provenance Applications: Publications

  • Reproducible Research Fomel, Sergey; Claerbout, Jon F. CiSE Volume: 11 Issue: 1 Date: Jan.-Feb. 2009 Page(s): 5-7 Digital Object Identifier 10.1109/MCSE.2009.14

Provenance: Security and Privacy



Data on the Web

Web Schema Matching and Integration


Additional: papers

Additional papers on Dataspaces:

Relational data on the Web

  • [1] Information-theoretic tools for mining database structure from large data sets. Periklis Andritsos, Renee J. Miller and Panayiotis Tsaparas. SIGMOD 2004

Additional Papers:


Data integration on the fly (or almost...)


Usable query interfaces for structured data


Snippet Generation and Ranking

The Deep Web

  • Google's Deep Web crawl. Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy. PVLDB 1(2): 1241-1252 (2008) (*Ramesh will present this)

Information Extraction