Difference between revisions of "Reading List"

From VistrailsWiki
Jump to navigation Jump to search
Line 157: Line 157:
=== Relational data on the Web ===
=== Relational data on the Web ===


Would like to present: <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>
Would like to critique: <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>
Pravin
* [http://fleixeiras.cs.utah.edu/researchTopics/images/e/e7/Webtables-vldb08.pdf WebTables: exploring the power of tables on the web. ] Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wang, Eugene Wu, Yang Zhang: PVLDB 1(1): 538-549 (2008)
* [http://fleixeiras.cs.utah.edu/researchTopics/images/e/e7/Webtables-vldb08.pdf WebTables: exploring the power of tables on the web. ] Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wang, Eugene Wu, Yang Zhang: PVLDB 1(1): 538-549 (2008)


* [http://fleixeiras.cs.utah.edu/researchTopics/images/0/0a/Relweb-webdb08.pdf Uncovering the Relational Web. ] Michael J. Cafarella, Alon Y. Halevy, Yang Zhang, Daisy Zhe Wang, Eugene Wu. WebDB 2008
* [http://fleixeiras.cs.utah.edu/researchTopics/images/0/0a/Relweb-webdb08.pdf Uncovering the Relational Web. ] Michael J. Cafarella, Alon Y. Halevy, Yang Zhang, Daisy Zhe Wang, Eugene Wu. WebDB 2008


Huong
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/dasu-sigmod2002 Mining database structure; or, how to build a data quality browser.] Tamraparni Dasu, Theodore Johnson, S. Muthukrishnan, Vladislav Shkapenyuk. SIGMOD 2002
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/dasu-sigmod2002 Mining database structure; or, how to build a data quality browser.] Tamraparni Dasu, Theodore Johnson, S. Muthukrishnan, Vladislav Shkapenyuk. SIGMOD 2002


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/andritsos-sigmod2004.pdf] Information-theoretic tools for mining database structure from large data sets. Periklis Andritsos, Renee J. Miller and Panayiotis Tsaparas. SIGMOD 2004
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/andritsos-sigmod2004.pdf] Information-theoretic tools for mining database structure from large data sets. Periklis Andritsos, Renee J. Miller and Panayiotis Tsaparas. SIGMOD 2004


Some More Papers
Additional Papers:
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/elmargamid-tkde2007.pdf Duplicate Record Detection: A Survey.] Ahmed K. Elmagarmid, Panagiotis G. Ipeirotis, Vassilios S. Verykios. IEEE TKDE, 2007
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/elmargamid-tkde2007.pdf Duplicate Record Detection: A Survey.] Ahmed K. Elmagarmid, Panagiotis G. Ipeirotis, Vassilios S. Verykios. IEEE TKDE, 2007
* [http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=655802&isnumber=14296 Efficient Discovery of Functional and Approximate Dependencies Using Partitions] Yka Huhtala, Juha Karkkainen, Pasi Porkka, and Hannu Toivonen. In Proc. IEEE Intl. conf. on Data Engineering, 1998.
* [http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=655802&isnumber=14296 Efficient Discovery of Functional and Approximate Dependencies Using Partitions] Yka Huhtala, Juha Karkkainen, Pasi Porkka, and Hannu Toivonen. In Proc. IEEE Intl. conf. on Data Engineering, 1998.
Line 183: Line 177:


=== Data integration on the fly (or almost...) ===
=== Data integration on the fly (or almost...) ===
Would like to present: <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>
Would like to critique:  <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>
Zhan:
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/franklin-sigmodrecord2005.pdf From databases to dataspaces: a new abstraction for information management.] Michael Franklin, Alon Halevy, David Maier. Sigmod Record, 2005
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/franklin-sigmodrecord2005.pdf From databases to dataspaces: a new abstraction for information management.] Michael Franklin, Alon Halevy, David Maier. Sigmod Record, 2005


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/dong-sigmod2007 Indexing dataspaces.] Xin Dong and Alon Halevy. SIGMOD 2007.
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/dong-sigmod2007 Indexing dataspaces.] Xin Dong and Alon Halevy. SIGMOD 2007.
Additional papers:


* [http://www.cs.utah.edu/~juliana/rtdb2008/References/jeffery-sigmod2008.pdf Pay-as-you-go user feedback for dataspace systems.] Shawn R. Jeffery, Michael J. Franklin, Alon Y. Halevy. SIGMOD Conference 2008: 847-860
* [http://www.cs.utah.edu/~juliana/rtdb2008/References/jeffery-sigmod2008.pdf Pay-as-you-go user feedback for dataspace systems.] Shawn R. Jeffery, Michael J. Franklin, Alon Y. Halevy. SIGMOD Conference 2008: 847-860
Line 206: Line 193:


=== Usable query interfaces for structured data ===
=== Usable query interfaces for structured data ===
Mark:


* [http://db.ucsd.edu/people/vagelis/publications/discover.pdf Discover: keyword search in relational databases.] Vagelis Hristidis, Yannis Papakonstantinou. VLDB 2002.
* [http://db.ucsd.edu/people/vagelis/publications/discover.pdf Discover: keyword search in relational databases.] Vagelis Hristidis, Yannis Papakonstantinou. VLDB 2002.


* [http://www.vldb2005.org/program/paper/wed/p505-kacholia.pdf Bidirectional Expansion For Keyword Search on Graph Databases.] Varun Kacholia, Shashank Pandit, Soumen Chakrabarti, S Sudarshan, Rushi Desai and Hrishikesh Karambelkar, VLDB 2005
* [http://www.vldb2005.org/program/paper/wed/p505-kacholia.pdf Bidirectional Expansion For Keyword Search on Graph Databases.] Varun Kacholia, Shashank Pandit, Soumen Chakrabarti, S Sudarshan, Rushi Desai and Hrishikesh Karambelkar, VLDB 2005
Komal:


* [http://www.cse.iitb.ac.in/~aru/Publications/BanksICDE2002.pdf Keyword Searching and Browsing in databases using BANKS.] Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, S. Sudarshan. ICDE 2002  
* [http://www.cse.iitb.ac.in/~aru/Publications/BanksICDE2002.pdf Keyword Searching and Browsing in databases using BANKS.] Gaurav Bhalotia, Arvind Hulgeri, Charuta Nakhe, Soumen Chakrabarti, S. Sudarshan. ICDE 2002  


* [http://www.cs.uic.edu/~fliu1/Sigmod06_Keyword_FangLiu_UIC.pdf Effective keyword search in relational databases.] Liu,, Fang and Yu,, Clement and Meng,, Weiyi and Chowdhury,, Abdur. SIGMOD 2006, pp 563--574.
* [http://www.cs.uic.edu/~fliu1/Sigmod06_Keyword_FangLiu_UIC.pdf Effective keyword search in relational databases.] Liu,, Fang and Yu,, Clement and Meng,, Weiyi and Chowdhury,, Abdur. SIGMOD 2006, pp 563--574.

Revision as of 14:24, 24 January 2012

Provenance

Overview

Provenance in Databases

  • Curated Databases W. Tan, P. Buneman, J. Cheney, S. Vansumerren. ACM Symposium on Principles of Database Systems (PODS), 2008.

Provenance Management: Storage, Indexing and Querying

  • Querying and Creating Visualizations by Analogy. Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire and Claudio T. Silva. IEEE Transactions on Visualization and Computer Graphics, 13(6), pp. 1560-1567, 2007. Best paper in IEEE Visualization 2007.

Provenance/Workflow/Graph Indexing


Additional papers:


Presentation:

Video lecture:

Provenance Mining

Provenance Applications: Publications

  • Reproducible Research Fomel, Sergey; Claerbout, Jon F. CiSE Volume: 11 Issue: 1 Date: Jan.-Feb. 2009 Page(s): 5-7 Digital Object Identifier 10.1109/MCSE.2009.14

Provenance: Security and Privacy


Storing Scientific Data

Web Schema Matching and Integration


Additional: papers

Additional papers on Dataspaces:

Presentation:

Querying Diverse Data

Relational data on the Web

  • [1] Information-theoretic tools for mining database structure from large data sets. Periklis Andritsos, Renee J. Miller and Panayiotis Tsaparas. SIGMOD 2004

Additional Papers:

Presentation:

Data integration on the fly (or almost...)

Presentation PPT:

Usable query interfaces for structured data

Presentation:

Snippet Generation and Ranking

The Deep Web

Would like to present: <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>

Would like to critique: <<add your name and rate your preference from 1-10, 1=least interested, 10 = very interested>>

  • Google's Deep Web crawl. Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy. PVLDB 1(2): 1241-1252 (2008) (*Ramesh will present this)

Information Extraction

Zhan will present (long long paper, but only need to have an overview):

Parasaran will present:

Mark will present:

Thanh will present:

--- --- ---