|
Edinburgh Research Archive >
Informatics, School of >
Informatics Publications >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1842/4698
|
| Title: | Curated Databases |
| Authors: | Buneman, Peter Cheney, James Wang-Chiew Tan Stijn Vansummeren |
| Issue Date: | 28-Jan-2011 |
| Journal Title: | Proceedings of the Twenty-Seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2008 |
| Page Numbers: | 1-12 |
| Abstract: | Curated databases are databases that are populated and
updated with a great deal of human effort. Most reference
works that one traditionally found on the reference shelves
of libraries – dictionaries, encyclopedias, gazetteers etc. –
are now curated databases. Since it is now easy to publish
databases on the web, there has been an explosion in the
number of new curated databases used in scientific research.
The value of curated databases lies in the organization and
the quality of the data they contain. Like the paper reference works they have replaced, they usually represent the
efforts of a dedicated group of people to produce a definitive
description of some subject area.
Curated databases present a number of challenges for database research. The topics of annotation, provenance, and
citation are central, because curated databases are heavily
cross-referenced with, and include data from, other databases, and much of the work of a curator is annotating existing data. Evolution of structure is important because these
databases often evolve from semistructured representations,
and because they have to accommodate new scientific discoveries. Much of the work in these areas is in its infancy,
but it is beginning to provide suggest new research for both
theory and practice. We discuss some of this research and
emphasize the need to find appropriate models of the processes associated with curated databases. |
| URI: | http://users.soe.ucsc.edu/~wctan/papers/2008/curateddbs.pdf http://hdl.handle.net/1842/4698 |
| ISBN: | 978-1-60558-108-8 |
| Appears in Collections: | Informatics Publications
|
Items in ERA are protected by copyright, with all rights reserved, unless otherwise indicated.
|