An analysis of structured data on the web
Publication
, Journal Article
Dalvi, N; Machanavajjhala, A; Pang, B
Published in: Proceedings of the VLDB Endowment
January 1, 2012
In this paper, we analyze the nature and distribution of structured data on the Web. Web-scale information extraction, or the problem of creating structured tables using extraction from the entire web, is gathering lots of research interest. We perform a study to understand and quantify the value of Web-scale extraction, and how structured information is distributed amongst top aggregator websites and tail sites for various interesting domains. We believe this is the first study of its kind, and gives us new insights for information extraction over the Web. © 2012 VLDB Endowment.
Duke Scholars
Published In
Proceedings of the VLDB Endowment
DOI
EISSN
2150-8097
Publication Date
January 1, 2012
Volume
5
Issue
7
Start / End Page
680 / 691
Related Subject Headings
- 4605 Data management and data science
- 0807 Library and Information Studies
- 0806 Information Systems
- 0802 Computation Theory and Mathematics
Citation
APA
Chicago
ICMJE
MLA
NLM
Dalvi, N., Machanavajjhala, A., & Pang, B. (2012). An analysis of structured data on the web. Proceedings of the VLDB Endowment, 5(7), 680–691. https://doi.org/10.14778/2180912.2180920
Dalvi, N., A. Machanavajjhala, and B. Pang. “An analysis of structured data on the web.” Proceedings of the VLDB Endowment 5, no. 7 (January 1, 2012): 680–91. https://doi.org/10.14778/2180912.2180920.
Dalvi N, Machanavajjhala A, Pang B. An analysis of structured data on the web. Proceedings of the VLDB Endowment. 2012 Jan 1;5(7):680–91.
Dalvi, N., et al. “An analysis of structured data on the web.” Proceedings of the VLDB Endowment, vol. 5, no. 7, Jan. 2012, pp. 680–91. Scopus, doi:10.14778/2180912.2180920.
Dalvi N, Machanavajjhala A, Pang B. An analysis of structured data on the web. Proceedings of the VLDB Endowment. 2012 Jan 1;5(7):680–691.
Published In
Proceedings of the VLDB Endowment
DOI
EISSN
2150-8097
Publication Date
January 1, 2012
Volume
5
Issue
7
Start / End Page
680 / 691
Related Subject Headings
- 4605 Data management and data science
- 0807 Library and Information Studies
- 0806 Information Systems
- 0802 Computation Theory and Mathematics