Skip to main content

An analysis of structured data on the web

Publication ,  Journal Article
Dalvi, N; Machanavajjhala, A; Pang, B
Published in: Proceedings of the VLDB Endowment
January 1, 2012

In this paper, we analyze the nature and distribution of structured data on the Web. Web-scale information extraction, or the problem of creating structured tables using extraction from the entire web, is gathering lots of research interest. We perform a study to understand and quantify the value of Web-scale extraction, and how structured information is distributed amongst top aggregator websites and tail sites for various interesting domains. We believe this is the first study of its kind, and gives us new insights for information extraction over the Web. © 2012 VLDB Endowment.

Duke Scholars

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2012

Volume

5

Issue

7

Start / End Page

680 / 691

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Dalvi, N., Machanavajjhala, A., & Pang, B. (2012). An analysis of structured data on the web. Proceedings of the VLDB Endowment, 5(7), 680–691. https://doi.org/10.14778/2180912.2180920
Dalvi, N., A. Machanavajjhala, and B. Pang. “An analysis of structured data on the web.” Proceedings of the VLDB Endowment 5, no. 7 (January 1, 2012): 680–91. https://doi.org/10.14778/2180912.2180920.
Dalvi N, Machanavajjhala A, Pang B. An analysis of structured data on the web. Proceedings of the VLDB Endowment. 2012 Jan 1;5(7):680–91.
Dalvi, N., et al. “An analysis of structured data on the web.” Proceedings of the VLDB Endowment, vol. 5, no. 7, Jan. 2012, pp. 680–91. Scopus, doi:10.14778/2180912.2180920.
Dalvi N, Machanavajjhala A, Pang B. An analysis of structured data on the web. Proceedings of the VLDB Endowment. 2012 Jan 1;5(7):680–691.

Published In

Proceedings of the VLDB Endowment

DOI

EISSN

2150-8097

Publication Date

January 1, 2012

Volume

5

Issue

7

Start / End Page

680 / 691

Related Subject Headings

  • 4605 Data management and data science
  • 0807 Library and Information Studies
  • 0806 Information Systems
  • 0802 Computation Theory and Mathematics