Mining search and browse logs for web search: A survey
Huge amounts of search log data have been accumulated at Web search engines. Currently, a popular Web search engine may receive billions of queries and collect terabytes of records about user search behavior daily. Beside search log data, huge amounts of browse log data have also been collected through client-side browser plugins. Suchmassive amounts of search and browse log data provide great opportunities formining the wisdom of crowds and improvingWeb search. At the same time, designing effective and efficient methods to clean, process, and model log data also presents great challenges. In this survey,we focus on mining search and browse log data forWeb search.We start with an introduction to search and browse log data and an overview of frequently-used data summarizations in log mining. We then elaborate how log mining applications enhance the five major components of a search engine, namely, query understanding, document understanding, document ranking, user understanding, and monitoring and feedback. For each aspect, we survey the major tasks, fundamental principles, and state-of-the-art methods. © 2013 ACM.
Duke Scholars
Altmetric Attention Stats
Dimensions Citation Stats
Published In
DOI
EISSN
ISSN
Publication Date
Volume
Issue
Related Subject Headings
- 4611 Machine learning
- 4602 Artificial intelligence
- 0806 Information Systems
- 0801 Artificial Intelligence and Image Processing
Citation
Published In
DOI
EISSN
ISSN
Publication Date
Volume
Issue
Related Subject Headings
- 4611 Machine learning
- 4602 Artificial intelligence
- 0806 Information Systems
- 0801 Artificial Intelligence and Image Processing