TY - CHAP
T1 - Search engine-crawler symbiosis
T2 - Adapting to community interests
AU - Pant, Gautam
AU - Bradshaw, Shannon
AU - Menczer, Filippo
PY - 2003
Y1 - 2003
N2 - Web crawlers have been used for nearly a decade as a search engine component to create and update large collections of documents. Typically the crawler and the rest of the search engine are not closely integrated. If the purpose of a search engine is to have as large a collection as possible to serve the general Web community, a close integration may not be necessary. However, if the search engine caters to a specific community with shared focused interests, it can take advantage of such an integration. In this paper we investigate a tightly coupled system in which the crawler and the search engine engage in a symbiotic relationship. The crawler feeds the search engine and the search engine in turn helps the crawler to better its performance. We show that the symbiosis can help the system learn about a community's interests and serve such a community with better focus.
AB - Web crawlers have been used for nearly a decade as a search engine component to create and update large collections of documents. Typically the crawler and the rest of the search engine are not closely integrated. If the purpose of a search engine is to have as large a collection as possible to serve the general Web community, a close integration may not be necessary. However, if the search engine caters to a specific community with shared focused interests, it can take advantage of such an integration. In this paper we investigate a tightly coupled system in which the crawler and the search engine engage in a symbiotic relationship. The crawler feeds the search engine and the search engine in turn helps the crawler to better its performance. We show that the symbiosis can help the system learn about a community's interests and serve such a community with better focus.
UR - http://www.scopus.com/inward/record.url?scp=35048813582&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35048813582&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-45175-4_21
DO - 10.1007/978-3-540-45175-4_21
M3 - Chapter
AN - SCOPUS:35048813582
SN - 354040726X
SN - 9783540407263
T3 - Lecture Notes in Computer Science
SP - 221
EP - 232
BT - Research and Advanced Technology for Digital Libraries
A2 - Koch, Traugott
A2 - Sølvberg, Ingeborg Torvik
PB - Springer
ER -