MySpiders: Evolve your own intelligent web crawlers

Gautam Pant, Filippo Menczer

Research output: Contribution to journalArticlepeer-review

Abstract

The dynamic nature of the World Wide Web makes it a challenge to find information that is both relevant and recent. Intelligent agents can complement the power of search engines to meet this challenge. We present a Web tool called MySpiders, which implements an evolutionary algorithm managing a population of adaptive crawlers who browse the Web autonomously. Each agent acts as an intelligent client on behalf of the user, driven by a user query and by textual and linkage clues in the crawled pages. Agents autonomously decide which links to follow, which clues to internalize, when to spawn offspring to focus the search near a relevant source, and when to starve. The tool is available to the public as a threaded Java applet. We discuss the development and deployment of such a system.

Original languageEnglish (US)
Pages (from-to)221-229
Number of pages9
JournalAutonomous Agents and Multi-Agent Systems
Volume5
Issue number2
DOIs
StatePublished - 2002
Externally publishedYes

Keywords

  • Applet
  • InfoSpiders
  • MySpiders
  • Online search
  • Topic-driver crawlers
  • Web informational retrieval

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'MySpiders: Evolve your own intelligent web crawlers'. Together they form a unique fingerprint.

Cite this