Distributed search engine - Document - PDFSEARCH.IO

First Page		Document Content
Date: 2013-09-23 08:37:31 World Wide Web Heritrix Focused crawler Web harvesting Web archiving Robots exclusion standard Web search engine Distributed web crawling Information science Web crawlers Information retrieval		Add to Reading List Source URL: www.ipsyp.gr Download Document from Source Website File Size: 149,31 KB Share Document on Facebook

	Legal deposit of the French Web: harvesting strategies for a national domain France Lasfargues, Clément Oury, and Bert Wendland Bibliothèque nationale de France Quai François MauriacParis Cedex 13 DocID: 1qJYU - View Document
	Adapting the Hypercube Model to Archive Deferred Representations and Their Descendants Justin F. Brunelle, Michele C. Weigle, and Michael L. Nelson Old Dominion University Department of Computer Science Norfolk, Virginia DocID: 1qeWd - View Document
	Proceedings Template - WORD DocID: 1pmUL - View Document
	Adapting the Hypercube Model to Archive Deferred Representations and Their Descendants Justin F. Brunelle, Michele C. Weigle, and Michael L. Nelson Old Dominion University Department of Computer Science Norfolk, Virginia DocID: 1pgFe - View Document
	Incremental crawling with Heritrix Kristinn Sigurðsson National and University Library of Iceland ArngrímsgötuReykjavík Iceland DocID: 1p7IJ - View Document