Spider website: Crawl a site and retrieve the the URL of all links

Recommend this page to a friend!

Download

Info

Files

Install with Composer

Download

Reputation

Support forum

Blog

Links

Ratings				Unique User Downloads		Download Rankings
45%				Total: 3,048		All time: 1,198 This week: 488

Version		License		PHP version		Categories
`spider` 0.1		GNU General Publi...		5.0		HTML, PHP 5, Searching

Description

Author

Karol Janyst

This class can be used to crawl a site and retrieve the the URL of all links.

It can retrieve a page of a site and follow all links recursively to retrieve all the site URLs.

The class can restrict the crawling to URLs with a given extension and avoids accessing pages listed in the site robots.txt file, or pages set with the no index or no follow meta tags.

Karol Janyst

Name:	Karol Janyst `<contact>`
Classes:	2 packages by Karol Janyst
Country:	Poland
Age:	37
All time rank:	384	9 in Poland
Week rank:	195	4 in Poland

Files (2)

File	Role	Description
`spider.class.php`	Class	Main class file
`example.php`	Example	Example file

The PHP Classes site has supported package installation using the Composer tool since 2013, as you may verify by reading this instructions page.

Install with Composer

	spider-2009-06-08.zip 3KB
	spider-2009-06-08.tar.gz 3KB
	Install with Composer

Version Control

Unique User Downloads

Download Rankings

Total:	3,048
This week:	0

All time:	1,198
This week:	488

User Ratings

User Comments (2)

	All time
Utility:	62%
Consistency:	78%
Documentation:	-
Examples:	65%
Tests:	-
Videos:	-
Overall:	45%
Rank:	3319

I ran a simple test using this class. 14 years ago (Oliver Lillie)	22%
It's got great potential, but. 15 years ago (F Philip DeGeorge)	55%

Applications that use this package

No pages of applications that use this class were specified.

If you know an application of this package, send a message to the author to add a link here.

About us

Advertise on this site

For more information send a message to info at phpclasses dot org.