Crawler: Extract links and images from remote Web pages

Recommend this page to a friend!

Download

Info

Files

Install with Composer

Download

Reputation

Support forum

Blog

Links

Ratings				Unique User Downloads		Download Rankings
52%				Total: 6,443		All time: 327 This week: 488

Version		License		PHP version		Categories
`crawler` 1.1		Freely Distributable		4.0		HTML, Web services

Description

Author

Md. Shaiful islam

This class can be used to extract links and images from remote Web pages.

It can access Web pages, parse the pages HTML and extract the URLs of the links and the images.

If necessary, the class may access a login page and emulate the submission of a login form to subsequent accesses can be done on behalf of the logged user.

Innovation Award

March 2008
Number 7

Prize: One copy of Delphi for PHP

Retrieving Web pages from remote sites is a relatively easy task in PHP.

If you want to crawl a site to search for something in its pages, you only need to retrieve the site pages, use some regular expressions to extract the site links, and retrieve the linked pages until all pages were followed.

However, if some pages can only be accessed by authenticated users, the problem is no longer so simple.

This package provides a more complete solution to the problem of crawling site pages by automatically authenticating, so it can access all pages restricted to logged users.

Manuel Lemos

Md. Shaiful islam

Name:	Md. Shaiful islam `<contact>`
Classes:	1 package by Md. Shaiful islam
Country:	United States

Innovation award

Nominee: 1x

Files (4)

File	Role	Description
`Crawler.php`	Class	The Class
`ExampleCrawlImage.php`	Example	Crawl Image form http://www.phpclasses.org/ site
`ExampleCrawlLink.php`	Example	Crawl links form http://www.phpclasses.org/ site
`ExampleLoginCrawlLink.php`	Example	Login and CrawlLink from a site

The PHP Classes site has supported package installation using the Composer tool since 2013, as you may verify by reading this instructions page.

Install with Composer

	crawler-2008-09-11.zip 2KB
	crawler-2008-09-11.tar.gz 2KB
	Install with Composer

Version Control

Unique User Downloads

Download Rankings

Total:	6,443
This week:	0

All time:	327
This week:	488

User Ratings

User Comments (3)

	All time
Utility:	75%
Consistency:	69%
Documentation:	-
Examples:	76%
Tests:	-
Videos:	-
Overall:	52%
Rank:	2462

exellent! 3 years ago (Jeff Dudas)	70%
Does not work for linked in 12 years ago (Mansoor Rana)	12%
Lacking recursion, it doesn't actually crawl. 16 years ago (wahoo frankinson)	32%

Applications that use this package

No pages of applications that use this class were specified.

If you know an application of this package, send a message to the author to add a link here.

Pages that reference this package


Hoje vou postar uma classe para identificar links e imagens em sites...
I�ve just finished developing a stable release of quite an advanced SEO tool, I called it �position checker�...

Latest pages that reference packages

About us

Advertise on this site

For more information send a message to info at phpclasses dot org.