PHP Classes

Incorrect url for local links

Recommend this page to a friend!

      Crawler  >  All threads  >  Incorrect url for local links  >  (Un) Subscribe thread alerts  
Subject:Incorrect url for local links
Summary:Local links (without hostname) are incorrect
Messages:2
Author:Jeff Rice
Date:2008-08-11 19:58:17
Update:2008-08-11 20:21:18
 

  1. Incorrect url for local links   Reply   Report abuse  
Picture of Jeff Rice Jeff Rice - 2008-08-11 19:58:17

For local links, ie a href=index.php, the crawler returns the URL as http://index.html or in some cases, even stranger, http:/index.php.

  2. Re: Incorrect url for local links   Reply   Report abuse  
Picture of Jeff Rice Jeff Rice - 2008-08-11 20:21:18 - In reply to message 1 from Jeff Rice

Actually, more so the returning of URLs is fragile and prone to breakage. It does extract the links properly, but the returned URL list is badly broken and needs rewriting to work with sites that have links in anything other than http://x.y.z format.