PHP Classes
elePHPant
Icontem

Apache Log Iterator: Extract data from Apache log file lines and fields

Recommend this page to a friend!
  Info   View files View files (5)   DownloadInstall with Composer Download .zip   Reputation   Support forum   Blog    
Last Updated Ratings Unique User Downloads Download Rankings
2014-05-02 (2 years ago) RSS 2.0 feedNot yet rated by the usersTotal: 422 All time: 6,055 This week: 961Up
Version License PHP version Categories
apache-log-iterator 1.0.1GNU General Publi...5.2PHP 5, Logging, Parsers
Description Author

This package can extract data from Apache log file lines and fields.

It provides a class with an iterator interface that can extract the lines of a given Apache log file.

Another class provides an iterator interface to extract Apache log file line fields. The field values are returned in an associative array.

Picture of Simon Champion
Name: Simon Champion <contact>
Classes: 1 package by
Country: United Kingdom United Kingdom

Details
# ApacheLogIterator

### Version 1.0.1

A small PHP class intended to simplify the processing of Apache log files within a PHP program.

## Requirements

ApacheLogIterator has been tested under PHP 5.3. It is written with the intention of also working under 5.2, but this has not yet been tested.

This class has no external dependencies.

## Functionality

ApacheLogIterator is a parser that aims to make the reading of your Apache log file as simple and as efficient as possible.

It extends the SPLFileObject. Normal usage of the SPLFileObject would be to open a file and use a foreach() loop to load each line from the file in turn. The ApacheLogIterator class extends this, such that instead of simply returning the raw data as read from the file, it processes it into a structured array.

Because it's an iterator, you can loop through it using foreach() without having to load the whole file into memory. This is particularly useful because Apache log files can be very large. Only the current record is in memory at any given time.

In addition, ApacheLogIterator comes with a small helper class called ApacheLogFields. This defines the structure of the log records. If your Apache is configured to produce logs in a different format to the ones described in the code, you may override the ApacheLogFiles class to define your own log record format as required.

Filtering the output is trivial: Since it is an Iterator class, you can filter the output using PHP's built-in [FilterIterator](http://php.net/manual/en/class.filteriterator.php).

## Example for Standard Apache Logs

    $logFile = "/path/to/apache/log/file";
    $logIterator = new ApacheLogIterator($logFile);
    foreach ($logIterator as $logRecord) {
        print_r($logRecord); //do whatever you want to here with the output array.
    }

## Example for Apache Error Logs

    $logFile = "/path/to/apache/log/error_file";
    $logFields = new ApacheLogFields();
    $logFields->regex = '/^\[(... ... \d\d \d\d:\d\d:\d\d \d\d\d\d)\] \[(error)\] \[.+ (.*)\] (.*,) (.*$)$/';         
    $logFields->fieldArray = array(                                                                      
        'originalLogEntry',
        'logtime',
        'type',
        'remoteIP',                                                                                      
        'query',
        'referrer'                                                                                       
    );
    $logIterator = new ApacheLogIterator($logFile, $logFields);                                    
    foreach ($logIterator as $logRecord) {
        print_r($logRecord); //do whatever you want to here with the output array.
    }

The output array looks something like this:

    array (
        'originalLogEntry' => '*** the full log record, in case you need it ***',
        'localServer' => 'yourservername',
        'remoteIP' => '192.168.1.1',
        'datetime' => '21/Jul/2012:15:10:54 +0100',
        'method' => 'GET',
        'status' => '200',
        'bytes' => '32611',
        'referrer' => 'http://www.referralurl.com/',
        'userAgent' => 'Mozilla/5.0 (Windows NT 6.0; rv:13.0) Gecko/20100101 Firefox/13.0.1',
        'request' => array (
            'scheme' => 'http',
            'host' => 'mydomain.com',
            'path' => '/query.php',
            'query' => 'q=te57+1ng&submit=Search',
            'fullURL' => 'http://mydomain.com:/query.php?q=te57+1ng&submit=Search',
            'queryArgs' => array (
                'q' => 'te57 1ng',
                'submit' => 'Search',
            ),
        ),
    )

## Copyright and License

This class was written by Simon Champion, and is copyright [Connection Services Limited](http://www.connectionservices.com/).

It is released under the General Public License version 3 (GPLv3); see COPYING.txt for full license text. Please contact us if you require alternative licensing arrangements.
  Files folder image Files  
File Role Description
Plain text file ApacheLogFields.php Class Class source
Plain text file ApacheLogIterator.php Class Class source
Accessible without login Plain text file ApacheLogIteratorTest.php Test Unit test script
Accessible without login Plain text file COPYING.txt Lic. License text
Accessible without login Plain text file README.md Doc. Documentation

 Version Control Unique User Downloads Download Rankings  
 100%
Total:422
This week:0
All time:6,055
This week:961Up