Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4639

Add better parser for Apache HTTPD access log.

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.15.0
    • 0.16.0
    • piggybank
    • None
    • Reviewed
    • In piggybank there is now a generic Apache httpd access log loader that supports (almost) all custom LogFormats.

    Description

      Currently there are two parsers for Apache HTTPD acces log files in piggybank that only allow parsing the 'combined' and 'common' logformats. These two also only parse the 'basics'.

      This is proposed patch to add the existing https://github.com/nielsbasjes/logparser (Apache 2.0 license) as an 'out of the box' parser to piggybank.
      This parser parses the logfile using the LogFormat specification used to writte it. Almost all LogFormat specifiers are supported and as such adds easy parsing capabilities for (almost) all custom logformats used in production scenarios.
      This parser also goes much deeper in the sense that it allows extracting things like the value of a cookie or the value of a query string parameter.

      Attachments

        1. PIG-4639-20150723-classnotfound.patch
          12 kB
          Niels Basjes
        2. PIG-4639-20150725.patch
          13 kB
          Niels Basjes
        3. PIG-4639-20150805-1247.patch
          13 kB
          Niels Basjes

        Issue Links

          Activity

            People

              nielsbasjes Niels Basjes
              nielsbasjes Niels Basjes
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: