Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1150

ARFF Integration does not support quoted identifiers

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.7
    • 0.8
    • classic
    • None
    • All

    Description

      I ran the NSL-KDD data set (http://nsl.cs.unb.ca/NSL-KDD/) through the ARFF integration. The process failed to parse the arff formatted file. The file has quoted identifiers:

      @relation 'KDDTrain-20Percent'
      @attribute 'duration' real
      @attribute 'protocol_type'

      {'tcp','udp', 'icmp'}

      The quotes caused the problem. The "official" arff BNF shows that quotes should be supported:

      https://list.scms.waikato.ac.nz/mailman/htdig/wekalist/2008-January/012153.html

      Attachments

        1. MAHOUT-1150.patch
          8 kB
          Marty Kube

        Activity

          People

            Unassigned Unassigned
            martykube Marty Kube
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: