Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.5
    • Component/s: None
    • Labels:
      None

      Description

      We should be able to read in and write out CSV files to Vectors

      1. MAHOUT-548.patch
        24 kB
        Grant Ingersoll

        Activity

        Hide
        Grant Ingersoll added a comment -

        Here's a first crack at support for reading in and putting out CSV. The output comes via the VectorDumper, but we really should have a more generic framework for output, me thinks. Input is handled through a new utils package called csv that defines a CSVVectorIterable. It uses Commons CSV (published via Solr into Maven)

        There's also a couple of other Iterables for things I had started on that I should have posted separately but got mixed in.

        Show
        Grant Ingersoll added a comment - Here's a first crack at support for reading in and putting out CSV. The output comes via the VectorDumper, but we really should have a more generic framework for output, me thinks. Input is handled through a new utils package called csv that defines a CSVVectorIterable. It uses Commons CSV (published via Solr into Maven) There's also a couple of other Iterables for things I had started on that I should have posted separately but got mixed in.
        Hide
        Dmitriy Lyubimov added a comment -
        +++ utils/pom.xml	(working copy)
        @@ -138,8 +138,13 @@
               <type>test-jar</type>
               <scope>test</scope>
             </dependency>
        -
        +    <!-- Solr publishes this, but it is Commons CSV -->
             <dependency>
        +      <groupId>org.apache.solr</groupId>
        +      <artifactId>solr-commons-csv</artifactId>
        +      <version>1.4.1</version>
        +    </dependency>
        +    <dependency>
               <groupId>junit</groupId>
               <artifactId>junit</artifactId>
               <scope>test</scope>
        

        Maven note:
        I am not sure if it is absolutely unavoidable to bring solr dependency in but if it is needed, please read MAHOUT-622, I am trying to fix instances like this there.
        Thanks.

        Show
        Dmitriy Lyubimov added a comment - +++ utils/pom.xml (working copy) @@ -138,8 +138,13 @@ <type>test-jar</type> <scope>test</scope> </dependency> - + <!-- Solr publishes this , but it is Commons CSV --> <dependency> + <groupId>org.apache.solr</groupId> + <artifactId>solr-commons-csv</artifactId> + <version>1.4.1</version> + </dependency> + <dependency> <groupId>junit</groupId> <artifactId>junit</artifactId> <scope>test</scope> Maven note: I am not sure if it is absolutely unavoidable to bring solr dependency in but if it is needed, please read MAHOUT-622 , I am trying to fix instances like this there. Thanks.
        Hide
        Hudson added a comment -

        Integrated in Mahout-Quality #708 (See https://hudson.apache.org/hudson/job/Mahout-Quality/708/)

        Show
        Hudson added a comment - Integrated in Mahout-Quality #708 (See https://hudson.apache.org/hudson/job/Mahout-Quality/708/ )

          People

          • Assignee:
            Grant Ingersoll
            Reporter:
            Grant Ingersoll
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development