Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-3659

'Using Pig for querying data' tutorial is outdated

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: documentation
    • Labels:
      None

      Description

      The third paragraph (first that is not a description) calls hadoop.

      hadoop fs -put bank.csv .

       

      This hadoop call infers hadoop is already installed, which is not mentioned in the previous paragraphs as a dependency or included/mentioned in the installation files and quickstart. When a user has a fresh install and just hits 'run all paragraphs' it'll error out here.

       

      While this is not terribly difficult to overcome it creates friction in getting up and running without issues. The next question is what version of Hadoop to use, I've tested with Hadoop 2.7.7 and it appears to work just fine but I haven't fully vetted it yet.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              OptimizePrime Alex Byrd
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: