Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4225

Allow users to specify Python executable for Pig streaming

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.12.0, 0.12.1
    • None
    • internal-udfs
    • None

    Description

      The current PythonScriptEngine uses whatever python is currently on the path in order to execute scripts.

      Python users are accustomed to creating virtual environments (virtualenvs) where associated requirements are installed without needing to worry about "global" installs via, for example, sudo pip install <some package>.

      Is it possible to have the Python executable specified either via the DEFINE command syntax or, in a hadoop job configuration variable? Perhaps pig.pythonstreaming.pythonpath?

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              msukmanowsky Mike Sukmanowsky
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: