Uploaded image for project: 'Apache Blur'
  1. Apache Blur
  2. BLUR-397

Improve data loading from M/R

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.2.4
    • Blur, Blur MapReduce
    • None

    Description

      There's an awkward permissions dilemma when writing data into Blur from Map/Reduce.

      A job would typically create a table, then load the data. The challenge is that the table itself is created through the controller, which means it's written to DFS as the user actually running the controller daemon - typically 'blur'. The Map/Reduce job may be run as some other user totally, but it may be a user that you don't want to have write access inside blur's directory paths. In other words, you'd like arbitrary user(s) to be able to create/populate table data without necessarily having write access to blur's internal stuffs.

      One approach is to have the user's job write to any location they have access to, the "tell" Blur to 'import' it - at which time, Blur would literally move the data into it's control.

      Attachments

        Activity

          People

            amccurry Aaron McCurry
            williamstw Tim Williams
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: