Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-2848

Task failed during 1st load and restarted on same executor

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.4.1
    • None
    • data-load
    • None
    • Spark 2.1

    Description

      Steps :

      Huge data load performed. (table has 3.5 billion records)

       

      Actual Issue :

      One of the task failed during first load and restarted again in the same executor and took double the time to load first segment, rest of the 5 load ran properly.

       Due to this failure and task rerun Segment_0 has got double the number of .carbondata files and even the number of records loaded twice.

       

      Expected :

      Task should not fail in 1st load. Even after failed task restarts carbondata files and records loaded should not double.

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            pawanmalwal Pawan Malwal
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: