Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1115

parquet writer never finishes when the source contains huge number of small files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • None
    • 0.8.0
    • Storage - Writer
    • None

    Description

      git.commit.id.abbrev=790a2ad
      Build # 26246

      When we try to use 'create table as.....' and the source folder contains around 5000 text files, drill never completes. I left the query to tun overnight, but it still didn't complete. However I see nonstop activity in the log files which suggests drill is actually doing something.

      Cluster Size : 2
      Each file contains only a single number.

      Attachments

        Activity

          People

            rkins Rahul Kumar Challapalli
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: