Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-1810

Scalability issues with kvgen

    XMLWordPrintableJSON

Details

    Description

      git.commit.id.abbrev=4a4f54a

      Memory Settings

      DRILL_MAX_DIRECT_MEMORY="32G"
      DRILL_MAX_HEAP="4G"
      

      Scalar Dataset :

      {"col1":{"a":"b"}}
      

      The below query works fine for the above data set. However, if I just copy the same record 100000 times and execute the same query, kvgen fails with memory related issues

      select kvgen(col1) from `json_kvgenflatten/kvgen-scalar-large.json`;
      

      I attached the log files. Let me know if you need anything

      Attachments

        1. complex.log
          142 kB
          Rahul Kumar Challapalli
        2. simple.log
          61 kB
          Rahul Kumar Challapalli

        Issue Links

          Activity

            People

              mehant Mehant Baid
              rkins Rahul Kumar Challapalli
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: