Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-1089

Avro-C - Penalty 30x to 50x for using resolved writer on arrays

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.6.3, 1.7.0
    • 1.7.3
    • c
    • None
    • Ubuntu Linux

    Description

      The new performance tests created in AVRO-1088 show that using the
      resolved writer takes 30 to 50 times longer than using no schema
      resolution or using the resolved reader for simple and nested arrays.

      For a simple array, using the resolved writer took ~30x longer than
      using the memory reader that assumed a matching schema. For the nested
      array, using the resolved writer took ~50x longer.

      These results suggest that there is a bug in resolved writer. I do not
      have a proposed fix at this time.

            • Running simple array matched schemas ****
              250000 tests per run
              Run 1
              Run 2
              Run 3
              Average time: 2.123s
              Tests/sec: 117739
            • Running simple array resolved writer ****
              10000 tests per run
              Run 1
              Run 2
              Run 3
              Average time: 2.747s
              Tests/sec: 3641
            • Running nested array matched schemas ****
              250000 tests per run
              Run 1
              Run 2
              Run 3
              Average time: 3.030s
              Tests/sec: 82508
            • Running nested array resolved writer ****
              10000 tests per run
              Run 1
              Run 2
              Run 3
              Average time: 6.650s
              Tests/sec: 1504

      Attachments

        Activity

          People

            Unassigned Unassigned
            vnadkarni Vivek Nadkarni
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 48h
                48h
                Remaining:
                Remaining Estimate - 48h
                48h
                Logged:
                Time Spent - Not Specified
                Not Specified