Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-9649

[Rust] [DataFusion] Investigate poor performance cited in towardsdatascience blog post

    XMLWordPrintableJSON

    Details

    • Type: Task
    • Status: Closed
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: 1.0.0
    • Fix Version/s: None
    • Component/s: Rust, Rust - DataFusion
    • Labels:
      None

      Description

      According to a recently published blog post [1] DataFuson is ~20x slower than Pandas for some simple queries against a tiny data set. I think it would be good to try and reproduce these results to understand why performance is so bad.

       [1] https://towardsdatascience.com/data-processing-in-rust-with-datafusion-arrow-56df5432de68

        Attachments

          Activity

            People

            • Assignee:
              andygrove Andy Grove
              Reporter:
              andygrove Andy Grove
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: