Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-13317

[Python] Improve documentation on what 'use_threads' does in 'read_feather'

    XMLWordPrintableJSON

Details

    Description

      The current documentation for read_feather states the following:

      use_threads (bool, default True) – Whether to parallelize reading using multiple threads.

      if the underlying file uses compression, then multiple threads can still be spawned. The verbiage of the use_threads is ambiguous on whether the restriction on multiple threads is only for the conversion from pyarrow to the pandas dataframe vs the reading/decompression of the file itself which might spawn additional threads.

      set_cpu_count might be good to mention as a way to actually limit threads spawned

      Attachments

        Issue Links

          Activity

            People

              alenkaf Alenka Frim
              ajoseps Arun Joseph
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h 20m
                  1h 20m