Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-13207

[Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 4.0.1
    • 5.0.0
    • Documentation
    • None

    Description

      https://arrow.apache.org/docs/python/dataset.html#manual-scheduling mentions that to_table loads all data in memory and that the iterative approach should be used, but then points to scan which has been deprecated in favor of to_batches

      Attachments

        Issue Links

          Activity

            People

              amol- Alessandro Molina
              amol- Alessandro Molina
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: