Details
-
Task
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
None
Description
The JS client lacks basic introductory material around creating the common basic data types such as turning JS arrays into ints, dicts, etc. There is no equivalent of Python's https://arrow.apache.org/docs/python/data.html . This has made use for myself difficult, and I bet for others.
As with prev tutorials, I started sketching on https://observablehq.com/@lmeyerov/rich-data-types-in-apache-arrow-js-efficient-data-tables-wit . When we're happy can make sense to export as an html or something to the repo, or just link from the main readme.
I believe the target topics worth covering are:
- Common user data types: Ints, Dicts, Struct, Time
- Common column types: Data, Vector, Column
- Going from individual & arrays & buffers of JS values to Arrow-wrapped forms, and basic inspection of the result
Not worth going into here is Tables vs. RecordBatches, which is the other tutorial.
1. Ideas of what to add/edit/remove?
2. And anyone up for helping with discussion of Data vs. Vector, and ingest of Time & Struct?
3. ... Should we be encouraging Struct or Map? I saw some PRs changing stuff here.
cc wesm bhulette paul.e.taylor
Attachments
Issue Links
- links to