The JS client lacks basic introductory material around creating the common basic data types such as turning JS arrays into ints, dicts, etc. There is no equivalent of Python's https://arrow.apache.org/docs/python/data.html . This has made use for myself difficult, and I bet for others.
As with prev tutorials, I started sketching on https://observablehq.com/@lmeyerov/rich-data-types-in-apache-arrow-js-efficient-data-tables-wit . When we're happy can make sense to export as an html or something to the repo, or just link from the main readme.
I believe the target topics worth covering are:
- Common user data types: Ints, Dicts, Struct, Time
- Common column types: Data, Vector, Column
- Going from individual & arrays & buffers of JS values to Arrow-wrapped forms, and basic inspection of the result
Not worth going into here is Tables vs. RecordBatches, which is the other tutorial.
1. Ideas of what to add/edit/remove?
2. And anyone up for helping with discussion of Data vs. Vector, and ingest of Time & Struct?
3. ... Should we be encouraging Struct or Map? I saw some PRs changing stuff here.