Spaces:
Running
Running
feature request: download as Parquet?
#1
by
julien-c
- opened
CSV is ok but Parquet is hub native:)
nice!!! will use it
julien-c
changed discussion status to
closed
Not sure what the app uses for the Parquet export. It's likely in JS and therefore unlikely to use the Parquet writer for Xet from Arrow
Exactly, all is TS code.
Current export is straightforward/simplistic.
- Data is stored on DuckDB.
- We export the file locally
COPY (SELECT ...) TO 'file.parquet' (FORMAT PARQUET)
- We push the file to the hub with the hub js library
Here's an example of the result: https://huggingface.co/datasets/dvilasuero/hands_playing_instruments
Thanks!
@lhoestq
. Does the datasets
library make any special preparations before sending Parquet files? Or is enough with using huggingface_hub
+ hf_xet
?