Comma-separated values files have been enormously popular and used heavily for data storage and manipulation tasks.
However, with the introduction of WebAssembly, processing capabilities of web browsers were boosted and new opportunities appeared for efficient analytics in the browser. So, instead of using CSV files, it is suggested to work with Parquet
and Feather
formats in order to speed up OLAP workloads in the browser significantly.
And today we are excited to announce Parquet and Feather format support on Datagrok platform with Arrow package
, built on parquet-wasm and apache-arrow libraries. It enables such functions as:
- exporting any Datagrok dataframe to the Parquet and Feather formats;
- importing any existing file in the highlighted above formats as Datagrok dataframe.
It is super easy to use.
If you want to save data in the Parquet or Feather format, you need to choose “Save as Parquet” or “Save as Feather” from the drop-down list and click on it. The file will be automatically created and saved on your device.
If you want to open the local file, you need to go to File -> Open -> File and just choose the needed file in the required format.
Let’s take a look at how it works.