Decompressors for hyparquet
Go to file
2024-05-20 01:06:33 -07:00
.github/workflows Initial project skeleton 2024-05-09 00:19:56 -07:00
src Publish v0.1.0 2024-05-20 01:06:33 -07:00
test Zstandard 2024-05-20 00:29:00 -07:00
.eslintrc.json rollup 2024-05-19 00:37:53 -07:00
.gitignore Publish v0.1.0 2024-05-20 01:06:33 -07:00
LICENSE Initial project skeleton 2024-05-09 00:19:56 -07:00
package.json Publish v0.1.0 2024-05-20 01:06:33 -07:00
README.md Publish v0.1.0 2024-05-20 01:06:33 -07:00
rollup.config.js Brotli 2024-05-19 23:57:44 -07:00
tsconfig.json Initial project skeleton 2024-05-09 00:19:56 -07:00

hyparquet decompressors

npm workflow status mit license coverage

This package exports a compressors object intended to be passed into hyparquet.

Apache Parquet is a popular columnar storage format that is widely used in data engineering, data science, and machine learning applications for efficiently storing and processing large datasets. It supports a number of different compression formats, but most parquet files use snappy compression.

The hyparquet library by default only supports uncompressed and snappy compressed files. The hyparquet-compressors package extends support for all legal parquet compression formats.

Usage

import { parquetRead } from 'hyparquet'
import { compressors } from 'hyparquet-compressors'

await parquetRead({ file, compressors, onComplete: console.log })

Supported compression formats

Parquet compression types supported with hyparquet-compressors:

  • Uncompressed
  • Snappy
  • GZip
  • LZO
  • Brotli
  • LZ4
  • ZSTD
  • LZ4_RAW

References