Decompressors for hyparquet
Go to file
Kenny Daniel 03befc14a5
Brotli
2024-05-19 23:57:44 -07:00
.github/workflows Initial project skeleton 2024-05-09 00:19:56 -07:00
src Brotli 2024-05-19 23:57:44 -07:00
test Brotli 2024-05-19 23:57:44 -07:00
.eslintrc.json rollup 2024-05-19 00:37:53 -07:00
.gitignore rollup 2024-05-19 00:37:53 -07:00
LICENSE Initial project skeleton 2024-05-09 00:19:56 -07:00
package.json Brotli 2024-05-19 23:57:44 -07:00
README.md Fix hadoop lz4 2024-05-19 21:16:29 -07:00
rollup.config.js Brotli 2024-05-19 23:57:44 -07:00
tsconfig.json Initial project skeleton 2024-05-09 00:19:56 -07:00

hyparquet decompressors

workflow status mit license coverage

This package exports a compressors object intended to be passed into hyparquet.

Apache Parquet is a popular columnar storage format that is widely used in data engineering, data science, and machine learning applications for efficiently storing and processing large datasets. It supports a number of different compression formats, but most parquet files use snappy compression.

The hyparquet library by default only supports uncompressed and snappy compressed files. The hyparquet-compressors package extends support for all legal parquet compression formats.

Usage

import { parquetRead } from 'hyparquet'
import { compressors } from 'hyparquet-compressors'

await parquetRead({ file, compressors, onComplete: console.log })

Supported compression formats

Parquet compression types supported with hyparquet-compressors:

  • Uncompressed
  • Snappy
  • GZip
  • LZO
  • Brotli
  • LZ4
  • ZSTD
  • LZ4_RAW

References