hyparquet/CHANGELOG.md

171 lines
4.4 KiB
Markdown
Raw Permalink Normal View History

2025-04-30 08:01:38 +00:00
# Changelog
2025-04-17 08:45:02 +00:00
2025-11-26 18:06:43 +00:00
## [1.22.0]
- Rename constants to plural, and remove LogicalTypeType
2025-11-26 06:29:17 +00:00
## [1.21.1]
- Add bigint to ParquetQueryValue type
2025-11-21 11:11:01 +00:00
## [1.21.0]
- Adds optional `filter` parameter for pushdown row group filtering (#141)
## [1.20.3]
- Fix extra fetch on the boundary of row groups
2025-11-04 06:27:10 +00:00
## [1.20.2]
- Support S3 presigned URLs in `asyncBufferFromUrl` (#137 thanks @EpsilonPrime)
## [1.20.1]
- Update geospatial and variant metadata
2025-10-16 08:29:57 +00:00
## [1.20.0]
- Mark geospatial columns in GeoParquet with geometry or geography data type (#133)
- Add `geoparquet` option to opt out marking GeoParquet columns (#133)
2025-10-01 06:00:06 +00:00
## [1.19.0]
- Parse geometry and geography data types to geojson geometry objects (#131)
2025-09-27 07:01:33 +00:00
## [1.18.1]
- Fix geospatial metadata parsing
- Custom `stringFromBytes` parser option (#129)
2025-09-16 22:36:33 +00:00
## [1.18.0]
- Refine `onComplete` types for `rowFormat: 'array'` and `rowFormat: 'object'` (#120)
- Only object format for `parquetReadObjects` and `parquetQuery` (#120)
- Parquet `__index__` column overrides `parquetQuery` index annotation (#120)
## [1.17.8]
- Export `readColumnIndex` and `readOffsetIndex` (#122)
## [1.17.7]
2025-09-07 00:55:52 +00:00
- Fix early page termination for non-flat columns (#119)
- Fix a bug in parquetQuery, when rowFormat is 'array' (#118 thanks @severo)
## [1.17.6]
2025-09-07 00:55:52 +00:00
- Revert "Fix onComplete return type (#104)" (#117)
## [1.17.5]
2025-09-07 00:55:52 +00:00
- Fix high-precision decimal parsing (#116)
2025-08-22 19:26:00 +00:00
## [1.17.4]
- Fix onComplete return type (#104 thanks @supermar1010)
## [1.17.3]
- Fix circular import (#111)
2025-08-15 20:24:44 +00:00
## [1.17.2]
- Export `ParquetQueryFilter` type (#105)
- Remove a circular dependency (#108)
2025-07-02 22:49:06 +00:00
## [1.17.1]
- Fix zero row file (#98 thanks @kroche98)
2025-06-30 08:48:51 +00:00
## [1.17.0]
- Require explicit `$eq` operator for `parquetQuery` filters.
2025-06-17 21:30:00 +00:00
## [1.16.2]
- Fix readRleBitPackedHybrid when length is zero
## [1.16.1]
- Fix duckdb empty block
2025-06-10 18:02:42 +00:00
## [1.16.0]
- New `parsers` option for custom date parsing.
- Breaking change: parquetMetadataAsync moved initialFetchSize into an options object.
2025-05-30 22:56:18 +00:00
## [1.15.0]
- Change packaging to have node-specific exports for `asyncBufferFromFile` (#80).
2025-05-27 00:58:43 +00:00
## [1.14.0]
- Refactor to use `AsyncRowGroup` and `AsyncColumn` abstractions for better performance and flexibility (#83).
2025-05-25 06:41:52 +00:00
## [1.13.6]
- Fix page continuation (#81 thanks @jpivarski).
## [1.13.5]
- Fast parquetQuery filter (#78)
2025-05-18 01:09:37 +00:00
## [1.13.4]
- parquetSchema more generic argument type.
2025-05-16 06:50:13 +00:00
## [1.13.3]
2025-05-18 01:09:37 +00:00
- Convert logical type 'STRING'.
- Side-effect-free in package.json.
2025-05-16 06:50:13 +00:00
2025-05-14 23:42:12 +00:00
## [1.13.2]
- Fix duckdb delta encoding (#77 thanks @mike-iqmo).
## [1.13.1]
- Throw exception for unsupported file_path.
2025-04-30 08:01:38 +00:00
## [1.13.0]
2025-05-14 23:42:12 +00:00
- Query planner: pre-fetch byte ranges in parallel (#75).
2025-04-30 08:01:38 +00:00
## [1.12.1]
2025-04-17 08:45:02 +00:00
- Fix conversion of unsigned types.
- Allow passing a custom fetch function to utilities (#73).
2025-04-30 08:01:38 +00:00
## [1.12.0]
2025-04-17 08:45:02 +00:00
- Add `onPage` callback to `parquetRead`.
2025-04-30 08:01:38 +00:00
## [1.11.1]
2025-04-17 08:45:02 +00:00
- Fix handling of dictionary pages from parquet.net.
2025-04-30 08:01:38 +00:00
## [1.11.0]
2025-04-17 08:45:02 +00:00
- Fix continued data pages
- Skip decoding unnecessary pages.
2025-04-30 08:01:38 +00:00
## [1.10.4]
2025-04-17 08:45:02 +00:00
- Add type definitions for thrift.
2025-04-30 08:01:38 +00:00
## [1.10.3]
2025-04-17 08:45:02 +00:00
- Internal refactor split out `readPage` function.
2025-04-30 08:01:38 +00:00
## [1.10.2]
2025-04-17 08:45:02 +00:00
- Export additional internal constants.
2025-04-30 08:01:38 +00:00
## [1.10.1]
2025-04-17 08:45:02 +00:00
- Fix parsing of `crypto_metadata` in thrift.
2025-04-30 08:01:38 +00:00
## [1.10.0]
2025-04-17 08:45:02 +00:00
- Map `src` files to TypeScript types via package exports (#70).
- Use `defaultInitialFetchSize` for both metadata and `cachedAsyncBuffer`.
2025-04-30 08:01:38 +00:00
## [1.9.x]
2025-04-17 08:45:02 +00:00
- Add `minSize` parameter to `cachedAsyncBuffer` for finer control.
- Return typed arrays in `onChunk` callbacks.
- Change `readColumn` to return an array of `DecodedArray` (#67).
2025-04-30 08:01:38 +00:00
## [1.8.x]
2025-05-14 23:42:12 +00:00
- Support endpoints without range requests in `asyncBufferFromUrl` (#57 thanks @swlynch99).
2025-04-17 08:45:02 +00:00
- Enhance error messages for common parsing issues.
2025-05-14 23:42:12 +00:00
- Mongo-style `filter` option in `parquetQuery` (#56 thanks @park-brian).
2025-04-17 08:45:02 +00:00
2025-04-30 08:01:38 +00:00
## [1.7.0]
2025-05-14 23:42:12 +00:00
- Enable `readColumn` to read all rows (#53 thanks @park-brian).
2025-04-17 08:45:02 +00:00
- Validate url in `asyncBufferFromUrl`.
2025-04-30 08:01:38 +00:00
## [1.6.x]
2025-05-14 23:42:12 +00:00
- Fix timestamp conversion in metadata parsing (#45 thanks @cbardasano).
2025-04-17 08:45:02 +00:00
- Build TypeScript types before publishing to npm.
2025-04-30 08:01:38 +00:00
## [1.5.0]
2025-04-17 08:45:02 +00:00
- Export `cachedAsyncBuffer` utility.
2025-04-30 08:01:38 +00:00
## [1.4.0]
2025-04-17 08:45:02 +00:00
- Add `parquetQuery` with `orderBy` option.
2025-04-30 08:01:38 +00:00
## [1.3.0]
2025-04-17 08:45:02 +00:00
- Promisify `parquetReadObjects` function.
- Add support for parsing column and offset indexes (#29).
2025-04-30 08:01:38 +00:00
## [1.2.0]
2025-05-14 23:42:12 +00:00
- Return columns in the requested order (#27 thanks @cstranstrum).
- Add option to return each row as an object keyed by column names (#25 thanks @cstranstrum).
2025-04-17 08:45:02 +00:00
2025-04-30 08:01:38 +00:00
## [1.1.0]
2025-04-17 08:45:02 +00:00
- Export `asyncBufferFromFile` and `asyncBufferFromUrl` utilities.
2025-04-30 08:01:38 +00:00
## [1.0.0]
2025-04-17 08:45:02 +00:00
- Initial stable release.