Commit Graph

43 Commits

Author SHA1 Message Date
mike-iqmo
dbf3065f8e
Addresses issues with duckdb use of delta encodings (#77)
* Addresses issues with duckdb use of delta encodings

* Shrunk size of test data
2025-05-14 16:28:58 -07:00
Kenny Daniel
4df7095ab4
Group column decoding params into an object 2025-04-10 19:30:25 -07:00
Kenny Daniel
1247f5d606
Split out readPage
Remove dict-page-offset-zero test because it's a malformed parquet file.
2025-04-02 20:27:10 -07:00
Kenny Daniel
4bbc7742e5
Comment out unnecessary length read in readRleBitPackedHybrid 2025-03-09 11:20:58 -07:00
Kenny Daniel
791a847e42
Revert "Simplify relative import paths"
This reverts commit e590f4ee03263460a389bdd29678015727cdcd5a.
2025-03-06 08:54:32 -08:00
Kenny Daniel
e590f4ee03
Simplify relative import paths 2025-03-05 14:03:17 -08:00
Kenny Daniel
2a302702d4
Fix handling of boolean rle 2025-02-22 13:29:29 -08:00
Johan Levin
bf268e141c
Use prepended length for bit-packed hybrid bool columns (#62) 2025-02-19 11:07:49 -08:00
Sylvain Lesage
09ae9400c5
build types before publishing to npm (#46)
* build types before publishing to npm

* use prepare instead of prepublishOnly + make it clear that we only build types

doc for prepare vs prepublishOnly is here: https://docs.npmjs.com/cli/v8/using-npm/scripts

* no jsx in this lib

* relative imports from the root, so that it works from types/

* remove unused hyparquet.d.ts + report differences to jsdoc in files

* try to understand if this is the cause of the failing CI check

tsc fails: https://github.com/hyparam/hyparquet/actions/runs/12040954822/job/33571851170?pr=46

* Revert "try to understand if this is the cause of the failing CI check"

This reverts commit 5e2fc8ca179064369de71793ab1cda3facefddc7.

* not sure what happens, but we just need to ensure the types are created correctly

* increment version

* Explicitly export types for use in downstream typescript projects

* Use new typescript jsdoc imports for smaller package

* Combine some files and use @import jsdoc

* use the local typescript

---------

Co-authored-by: Kenny Daniel <platypii@gmail.com>
2024-12-02 17:47:42 +01:00
Kenny Daniel
ddb8b16cd0
Fix handling of multiple pages 2024-06-07 23:16:04 -07:00
Kenny Daniel
36f5b4f043
Move decompressPage to avoid circular dependency chain 2024-05-27 12:54:42 -07:00
Kenny Daniel
f4877dcd49
Publish v0.9.6! 2024-05-24 00:19:37 -07:00
Kenny Daniel
b8e4496063
Upgrade dataPage to match dictionary type 2024-05-23 00:07:09 -07:00
Kenny Daniel
9369faad46
Code cleanup 🧹 2024-05-22 12:58:37 -07:00
Kenny Daniel
66b832d5bb
Fix RLE encoding length 2024-05-21 18:41:57 -07:00
Kenny Daniel
a1ca1ef785
byte_stream_split_extended.gzip.parquet 2024-05-21 17:21:36 -07:00
Kenny Daniel
70387fa345
repeated_no_annotation.parquet 2024-05-20 23:09:31 -07:00
Kenny Daniel
9cd09b8eed
Byte stream split encoding 2024-05-20 04:09:32 -07:00
Kenny Daniel
7639b8ca7f
Fix fixed length byte array type 2024-05-12 21:52:26 -07:00
Kenny Daniel
82db6a8017
Delta binary packed encoding 2024-05-12 15:47:16 -07:00
Kenny Daniel
e398e66dd4
No utf8 in plain decoding 2024-05-09 16:28:50 -07:00
Kenny Daniel
4d5c8324aa
TypedArrays 2024-05-01 23:23:55 -07:00
Kenny Daniel
b8660baea1
Split out plain encoding 2024-04-30 20:28:50 -07:00
Kenny Daniel
f16e23f501
No copy readRleBitPackedHybrid 2024-04-30 14:40:18 -07:00
Kenny Daniel
93ff9a9f99
Refactor isListLike and isMapLike to use schemaPath 2024-04-29 18:45:29 -07:00
Kenny Daniel
2c6a111113
Refactor to use schemaPath 2024-04-29 17:38:26 -07:00
Kenny Daniel
86273b110c
PageType enum to string 2024-04-18 00:21:13 -07:00
Kenny Daniel
f826bff757
Use DataReader over Decoded. Fewer allocations, slightly faster. 2024-04-17 23:43:04 -07:00
Kenny Daniel
52721a3d30
Split out assemble objects 2024-03-18 17:40:52 -07:00
Kenny Daniel
4654c5eddf
Fix tests 2024-03-15 00:17:35 -07:00
Kenny Daniel
c6ad30b59a
schemaElement returns trees 2024-03-12 20:39:15 -07:00
Kenny Daniel
319dbb124e
Hysnappy wasm for faster benchmark.js 2024-02-27 14:37:32 -08:00
Kenny Daniel
8b575ad2d8
ParquetType as string 2024-02-27 11:31:17 -08:00
Kenny Daniel
11f35c9e43
Encoding as string 2024-02-27 10:51:57 -08:00
Kenny Daniel
87d78ab06e
Oops fix the other tests 2024-02-26 22:51:57 -08:00
Kenny Daniel
a65132b79c
Data Page V2 2024-02-26 18:44:33 -08:00
Kenny Daniel
c70b3b2227
Prepare for data page v2 2024-02-26 11:44:28 -08:00
Kenny Daniel
e2b85304b3
Fix UTF8 decoding 2024-02-16 16:27:00 -08:00
Kenny Daniel
69d2e725dd
Fix plain_dictionary encoding 2024-02-11 20:45:53 -08:00
Kenny Daniel
17f7ace840
Fix bug in readData, support more parquet files 2024-02-11 14:27:38 -08:00
Kenny Daniel
6d03bd6d86
Fix definition level parsing 2024-01-20 18:18:32 -08:00
Kenny Daniel
647056da8b
Async parquetRead with options 2024-01-15 15:11:03 -08:00
Kenny Daniel
c384cf469b
Parquet data page parser 2024-01-07 15:33:24 -08:00