-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Issues: apache/parquet-java
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ProtoSchemaConverter renders invalid schema for oneof in unwrap mode
Type: bug
#3039
opened Oct 31, 2024 by
aka-peter
updated Oct 31, 2024
Cannot read parquet file that was generated from nanoparquet
Type: bug
#3043
opened Nov 6, 2024 by
RealTYPICAL
updated Nov 15, 2024
Is it possible to apply specific encodings on specific columns with ParquetWriter?
#3051
opened Nov 7, 2024 by
Selfeer
updated Nov 15, 2024
Include FAPEC compressor support to Parquet?
Type: enhancement
#3067
opened Nov 18, 2024 by
PortellJ
updated Nov 18, 2024
remove deprecated PathGlobPattern and DeprecatedFieldProjectionFilter to compile on hadoop 3.2+
Component: Parquet
Component: Thrift
Priority: Major
Type: enhancement
#2698
opened Jul 12, 2022 by
asfimport
updated Nov 19, 2024
Add Variant Logical Type
Type: enhancement
#3070
opened Nov 20, 2024 by
aihuaxu
updated Nov 20, 2024
Inconsistency between vectorized processing boundary calculation in benchmark and readBatchUsing512Vector calculation method in ParquetReadRouter class?
#3073
opened Nov 22, 2024 by
1111nit
updated Nov 22, 2024
read footer using 1 call readFully(byte[8]) instead of 5 calls ( 4 x read() for footer length + 1 x read(byte[4]) for magic marker )
Type: enhancement
#3074
opened Nov 23, 2024 by
Arnaud-Nauwynck
updated Nov 23, 2024
should not use seek() for skipping very small column chunks. better to read and ignore data.
Type: enhancement
#3076
opened Nov 23, 2024 by
Arnaud-Nauwynck
updated Nov 25, 2024
HadoopStreams to support ByteBufferPositionedReadable input streams
Type: enhancement
#3080
opened Nov 26, 2024 by
steveloughran
updated Nov 26, 2024
Make DELTA_LENGTH_BYTE_ARRAY default encoding for binary values
Type: enhancement
#3083
opened Nov 27, 2024 by
raunaqmorarka
updated Nov 27, 2024
Add documentation about how to verify the release
Type: enhancement
#3091
opened Nov 30, 2024 by
raulcd
updated Nov 30, 2024
2
Required field 'num_values' was not found in serialized data!
Type: bug
#3084
opened Nov 28, 2024 by
wardlican
updated Dec 2, 2024
Use FixedSizeBinary instead of Binary for int96 conversion when convertInt96ToArrowTimestamp is false
Type: enhancement
#3088
opened Nov 29, 2024 by
doki23
updated Dec 2, 2024
parquet-cli reports nested columns as null
Type: bug
#3095
opened Dec 3, 2024 by
acdha
updated Dec 3, 2024
make it easy to read and write parquet files in java without depending on hadoop
Component: Parquet
Priority: Major
Type: enhancement
#1497
opened Oct 7, 2017 by
asfimport
updated Dec 16, 2024
Fail to read google.protobuf.UInt32Value from parquet
Type: bug
#3112
opened Jan 3, 2025 by
0x26res
updated Jan 3, 2025
Implement the Variant binary encoding
Type: enhancement
#3116
opened Jan 7, 2025 by
gene-db
updated Jan 7, 2025
High Memory Usage and Long GC Times When Writing Parquet Files
#3102
opened Dec 10, 2024 by
ccl125
updated Jan 13, 2025
Parquet-java sometimes produces 0-size compressed data in data page v2
Type: bug
#3122
opened Jan 15, 2025 by
mapleFU
updated Jan 26, 2025
Add more constructors to ParquetFileReader
Type: enhancement
#3141
opened Jan 30, 2025 by
yuzhu
updated Feb 3, 2025
ExampleParquetWriter may write illegal values for uint_8 and uint_16
Type: bug
#3142
opened Jan 31, 2025 by
parthchandra
updated Feb 3, 2025
Optimize the binaryToDecimal function in the DecimalUtils class
Type: enhancement
#3146
opened Feb 6, 2025 by
qian0817
updated Feb 6, 2025
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.