-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Insights: apache/iceberg
Overview
Could not load contribution data
Please try again later
87 Pull requests merged by 34 people
-
API, Core: Add default value APIs and Avro implementation
#9502 merged
Oct 4, 2024 -
Build: Update baseline-java 5.69.0
#11252 merged
Oct 4, 2024 -
Build: Bump org.xerial:sqlite-jdbc from 3.46.1.0 to 3.46.1.3
#11231 merged
Oct 3, 2024 -
Build: Bump org.eclipse.microprofile.openapi:microprofile-openapi-api from 3.1.1 to 3.1.2
#11182 merged
Oct 3, 2024 -
Build: Bump mkdocs-material from 9.5.34 to 9.5.38
#11233 merged
Oct 3, 2024 -
Puffin: Document stats
ndv
value representation#10793 merged
Oct 3, 2024 -
AWS: Set better defaults for S3 retry behaviour
#11052 merged
Oct 1, 2024 -
Core: Deprecate legacy ways for loading position deletes
#11242 merged
Oct 1, 2024 -
ThreadPools introduce newExitingWorkerPool and newFixedThreadPool for clearer semantics
#11073 merged
Oct 1, 2024 -
Build: Bump nessie from 0.97.1 to 0.99.0
#11224 merged
Oct 1, 2024 -
Core: Add DataFileSet / DeleteFileSet
#11195 merged
Oct 1, 2024 -
Core: Support combining position deletes during writes
#11222 merged
Oct 1, 2024 -
REST: Handle Requests with Page Sizes Exceeding Available Number of Namespaces /Tables/Views
#11143 merged
Sep 30, 2024 -
Core: Improve error handling when parsing view representations
#11236 merged
Sep 30, 2024 -
Build: Bump io.delta:delta-spark_2.12 from 3.2.0 to 3.2.1
#11225 merged
Sep 30, 2024 -
Build: Bump junit-platform from 1.10.3 to 1.11.1
#11227 merged
Sep 30, 2024 -
Build: Bump io.delta:delta-standalone_2.12 from 3.2.0 to 3.2.1
#11228 merged
Sep 30, 2024 -
Build: Bump software.amazon.awssdk:bom from 2.28.5 to 2.28.11
#11229 merged
Sep 30, 2024 -
Build: Bump guava from 33.3.0-jre to 33.3.1-jre
#11230 merged
Sep 30, 2024 -
Spark: Deprecate SparkAppenderFactory
#11076 merged
Sep 27, 2024 -
[Minor][Test] Fix TestFastAppend.testAddManyFiles()
#11218 merged
Sep 27, 2024 -
Core: Replace use of CharSequenceMap in DeleteFileIndex with Map<String, PositionDeletes>
#11199 merged
Sep 26, 2024 -
Parquet: update PruneColumns to inherit from TypeWithSchemaVisitor to have Iceberg type
#11179 merged
Sep 26, 2024 -
Core: Add a util to compute partition stats
#11146 merged
Sep 26, 2024 -
Upgrade to Gradle 8.10.2
#11212 merged
Sep 26, 2024 -
Core: Remove unused code for streaming position deletes
#11175 merged
Sep 26, 2024 -
Core: Support merging in PositionDeleteIndex
#11208 merged
Sep 25, 2024 -
Spark: Added merge schema as spark configuration
#9640 merged
Sep 25, 2024 -
Core: Support iterating over positions in PositionDeleteIndex
#11202 merged
Sep 25, 2024 -
Core: Add rewritten delete files to write results
#11203 merged
Sep 25, 2024 -
Build: Bump mkdocs-macros-plugin from 1.0.5 to 1.2.0
#11189 merged
Sep 25, 2024 -
AWS: Fix AWS doc URL
#11198 merged
Sep 25, 2024 -
API, AWS: Retry S3InputStream reads
#10433 merged
Sep 24, 2024 -
[Docs] Update AWS docs to add more AWS engines that supports iceberg
#11192 merged
Sep 24, 2024 -
API: Deprecate ContentFile#path API and add location API which returns String
#11092 merged
Sep 23, 2024 -
Build: Bump org.roaringbitmap:RoaringBitmap from 1.2.1 to 1.3.0
#11187 merged
Sep 23, 2024 -
Build: Bump com.google.cloud:libraries-bom from 26.44.0 to 26.47.0
#11185 merged
Sep 23, 2024 -
Build: Bump nessie from 0.95.0 to 0.97.1
#11184 merged
Sep 23, 2024 -
Build: Bump tez010 from 0.10.3 to 0.10.4
#11183 merged
Sep 23, 2024 -
Build: Bump org.apache.httpcomponents.client5:httpclient5 from 5.3.1 to 5.4
#11186 merged
Sep 23, 2024 -
Docs: Uppercase keyword in branching
#11172 merged
Sep 20, 2024 -
Supplement test case for
RollbackToTimestampProcedure
#11171 merged
Sep 20, 2024 -
Build: Bump org.xerial.snappy:snappy-java from 1.1.10.6 to 1.1.10.7
#11140 merged
Sep 20, 2024 -
AWS: Bump AWS SDK to version 2.28.5
#11170 merged
Sep 20, 2024 -
Docs:
field_id
in name serialisation spec should readfield-id
#11135 merged
Sep 19, 2024 -
API, Core: Enable dropping rewritten delete files in RowDelta
#11166 merged
Sep 19, 2024 -
Docs: Clarified Partition Transform
#8337 merged
Sep 19, 2024 -
Build: Add .java-version to gitignore.
#11167 merged
Sep 19, 2024 -
Core: Add explicit JSON parser for LoadTableResponse
#11148 merged
Sep 19, 2024 -
Core: Move internal struct projection to SupportsIndexProjection
#11132 merged
Sep 18, 2024 -
Kafka Connect: separate CI workflow
#11075 merged
Sep 18, 2024 -
Core: Update metadata location without updating lastUpdatedMillis
#11151 merged
Sep 18, 2024 -
Build: switch to slf4j-simple 2.x for test implementation dependency
#11001 merged
Sep 17, 2024 -
Spark 3.4: Add utility to load table state reliably
#11115 merged
Sep 16, 2024 -
Docs: Backport fixes for remove_orphan_files procedure
#11133 merged
Sep 16, 2024 -
Flink: Increase the number of checkpoints from 4 to 6 to fix flakiness.
#11121 merged
Sep 16, 2024 -
Docs: Fix missing options for remove_orphan_files procedure
#11080 merged
Sep 14, 2024 -
Spark 3.4: Action to compute table stats
#11106 merged
Sep 13, 2024 -
Hive: Add View support for HIVE catalog
#9852 merged
Sep 13, 2024 -
Core: Allow servers to express supported endpoints via endpoint field in ConfigResponse
#10929 merged
Sep 13, 2024 -
API, Core: Add manifestLocation API to ContentFile
#11044 merged
Sep 13, 2024 -
Core: Parallelize manifest writing for many new files
#11086 merged
Sep 12, 2024 -
OpenAPI: Fix YAML example and value json formatting
#11119 merged
Sep 12, 2024 -
Flink: Port #10484 to v1.19 (#11010)
#11117 merged
Sep 12, 2024 -
Docs: Update Project links to include contributing and REST spec
#11114 merged
Sep 12, 2024 -
Flink: Maintenance - Lock remover
#11010 merged
Sep 12, 2024 -
Build: Upgrade google-java-format to 1.22.0
#11050 merged
Sep 11, 2024 -
Kafka Connect: Stop commits on terminated coordinator
#10814 merged
Sep 10, 2024 -
Kafka Connect: Docs on configuring the sink
#10746 merged
Sep 10, 2024 -
Add Scan Planning Endpoints to open api spec
#9695 merged
Sep 10, 2024 -
Build: Remove unused variables, fields and parameters
#11101 merged
Sep 10, 2024 -
Upgrade to Gradle 8.10.1
#11104 merged
Sep 10, 2024 -
Add rmoff blogs
#11069 merged
Sep 10, 2024 -
Core, Kafka, Spark: Use AssertJ instead of JUnit assertions
#11102 merged
Sep 10, 2024 -
Core: Fix the behavior of IncrementalFileCleanup when expire a snapshot
#10983 merged
Sep 9, 2024 -
Spark 3.3, 3.4: Fix incorrect catalog loaded in TestCreateActions
#11049 merged
Sep 9, 2024 -
Build: Bump io.netty:netty-buffer from 4.1.112.Final to 4.1.113.Final
#11097 merged
Sep 9, 2024 -
Build: Bump software.amazon.awssdk:bom from 2.27.12 to 2.27.21
#11098 merged
Sep 9, 2024 -
Spec: Fix rendering of partition stats file spec
#11068 merged
Sep 9, 2024 -
Build: Bump jetty from 11.0.23 to 11.0.24
#11096 merged
Sep 9, 2024 -
Docs: Document accessing instance variables
#11087 merged
Sep 7, 2024 -
Core: Fix setting hasNewDataFile flag in MergingSnapshotProducer
#11088 merged
Sep 7, 2024 -
Spark 3.5: Mandate identifier fields when create_changelog_view for table contain unsortable columns
#11045 merged
Sep 5, 2024 -
Spark 3.3, 3.4: Parallelize reading files in migrate procedures
#11043 merged
Sep 5, 2024 -
open-api: Fix compile warnings for testFixtures
#11071 merged
Sep 5, 2024 -
Build: Enable more error-prone checks
#11078 merged
Sep 5, 2024
45 Pull requests opened by 31 people
-
Spark: revert delete procedure
#11084 opened
Sep 5, 2024 -
Add REST Catalog tests to Spark 3.5 integration test
#11093 opened
Sep 6, 2024 -
Kafka Connect: add option to force columns to lowercase
#11100 opened
Sep 8, 2024 -
Core: Add internal Avro reader
#11108 opened
Sep 11, 2024 -
AWS: Introduce opt-in S3LocationProvider which is optimized for S3 performance
#11112 opened
Sep 11, 2024 -
Core: Fix caching table with metadata table names
#11123 opened
Sep 13, 2024 -
Spec: Adds Row Lineage
#11130 opened
Sep 13, 2024 -
Build: Bump org.apache.datasketches:datasketches-java from 6.0.0 to 6.1.0
#11137 opened
Sep 15, 2024 -
Build: Bump com.google.errorprone:error_prone_annotations from 2.31.0 to 2.32.0
#11139 opened
Sep 15, 2024 -
Flink: Maintenance - TableManager + ExpireSnapshots
#11144 opened
Sep 16, 2024 -
column to column comparisons for filtering file scans and row data
#11152 opened
Sep 17, 2024 -
OpenAPI: Add planning-mode to loadTable response
#11156 opened
Sep 18, 2024 -
Spark 3.5: Fix NotSerializableException when migrating partitioned Spark tables
#11157 opened
Sep 18, 2024 -
Core: Switch usage to DataFileSet / DeleteFileSet
#11158 opened
Sep 18, 2024 -
Docs: Add Bigquery Iceberg documentation, Update MRAP endpoint and add more docs
#11159 opened
Sep 18, 2024 -
Core: Fix UnicodeUtil#truncateStringMax returns malformed string.
#11161 opened
Sep 18, 2024 -
Feature/otf 1500 column comparisons 1521
#11164 opened
Sep 18, 2024 -
FIX: Exception Handling in AWS Glue renameTable Method
#11165 opened
Sep 18, 2024 -
Core: Add credentials to loadTable / loadView responses
#11173 opened
Sep 20, 2024 -
Config for deciding whether to use Iceberg Time type
#11174 opened
Sep 20, 2024 -
API, Core: Add scan planning api request and response models
#11180 opened
Sep 20, 2024 -
Always update table metadata when `refresh` is called
#11194 opened
Sep 24, 2024 -
update PartitionSpec with snapshot'schema
#11196 opened
Sep 24, 2024 -
Core: Add support for view-override property in catalog
#11200 opened
Sep 24, 2024 -
PoC: Add Variant type support in Iceberg
#11201 opened
Sep 24, 2024 -
Compatible with Spark4 (upgrade antlr4 to version 4.13.1 Compatible with jdk17 )
#11204 opened
Sep 25, 2024 -
Flink: id generation for schema starts from 1
#11209 opened
Sep 25, 2024 -
DO NOT MERGE WILL BREAK - Change BaseCatalog to Interface
#11210 opened
Sep 25, 2024 -
Nessie: respect the nearest namespace's `location` property when creating a table or view
#11215 opened
Sep 26, 2024 -
Data: Add partition stats writer and reader
#11216 opened
Sep 26, 2024 -
Flink: Tests alignment for the Flink Sink v2-based implemenation (IcebergSink)
#11219 opened
Sep 27, 2024 -
Core: Update TableMetadataParser to close streams
#11220 opened
Sep 27, 2024 -
Build: Bump junit from 5.10.1 to 5.11.1
#11223 opened
Sep 29, 2024 -
Build: Bump jackson-bom from 2.14.2 to 2.18.0
#11226 opened
Sep 29, 2024 -
Build: Bump datamodel-code-generator from 0.25.9 to 0.26.1
#11234 opened
Sep 29, 2024 -
Puffin: Add delete-vector-v1 blob type
#11238 opened
Sep 30, 2024 -
Spec v3: Add deletion vectors to the table spec
#11240 opened
Sep 30, 2024 -
Fflink: Add table.exec.iceberg.use-v2-sink option
#11244 opened
Oct 1, 2024 -
[DRAFT] Fix indexing in dictionary encoded Parquet readers
#11247 opened
Oct 2, 2024 -
Flink: FlinkSink & IcebergSink desynchronized tests alignment
#11249 opened
Oct 2, 2024 -
Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection
#11254 opened
Oct 4, 2024 -
Demonstrate bug for issue #11253
#11256 opened
Oct 4, 2024 -
Initial Support for Spark 4.0 preview2
#11257 opened
Oct 4, 2024 -
More accurate estimate on parquet row groups size
#11258 opened
Oct 4, 2024
186 Issues closed by 25 people
-
Build a util to read and write partition stats file for a table on a single node.
#8456 closed
Oct 5, 2024 -
Introduce PartitionEntry class to represent stats per partition
#8455 closed
Oct 5, 2024 -
Flaky test/env TestFileRewriteCoordinator
#8441 closed
Oct 5, 2024 -
Logic about Hive version checking in MetastoreLock
#8440 closed
Oct 5, 2024 -
Clarity on serialization format of schema.name-mapping.default in Iceberg metadata
#8437 closed
Oct 5, 2024 -
Can we do Client side Encryption with Iceberg format?
#8431 closed
Oct 5, 2024 -
ICEBERG_CANNOT_OPEN_SPLIT: Error opening Iceberg split s3
#8427 closed
Oct 5, 2024 -
spark-procedures migrating tables can pose fatal problems
#8425 closed
Oct 5, 2024 -
[Feature] Fast forward branch to a specific snapshot id
#8424 closed
Oct 5, 2024 -
appendManifest API is not thread safe
#8420 closed
Oct 5, 2024 -
Unable to write to iceberg table using spark
#8419 closed
Oct 5, 2024 -
Flink Iceberg
#8417 closed
Oct 5, 2024 -
Does expireSnapshotId delete older snapshots data files?
#8410 closed
Oct 5, 2024 -
SPJ joins in the outer join component of MERGE queries
#8387 closed
Oct 5, 2024 -
Apache Iceberg - Update one record in the table doubles the number of files in the whole table
#8378 closed
Oct 5, 2024 -
Cannot set a custom location for path based tables
#8377 closed
Oct 5, 2024 -
Docs: Improve branch/tagging branch fast-forward (branch always on HEAD) semantics
#8638 closed
Oct 4, 2024 -
spark read error with Failed to open file
#8635 closed
Oct 4, 2024 -
Large Iceberg Parquet file writes are (sometimes?) truncated
#8620 closed
Oct 4, 2024 -
pyiceberg 0.5.0 cli __init__() takes at least 1 positional argument (0 given)
#8606 closed
Oct 4, 2024 -
Debug sporadic structured streaming failures
#8603 closed
Oct 4, 2024 -
Ambiguiety around `list`, `map` and `struct` null counts
#8598 closed
Oct 4, 2024 -
Write format arrow
#8580 closed
Oct 4, 2024 -
Snapshot Expiration Behavior Inconsistency with TIMESTAMP AS OF and VERSION AS OF
#8565 closed
Oct 4, 2024 -
ConnectionUrl property for CatalogProperties
#8557 closed
Oct 4, 2024 -
CASCADE WITH Drop Namespace Gives exception
#8529 closed
Oct 4, 2024 -
Rest catalog server doesn't return table configuration as expected.
#8526 closed
Oct 4, 2024 -
ALTER TABLE ... DROP COLUMN allows dropping the last column of a table
#8522 closed
Oct 4, 2024 -
Discrepancy between table configuration default value and Spark documentation
#8516 closed
Oct 4, 2024 -
Iceberg table support specified column comments by flinksql create
#8511 closed
Oct 4, 2024 -
spark sql delete or merge can support run in batch?
#8509 closed
Oct 4, 2024 -
Build incremental update for a stats file based on incremental scan.
#8460 closed
Oct 4, 2024 -
Implement Synchronous partition stats writing during write operation (controlled by table property).
#8458 closed
Oct 4, 2024 -
Export to Long Term Storage and Re Loading
#8339 closed
Oct 4, 2024 -
spark write orc error: Java heap space
#8318 closed
Oct 4, 2024 -
Add the document for Spark properties
#8314 closed
Oct 4, 2024 -
Creating an existing database with spark sql command "Create database if exists" throws exception
#8298 closed
Oct 4, 2024 -
Supporting `double` type for `truncate` partitioning
#8275 closed
Oct 4, 2024 -
[FeatureRequest] Statistics: Average column width for dynamically-sized types
#8274 closed
Oct 4, 2024 -
DataTableScan may not include Unpartitioned data in the results
#8269 closed
Oct 4, 2024 -
Support rebase one branch onto other branch
#8268 closed
Oct 4, 2024 -
Provide `jsonschema` for the Metadata
#8266 closed
Oct 4, 2024 -
Default table properties not respected when using Spark DataFrame API
#8265 closed
Oct 4, 2024 -
Docs: document the compareWithFileList parameter
#8155 closed
Oct 4, 2024 -
iceberg materialized views
#8143 closed
Oct 4, 2024 -
Spark: Data cannot be written to iceberg using the spark v1 interface
#8124 closed
Oct 4, 2024 -
Compatibility between Flink connector and Iceberg table
#8115 closed
Oct 4, 2024 -
Reset data file's storage location after change partition
#8110 closed
Oct 4, 2024 -
Delete/Update fails for tables with more than 1000 columns
#6368 closed
Oct 4, 2024 -
Iceberg support ranger to make access data more safety
#3619 closed
Oct 4, 2024 -
Flink CDC job getting failed due to G1 old gc and large checkpointing time
#2900 closed
Oct 4, 2024 -
UUID write requires different record in Parquet and ORC/Avro
#1881 closed
Oct 4, 2024 -
Add view support for Hive catalog
#8698 closed
Oct 2, 2024 -
Why call deleteKey for Insert and Update After in Flink BaseDeltaTaskWriter?
#11081 closed
Oct 2, 2024 -
Flink: Maintenance - ExpireSnapshots
#10304 closed
Oct 1, 2024 -
Flink: Maintenance - TableMaintenanceBuilder
#10307 closed
Oct 1, 2024 -
Can sparksql ddl define primary key now?
#8508 closed
Oct 1, 2024 -
While decrypting iceberg table data using aws encyption sdk getting unsupported version error
#8497 closed
Oct 1, 2024 -
unify streaming and batch, combining FLink and iceberg.In case In pipeline, Is kafka necessary?
#8468 closed
Oct 1, 2024 -
multi-arg transform support
#8258 closed
Sep 27, 2024 -
RollingFileWriter Throws Exceptions if it Does Not Have Delete Permissions
#8253 closed
Sep 27, 2024 -
MERGE INTO number of affected rows
#8229 closed
Sep 27, 2024 -
Enabling schema evolution feature using spark configuration like we have in Delta Lake
#9651 closed
Sep 26, 2024 -
Kryo serialization problem for `GenericDataFile`
#11197 closed
Sep 25, 2024 -
BrotliDecompressor throwing precondition error on PySpark job with UDF and limit
#8211 closed
Sep 25, 2024 -
Request to add KLL Datasketch and hive ColumnStatisticsObj and as standard blob types to puffin file.
#8198 closed
Sep 25, 2024 -
NamedReference::bind performance issue
#8196 closed
Sep 25, 2024 -
Iceberg Java Api - S3 Session Token - 403 Forbidden exception
#8190 closed
Sep 25, 2024 -
javax.net.ssl.SSLException: Connection reset on S3 w/ S3FileIO and Apache HTTP client
#10340 closed
Sep 24, 2024 -
Create table should take in sort order/ distribution mode
#8179 closed
Sep 24, 2024 -
Why the logical types are handled differently between Iceberg-Avro and Iceberg-Parquet?
#8176 closed
Sep 24, 2024 -
Support partitioning and sorting on nested struct
#8175 closed
Sep 24, 2024 -
Can't clean up MetaData after modifying metadata.compression-codec.
#8162 closed
Sep 23, 2024 -
NullPointerException when doing FlinkEnvironmentContext.init() for Flink 1.17 and iceberg 1.3.0
#8159 closed
Sep 23, 2024 -
Api:Fix add the same listener to the same listeners queue multiple times
#8107 closed
Sep 21, 2024 -
CDC vectorized reader
#8089 closed
Sep 21, 2024 -
How to decide bucket number
#8087 closed
Sep 21, 2024 -
How Can safely delete small files after executed rewriteDataFiles
#8066 closed
Sep 21, 2024 -
The data of the same table is distributed across different file systems
#8055 closed
Sep 21, 2024 -
Flink: Implements SupportsDynamicFiltering interface
#8048 closed
Sep 21, 2024 -
Usage of Hidden Partitioning
#8031 closed
Sep 21, 2024 -
remove_orphan_files throws reached maximum depth exception in AWS EMR-6.11.0
#8022 closed
Sep 21, 2024 -
Write ordered by within unique physical partitions folder (exclude hash path).
#8008 closed
Sep 21, 2024 -
metadata.json delete
#8007 closed
Sep 21, 2024 -
performance degradation after migrating to spark 3.3.1 when using iceberg merge into
#7998 closed
Sep 21, 2024 -
use tez can't write data
#7990 closed
Sep 21, 2024 -
Docs: Add YouTube to the Apache website.
#7967 closed
Sep 21, 2024 -
Add FileIO docs
#7966 closed
Sep 21, 2024 -
I cannot package my application as uberjar using maven shade plugin.
#7953 closed
Sep 21, 2024 -
Migrate/ snapshot action should exclude file that does not contain any record
#7949 closed
Sep 21, 2024 -
Improve Documentation on getting started with GCS
#7948 closed
Sep 21, 2024 -
Iceberg does not trigger actually rewrite_data_files in certain situations
#8510 closed
Sep 20, 2024 -
Name Mapping Serialisation Spec lists field `field_id` but examples use `field-id`
#11134 closed
Sep 19, 2024 -
FlinkSQL Upsert did'nt support timestamp column as a primary key
#7707 closed
Sep 19, 2024 -
Support Rewrite Datafiles into a custom Partition Spec
#7557 closed
Sep 19, 2024 -
Duplicate records with MERGE command
#7005 closed
Sep 19, 2024 -
rewrite_data_files procedure is not compatible with ranger auth check
#11149 closed
Sep 18, 2024 -
Mixed usage of snapshotCreationTs, metadataCommitTs & tableAccessTs when using REST Catalog
#11103 closed
Sep 18, 2024 -
Iceberg spark procedure argument does not support empty map or empty array.
#8448 closed
Sep 18, 2024 -
IcebergParseException.getMessage does not show the below line
#8462 closed
Sep 18, 2024 -
How to remove orphan manifest and manifest list file
#7937 closed
Sep 18, 2024 -
Docs: Improve possible options/parameters for system procedures and usage.
#7934 closed
Sep 18, 2024 -
Read is not working on Iceberg Hive table
#7924 closed
Sep 18, 2024 -
Merge Small File Error
#7919 closed
Sep 18, 2024 -
Iceberg requiredNumOfPartitions method
#7918 closed
Sep 18, 2024 -
[Feature Request] Inspect partitions Metadata for Tables with Many Partitions
#7892 closed
Sep 18, 2024 -
Data files name collision written by Spark Streaming job after it's restart
#7890 closed
Sep 18, 2024 -
missing option in remove_orphan_files (prefix mismatch)
#7884 closed
Sep 18, 2024 -
Partition Filter returns incorrect results for decimal partition columns with trailing 0's
#7882 closed
Sep 18, 2024 -
DataFrame inconsistency after MERGE operation
#7863 closed
Sep 18, 2024 -
delete with clause IN
#7850 closed
Sep 18, 2024 -
PartitionSpec field name should be consistent for bucket and trunc in $partitions metadata table
#7849 closed
Sep 18, 2024 -
When iceberg stream reads table data, the data of update and delete operations will not be read out
#7835 closed
Sep 16, 2024 -
Implementing Storage Partition join and reducing the time for MERGE command
#7832 closed
Sep 16, 2024 -
CDC data inconsistencies with schema changes
#7822 closed
Sep 16, 2024 -
[bug] Spark SQL phase optimization failed on concurrent write attempt
#7800 closed
Sep 16, 2024 -
The Orc file (via iceberg)because large than Orc file(only via spark) ?
#7775 closed
Sep 16, 2024 -
Failed to check if LessThan(status,2) can be pushed down: Cannot find field 'status' in struct: struct<>
#7774 closed
Sep 16, 2024 -
BaseSparkAction should not override `spark.jobGroup.id` property
#8422 closed
Sep 14, 2024 -
what does value of partition mean in table dbxxx.tbxxx.partitions?
#11125 closed
Sep 13, 2024 -
[core] GSS initiate failed
#8342 closed
Sep 13, 2024 -
Streaming read from Iceberg table in S3 cause checkpoint related error
#11113 closed
Sep 12, 2024 -
iceberg mor table execute merge very very slow
#7431 closed
Sep 12, 2024 -
Using Iceberg from EKS to access resource in another aws account loads instance role by default
#7344 closed
Sep 12, 2024 -
location parameter of remove_orphan_files procedure relative to table location
#7334 closed
Sep 12, 2024 -
EMR 6.10.0 Cannot migrate a table from a non-Iceberg Spark Session Catalog. Found spark_catalog
#7317 closed
Sep 12, 2024 -
Support commit operations in pyiceberg
#7259 closed
Sep 12, 2024 -
Drop the SQL issue when attempting to drop an Iceberg table whose location does not exist
#7227 closed
Sep 12, 2024 -
Replace Thread.sleep() usage in test code with Awaitility
#7154 closed
Sep 12, 2024 -
Support bulk remove orphan files
#7111 closed
Sep 12, 2024 -
Iceberg add_files procedure with partition_filter scan non needed folders
#7027 closed
Sep 12, 2024 -
Integrate CRT with Iceberg S3 client
#6739 closed
Sep 12, 2024 -
StructCopy does not correctly Copy Fixed Data Type
#6685 closed
Sep 12, 2024 -
ManifestReader does not return metrics with null filters
#6658 closed
Sep 12, 2024 -
Unclear messaging about Glue catalog locking
#6636 closed
Sep 12, 2024 -
Purpose of MAX_CONTINUOUS_EMPTY_COMMITS in IcebergFilesCommitter
#6630 closed
Sep 12, 2024 -
Merge into does not work with spark temp table
#6615 closed
Sep 12, 2024 -
iceberg-1.1.0 - flink sql create hive catalog error
#6522 closed
Sep 12, 2024 -
KC Integration tests occasionally fail
#11111 closed
Sep 11, 2024 -
Add support for building runtime jars containing the sources
#1865 closed
Sep 11, 2024 -
[Docs]: improve ChangeLog
#6347 closed
Sep 11, 2024 -
Add ignoreDuplicates option for add_files procedure
#6306 closed
Sep 11, 2024 -
Use the Spark engine to delete from the error
#6255 closed
Sep 11, 2024 -
There is no data in the table, when insert data using Hive on Tez.
#6235 closed
Sep 11, 2024 -
rewriteDataFiles throws exception in spark 3.2
#6172 closed
Sep 11, 2024 -
How to write to a bucket-partitioned table using PySpark?
#5977 closed
Sep 11, 2024 -
Flink write iceberg bug(org.apache.iceberg.exceptions.NotFoundException)
#5846 closed
Sep 11, 2024 -
Registering BucketUDF on PySpark
#5721 closed
Sep 11, 2024 -
Error projecting nested structs from manifests table
#5649 closed
Sep 11, 2024 -
Use DefaultAWSCredentialsProviderChain for AWS credentials
#5608 closed
Sep 11, 2024 -
Support to write a custom partition transforms in iceberg
#5606 closed
Sep 11, 2024 -
read iceberg table by flink timeout
#5388 closed
Sep 11, 2024 -
Migrate to Spark DS V2 Filter
#5273 closed
Sep 11, 2024 -
java.lang.IllegalArgumentException: Table identifier not set
#5175 closed
Sep 11, 2024 -
Reduce CI Workload by Removing Some Spark Variants and Using Callable Workflows for Github Actions
#5153 closed
Sep 11, 2024 -
org.apache.flink.connectors.hive.FlinkHiveException: Unable to instantiate the hadoop input format
#5145 closed
Sep 11, 2024 -
Flink: FLIP-143 & FLIP-191 based Iceberg sink
#5119 closed
Sep 11, 2024 -
Proposal: FlinkSQL supports partition transform by computed columns
#5000 closed
Sep 11, 2024 -
rewritedatafile: Cannot commit, found new position delete for replaced data
#4996 closed
Sep 11, 2024 -
whether flink.actions.RewriteDataFilesAction does not implement option
#4970 closed
Sep 11, 2024 -
"Manifest is missing" ValidationException when there have Concurrent applications to rewrite manifests
#3466 closed
Sep 11, 2024 -
Getting the following error when using from spark thrift server
#3010 closed
Sep 11, 2024 -
Add metadata tables tests to make sure they don't break when reading different versions of tables
#2532 closed
Sep 11, 2024 -
Missing Types.UUIDType in SUPPORTED_PRIMITIVES
#1302 closed
Sep 11, 2024 -
Data files which are still useful are mistakenly cleaned up when trying to expire a specified snapshot
#10982 closed
Sep 9, 2024 -
Case sensitivity is not respected when using IcebergGenerics.ScanBuilder
#8178 closed
Sep 9, 2024 -
Kafka Connect: Record projection Index out of bounds error
#11099 closed
Sep 8, 2024 -
Iceberg to Redshift load
#6841 closed
Sep 8, 2024 -
Provide Puffin reader API allowing read without decompression
#6443 closed
Sep 7, 2024 -
[Feature Proposal] Log Store in Iceberg
#6429 closed
Sep 7, 2024 -
How compaction works along side incremental read
#6422 closed
Sep 7, 2024 -
Pyiceberg StaticTable use the last metadata json URL when the full path is not provided
#7979 closed
Sep 6, 2024 -
Limit in pyiceberg don't seems to be pushed down in the scan operator
#7965 closed
Sep 6, 2024 -
Flink: multiple sinks for the different iceberg tables in the same job?
#11074 closed
Sep 5, 2024
46 Issues opened by 41 people
-
Validation Error in ConfigResponse Model with RestCatalog in PyIceberg using Nessie REST API
#11255 opened
Oct 4, 2024 -
write.metadata.metrics.max-inferred-column-defaults doesn't respect nested columns
#11253 opened
Oct 3, 2024 -
Inexplainable behavior for SQLCatalog with Postgres and MinIO
#11250 opened
Oct 3, 2024 -
Flink: Maintenance - Add support for more kinds of scheduling
#11246 opened
Oct 1, 2024 -
Improve Memory Use in SparkScanBuilder
#11245 opened
Oct 1, 2024 -
Snapshot chain getting broken - data incorrectly removed
#11243 opened
Oct 1, 2024 -
[Deduplication]primary key working differently when running in same session vs running a new session
#11241 opened
Oct 1, 2024 -
BaseDeleteLoader may ignore delete records for binary columns
#11239 opened
Sep 30, 2024 -
Downloads link to Flink 1.20 runtime for Iceberg 1.6.1 leads to 404
#11237 opened
Sep 30, 2024 -
ManifestGroup::TaskContext should cache partition spec
#11235 opened
Sep 29, 2024 -
Spark vectorized read of Parquet produces incorrect result for a decimal column
#11221 opened
Sep 27, 2024 -
Provide option to specify user defined schema while reading from iceberg table
#11217 opened
Sep 26, 2024 -
[Parquet] When reading struct-type data without an id in iceberg-parquet, it returns null values.
#11214 opened
Sep 26, 2024 -
Before expiring snapshots is there need to provide history snapshot file statistics
#11213 opened
Sep 26, 2024 -
Move Writer classes from kafka-connect to core
#11207 opened
Sep 25, 2024 -
What's the use of old metadata file, why not delete by default?
#11206 opened
Sep 25, 2024 -
Spark SQL UI can't show scan metrics.
#11191 opened
Sep 23, 2024 -
Proposal: add Variant type to iceberg
#11178 opened
Sep 20, 2024 -
Deleting metadata(expire_snapshots doesn't help...)
#11169 opened
Sep 19, 2024 -
Iceberg Read is not working on Iceberg Hive table
#11168 opened
Sep 19, 2024 -
Kafka Connect: route to table using topic name
#11163 opened
Sep 18, 2024 -
Incorrect schema used when using time-travel
#11162 opened
Sep 18, 2024 -
Table rename in Glue Catalog throws Incorrect `AlreadyExistsException`
#11155 opened
Sep 17, 2024 -
REST Catalog does not validate "to" identifier on rename table
#11154 opened
Sep 17, 2024 -
s3:DeleteObject giving because no session policy allows the s3:DeleteObject action
#11153 opened
Sep 17, 2024 -
procedure add_files parallelism > 1 -> NotSerializableException
#11147 opened
Sep 16, 2024 -
REST Catalog pagination can throw IndexOutOfBoundsException
#11142 opened
Sep 15, 2024 -
Row Lineage for V3
#11129 opened
Sep 13, 2024 -
Inconsistent id definition on Flink resolvedSchema conversion to iceberg schema
#11128 opened
Sep 13, 2024 -
AWS: Glue ETL Job fails to create a table using lakeformation
#11126 opened
Sep 13, 2024 -
How to get the specific catalog config from Iceberg REST get config interface?
#11124 opened
Sep 13, 2024 -
Improve Position Deletes in V3
#11122 opened
Sep 12, 2024 -
support equality/positional deletes in vectorized arrow reader
#11120 opened
Sep 12, 2024 -
REST: Standardize vended credentials used in loadTable / loadView responses
#11118 opened
Sep 12, 2024 -
Iceberg defaulting to URLConnectionHttpClient instead of Apache HTTP Client
#11116 opened
Sep 11, 2024 -
Does main branch reference reset requiring a clean up of snapshot logs
#11109 opened
Sep 11, 2024 -
Kafka Connect: auto create with lowercase columns
#11091 opened
Sep 6, 2024 -
Table has more than one bucket keys, but "show create table xxx" only displays one
#11090 opened
Sep 6, 2024 -
Connection pool shut down for iceberg 1.5.0
#11089 opened
Sep 6, 2024 -
Cannot commit identity partition on datatypes time,timestamp* using 'fromPartitionString'
#11085 opened
Sep 5, 2024 -
Store min/max stats per column per partition
#11083 opened
Sep 5, 2024
177 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
OpenAPI: Standardize credentials in loadTable/loadView responses
#10722 commented on
Oct 2, 2024 • 65 new comments -
Arrow: add support for null vectors
#10953 commented on
Oct 2, 2024 • 62 new comments -
Spec: Support geo type
#10981 commented on
Oct 1, 2024 • 53 new comments -
Support changelog scan for table with delete files
#10935 commented on
Sep 25, 2024 • 41 new comments -
Spec: Add v3 types and type promotion
#10955 commented on
Oct 4, 2024 • 28 new comments -
Materialized View Spec
#11041 commented on
Oct 4, 2024 • 23 new comments -
Spark: Add RewriteTablePath action interface
#10920 commented on
Oct 4, 2024 • 17 new comments -
GCP: Add Iceberg Catalog for GCP BigQuery Metastore
#11039 commented on
Sep 18, 2024 • 14 new comments -
Kafka Connect: Include third party licenses and notices in distribution
#10829 commented on
Sep 18, 2024 • 13 new comments -
Manifest list encryption
#7770 commented on
Sep 27, 2024 • 12 new comments -
Spark partial limit push down
#10943 commented on
Sep 28, 2024 • 11 new comments -
API: Add RemoveUnusedSpecs in Table
#10755 commented on
Oct 2, 2024 • 9 new comments -
fix: fixing tests to work with s3Express
#11021 commented on
Sep 25, 2024 • 8 new comments -
Core: Add reference snapshot ID/timestamps to AllEntriesTable and AllManifestsTable
#9335 commented on
Oct 1, 2024 • 8 new comments -
Use Snapshot's statistics file in SparkScan
#11040 commented on
Oct 1, 2024 • 7 new comments -
Spark 3.5: Don't change table distribution when only altering local order
#10774 commented on
Sep 17, 2024 • 7 new comments -
WIP: Initial Support for Spark 4.0
#10622 commented on
Sep 30, 2024 • 7 new comments -
Remove Hive 2
#10996 commented on
Sep 27, 2024 • 6 new comments -
OpenAPI: Add AppendDataFile models to openapi spec for fine grained metadata commits
#10202 commented on
Sep 26, 2024 • 5 new comments -
Core/RewriteFiles: Duplicate Data Bug - Fixed dropping delete files that are still required
#10962 commented on
Sep 12, 2024 • 5 new comments -
Spec: Add expiry time config to REST table load
#10873 commented on
Sep 16, 2024 • 4 new comments -
Core: Add support for `view-default` property in catalog
#11064 commented on
Sep 12, 2024 • 4 new comments -
Core: Remove one comment from FastAppend
#10995 commented on
Sep 20, 2024 • 4 new comments -
Updating SparkScan to only read Apache DataSketches
#11035 commented on
Sep 24, 2024 • 4 new comments -
Core: fix NPE with HadoopFileIO because FileIOParser doesn't serialize Hadoop configuration
#10926 commented on
Sep 30, 2024 • 2 new comments -
OpenAPI: Add query param to control namespace separator
#10904 commented on
Sep 18, 2024 • 1 new comment -
Encryption integration and test
#5544 commented on
Sep 21, 2024 • 1 new comment -
Spark 3.5: Fix flaky test due to deleting temp directory failure
#10811 commented on
Sep 11, 2024 • 1 new comment -
Max number of columns
#9220 commented on
Oct 3, 2024 • 0 new comments -
Support Page Skipping in Iceberg Parquet Reader
#193 commented on
Oct 4, 2024 • 0 new comments -
Create empty snapshot for metadata operations
#7075 commented on
Oct 4, 2024 • 0 new comments -
Disaster Recovery Options for AWS Athena/Iceberg Integration
#6619 commented on
Oct 3, 2024 • 0 new comments -
com.esotericsoftware.kryo.KryoException: java.lang.ClassCastException: java.lang.Integer cannot be cast to java.nio.ByteBuffer
#9738 commented on
Sep 9, 2024 • 0 new comments -
Support Parquet v2 Spark vectorized read
#7162 commented on
Oct 4, 2024 • 0 new comments -
Does JDBC connector uses any retry mechanism?
#7173 commented on
Oct 4, 2024 • 0 new comments -
Data Deletion Issue with MERGE INTO ... WHEN MATCHED THEN DELETE statement in Iceberg 1.3 with Spark 3.4.1
#8126 commented on
Oct 4, 2024 • 0 new comments -
How to avoid partition key sorting when inserting data into a partitioned Iceberg table?
#10181 commented on
Oct 4, 2024 • 0 new comments -
DeleteOrphanFiles or ExpireSnapshots outofmemory
#3703 commented on
Oct 4, 2024 • 0 new comments -
Docs: Fix MkDocs ASF nav links
#8965 commented on
Oct 4, 2024 • 0 new comments -
Partition stats task tracker
#8450 commented on
Oct 5, 2024 • 0 new comments -
API,Core: Introduce metrics for data files by file format
#5837 commented on
Sep 20, 2024 • 0 new comments -
Core: Rollback compaction on conflicts
#5888 commented on
Oct 4, 2024 • 0 new comments -
Creating a hive Managed Table?
#9013 commented on
Sep 30, 2024 • 0 new comments -
Flink SQL SELECT ORDER BY clause caused data loss.
#9022 commented on
Sep 30, 2024 • 0 new comments -
Merge into second commit when with no changes
#9024 commented on
Sep 30, 2024 • 0 new comments -
org.apache.iceberg.hive.RuntimeMetaException: Failed to connect to Hive Metastore at
#9030 commented on
Oct 1, 2024 • 0 new comments -
how to update nested column value with spark
#10557 commented on
Oct 1, 2024 • 0 new comments -
Variant Data Type Support
#10392 commented on
Oct 1, 2024 • 0 new comments -
Unable to query iceberg table , getting unable to open manifest file "org.apache.avro.InvalidAvroMagicException: Not an Avro data file"
#11070 commented on
Oct 1, 2024 • 0 new comments -
Iceberg data file Not Found but have an entry in table.files catalog
#8338 commented on
Oct 1, 2024 • 0 new comments -
Error generating Go code from rest-catalog-open-api.yaml
#9070 commented on
Oct 2, 2024 • 0 new comments -
Iceberg: Partition-Level Tagging Support
#9060 commented on
Oct 2, 2024 • 0 new comments -
Failed to create namespace using spark sql based on iceberg hadoop catalog (rest catalog)
#9072 commented on
Oct 2, 2024 • 0 new comments -
Extend check-nullability parameter scope to allow writing optional list elements and map values to required elements and values
#9091 commented on
Oct 2, 2024 • 0 new comments -
It sometimes throws exception java.lang.AssertionError: assertion failed after upgrade to Iceberg 1.3.1 + Spark 3.4.1
#9092 commented on
Oct 3, 2024 • 0 new comments -
Metrics for Manifest file caching
#9093 commented on
Oct 3, 2024 • 0 new comments -
hive iceberg
#9094 commented on
Oct 3, 2024 • 0 new comments -
Add User Interface to Iceberg based lakehouse
#10980 commented on
Oct 3, 2024 • 0 new comments -
Spec: Fix table of content generation
#11067 commented on
Sep 27, 2024 • 0 new comments -
Build: Bump org.apache.hadoop.thirdparty:hadoop-shaded-guava from 1.2.0 to 1.3.0
#11061 commented on
Oct 1, 2024 • 0 new comments -
Build: Bump com.azure:azure-sdk-bom from 1.2.25 to 1.2.27
#11058 commented on
Oct 1, 2024 • 0 new comments -
Build: Bump net.snowflake:snowflake-jdbc from 3.18.0 to 3.19.0
#11057 commented on
Oct 1, 2024 • 0 new comments -
Build: Bump parquet from 1.13.1 to 1.14.2
#11054 commented on
Oct 1, 2024 • 0 new comments -
REST: Use HEAD request to check table existence
#10999 commented on
Sep 25, 2024 • 0 new comments -
DRAFT: DO NOT MERGE - create a NullVector instance as the dummy holder for null values
#10923 commented on
Sep 6, 2024 • 0 new comments -
DRAFT: DO NOT MERGE Create a reader for missing column in parquet file
#10922 commented on
Sep 6, 2024 • 0 new comments -
Core: Make namespace separator configurable
#10877 commented on
Sep 11, 2024 • 0 new comments -
Build: Bump antlr from 4.9.3 to 4.13.2
#10867 commented on
Sep 5, 2024 • 0 new comments -
REST: AuthManager API
#10753 commented on
Sep 18, 2024 • 0 new comments -
Flink-1.19: Fix the file offset mismatch when Flink reader first seek…
#10567 commented on
Sep 19, 2024 • 0 new comments -
Deprecate ContentCache.invalidateAll
#10494 commented on
Oct 3, 2024 • 0 new comments -
#10275 - fix NullPointerException
#10284 commented on
Sep 6, 2024 • 0 new comments -
Core: Allow manifest file cache to be configurable
#10118 commented on
Sep 20, 2024 • 0 new comments -
Iceberg/Comet integration POC
#9841 commented on
Oct 4, 2024 • 0 new comments -
[AWS] S3FileIO - Add Cross-Region Bucket Access
#9804 commented on
Oct 4, 2024 • 0 new comments -
Spark: Add serialzable isolation test for concurrent MERGE INTOs
#9050 commented on
Oct 1, 2024 • 0 new comments -
Core: Implement equals/hashCode method for RESTResponse
#9049 commented on
Oct 1, 2024 • 0 new comments -
Support tencent COS fileIO
#9048 commented on
Oct 1, 2024 • 0 new comments -
Docs: Update site-docs/spark-quickstart.md
#8991 commented on
Sep 29, 2024 • 0 new comments -
Spark: support rewrite on specified target branch
#8797 commented on
Sep 9, 2024 • 0 new comments -
Core: Avro writers use BlockingBinaryEncoder to enable array/map size calculations.
#8625 commented on
Sep 22, 2024 • 0 new comments -
Core, Hive, Nessie: Use ResolvingFileIO as default instead of HadoopFileIO
#8272 commented on
Sep 13, 2024 • 0 new comments -
Core: Make metrics reporter serializable (alternative impl)
#8032 commented on
Sep 6, 2024 • 0 new comments -
Use SupportsPrefixOperations for Remove OrphanFile Procedure
#7914 commented on
Oct 4, 2024 • 0 new comments -
Core: Fix retry behavior for Jdbc Client
#7561 commented on
Oct 4, 2024 • 0 new comments -
API: Add ParquetUtils.getSplitOffsets that takes an InputFile
#7267 commented on
Oct 4, 2024 • 0 new comments -
[Core][Spark] Improve DeleteOrphanFiles action to return additional details of deleted orphan files
#7127 commented on
Oct 4, 2024 • 0 new comments -
Core: Add Catalog Transactions API
#6948 commented on
Sep 10, 2024 • 0 new comments -
API,Core: Support Conditional Commits
#6513 commented on
Oct 4, 2024 • 0 new comments -
Flakiness in TestMetadataTableReadableMetrics#testPrimitiveColumns()
#8679 commented on
Sep 22, 2024 • 0 new comments -
TestIcebergSourceFailover fails when running with JDK17
#8680 commented on
Sep 22, 2024 • 0 new comments -
Various fields in Manifest & ManifestList not written in accordance to the Spec
#8699 commented on
Sep 22, 2024 • 0 new comments -
De-Duping Rows While Compacting
#8702 commented on
Sep 22, 2024 • 0 new comments -
spark read much volume of data from one source when storage partition join implemented
#8710 commented on
Sep 22, 2024 • 0 new comments -
Spec: Add `141: spec_id` and `142: schema_id` to the spec
#8712 commented on
Sep 22, 2024 • 0 new comments -
Optimize metadata tables?
#8714 commented on
Sep 22, 2024 • 0 new comments -
S3 compression Issue with Iceberg
#8713 commented on
Sep 22, 2024 • 0 new comments -
Support deletion in Apache Flink
#8718 commented on
Sep 22, 2024 • 0 new comments -
Upsert support for keyless Apache Flink tables
#8719 commented on
Sep 22, 2024 • 0 new comments -
Null support in Apache Flink
#8720 commented on
Sep 22, 2024 • 0 new comments -
Support Hudi `DeltaStreamer` compatible feature
#8724 commented on
Sep 22, 2024 • 0 new comments -
Implementation does not write `schema-id` into Manifest Avro headers
#8745 commented on
Sep 22, 2024 • 0 new comments -
Spec does not define which header fields to be present in ManifestLists
#8746 commented on
Sep 22, 2024 • 0 new comments -
Cannot create a V1 table with `CREATE OR REPLACE TABLE`
#8756 commented on
Sep 22, 2024 • 0 new comments -
Flaky test/env TestFlinkParquetReader, TestFlinkParquetWriter, TestIcebergSourceBoundedSql
#8761 commented on
Sep 22, 2024 • 0 new comments -
How is iceberg compatible with hive's tez engine
#8757 commented on
Sep 22, 2024 • 0 new comments -
is there anyway to rewrite onto a specific branch?
#8762 commented on
Sep 22, 2024 • 0 new comments -
Parquet.write to S3 with GlueCatalog requires commit
#8767 commented on
Sep 23, 2024 • 0 new comments -
How can I quickly insert data into an iceberg table in a Python environment?
#8801 commented on
Sep 23, 2024 • 0 new comments -
manifest lost
#8806 commented on
Sep 23, 2024 • 0 new comments -
Make iceberg an idempotent sink for Spark like delta lake
#8809 commented on
Sep 23, 2024 • 0 new comments -
Iceberg Glue Concurrent Update can result in missing metadata_location
#9411 commented on
Sep 9, 2024 • 0 new comments -
Iceberg does not work with Spark's default hive metastore (embedded Derby database)
#7847 commented on
Sep 10, 2024 • 0 new comments -
show table extended not supported for v2 table.
#5782 commented on
Sep 11, 2024 • 0 new comments -
Flink: add more sink shuffling support
#6303 commented on
Sep 11, 2024 • 0 new comments -
Type Promotion: Int/Long to String
#9064 commented on
Sep 11, 2024 • 0 new comments -
Multi-Column Transforms
#9132 commented on
Sep 11, 2024 • 0 new comments -
Location Ownership
#9133 commented on
Sep 11, 2024 • 0 new comments -
Issue with CALL parsing
#8343 commented on
Sep 12, 2024 • 0 new comments -
java.lang.NoClassDefFoundError: scala/jdk/CollectionConverters$
#10175 commented on
Sep 12, 2024 • 0 new comments -
Support partial insert in merge into command
#8199 commented on
Sep 13, 2024 • 0 new comments -
Huge amount of Aws s3 Exception "Unable to execute HTTP request: The target server failed to respond" during Iceberg v2 table merge with some DeleteFiles + DataFiles in a partition
#8218 commented on
Sep 14, 2024 • 0 new comments -
DOCS: Report CSS and styling issues on the new site.
#9643 commented on
Sep 16, 2024 • 0 new comments -
Table maintenace procedure(expire_snapshots) not work as expceted
#10907 commented on
Sep 17, 2024 • 0 new comments -
deletion & purge improvements for undelete feature in REST catalog
#11023 commented on
Sep 17, 2024 • 0 new comments -
append() fails with pyspark DataframeWriterV2's writeTo api
#9874 commented on
Sep 18, 2024 • 0 new comments -
List all AWS S3 properties in the docs
#10674 commented on
Sep 18, 2024 • 0 new comments -
Adding RESTCatalog based Spark Integ Test
#11079 commented on
Sep 18, 2024 • 0 new comments -
Cannot specify custom delete handler when using a DeleteFiles in a Transaction
#8642 commented on
Sep 20, 2024 • 0 new comments -
Document Azure and GCP integration
#8662 commented on
Sep 20, 2024 • 0 new comments -
How does iceberg solve the problem of small files? Is there any good solution?
#8663 commented on
Sep 20, 2024 • 0 new comments -
Could there be duplicate values in the result returned by the findOrphanFiles method?
#8670 commented on
Sep 20, 2024 • 0 new comments -
Support Zorder for data writes (not just rewrites)
#8674 commented on
Sep 20, 2024 • 0 new comments -
DELETE fails with "java.lang.IllegalArgumentException: info must be ExtendedLogicalWriteInfo"
#8926 commented on
Sep 26, 2024 • 0 new comments -
Spark write abort result in table miss metadata location file
#8927 commented on
Sep 26, 2024 • 0 new comments -
Missing serialVersionUID in Serializable implementation
#8929 commented on
Sep 26, 2024 • 0 new comments -
Parquet bloom filter doesn't work with nested fields
#9898 commented on
Sep 26, 2024 • 0 new comments -
Slow RewriteManifests due to Validation of Manifest Entries
#8932 commented on
Sep 27, 2024 • 0 new comments -
equality delete files can be removed immediately after rewrite?
#8933 commented on
Sep 28, 2024 • 0 new comments -
Long overflow when Iceberg reading INT96 timestamp column from Spark parquet table
#8949 commented on
Sep 28, 2024 • 0 new comments -
Does the Java API support primary keys for creating tables
#8950 commented on
Sep 28, 2024 • 0 new comments -
Why are updateSchema and UpdatePartitionSpec commit not retried?
#8964 commented on
Sep 28, 2024 • 0 new comments -
Question on BaseMetastoreViewCatalog#buildView
#8967 commented on
Sep 28, 2024 • 0 new comments -
flink1.13.2+iceberg0.13.0+hive-metastore3.0.0+minio(S3) Forbidden (Service: Amazon S3; Status Code: 403
#8968 commented on
Sep 28, 2024 • 0 new comments -
Support adding an additional `opType` column when creating a table
#8973 commented on
Sep 28, 2024 • 0 new comments -
Support MOR CDC view
#8975 commented on
Sep 28, 2024 • 0 new comments -
View is no longer in sync with table after catalog cache entry expires
#8977 commented on
Sep 28, 2024 • 0 new comments -
org.apache.iceberg.spark.source.SerializableTableWithSize cannot be cast to org.apache.iceberg.Table
#8978 commented on
Sep 28, 2024 • 0 new comments -
Support relative paths in Table Metadata
#1617 commented on
Sep 28, 2024 • 0 new comments -
Data duplicate after the partition is modified
#8979 commented on
Sep 29, 2024 • 0 new comments -
hive integration iceberg related problems
#8993 commented on
Sep 29, 2024 • 0 new comments -
manifest exception
#8994 commented on
Sep 29, 2024 • 0 new comments -
I can't find any detailed explanation about column metric options on the official docs for Iceberg configuration
#8995 commented on
Sep 29, 2024 • 0 new comments -
Doc Bug: Iceberg Flink Example uses unsupported UNIQUE constraint
#8997 commented on
Sep 29, 2024 • 0 new comments -
[Feature Request] Implement `equals` for `RESTMessage`
#9003 commented on
Sep 30, 2024 • 0 new comments -
Enable Partition Transforms and/or Spark SQL In Spark `rewrite_data_files` Procedure
#8846 commented on
Sep 23, 2024 • 0 new comments -
Query fails when executed without filter i.e. aggregate pushdown
#8859 commented on
Sep 23, 2024 • 0 new comments -
Distributed execution of DeleteReachableFilesSparkAction
#8862 commented on
Sep 23, 2024 • 0 new comments -
Spark sort/zorder rewrite data does not apply the expected SHUFFLE_PARTITIONS for each target group
#10716 commented on
Sep 23, 2024 • 0 new comments -
Support create table `PRIMARY KEY` column via Spark sql?
#5069 commented on
Sep 23, 2024 • 0 new comments -
Improve `All` Metadata Tables with Snapshot Information
#8856 commented on
Sep 24, 2024 • 0 new comments -
java.lang.IllegalArgumentException: requirement failed while read migrated parquet table
#8863 commented on
Sep 24, 2024 • 0 new comments -
support meta column query on staged scan
#8866 commented on
Sep 24, 2024 • 0 new comments -
Flink: OverflowError: value too large to convert to int32_t
#8874 commented on
Sep 24, 2024 • 0 new comments -
fast_forward command not merging branches within AWS Glue
#8881 commented on
Sep 24, 2024 • 0 new comments -
Apache hive 3 with Tez engine select table no empty
#8891 commented on
Sep 24, 2024 • 0 new comments -
DatasourceV2 does not prune columns after V2ScanRelationPushDown
#9268 commented on
Sep 24, 2024 • 0 new comments -
Hive's performance for querying the Iceberg table is very poor.
#8901 commented on
Sep 25, 2024 • 0 new comments -
Consumer Latency Monitoring Support in Iceberg ?
#8903 commented on
Sep 25, 2024 • 0 new comments -
operations fail after upgrading to spark 3.4
#8904 commented on
Sep 25, 2024 • 0 new comments -
Pushdown SUBSTRING filter when equivalent to STARTSWITH
#8911 commented on
Sep 25, 2024 • 0 new comments -
Schema issue between Arrow and PyIceberg
#8913 commented on
Sep 25, 2024 • 0 new comments -
Request Timeout API to RestCatalog's HTTPClient is provided by Iceberg SDK
#8915 commented on
Sep 25, 2024 • 0 new comments -
Flink SQL with Iceberg snapshots doesn't react if table has upsert
#9948 commented on
Sep 25, 2024 • 0 new comments -
`ALTER TABLE ... DROP COLUMN` allows dropping a column used by old PartitionSpecs
#4563 commented on
Sep 25, 2024 • 0 new comments -
Iceberg streaming using checkpoint does not ignore the stream-from-timestamp option
#8921 commented on
Sep 26, 2024 • 0 new comments -
Vulnerabilities found on latest version - jackson, avro, openssl
#8923 commented on
Sep 26, 2024 • 0 new comments