-
Notifications
You must be signed in to change notification settings - Fork 3k
Core: Scan only live entries for partitions table #8969
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java
Show resolved
Hide resolved
Fokko
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
flyrain
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1 Thanks for the fix, @dramaticlly! Curious if we need the test for spark3.3 and 3.2.
spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java
Show resolved
Hide resolved
dramaticlly
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added corresponding unit test for spark 3.2 and 3.3.
spark/v3.4/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestDelete.java
Show resolved
Hide resolved
|
Thanks, @dramaticlly! Thanks for reviewing, @ajantha-bhat @Fokko @flyrain! |
|
Hello, thanks for the fix! Will it be backported to 1.4.x? |
Co-authored-by: Steve Zhang <hongyue_zhang@apple.com>
…e#1297) backport apache#8969 Co-authored-by: Hongyue/Steve Zhang <steveiszhy@gmail.com>
close #8951
This help fix the bug introduced in #7581 where partition table scan entries instead of files to have partition level stats aggregagted over files. In order to have correct stats, only live entries need to be scanned instead of all entries