Spark: support use-table-distribution-and-ordering in session conf #8164

chenjunjiedada · 2023-07-27T08:11:42Z

In some cases of skew data, where a few partitions contain most records, we want to skip sorting operations and use fanout writing to accelerate the merge into, the later read performance wouldn't downgrade much if we apply the rebalance at first.

chenjunjiedada · 2023-07-27T08:16:38Z

This closes #5853. @aokolnychyi Could you please take a look?

aokolnychyi · 2023-07-28T20:14:04Z

Since recently, we will automatically skip the local sort if fanout writers are enabled and the table is unsorted. Applies to regular jobs as well as row-level operations (both CoW and MoR).

Let me take a closer look later today.

aokolnychyi · 2023-07-28T23:26:00Z

I have doubts about adding this config at the SQL level. It won't really help the use cause you mentioned above. It will disable both distribution and ordering. In regular writes, you can add a manual repartition step but not in row-level operations. Not doing a repartition/rebalance step is probably not a great idea.

I see multiple options:

Leave as is where no local sort is triggered if fanout writers are enabled and the table is unsorted.
Never request a local sort if fanout writers are enabled (even when the table is sorted).
Add a SQL property like spark.sql.iceberg.use-table-ordering-with-fanout-writers to control this behavior.

I am probably inclined to go with option 1 or 2. Any thoughts, @chenjunjiedada @RussellSpitzer @szehon-ho?

aokolnychyi · 2023-07-28T23:33:17Z

@chenjunjiedada, did the table have a proper sort order in the use case that hit this?

chenjunjiedada · 2023-07-29T01:44:32Z

@aokolnychyi The table doesn't have a sort order. I agree that without repartition/rebalance is not a good idea, it leads to small files problem. It just does not hurt that much if the data contains few partitions.

Leave as is where no local sort is triggered if fanout writers are enabled and the table is unsorted.

Never request a local sort if fanout writers are enabled (even when the table is sorted).

Add a SQL property like spark.sql.iceberg.use-table-ordering-with-fanout-writers to control this behavior.

I prefer option 1. Just tried to update SortOrderUtil.

chenjunjiedada · 2023-07-30T01:28:28Z

Hmm, it seems like we also need to take different distribution modes into account. Range distribution should apply local sort anyway, right?

chenjunjiedada · 2023-07-30T01:53:20Z

I found #7637 already contains the option 1 logic in Spark 3.4, the unit test testRangeCopyOnWriteMergePartitionedUnsortedTableFanout also verifies that. The issue mentioned is in our Spark 3.3 production env, so backporting #7637 should work. @aokolnychyi Do we have a plan to backport this? The AQE also exists in Spark 3.3, any other dependencies from Spark 3.4?

  // a local ordering within a task is beneficial in two cases:
  // - there is a defined table sort order, so it is clear how the data should be ordered
  // - the table is partitioned and fanout writers are disabled,
  //   so records for one partition must be co-located within a task
  private static SortOrder[] writeOrdering(Table table, boolean fanoutEnabled) {
    if (fanoutEnabled && table.sortOrder().isUnsorted()) {
      return EMPTY_ORDERING;
    } else {
      return ordering(table);
    }
  }

aokolnychyi · 2023-08-09T01:10:01Z

@chenjunjiedada, AQE is not supported for V2 writes in OSS Spark 3.3. It works for queries but not for writes. I was not planning to cherry-pick this to 3.3 but we can do it after #8042. I can help review if you want to do the cherry-pick.

chenjunjiedada · 2023-08-16T13:50:15Z

@aokolnychyi , Do we need to backport #7646 as well or #7637 can work independently?

javrasya · 2024-03-19T00:09:55Z

Any update on this. Due to the fact that Spark3.3 is the latest version supported by Glue, we are stuck with it and not being able to set this within SQL is unreasonable since it will sort before write for no reason, or am I missing something? 🤔

github-actions · 2024-09-09T00:15:24Z

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions.

github-actions · 2024-09-17T00:12:14Z

This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

Spark: support use-table-distribution-and-ordering in session conf

7f33065

chenjunjiedada requested a review from aokolnychyi July 27, 2023 08:11

github-actions bot added the spark label Jul 27, 2023

github-actions bot added the core label Jul 29, 2023

chenjunjiedada force-pushed the add-session-conf branch 2 times, most recently from 720d42b to f60ce13 Compare July 29, 2023 15:56

address comments

bbc2ece

chenjunjiedada force-pushed the add-session-conf branch from f60ce13 to bbc2ece Compare July 29, 2023 15:58

github-actions bot added the stale label Sep 9, 2024

github-actions bot closed this Sep 17, 2024

eubnara mentioned this pull request Oct 7, 2024

How to avoid partition key sorting when inserting data into a partitioned Iceberg table? #10181

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark: support use-table-distribution-and-ordering in session conf #8164

Spark: support use-table-distribution-and-ordering in session conf #8164

Uh oh!

chenjunjiedada commented Jul 27, 2023 •

edited

Loading

Uh oh!

chenjunjiedada commented Jul 27, 2023 •

edited

Loading

Uh oh!

aokolnychyi commented Jul 28, 2023 •

edited

Loading

Uh oh!

aokolnychyi commented Jul 28, 2023

Uh oh!

aokolnychyi commented Jul 28, 2023

Uh oh!

chenjunjiedada commented Jul 29, 2023 •

edited

Loading

Uh oh!

chenjunjiedada commented Jul 30, 2023

Uh oh!

chenjunjiedada commented Jul 30, 2023 •

edited

Loading

Uh oh!

aokolnychyi commented Aug 9, 2023

Uh oh!

chenjunjiedada commented Aug 16, 2023

Uh oh!

javrasya commented Mar 19, 2024

Uh oh!

github-actions bot commented Sep 9, 2024

Uh oh!

github-actions bot commented Sep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Spark: support use-table-distribution-and-ordering in session conf #8164

Spark: support use-table-distribution-and-ordering in session conf #8164

Uh oh!

Conversation

chenjunjiedada commented Jul 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenjunjiedada commented Jul 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aokolnychyi commented Jul 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aokolnychyi commented Jul 28, 2023

Uh oh!

aokolnychyi commented Jul 28, 2023

Uh oh!

chenjunjiedada commented Jul 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenjunjiedada commented Jul 30, 2023

Uh oh!

chenjunjiedada commented Jul 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aokolnychyi commented Aug 9, 2023

Uh oh!

chenjunjiedada commented Aug 16, 2023

Uh oh!

javrasya commented Mar 19, 2024

Uh oh!

github-actions bot commented Sep 9, 2024

Uh oh!

github-actions bot commented Sep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chenjunjiedada commented Jul 27, 2023 •

edited

Loading

chenjunjiedada commented Jul 27, 2023 •

edited

Loading

aokolnychyi commented Jul 28, 2023 •

edited

Loading

chenjunjiedada commented Jul 29, 2023 •

edited

Loading

chenjunjiedada commented Jul 30, 2023 •

edited

Loading