ORC: Implement buildEqualityWriter() and buildPositionWriter() - 2nd version #3250

pvary · 2021-10-08T13:01:04Z

2nd implementation of #2935.
Based on #3248 I was able to implement the ORC delete writers.

data/src/test/java/org/apache/iceberg/io/TestFileWriterFactory.java

data/src/main/java/org/apache/iceberg/data/DeleteFilter.java

orc/src/main/java/org/apache/iceberg/data/orc/GenericOrcWriter.java

orc/src/main/java/org/apache/iceberg/orc/ORC.java

pvary · 2021-10-11T13:51:00Z

This seems like a better solution for #2935.
This PR is build on #3248.

The first commit in this PR is the squashed commits of #3248, the next 3 changes are unique for this.

If you have time, could you please review: @kbendick, @aokolnychyi, @rdblue?

@openinx: Answered your comments, and did the appropriate changes. Thanks for your time!

openinx · 2021-10-12T13:16:26Z

Thanks @pvary for the work, I plan to check this again once we got the dependency PR #3248 merged !

openinx · 2021-10-14T02:08:42Z

@pvary we've just got the #3248 merged now, I think it's time to rebase this PR and take another round review now, thanks !

pvary · 2021-10-14T13:14:08Z

@openinx: Rebased, and rerun the flaky test (TestFlinkTableSink#testReplacePartitions). Now we have a clean run.
Could you please review?

Thanks,
Peter

flink/src/main/java/org/apache/iceberg/flink/data/FlinkOrcWriter.java

orc/src/main/java/org/apache/iceberg/data/orc/GenericOrcWriter.java

orc/src/main/java/org/apache/iceberg/data/orc/GenericOrcWriters.java

orc/src/main/java/org/apache/iceberg/orc/ORC.java

openinx · 2021-10-15T03:10:27Z

orc/src/main/java/org/apache/iceberg/orc/ORC.java

+
+      if (createWriterFunc != null) {
+        appenderBuilder.createWriterFunc((schema, typeDescription) ->
+            GenericOrcWriters.positionDelete(createWriterFunc.apply(deleteSchema, typeDescription), pathTransformFunc));


If people don't provide a rowSchema by using DeleteWriteBuilder#rowSchema, then we still use the flink RowData writer to write the <path, pos> and it is required to convert the path from CharSequence to RowData I think. That's the hottest code path because I think most of the cases people won't need the extra rowSchema to attach the original row when write PositionDelete, and all the pos-delete writer will use run into this line.

In my view, for the case without rowSchema, I think we can use the Record writer to avoid the extra conversion from CharSequence to RowData or InternalRow.

It's better if we have an unit test to address this parquet/orc issue, I think it's good to make it a separate issue or PR.

I have changed the code to match the way how it is currently working with Parquet. I have created a new OrcRowWriter for the pathPosSchema and used that to write the data. Is this what you were suggesting for cases when no rowSchema was provided?

Are you suggesting that we should make sure that the GenericOrcWriter is writing path as expected? Also if I see correctly the Parquet code is also using identity transform for path values, so that is why you are suggesting to write a test case specifically for this?

If I remove the pathTransform from the PositionDeleteStructWriter then the TestFlinkFileWriterFactory#testPositionDeleteWriterWithRow will fail immediately with the following exception:

java.lang.String cannot be cast to org.apache.flink.table.data.StringData java.lang.ClassCastException: java.lang.String cannot be cast to org.apache.flink.table.data.StringData at org.apache.iceberg.flink.data.FlinkOrcWriters$StringWriter.nonNullWrite(FlinkOrcWriters.java:96) at org.apache.iceberg.orc.OrcValueWriter.write(OrcValueWriter.java:42) at org.apache.iceberg.data.orc.GenericOrcWriters$StructWriter.write(GenericOrcWriters.java:492)

I expect that the same thing is handler already with the generic record writer, so that is why we do not get the same exception for TestFlinkFileWriterFactory#testPositionDeleteWriter

Do I miss something?

Sorry, I think I should describe this more clear. I mean we may need to add an unit test to address the case to ensure it will use the Record PositionDeleteStructWriter to write the <path, pos> if we don't provide any rowSchema. That prevents us introducing new changes that will lead to write path performance regression because of the conversion from CharSequence to RowData or InternalRow.

For this two cases:

Write <path, pos, row-data> into positional files, the row-data could be flink's RowData or spark's InternalRow.

Write <path, pos> without any attached row-data into positional files.

I think we could use the same PositionDeleteStructWriter to write the <path, pos, row-data> or <path, pos>, the difference is what kinds of pathTransformFunc that we will pass to:

For flink's <path, pos, RowData>, we should pass a path -> StringData.fromString(path.toString());

For spark's <path, pos, InternalRow>, we should pass a path -> UTF8String.fromString(path.toString();

For both spark and flink's <path, pos>, we should pass a dummy Function.identity() to do nothing because we will just use the Record writer.

Ok. Refactored to match how it is working for Parquet.

There is one thing I am not entirely comfortable with:

If we provide rowSchema, but do not provide the createWriterFunc then we ignore the provided rowSchema. My understanding was that the value of the rowSchema defines wether we write rowData to the position delete file or we just use it to store the filename and the position. It turns out this is defined by the combined values of these properties.

Wouldn't it be better to have a single storeRows boolean flag on the DeleteWriteBuilder class to define this behaviour, and make the appropriate checks when creating the writer wether every required parameter is set? I thing this would make this easier to understand for the next contributors.

Thanks for sharing your feeling, there are two cases we did not describe it clear in current Parquet/ORC/Avro position writer builder I think:

rowSchema is null and createWriterFunc is not null. In this case, the createWriterFunc should be meaningless because we don't need to write any extra row records into position delete files. Using the default Record position delete writer should be OK for me.

rowSchema is not null and createWriteFunc is null. In this case, I think we should throw an IllegalArgumentException because we don't know how to construct the rowSchema 's column writers. Adding a Precondition.checkArgument(rowSchema!=null && createWriterFunc, "..." should be OK, but currently we will fallback to skip to write rowSchema row into position delete files, which is your confusion I think.

That sounds reasonable to make it into a separate PR I think.

I tried to add a Precondition.checkArgument to buildPositionWriter(), just like have you suggested, but DeleteReadTests.testPositionDeletes() and DeleteReadTests.testMixedPositionAndEqualityDeletes() tests were failing with this:

Create function should be provided if we write row data java.lang.IllegalArgumentException: Create function should be provided if we write row data at org.apache.iceberg.relocated.com.google.common.base.Preconditions.checkArgument(Preconditions.java:142) at org.apache.iceberg.parquet.Parquet$DeleteWriteBuilder.buildPositionWriter(Parquet.java:600) at org.apache.iceberg.data.FileHelpers.writeDeleteFile(FileHelpers.java:59) at org.apache.iceberg.data.DeleteReadTests.testPositionDeletes(DeleteReadTests.java:287) [..]

I the tests we use FileHelper.writeDeleteFile with a table which provides the schema and without a createWriterFunc. So I am not sure if it is a test only issue, or a real use-case. Any info around this?

Looks like it's here where we've set the rowSchema by default, while in fact we shouldn't use the table.schema() as rowSchema when building the parquet posDeleteWriter by default. I will suggest to use the following to construct the PositionWriter:

PositionDeleteWriter<?> writer = Parquet.writeDeletes(out) .withSpec(table.spec()); .setAll(table.properties()); .metricsConfig(MetricsConfig.forTable(table)) .withPartition(partition) .overwrite() .buildPositionWriter();

And if people plan to use forTable(table) to construct the position writer, then the Preconditions.checkArgument(rowSchema == null || createWriterFunc != null) will remind the devs to add createWriterFunc or fallback to use the separate setters.

Created #3305

spark/src/main/java/org/apache/iceberg/spark/data/SparkOrcWriter.java

openinx

Thanks for the update, @pvary ! I left several comments which I think we need to address.

pvary · 2021-10-15T10:44:52Z

Thanks for the update, @pvary ! I left several comments which I think we need to address.

Thanks for the review @openinx!
Addressed your comments. I am not sure I understand your suggestions correctly in one case. Could you please check that out?

Thanks,
Peter

spark/src/main/java/org/apache/iceberg/spark/source/SparkFileWriterFactory.java

orc/src/main/java/org/apache/iceberg/data/orc/GenericOrcWriters.java

openinx

Almost looks great to me now, left several minor comments ! Thanks @pvary for the update!

openinx

Looks great to me now, thanks @pvary for the patient work !

pvary · 2021-10-18T10:55:18Z

Thanks for the review and the merge @openinx!

github-actions bot added data flink ORC spark labels Oct 8, 2021

pvary mentioned this pull request Oct 8, 2021

ORC: Implement buildEqualityWriter() and buildPositionWriter() #2935

Closed

pvary force-pushed the deleteStruct2 branch from 116547e to 5eb434c Compare October 8, 2021 13:32