[Improvement] ParallelIterable#hasNext submit reading ManifestFile task slowly with DataTableScan#planTasks

This Issuse manily to speed up reading `Iterator<FileScanTask>` with planTasks.

### Background
We use trino to query icebergTable,  but same query are sometimes very slow in scheduling stage. 
Using Arthas,  finally positioned to `fileScanTasks.hasNext()`  in [`io.trino.plugin.iceberg.IcebergSplitSource#getNextBatch`](https://github.com/trinodb/trino/blob/master/plugin/trino-iceberg/src/main/java/io/trino/plugin/iceberg/IcebergSplitSource.java)
This method traverse `Iterator<FileScanTask>` to get IcebergSplit .
![image](https://user-images.githubusercontent.com/39874737/145963998-73330aec-2bc0-4b06-b950-7b8146442b60.png)

`fileScanTasks` generated by `DataTableScan.planTasks,`  when called `ParallelIterable.hasNext()`,  it should parallel submiting Runnable task to workPool ,  but when called `tasks.next()`,  it will reading manifest firstly.  
When Hdfs has bad performance, this step will take too many time to stuck submitTask.
So,  it's better to put reading manifest step into Runnable rather than `Iterables.transform()` in `ManifestGroup#entries`.

![image](https://user-images.githubusercontent.com/39874737/145956804-0d466e40-6877-4c35-baa9-cb3255474700.png)

Through profile,  we can sure that` ManifestFiles.read()` and `ManifestReader.entries()` reading avro file from the hdfs.
![image](https://user-images.githubusercontent.com/39874737/145945692-d8b03971-80f0-4ba7-8ee5-1740d6d7dfbd.png)

### Reslove
I’m not familiar with iceberg. I provide a rewrite idea which may not right. Hoping everyone can help to improve it. Thanks all !
https://github.com/apache/iceberg/pull/3742

### Test Performence
Query the same sql 100 times per test, the sql read 47 arvo file,
Count trino scheduling time

![image](https://user-images.githubusercontent.com/39874737/145999989-b955d29f-189e-4690-a1dd-ad8d4eaa8db3.png)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement] ParallelIterable#hasNext submit reading ManifestFile task slowly with DataTableScan#planTasks #3741

Background

Reslove

Test Performence

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Improvement] ParallelIterable#hasNext submit reading ManifestFile task slowly with DataTableScan#planTasks #3741

Description

Background

Reslove

Test Performence

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions