Implement Adaptive query execution #387

Dandandan · 2022-10-18T09:12:03Z

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Adaptive query execution is the re-optimization of the query pipeline.
This allows for faster complex queries, as joins and other.

The pull/stage based model of Ballista allows for implementing a similar strategy as Spark.

Describe the solution you'd like

When a stage has been finished: provide/update the statistics (row count / byte size) for the remaining stages
Re-optimize the different stages based on (exact) stats. We can start with running only the physical optimization passes (join order, aggregate statistics, broadcast join Implement broadcast join optimization #348, etc.) as we already converted the logical plan to the physical plan.

Describe alternatives you've considered

Additional context

Dandandan added enhancement New feature or request performance labels Oct 18, 2022

Dandandan mentioned this issue Oct 24, 2022

Reorder joins after resolving stage inputs #441

Merged

Ted-Jiang assigned Ted-Jiang and unassigned Ted-Jiang Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Adaptive query execution #387

Implement Adaptive query execution #387

Dandandan commented Oct 18, 2022 •

edited

Loading

Implement Adaptive query execution #387

Implement Adaptive query execution #387

Comments

Dandandan commented Oct 18, 2022 • edited Loading

Dandandan commented Oct 18, 2022 •

edited

Loading