Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Adaptive query execution #387

Open
Dandandan opened this issue Oct 18, 2022 · 0 comments
Open

Implement Adaptive query execution #387

Dandandan opened this issue Oct 18, 2022 · 0 comments
Labels
enhancement New feature or request performance

Comments

@Dandandan
Copy link
Contributor

Dandandan commented Oct 18, 2022

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Adaptive query execution is the re-optimization of the query pipeline.
This allows for faster complex queries, as joins and other.

The pull/stage based model of Ballista allows for implementing a similar strategy as Spark.

Describe the solution you'd like

  • When a stage has been finished: provide/update the statistics (row count / byte size) for the remaining stages
  • Re-optimize the different stages based on (exact) stats. We can start with running only the physical optimization passes (join order, aggregate statistics, broadcast join Implement broadcast join optimization #348, etc.) as we already converted the logical plan to the physical plan.

Describe alternatives you've considered

Additional context

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request performance
Projects
None yet
Development

No branches or pull requests

2 participants