Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable result validation for collect_list Spark function in the Fuzzer #9361

Closed

Conversation

liujiayi771
Copy link
Contributor

@liujiayi771 liujiayi771 commented Apr 4, 2024

In #9231, collect_list is added to the disable list of duckQueryRunner.
However, this is unnecessary because DuckDB does not have an aggregate function
named collect_list, hence it would not be compared against DuckDB. This
setting is redundant.

Other than this, the results verification of collect_list has been set to
nullptr, so its results are not verified. But we can use a custom array
verifier used by Presto's array_agg to check the results of itself.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 4, 2024
Copy link

netlify bot commented Apr 4, 2024

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit a8bb607
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/660ea23f4f72960007f3dee0

@liujiayi771 liujiayi771 changed the title Enable fuzzer test for spark collect_list Enable result validation for the collect_list function in the fuzzer test Apr 4, 2024
Copy link
Contributor

@mbasmanova mbasmanova left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@liujiayi771 Thank you for improving test coverage for the new aggregate function.

@mbasmanova mbasmanova changed the title Enable result validation for the collect_list function in the fuzzer test Enable result validation for collect_list Spark function in the Fuzzer Apr 4, 2024
@facebook-github-bot
Copy link
Contributor

@mbasmanova has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@mbasmanova merged this pull request in 3e702c7.

Copy link

Conbench analyzed the 1 benchmark run on commit 3e702c73.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

Joe-Abraham pushed a commit to Joe-Abraham/velox that referenced this pull request Jun 7, 2024
facebookincubator#9361)

Summary:
In facebookincubator#9231, `collect_list` is added to the disable list of `duckQueryRunner`.
However, this is unnecessary because DuckDB does not have an aggregate function
named `collect_list`, hence it would not be compared against DuckDB. This
setting is redundant.

Other than this, the results verification of `collect_list` has been set to
`nullptr`, so its results are not verified. But we can use a custom array
verifier used by Presto's `array_agg` to check the results of itself.

Pull Request resolved: facebookincubator#9361

Reviewed By: xiaoxmeng

Differential Revision: D55744044

Pulled By: mbasmanova

fbshipit-source-id: a1a94c58b2a01463261775d8b6e08b65fd986d29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants