Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Story - Supporting Approximate Count Distinct #10792

Closed
ttnghia opened this issue May 4, 2022 · 11 comments
Closed

[FEA] Story - Supporting Approximate Count Distinct #10792

ttnghia opened this issue May 4, 2022 · 11 comments
Assignees
Labels
2 - In Progress Currently a work in progress feature request New feature or request Java Affects Java cuDF API. libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change proposal Change current process or code Python Affects Python cuDF API. Spark Functionality that helps Spark RAPIDS

Comments

@ttnghia
Copy link
Contributor

ttnghia commented May 4, 2022

This issue tracks the dependencies for supporting approximate count distinct using HyperLogLog algorithm.

@ttnghia ttnghia added feature request New feature or request 2 - In Progress Currently a work in progress proposal Change current process or code libcudf Affects libcudf (C++/CUDA) code. Python Affects Python cuDF API. Java Affects Java cuDF API. Spark Functionality that helps Spark RAPIDS non-breaking Non-breaking change labels May 4, 2022
@ttnghia ttnghia self-assigned this May 4, 2022
@ttnghia ttnghia changed the title [FEA] Story - Supporting Approximate Count Distint [FEA] Story - Supporting Approximate Count Distinct May 4, 2022
@jrhemstad
Copy link
Contributor

Given there was already sizable discussion in #10652, can we just use that issue? Or is there a different intent with this one?

@ttnghia
Copy link
Contributor Author

ttnghia commented May 4, 2022

That issue has been diluted.
Here I'm going to add checked list item for the necessary PRs/features we are going to add so people can keep track, like this: #10186.

@revans2
Copy link
Contributor

revans2 commented Jun 14, 2022

This is still wanted

@revans2
Copy link
Contributor

revans2 commented Jul 14, 2022

This is still wanted

@jrhemstad jrhemstad added 0 - Backlog In queue waiting for assignment and removed 2 - In Progress Currently a work in progress labels Jul 14, 2022
@rapidsai rapidsai deleted a comment from github-actions bot Jul 21, 2022
@rapidsai rapidsai deleted a comment from github-actions bot Jul 21, 2022
@ttnghia ttnghia added 2 - In Progress Currently a work in progress and removed 0 - Backlog In queue waiting for assignment labels Jul 21, 2022
@github-actions
Copy link

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

@GregoryKimball
Copy link
Contributor

@etseidl FYI, our friends in Spark-RAPIDS are also interested in the HyperLogLog algorithm. 😄

@GregoryKimball
Copy link
Contributor

Also see NVIDIA/cuCollections#429

@vyasr
Copy link
Contributor

vyasr commented May 17, 2024

Echoing Jake's question from two years ago: do we need this issue? Can we consolidate discussion in #10652? At this point it seems like we're just forced to post updates in two places. @ttnghia WDYT?

@res-life res-life self-assigned this Sep 26, 2024
@res-life
Copy link
Contributor

I'm working on this.

@res-life
Copy link
Contributor

If no one has already started this, then let me pick this one.

@vyasr
Copy link
Contributor

vyasr commented Sep 27, 2024

To avoid fragmentation, I'm going to close this as a dup of #10652 so that we can focus discussion there.

@vyasr vyasr closed this as completed Sep 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2 - In Progress Currently a work in progress feature request New feature or request Java Affects Java cuDF API. libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change proposal Change current process or code Python Affects Python cuDF API. Spark Functionality that helps Spark RAPIDS
Projects
None yet
Development

No branches or pull requests

7 participants