Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[gpuCI] Forward-merge branch-22.10 to branch-22.12 [skip gpuci] #11805

Merged
merged 1 commit into from
Sep 28, 2022

Commits on Sep 28, 2022

  1. Fix regex out-of-bounds write in strided rows logic (#11797)

    Fixes an out-of-bounds write error when a large number of strings requires a strided loop to meet an internal memory maximum. For row sizes that do not require strided loops, the row index never exceeds the size of the column preventing any out-of-bounds access. For large row counts, the CUDA `thread index` may be larger than the minimal count used for building the working-memory buffer. Since the kernel is launched with a thread-count with a specific block size, extra threads past the end of the minimal count are necessary to fill out the last block. These threads never contribute to the overall result but will attempt to access past the end of the working memory. Writing to this memory may corrupt memory for another kernel launched in parallel from another CPU thread. This change adds logic to prevent the extra threads from doing any work.
    
    Fixes #11768
    
    Authors:
      - David Wendt (https://github.com/davidwendt)
    
    Approvers:
      - MithunR (https://github.com/mythrocks)
      - Nghia Truong (https://github.com/ttnghia)
      - Mike Wilson (https://github.com/hyperbolic2346)
    
    URL: #11797
    davidwendt authored Sep 28, 2022
    Configuration menu
    Copy the full SHA
    da04725 View commit details
    Browse the repository at this point in the history