-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ConvTransposeGrad CUDA Kernel #17201
Conversation
Thank you for the review @er3x3 :) |
@baijumeswani, please take a look at test error, and update the test: https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1122071&view=logs&j=5076e696-f193-5f12-2d8a-703dda41a79b&t=318cbefa-99dd-503e-0fe4-581779019aa9&l=14091 |
Cherry-pick 1st round for rel-1.16.0 from https://github.com/microsoft/onnxruntime/issues?q=label%3Arelease%3A1.16+label%3Atriage%3Aapproved+is%3Aclosed except #17201 because it caused UT failure and is not fixed yet. PR list: #16417 #16936 #17000 #17236 #17238 #17240 #17252 #17255 #17258 #17265 #17267 #17277
This pull request adds a CUDA kernel implementation for the convolution transpose gradient operation to ONNX Runtime.
Work here is heavily derived from the previous work by @er3x3 in #8613 and from PyTorch convolution kernels.