-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
snnn/p202310301 #18169
Closed
Closed
snnn/p202310301 #18169
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Member
snnn
commented
Oct 30, 2023
- Cherry-pick 1st Round (Cherry-pick 1st Round #17308)
- Cherry-pick 2nd Round (Cherry-pick 2nd Round #17386)
- [rel-1.16.0] Use name of temporary provisioning profile. ([rel-1.16.0] Use name of temporary provisioning profile. #17456)
- [rel-1.16.0] Disable QNN QDQ test for release branch ([rel-1.16.0] Disable QNN QDQ test for release branch #17463)
- Remove 52 from CMAKE_CUDA_ARCHITECTURES to reduce Nuget package size (Remove 52 from CMAKE_CUDA_ARCHITECTURES to reduce Nuget package size #17461)
- [rel-1.16.0] Cherry-pick 17507 ([rel-1.16.0] Cherry-pick 17507 #17520)
- [rel-1.16.0] Cherry-pick 16940 and 17523 ([rel-1.16.0] Cherry-pick 16940 and 17523 #17506)
- Cherry-picks pipeline changes to 1.16.0 release branch (Cherry-picks pipeline changes to 1.16.0 release branch #17577)
- Cherry-picks for 1.16.1 release (cherry-picks for 1.16.1 release #17741)
- Cancel EP check in python for 1.16.1 (Cancel EP check in python for 1.16.1 #17768)
- [hotfix] fix session option access in Node.js binding ([hotfix] fix session option access in Node.js binding #17762)
- Fix onnx quantizer activation and weight type attribute
- Upgrade transformers to fix CI (Upgrade transformers to fix CI #17830)
- Increase version number for preparing the 1.16.2 release (Increase version number for preparing the 1.16.2 release #18070)
- Revert "Disable dml stage in windows GPU pipeline temporarily. (Disable dml stage in windows GPU pipeline temporarily. #18034)" (Revert "Disable dml stage in windows GPU pipeline temporarily. (#18034)" #18150)
Cherry-pick 1st round for rel-1.16.0 from https://github.com/microsoft/onnxruntime/issues?q=label%3Arelease%3A1.16+label%3Atriage%3Aapproved+is%3Aclosed except microsoft#17201 because it caused UT failure and is not fixed yet. PR list: microsoft#16417 microsoft#16936 microsoft#17000 microsoft#17236 microsoft#17238 microsoft#17240 microsoft#17252 microsoft#17255 microsoft#17258 microsoft#17265 microsoft#17267 microsoft#17277
Cherry-pick 2nd round for 1.16.0 release. PR List: microsoft#17201 microsoft#17270 microsoft#17311 microsoft#17315 microsoft#17320 microsoft#17326 microsoft#17355 microsoft#17227 microsoft#17380 microsoft#17386
…7456) ### Description <!-- Describe your changes. --> Use name of temporary provisioning profile. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. --> The old provisioning profile no longer works. Switched to a temporary one that we can use before a new one is available. The temporary one has a different name. Alternative to microsoft#17454.
Disable QNN QDQ test for release branch ### Description Disable QNN QDQ test for release branch to get rid of model test failure caused by new model update in build image.
…icrosoft#17461) ### Description Remove 52 from CMAKE_CUDA_ARCHITECTURES to reduce Nuget package size. ### Motivation and Context PR microsoft#17227 increased binary size by 20%. Right the package size is about 260MB. However, nuget has a hard limit of 250MB. Without this change we cannot publish the package.
Cherry-pick microsoft#17507 for rel-1.16.0. Note: The PR 17507 contains the part of engine decryption refactor that we don't want to include it in ORT 1.16 release. This cherry pick PR excludes this part.
### Description 1. Delete Prefast tasks (microsoft#17522) 2. Disable yum update (microsoft#17551) 3. Avoid calling patchelf (microsoft#17365 and microsoft#17562) we that we can validate the above fix The main problem I'm trying to solve is: our GPU package depends on both CUDA 11.x and CUDA 12.x . However, it's not easy to see the information because ldd doesn't work with the shared libraries we generate(see issue microsoft#9754) . So the patchelf change are useful for me to validate the "Disabling yum update" was successful. As you can see we call "yum update" from multiple places. Without some kind of validation it's hard to say if I have covered all of them. The Prefast change is needed because I'm going to update the VM images in the next a few weeks. In case of we need to publish a patch release after that. ### Motivation and Context Without this fix we will mix using CUDA 11.x and CUDA 12.x. And it will crash every time when we use TensorRT.
Cherry-pick the following PRs to the release branch: Fix: Fail to skip disabledmodel in winml (microsoft#17728) Move dotnet build and test into docker in Linux CPU CI (microsoft#17417) Run Nuget_Test_Linux_GPU in container (microsoft#17452) Run Final_Jar_Testing_Linux_GPU in docker (microsoft#17533) TreeEnsemble speed up (microsoft#17449) Remove onnxruntime extensions from list of gitmodules (microsoft#17615) Include onnxruntime_float16.h in the package. (microsoft#17637) Fix static quantization for QDQ and Percentile distribution (microsoft#17649) [TensorRT EP] Back out the PerThreadContext (microsoft#17690) Update nodejs to 18.x (microsoft#17657) Update linux-wasm-ci.yml: remove the ln command (microsoft#17735)
Remove the condition to allow an empty provide list. Co-authored-by: Randy Shuai <rashuai@microsoft.com>
### Description fix session option access in Node.js binding ### Motivation and Context This is a bug that affect transformer.js using ONNX Runtime Node.js binding. Issue: microsoft#17377 This bug is already fixed in main branch, but it is not picked in 1.16 release.
### Description Python package pipeline fails due to "tokenizers" compilation. Since "tokenizers" is a dep of "transformers", we update its version and hope a new solution had been there. ``` error: casting `&T` to `&mut T` is undefined behavior, even if the reference is unused, consider instead using an `UnsafeCell` --> tokenizers-lib/src/models/bpe/trainer.rs:517:47 ``` ### Motivation and Context Cherry-pick from microsoft#17823
…8070) 1. Increase version number for preparing the 1.16.2 release (microsoft#18070) 2. cherry-pick 18034
…soft#18034)" (microsoft#18150) This reverts commit 99b8dca. ### Description <!-- Describe your changes. --> ### Motivation and Context Restore the dml stage in windows GPU pipeline. Agent issue is solved by adding Feature.DisableGpuDriver in pool properties.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.