Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add docs on BackendAttributes #87

Merged
merged 2 commits into from
Aug 4, 2023
Merged

Add docs on BackendAttributes #87

merged 2 commits into from
Aug 4, 2023

Conversation

rmccorm4
Copy link
Contributor

@rmccorm4 rmccorm4 commented Aug 4, 2023

Adds generic docs on BackendAttributes as I didn't find any existing docs other than the API headers.

Just wanted to give a high level overview of attributes and improve searchability, detailed information about them can be found in headers.

Specifically call out the parallel instance loading attribute, and list the currently supported official backends related to ORT PR: triton-inference-server/onnxruntime_backend#208


Also noticed this mention of device memory tracking BackendAttribute here, but I think it's outdated from initial design. edit: Confirmed with Guan and removed it.

Copy link
Member

@Tabrizian Tabrizian left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice doc improvement!

@rmccorm4 rmccorm4 merged commit 504abc9 into main Aug 4, 2023
1 check passed
@rmccorm4 rmccorm4 deleted the rmccormick-parallel branch August 4, 2023 19:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants