Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add LLM model server metrics #1103

Merged
merged 15 commits into from
Jun 27, 2024

Conversation

achandrasekar
Copy link
Contributor

@achandrasekar achandrasekar commented May 31, 2024

This change adds common model server metrics that we want to standardize on. It starts of with two common latency metrics - time per output token and time to first token.

Fixes #1102

Changes

Please provide a brief description of the changes here.

Note: if the PR is touching an area that is not listed in the existing areas, or the area does not have sufficient domain experts coverage, the PR might be tagged as experts needed and move slowly until experts are identified.

This change adds common model server metrics that we want to standardize on. It starts of with two common latency metrics - time per output token and time to first token.

Merge requirement checklist

@achandrasekar achandrasekar requested review from a team May 31, 2024 18:19
@achandrasekar achandrasekar requested a review from a team May 31, 2024 18:27
@achandrasekar
Copy link
Contributor Author

cc @lmolkova @SergeyKanzhelev

.chloggen/1102.yaml Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
@achandrasekar
Copy link
Contributor Author

cc @jsuereth to take a look as well

@drewby drewby self-requested a review June 13, 2024 06:39
docs/gen-ai/gen-ai-metrics.md Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
docs/gen-ai/gen-ai-metrics.md Outdated Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
model/metrics/gen-ai.yaml Outdated Show resolved Hide resolved
achandrasekar and others added 2 commits June 20, 2024 20:16
Co-authored-by: Drew Robbins <drew@drewby.com>
Co-authored-by: Liudmila Molkova <limolkova@microsoft.com>
Copy link
Member

@drewby drewby left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

model/metrics/gen-ai.yaml Show resolved Hide resolved
model/metrics/gen-ai.yaml Show resolved Hide resolved
@joaopgrassi joaopgrassi merged commit a328d73 into open-telemetry:main Jun 27, 2024
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

Semantic conventions for LLM model server metrics
7 participants