Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query: high number of goroutines due to partial deadlock #4925

Closed
aymericDD opened this issue Dec 6, 2021 · 3 comments
Closed

Query: high number of goroutines due to partial deadlock #4925

aymericDD opened this issue Dec 6, 2021 · 3 comments

Comments

@aymericDD
Copy link
Contributor

aymericDD commented Dec 6, 2021

Thanos, Prometheus and Golang version used:

thanos, version 0.23.1 (branch: HEAD, revision: 5327cd8)
build user: root@0acc901868e9
build date: 20211005-12:08:29
go version: go1.16.8
platform: linux/amd64

Object Storage Provider:

  • S3

What happened:

The number of goroutines of the thanos query component explode when the number of requests is growing :
image
Selection_144

The Thanos query stack all queries and never return responses. No logs are returned because the goroutines are blocked due to a semaphore problem. Cf goroutine pprof :
pprof.thanos.goroutine.005.pb.gz

What you expected to happen:

All requests must work and the number of goroutines must not grow.

How to reproduce it (as minimally and precisely as possible):

It is difficult to reproduce it. The way to reproduce it is to spam the query with a lot of requests.

Full logs to relevant components:

No logs are displayed when the partial deadlock is in place.

Anything else we need to know:

@aymericDD
Copy link
Contributor Author

The issue is fixed by #4795
Can this fix be backported to v0.23.1 to avoid outages ? This is very critical for production environments. Thanks for your support and thanks @GiedriusS for the fix 👍

@aymericDD
Copy link
Contributor Author

PR to backport fix to release-23 #4926

@yeya24
Copy link
Contributor

yeya24 commented Dec 12, 2021

Close as the new release is out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants