Support in-memory HTTP connections for scraping #3602

rfratto · 2023-04-23T19:59:33Z

This PR integrates github.com/rfratto/ckit/memconn into Flow, which was already in use in the integrations-next subsystem in static mode.

memconn allows establishing fully in-memory HTTP connections to avoid the network stack. This is primarily useful in the presence of mTLS, where otherwise Flow would need to know how to connect to itself as a client with the proper TLS certificates.

As of this PR, in-memory HTTP connections are only used for prometheus.exporter.* components and prometheus.scrape, where prometheus.exporter.* components export an in-memory address, and prometheus.scrape can use a custom dialer to connect to that address.

I am not really happy with the API as-is, but @mattdurham is actively working on refactoring the controller/module API which would be able to implement this logic in a cleaner way; so for now I think the slightly ugly API is acceptable.

Closes #2984.

Related to #2715, grafana/alloy#456.

This adds an in-memory HTTP traffic listener so components which communicate to other components through HTTP (e.g., for scraping Prometheus metrics endpoints) do not need to use the network. This is required to allow prometheus.scrape to continue to collect metrics from prometheus.exporter.* components if the HTTP server is protected by mTLS; otherwise, prometheus.scrape will not work until configured with support for mTLS. Closes grafana#2984.

This updates prometheus.scrape to use the custom dialer passed to it for opening connections to targets. This allows prometheus.scrape to properly scrape metrics from the in-memory HTTP listener.

cmd/internal/flowmode/cmd_run.go

mattdurham · 2023-04-24T14:13:02Z

We previously added this to the upgrade guide do we want to do the same here?

rfratto · 2023-04-24T14:15:53Z

We previously added this to the upgrade guide do we want to do the same here?

I don't think so, before we documented it because we removed a field in integrations relevant to self-scraping over the network, but there wasn't anything for that here.

This should be a fully transparent change to Flow users, and sets us up for the future where we have TLS support.

mattdurham

LGTM

CHANGELOG.md

tpaschalis · 2023-04-24T14:30:35Z

docs/sources/flow/reference/components/prometheus.exporter.snmp.md

+The exported targets will use the configured [in-memory traffic][] address
+specified by the [run command][].


I'm not sure why we document this phrase on the exporters.

Isn't prometheus.scrape the one initiating the connection towards the targets via the in-memory address?

My thought with documenting it was to make sure that people knew it was a special target and not something they could scrape from outside of the agent.

It threw me a bit off as the fact that the target is even a network-based thing was an implementation detail that the user didn't need to know, but I get it from that perspective.

Up to you whether you want it or not, it's okay by me.

Right now my gut is saying we should leave this in (at least for now), since it's bordering between being an implementation detail and being something users need to know about, especially if agent.internal happens to be a real address on the user's DNS servers.

Co-authored-by: Paschalis Tsilias <tpaschalis@users.noreply.github.com>

* flow: support in-memory HTTP traffic This adds an in-memory HTTP traffic listener so components which communicate to other components through HTTP (e.g., for scraping Prometheus metrics endpoints) do not need to use the network. This is required to allow prometheus.scrape to continue to collect metrics from prometheus.exporter.* components if the HTTP server is protected by mTLS; otherwise, prometheus.scrape will not work until configured with support for mTLS. Closes #2984. * prometheus.scrape: use custom dialer for opening conns to targets This updates prometheus.scrape to use the custom dialer passed to it for opening connections to targets. This allows prometheus.scrape to properly scrape metrics from the in-memory HTTP listener. * docs: document in-memory traffic in Flow * misc: add CHANGELOG entry for in-memory HTTP traffic Co-authored-by: Paschalis Tsilias <tpaschalis@users.noreply.github.com>

rfratto added 4 commits April 23, 2023 13:58

prometheus.scrape: use custom dialer for opening conns to targets

3b18df1

This updates prometheus.scrape to use the custom dialer passed to it for opening connections to targets. This allows prometheus.scrape to properly scrape metrics from the in-memory HTTP listener.

docs: document in-memory traffic in Flow

b1e48da

misc: add CHANGELOG entry for in-memory HTTP traffic

76719be

rfratto mentioned this pull request Apr 23, 2023

Add Auth Basic to Grafana Agent API grafana/alloy#456

Open

rfratto requested review from tpaschalis and ptodev April 24, 2023 01:43

mattdurham reviewed Apr 24, 2023

View reviewed changes

cmd/internal/flowmode/cmd_run.go Outdated Show resolved Hide resolved

flow: log addr of closed HTTP listener

407450b

mattdurham approved these changes Apr 24, 2023

View reviewed changes

tpaschalis reviewed Apr 24, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

tpaschalis reviewed Apr 24, 2023

View reviewed changes

tpaschalis approved these changes Apr 24, 2023

View reviewed changes

rfratto and others added 2 commits April 24, 2023 11:18

Update CHANGELOG.md

24ac591

Co-authored-by: Paschalis Tsilias <tpaschalis@users.noreply.github.com>

Merge branch 'main' into flow-memconn-support

def53d8

rfratto enabled auto-merge (squash) April 24, 2023 16:56

rfratto merged commit c5a8d95 into grafana:main Apr 24, 2023

github-actions bot added the frozen-due-to-age Locked due to a period of inactivity. Please open new issues or PRs if more discussion is needed. label Feb 29, 2024

github-actions bot locked as resolved and limited conversation to collaborators Feb 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support in-memory HTTP connections for scraping #3602

Support in-memory HTTP connections for scraping #3602

rfratto commented Apr 23, 2023

mattdurham commented Apr 24, 2023

rfratto commented Apr 24, 2023

mattdurham left a comment

tpaschalis Apr 24, 2023

rfratto Apr 24, 2023

tpaschalis Apr 24, 2023

rfratto Apr 24, 2023 •

edited

Loading

		The exported targets will use the configured [in-memory traffic][] address
		specified by the [run command][].

Support in-memory HTTP connections for scraping #3602

Support in-memory HTTP connections for scraping #3602

Conversation

rfratto commented Apr 23, 2023

mattdurham commented Apr 24, 2023

rfratto commented Apr 24, 2023

mattdurham left a comment

Choose a reason for hiding this comment

tpaschalis Apr 24, 2023

Choose a reason for hiding this comment

rfratto Apr 24, 2023

Choose a reason for hiding this comment

tpaschalis Apr 24, 2023

Choose a reason for hiding this comment

rfratto Apr 24, 2023 • edited Loading

Choose a reason for hiding this comment

rfratto Apr 24, 2023 •

edited

Loading