Iteratively execute synchronous ingest processors #84250

danhermann · 2022-02-22T23:00:49Z

Iteratively executes ingest processors that are not asynchronous. This resolves the issue of stack overflows that could occur in large ingest pipelines. As a secondary effect, it dramatically reduces the depth of the stack during ingest pipeline execution which reduces the performance cost of gathering a stack trace for any exceptions instantiated during pipeline execution. The aggregate performance effect of these changes has been observed to be a 10-15% improvement in some larger pipelines.

The primary changes required were adding iterative execution loops to CompoundProcessor and ForEachProcessor. Most of the other changes were consequences of changing those two processors.

Fixes #84274.

elasticsearchmachine · 2022-02-22T23:01:12Z

Hi @danhermann, I've created a changelog YAML for you.

elasticmachine · 2022-02-24T16:50:31Z

Pinging @elastic/es-data-management (Team:Data Management)

elasticsearchmachine · 2022-03-05T14:43:42Z

Hi @danhermann, I've updated the changelog YAML for you.

…arch into ingest_perf

danhermann · 2022-03-07T12:01:46Z

@elasticmachine update branch

masseyke · 2022-03-17T18:52:43Z

Adding a brief description of how CompoundProcessor works since I was initially thrown off:
It iterates through all non-async processors, executing them until it comes across an async processor. It then asynchronously executes that one and recurses a stack frame, where it again iterates through all non-async processors, executing them until it comes across an async processor. And so on. So we still have recursion, but only one level per async processor. The synchronous processors won't all run at the same level in the same for loop, but we're at least no longer going additional levels for each synchronous processor.

masseyke · 2022-03-29T19:13:57Z

@elasticmachine update branch

martijnvg · 2022-04-11T12:47:39Z

@elasticmachine update branch

masseyke · 2022-04-11T13:50:19Z

@elasticmachine run elasticsearch-ci/bwc

martijnvg

LGTM

masseyke · 2022-04-12T13:39:03Z

I ran a couple of rally tests to make sure that we don't see a performance regression with this. For all tests I used a one-node Elasticsearch cluster and a separate node running rally.
First I ran the ingest_pipeline:baseline track that comes with rally. This doesn't have many processors, but it was a good check that we hadn't accidentally messed up performance:

esrally race --pipeline=benchmark-only --target-hosts="http://10.10.26.75:9200/" --track="http_logs" --challenge="append-index-only-with-ingest-pipeline" --track-params="ingest_pipeline:baseline" --client-options=~/cluster2-client-options.json --on-error=abort --track-params="bulk_indexing_clients:16,number_of_shards:1"

The median throughput improved by about 4.5%.
Next I ran the solutions/logs track in rally-internal-tracks, after configuring it to use apache access logs for 100% of the data.

esrally race --pipeline=benchmark-only --target-hosts="http://10.10.26.75:9200"  --track="solutions/logs" --client-options=~/cluster2-client-options.json --on-error=abort  --track-repository=rally-internal-tracks --track-params="bulk_indexing_clients:16,number_of_shards:1" --kill-running-processes

I ran that multiple times (since the first run on a fresh ES was always slowest), and on the non-first runs I'm consistently seeing the Total Ingest Pipeline time before the change around 190s, and after the change around 170s, so an ~11% improvement.

* upstream/master: (40 commits) Fix BuildTests serialization (elastic#85827) Use urgent priority for node shutdown cluster state update (elastic#85838) Remove Task classes from HLRC (elastic#85835) Remove unused migration classes (elastic#85834) Remove uses of Charset name parsing (elastic#85795) Remove legacy versioned logic for DefaultSystemMemoryInfo (elastic#85761) Expose proxy settings for GCS repositories (elastic#85785) Remove SLM classes from HLRC (elastic#85825) TSDB: fix the time_series in order collect priority (elastic#85526) Remove ILM classes from HLRC (elastic#85822) FastVectorHighlighter should use ValueFetchers to load source data (elastic#85815) Iteratively execute synchronous ingest processors (elastic#84250) Remove TransformClient from HLRC (elastic#85787) Mute XPackRestIT deprecation/10_basic/Test Deprecations (elastic#85807) Unmute Lintian packaging test (elastic#85778) Add a highlighter unit test base class (elastic#85719) Remove NIO Transport Plugin (elastic#82085) [TEST] Remove token methods from HLRC SecurityClient (elastic#85515) [Test] Use thread-safe hashSet for result collection (elastic#85653) [TEST] Mute BuildTests.testSerialization (elastic#85801) ... # Conflicts: # server/src/test/java/org/elasticsearch/search/aggregations/timeseries/TimeSeriesIndexSearcherTests.java

DJRickyB · 2022-04-13T20:16:23Z

Thanks @masseyke for pushing this across the finish line. It had a positive impact on the notoriously hard-to-pin-down conditional processor as well as script and geoip in our nightly benchmarks. I have annotated the charts: https://elasticsearch-benchmarks.elastic.co/#tracks/logging/nightly/default/30d

danhermann added >enhancement :Data Management/Ingest Node Execution or management of Ingest Pipelines including GeoIP v8.2.0 labels Feb 22, 2022

wip

115df83

danhermann force-pushed the ingest_perf branch from ef3138f to 115df83 Compare February 24, 2022 16:50

danhermann marked this pull request as ready for review February 24, 2022 16:50

elasticmachine added the Team:Data Management Meta label for data/management team label Feb 24, 2022

danhermann marked this pull request as draft February 24, 2022 16:51

danhermann added 4 commits February 28, 2022 10:45

more test fixes

c5b8f0e

and more test fixes

2d1ab6f

checkstyle

545c72f

execute compound processors iteratively when possible

5cadac4

danhermann marked this pull request as ready for review March 4, 2022 15:09

danhermann marked this pull request as draft March 4, 2022 15:10

danhermann and others added 6 commits March 4, 2022 16:14

TRP handles sync and async processors

c4f4f18

Merge branch 'master' into ingest_perf

0074668

checkstyle

7d6977a

spotless

86c8ff9

DRY up CompoundProcessor

1b1dfa3

spotless

cb0a940

danhermann changed the title ~~Ingest pipeline performance improvements~~ Iteratively execute synchronous ingest processors Mar 5, 2022

danhermann and others added 6 commits March 5, 2022 08:43

Update docs/changelog/84250.yaml

e308933

fix test

3eba7c4

Merge branch 'ingest_perf' of https://github.com/danhermann/elasticse…

feedd5d

…arch into ingest_perf

update changelog description

0ae86f6

checkstyle

50cd6af

spotless

7ca7025

fix test

78dec28

danhermann marked this pull request as ready for review March 5, 2022 18:40

simplify CompoundProcessor

3371257

elastic deleted a comment from elasticmachine Mar 7, 2022

Merge branch 'master' into ingest_perf

32e17db

dakrone assigned masseyke Mar 17, 2022

Merging master

059444b

elasticmachine and others added 3 commits March 30, 2022 05:43

Merge branch 'master' into ingest_perf

7243cdc

Testing that multiple processors work

ff7c652

Adding test for a failure in the middle of multiple processors

43b53f3

martijnvg self-requested a review March 30, 2022 13:03

salvatore-campagna added v8.3.0 and removed v8.2.0 labels Mar 30, 2022

Merge branch 'master' into ingest_perf

b710203

martijnvg approved these changes Apr 11, 2022

View reviewed changes

masseyke merged commit cce3d92 into elastic:master Apr 12, 2022

jpountz added the release highlight label Apr 26, 2022

joegallo mentioned this pull request Oct 19, 2022

Ingest CompoundProcessor asynchronous time stats can be incorrect #91033

Merged

MaximeWewer mentioned this pull request Feb 16, 2023

[BUG] CompoundProcessor limits ingest pipeline length opensearch-project/OpenSearch#6338

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iteratively execute synchronous ingest processors #84250

Iteratively execute synchronous ingest processors #84250

danhermann commented Feb 22, 2022 •

edited

Loading

elasticsearchmachine commented Feb 22, 2022

elasticmachine commented Feb 24, 2022

elasticsearchmachine commented Mar 5, 2022

danhermann commented Mar 7, 2022

masseyke commented Mar 17, 2022

masseyke commented Mar 29, 2022

martijnvg commented Apr 11, 2022

masseyke commented Apr 11, 2022

martijnvg left a comment

masseyke commented Apr 12, 2022

DJRickyB commented Apr 13, 2022

Iteratively execute synchronous ingest processors #84250

Iteratively execute synchronous ingest processors #84250

Conversation

danhermann commented Feb 22, 2022 • edited Loading

elasticsearchmachine commented Feb 22, 2022

elasticmachine commented Feb 24, 2022

elasticsearchmachine commented Mar 5, 2022

danhermann commented Mar 7, 2022

masseyke commented Mar 17, 2022

masseyke commented Mar 29, 2022

martijnvg commented Apr 11, 2022

masseyke commented Apr 11, 2022

martijnvg left a comment

Choose a reason for hiding this comment

masseyke commented Apr 12, 2022

DJRickyB commented Apr 13, 2022

danhermann commented Feb 22, 2022 •

edited

Loading