Make segment merging parallel in SegmentMerger [LUCENE-8580] #9626

asfimport · 2018-11-30T10:53:19Z

A placeholder issue stemming from the discussion on the mailing list [1]. Not of any high priority.

At the moment any merging from N segments into one will happen sequentially for each data structure involved in a segment (postings, norms, points, etc.). If the input segments are large, the CPU (and I/O) are mostly unused and the process takes a long time.

Merging of these data structures is mostly independent of each other, so it'd be interesting to see if we can speed things up by allowing them to run concurrently. I investigated this on a 40GB index with 22 segments, force-merging this into 1 segment (of similar size). Quick and dirty patch attached.

I see some improvement, although it's not by much; the largest component dominates everything else.

Results from an 8-core CPU.
Before:

SM 0 [2018-11-30T09:21:11.662Z; main]: 347237 msec to merge stored fields [41922110 docs]
SM 0 [2018-11-30T09:21:18.236Z; main]: 6562 msec to merge norms [41922110 docs]
SM 0 [2018-11-30T09:33:53.746Z; main]: 755507 msec to merge postings [41922110 docs]
SM 0 [2018-11-30T09:33:53.746Z; main]: 0 msec to merge doc values [41922110 docs]
SM 0 [2018-11-30T09:33:53.746Z; main]: 0 msec to merge points [41922110 docs]
SM 0 [2018-11-30T09:33:53.746Z; main]: 7 msec to write field infos [41922110 docs]

IW 0 [2018-11-30T09:33:56.124Z; main]: merge time 1112238 msec for 41922110 docs

After:

SM 0 [2018-11-30T10:16:42.179Z; ForkJoinPool.commonPool-worker-1]: 8189 msec to merge norms
SM 0 [2018-11-30T10:16:42.195Z; ForkJoinPool.commonPool-worker-3]: 0 msec to merge doc values
SM 0 [2018-11-30T10:16:42.195Z; ForkJoinPool.commonPool-worker-3]: 0 msec to merge points
SM 0 [2018-11-30T10:16:42.211Z; ForkJoinPool.commonPool-worker-1]: merge store matchedCount=22 vs 22
SM 0 [2018-11-30T10:23:24.574Z; ForkJoinPool.commonPool-worker-1]: 402381 msec to merge stored fields [41922110 docs]
SM 0 [2018-11-30T10:32:20.862Z; ForkJoinPool.commonPool-worker-2]: 938668 msec to merge postings

IW 0 [2018-11-30T10:32:23.513Z; main]: merge time  950249 msec for 41922110 docs

Ideally, one would need to push forkjoin into individual subroutines so that, for example, postings utilize concurrency when merging (pulling blocks of terms concurrently from the input, calculating statistics, etc. and then pushing in an ordered fashion to the codec).

[1] https://markmail.org/thread/dtejwq42qagykeac

Migrated from LUCENE-8580 by Dawid Weiss (@dweiss), updated Apr 26 2022
Attachments: LUCENE-8580.patch

asfimport · 2021-01-27T08:04:24Z

Dawid Weiss (@dweiss) (migrated from JIRA)

About time, eh?

asfimport · 2022-03-20T06:46:20Z

Vigya Sharma (@vigyasharma) (migrated from JIRA)

I started looking into this, and had some early questions... What would be the right benchmark to look at, for measuring any improvements here? Would it be the Indexing Throughput benchmark?

Should we add some benchmark specifically for merge related performance, like triggering a force merge? I suppose it wont be a measure of qps, but rather, the time to complete a merge on a given index? Do we have such benchmarks already?

Apologies if these are obvious things, I've only recently started looking at benchmarks. Will update here if I'm able to figure something out about these myself..

asfimport · 2022-04-20T19:45:16Z

Vigya Sharma (@vigyasharma) (migrated from JIRA)

I'm thinking of tackling this one data structure at a time, starting with postings.

Extending on the @dweiss's patch, I was wondering if we could merge each field in parallel, in a separate fork-join task. I feel merging terms in parallel is tricky as we want to retain their sorted order, but going one level deeper, we could parallelize merging the postings within a term. Every ReaderSlice maps to a separate chunk of docIds in the target segment, so we could write postings for a term from each subreader in parallel.

For the fields part first, would it work to just spawn off multiple FieldsConsumer.write(...) calls in parallel, with different subsets of fields (maybe 1 field per task)?

TermsIndex has an FST per field and I see we go field by field while writing terms and merging their postings, which makes it look viable. I'm not yet clear on whether this will require writing to multiple (tim/tip/tmd/others?) files, and if that is a problem (whether we need to stitch them together which drains away all our concurrency gains, or if we can explore ways to work with multiple files).

I'll explore more on how the files get written. Want to check with the community for any pointers, if I'm on the right track here, and if there are some obvious wrinkles I should look at.

asfimport · 2022-04-26T18:57:17Z

Michael Sokolov (@msokolov) (migrated from JIRA)

It seems we have generally stuck with one or a handful of files per format. Presumably we don't want to fragment into many small files in order to enable the OS to handle I/O more optimally? I think the key is not to impact search performance. But maybe start by writing many files in parallel just to see what is the best we can achieve for parallel merging with this approach and worry about connecting them together later? It shouldn't be hard to do the stitching, just block copies + rewriting the metadata?

…ngle merge action (#13124) This commit adds a new interface to all MergeScheduler classes that allows the scheduler to provide an Executor for intra-merge parallelism. The first sub-class to satisfy this new interface is the ConcurrentMergeScheduler (CMS). In particular, the CMS will limit the amount of parallelism based on the current number of mergeThreads and the configured maxThread options. Parallelism is now added by default to the SegmentMerger where each merging task is executed in parallel with the others. Additionally, the Lucene99 HNSW codec will use the provided MergeScheduler executor if no executor is provided directly. Relates to: #12740 Relates to: #9626

…ngle merge action (#13190) This commit adds a new interface to all MergeScheduler classes that allows the scheduler to provide an Executor for intra-merge parallelism. The first sub-class to satisfy this new interface is the ConcurrentMergeScheduler (CMS). In particular, the CMS will limit the amount of parallelism based on the current number of mergeThreads and the configured maxThread options. Parallelism is now added by default to the SegmentMerger where each merging task is executed in parallel with the others. Additionally, the Lucene99 HNSW codec will use the provided MergeScheduler executor if no executor is provided directly. Relates to: #12740 Relates to: #9626 This is a take 2 of: #13124

…ngle merge action (apache#13190) This commit adds a new interface to all MergeScheduler classes that allows the scheduler to provide an Executor for intra-merge parallelism. The first sub-class to satisfy this new interface is the ConcurrentMergeScheduler (CMS). In particular, the CMS will limit the amount of parallelism based on the current number of mergeThreads and the configured maxThread options. Parallelism is now added by default to the SegmentMerger where each merging task is executed in parallel with the others. Additionally, the Lucene99 HNSW codec will use the provided MergeScheduler executor if no executor is provided directly. Relates to: apache#12740 Relates to: apache#9626 This is a take 2 of: apache#13124

asfimport assigned dweiss Aug 23, 2022

vigyasharma mentioned this issue Apr 10, 2023

Scalable merge/compaction of big doc values segments. #12203

Open

jpountz mentioned this issue Nov 2, 2023

Better wire up HNSW concurrent merge config #12740

Open

benwtrent mentioned this issue Mar 18, 2024

Add new parallel merge task executor for parallel actions within a single merge action #13190

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make segment merging parallel in SegmentMerger [LUCENE-8580] #9626

Make segment merging parallel in SegmentMerger [LUCENE-8580] #9626

asfimport commented Nov 30, 2018

asfimport commented Jan 27, 2021

asfimport commented Mar 20, 2022

asfimport commented Apr 20, 2022

asfimport commented Apr 26, 2022

Make segment merging parallel in SegmentMerger [LUCENE-8580] #9626

Make segment merging parallel in SegmentMerger [LUCENE-8580] #9626

Comments

asfimport commented Nov 30, 2018

asfimport commented Jan 27, 2021

asfimport commented Mar 20, 2022

asfimport commented Apr 20, 2022

asfimport commented Apr 26, 2022