Add support for S3 failover #4553

steven-sheehy · 2022-09-26T16:47:44Z

Description:

Add a ConsensusNode abstraction and use it everywhere instead of node account IDs or calling the address book multiple times per file
Add a connection timeout property and increase it to 5s to avoid occasional connection errors.
Add hedera.mirror.importer.downloader.sources properties that grandfathers legacy hedera.mirror.importer.downloader properties as the primary source in list
Change stream file persistence to write files to a node ID based path
Increase downloader batch size to 100 to improve historical sync speed
Refactor Downloader to push S3 logic into StreamFileProvider abstraction
Replace hedera.mirror.importer.downloader.*.batchSize with generic hedera.mirror.importer.downloader.batchSize
Replace hedera.mirror.importer.downloader.*.threads with generic hedera.mirror.importer.downloader.threads
Remove hedera.mirror.importer.downloader.*.prefix in favor of existing StreamType properties

Related issue(s):

Fixes #54
Fixes #4538

Notes for reviewer:

Checklist

Documented (Code comments, README, etc.)
Tested (unit, integration, etc.)

codecov · 2022-09-26T16:58:45Z

Codecov Report

Base: 92.71% // Head: 92.51% // Decreases project coverage by -0.20% ⚠️

Coverage data is based on head (17fbb83) compared to base (a483c5b).
Patch coverage: 88.60% of modified lines in pull request are covered.

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #4553      +/-   ##
============================================
- Coverage     92.71%   92.51%   -0.21%     
- Complexity     2895     2926      +31     
============================================
  Files           532      535       +3     
  Lines         17172    17188      +16     
  Branches       1805     1810       +5     
============================================
- Hits          15921    15901      -20     
- Misses          904      940      +36     
  Partials        347      347

Impacted Files	Coverage Δ
...era/mirror/importer/config/CacheConfiguration.java	`100.00% <ø> (ø)`
...ownloader/balance/BalanceDownloaderProperties.java	`90.90% <ø> (-3.21%)`	⬇️
...er/downloader/event/EventDownloaderProperties.java	`90.90% <ø> (-3.21%)`	⬇️
.../downloader/record/RecordDownloaderProperties.java	`90.90% <ø> (-3.21%)`	⬇️
...ror/importer/addressbook/ConsensusNodeWrapper.java	`43.75% <43.75%> (ø)`
...ror/importer/config/CloudStorageConfiguration.java	`53.19% <53.33%> (-12.60%)`	⬇️
...ror/importer/downloader/NodeSignatureVerifier.java	`89.65% <66.66%> (+5.78%)`	⬆️
...ra/mirror/importer/domain/FileStreamSignature.java	`84.09% <78.57%> (-2.03%)`	⬇️
...or/importer/downloader/ConsensusValidatorImpl.java	`92.68% <80.00%> (-3.55%)`	⬇️
.../hedera/mirror/importer/domain/StreamFileData.java	`79.41% <87.50%> (+5.21%)`	⬆️
... and 26 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

...orter/src/main/java/com/hedera/mirror/importer/downloader/provider/S3StreamFileProvider.java

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

steven-sheehy · 2022-09-27T03:23:49Z

hedera-mirror-importer/src/main/java/com/hedera/mirror/importer/downloader/Downloader.java

-                    }
-                })
-                .filter(s -> s != null && s.getFileType() == SIGNATURE)
-                .collect(groupingBy(StreamFilename::getInstant, maxBy(StreamFilename.EXTENSION_COMPARATOR)));


I removed the grouping by instant functionality. This would only occur when the account balance csv and pg.gz file were uploaded simultaneously for a short period of time.

It's fine to download either one as they're both equally valid so we can simplify our logic to just rely on the single layer of grouping by StreamFilename (which uses filename without compression suffix as distinct key) via the multimap. That way we don't pay the cost of nested grouping all files for a temporary window.

edwin-greene

Looks good to me.

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

xin-hedera

looks good!

...or-importer/src/main/java/com/hedera/mirror/importer/addressbook/AddressBookServiceImpl.java

...ror-importer/src/main/java/com/hedera/mirror/importer/downloader/ConsensusValidatorImpl.java

...irror-importer/src/main/java/com/hedera/mirror/importer/downloader/DownloaderProperties.java

pom.xml

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

xin-hedera

LGTM

sonarcloud · 2022-09-29T18:28:43Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
5 Code Smells

No Coverage information
0.0% Duplication

edwin-greene

Looks good

mgoelswirlds

lgtm

* Add a ConsensusNode abstraction and use it everywhere instead of node account IDs or calling the address book multiple times per file * Add a connection timeout property and increase it to 5s to avoid occasional connection errors. * Add hedera.mirror.importer.downloader.sources properties that grandfathers legacy hedera.mirror.importer.downloader properties as the primary source in list * Change stream file persistence to write files to a node ID based path * Increase downloader batch size to 100 to improve historical sync speed * Refactor Downloader to push S3 logic into StreamFileProvider abstraction * Replace hedera.mirror.importer.downloader.*.batchSize with generic hedera.mirror.importer.downloader.batchSize * Replace hedera.mirror.importer.downloader.*.threads with generic hedera.mirror.importer.downloader.threads * Remove hedera.mirror.importer.downloader.*.prefix in favor of existing StreamType properties Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com> Signed-off-by: mgoelswirlds <mugdha.goel@swirldslabs.com>

steven-sheehy added enhancement Type: New feature downloader Area: S3 downloader breaking Contains a breaking change that warrants mention in the release notes labels Sep 26, 2022

steven-sheehy added this to the 0.66.0 milestone Sep 26, 2022

steven-sheehy self-assigned this Sep 26, 2022

edwin-greene reviewed Sep 26, 2022

View reviewed changes

...orter/src/main/java/com/hedera/mirror/importer/downloader/provider/S3StreamFileProvider.java Show resolved Hide resolved

steven-sheehy added 2 commits September 26, 2022 16:50

Add support for S3 failover

50faa27

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

Fix tests and add license

21779a8

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

steven-sheehy force-pushed the 54-s3-failover branch from 8ade773 to 21779a8 Compare September 26, 2022 21:50

Fix code smells

257acb2

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

steven-sheehy marked this pull request as ready for review September 26, 2022 22:55

steven-sheehy requested a review from a team September 26, 2022 22:55

steven-sheehy commented Sep 27, 2022

View reviewed changes

edwin-greene previously approved these changes Sep 27, 2022

View reviewed changes

Fix node stake cache eviction

6785dcc

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

steven-sheehy dismissed edwin-greene’s stale review via 6785dcc September 28, 2022 17:04

xin-hedera requested changes Sep 29, 2022

View reviewed changes

steven-sheehy added 2 commits September 29, 2022 11:50

Merge remote-tracking branch 'origin/main' into 54-s3-failover

7c3a861

Address review feedback

17fbb83

Signed-off-by: Steven Sheehy <steven.sheehy@swirldslabs.com>

xin-hedera approved these changes Sep 29, 2022

View reviewed changes

edwin-greene approved these changes Sep 29, 2022

View reviewed changes

mgoelswirlds approved these changes Sep 29, 2022

View reviewed changes

steven-sheehy merged commit 7a96ddb into main Sep 29, 2022

steven-sheehy deleted the 54-s3-failover branch September 29, 2022 20:35

steven-sheehy linked an issue Sep 30, 2022 that may be closed by this pull request

Use NodeId instead of NodeAccountId in Node Signature Verification Flow #1976

Closed

This was referenced Sep 30, 2022

Use NodeId instead of NodeAccountId in Node Signature Verification Flow #1976

Closed

Fix GCS downloader regression #4571

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for S3 failover #4553

Add support for S3 failover #4553

steven-sheehy commented Sep 26, 2022 •

edited

Loading

codecov bot commented Sep 26, 2022 •

edited

Loading

steven-sheehy Sep 27, 2022 •

edited

Loading

edwin-greene left a comment

xin-hedera left a comment

xin-hedera left a comment

sonarcloud bot commented Sep 29, 2022

edwin-greene left a comment

mgoelswirlds left a comment

Add support for S3 failover #4553

Add support for S3 failover #4553

Conversation

steven-sheehy commented Sep 26, 2022 • edited Loading

codecov bot commented Sep 26, 2022 • edited Loading

Codecov Report

steven-sheehy Sep 27, 2022 • edited Loading

Choose a reason for hiding this comment

edwin-greene left a comment

Choose a reason for hiding this comment

xin-hedera left a comment

Choose a reason for hiding this comment

xin-hedera left a comment

Choose a reason for hiding this comment

sonarcloud bot commented Sep 29, 2022

edwin-greene left a comment

Choose a reason for hiding this comment

mgoelswirlds left a comment

Choose a reason for hiding this comment

steven-sheehy commented Sep 26, 2022 •

edited

Loading

codecov bot commented Sep 26, 2022 •

edited

Loading

steven-sheehy Sep 27, 2022 •

edited

Loading