CLI: fast summary section rewrites #677

wkalt · 2022-10-25T02:54:04Z

Currently the add attachment and add metadata subcommands rewrite the entire file, which takes time proportional to the size of the file. It would be way faster if we only rewrote the summary section. For this we need some tricky code to bump all the appropriate summary section indexes, depending on where the summary offsets for the metadata/attachment sections reside, but it should totally be possible.

One thing to be aware of - we will lose crash safety, vs the current tmpfile + move approach. Options to mitigate seem like a) copy the original file to a "original file" location, which has the downside of still being proportional to file size, or b) pairing this with a robust "reindex" subcommand that will regenerate a valid summary section from the presumed-valid original data section (we should not need to touch the data section). Option b) seems best IMO, to make the average case very fast.

The text was updated successfully, but these errors were encountered:

jtbandes · 2022-10-25T23:19:59Z

a robust "reindex" subcommand that will regenerate a valid summary section from the presumed-valid original data section

Isn't this what you get when you run recover on a file with a valid data section? Is your point that reindex should also be in-place?

wkalt · 2022-10-28T01:11:48Z

@jtbandes I don't think there's any reason that "reindex" needs to be in place, since the expectation would be that it would only get run in uncommon failures of this function. If we already have this in the recover command I think that's sufficient.

wkalt · 2023-03-17T17:40:17Z

internal ticket: https://linear.app/foxglove/issue/FG-2526/mcap-cli-fast-summary-section-rewrites

jtbandes · 2024-05-24T23:02:25Z

Looks like this was done in #915

wkalt added the feature New feature or request label Oct 25, 2022

wkalt mentioned this issue Oct 28, 2022

CLI: Ability to write in columnar order #686

Closed

jtbandes added the cli label Nov 11, 2022

jtbandes closed this as completed May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLI: fast summary section rewrites #677

CLI: fast summary section rewrites #677

wkalt commented Oct 25, 2022

jtbandes commented Oct 25, 2022

wkalt commented Oct 28, 2022

wkalt commented Mar 17, 2023

jtbandes commented May 24, 2024

CLI: fast summary section rewrites #677

CLI: fast summary section rewrites #677

Comments

wkalt commented Oct 25, 2022

jtbandes commented Oct 25, 2022

wkalt commented Oct 28, 2022

wkalt commented Mar 17, 2023

jtbandes commented May 24, 2024