Replace S3Boto3Storage._compress_content with a streaming implementation #1061

vainu-arto · 2021-09-21T12:28:46Z

The original version reads the entire content in one go, compresses and
places it into a buffer in memory. This limits both the possible size of
the file that can be saved and compressed, and also the performance of the
transfer.

The original version reads the entire content in one go, compresses and places it into a buffer in memory. This limits both the possible size of the file that can be saved and compressed, and also the performance of the transfer.

jschneier · 2021-10-03T19:32:25Z

This handles setting mtime=0, right?

vainu-arto · 2021-10-04T04:11:29Z

Yes, mtime in the header will always be zero. In my understanding it isn't actually possible to set it to any other value when allowing zlib to generate the header itself.

jschneier · 2021-10-07T02:01:40Z

How was this tested?

vainu-arto · 2021-10-07T04:52:59Z

I ran random sets of data through it and the stdlib GzipFile, decompressed (with GzipFile and system gzip) and compared the output between each other and the original. I couldn't find a way to make it produce output that could not be decompressed or would differ from the original.

jschneier · 2021-10-07T15:53:46Z

Thanks

…#1061) The original version reads the entire content in one go, compresses and places it into a buffer in memory. This limits both the possible size of the file that can be saved and compressed, and also the performance of the transfer.

vainu-arto force-pushed the streaming-compression branch 2 times, most recently from 34e5c95 to 6f273e9 Compare September 21, 2021 12:45

vainu-arto force-pushed the streaming-compression branch from 6f273e9 to 38b32bf Compare September 21, 2021 12:46

jschneier merged commit 544a9f9 into jschneier:master Oct 7, 2021

vainu-arto deleted the streaming-compression branch October 8, 2021 04:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace S3Boto3Storage._compress_content with a streaming implementation #1061

Replace S3Boto3Storage._compress_content with a streaming implementation #1061

vainu-arto commented Sep 21, 2021

jschneier commented Oct 3, 2021

vainu-arto commented Oct 4, 2021

jschneier commented Oct 7, 2021

vainu-arto commented Oct 7, 2021

jschneier commented Oct 7, 2021

Replace S3Boto3Storage._compress_content with a streaming implementation #1061

Replace S3Boto3Storage._compress_content with a streaming implementation #1061

Conversation

vainu-arto commented Sep 21, 2021

jschneier commented Oct 3, 2021

vainu-arto commented Oct 4, 2021

jschneier commented Oct 7, 2021

vainu-arto commented Oct 7, 2021

jschneier commented Oct 7, 2021