Use correct byte size for truncation #810

waltjones · 2019-12-05T00:00:20Z

When calculating payload length for truncation, payload.length was being used. This returns the character count (or more accurately the UTF-16 code point count), rather than the UTF-8 byte count. This leads to non-ascii payloads being insufficiently truncated and then rejected by the API.

This PR uses a minimal implementation for counting UTF-8 bytes that never undercounts bytes, and rarely overcounts*. This is safe and valid to be used for truncation, and is significantly smaller and faster than a complete UTF-8 encoder.

It will overcount when it encounters UTF-16 surrogates (two code point chars.) These should be rare in practice, but specific support for these could be added if/when desired.

Use correct byte size for truncation

fix: use correct byte size for truncation

309080c

waltjones merged commit b63fb20 into master Dec 5, 2019

snyk-bot mentioned this pull request May 7, 2020

[Snyk] Upgrade rollbar from 2.15.0 to 2.15.1 CodeOtter/ecma6-web-app#4

Closed

renovate bot mentioned this pull request Jun 6, 2021

Update dependency rollbar to v2.26.4 spencermize/Veload#81

Open

1 task

mudetroit pushed a commit that referenced this pull request Mar 14, 2024

Merge pull request #810 from rollbar/wj-truncation-byte-count

5c30a6a

Use correct byte size for truncation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use correct byte size for truncation #810

Use correct byte size for truncation #810

waltjones commented Dec 5, 2019

Use correct byte size for truncation #810

Use correct byte size for truncation #810

Conversation

waltjones commented Dec 5, 2019