Add byteLength method and hasState property #258

Meigyoku-Thmn · 2020-09-16T08:15:31Z

I would like to add the byteLength method and the hasState property.

byteLength method:

Guaranteed to be faster than iconv.encode(..).length because it doesn't allocate an entire buffer.
Usecase: you might have a very long string and don't want to create a very long buffer to encode it, but you want to write that string into a binary file with a prefix byte length (maybe in 7-bit format likes this method in .NET). So you can use this method, then you encode the string gradually on its' substrings by an encoder.

hasState property:

If reading binary data to decode, or encoding to write to binary file, you might want know if the decoder or encoder has any accumulated state inside them so you can decide if there is an error or not.

* Add two backends: node & web * Convert core lib files to use the backends (and not use Buffer) * Convert utf16 codec as an example * Add testing for both node side and webpack * Bump Node.js minimal supported version to 4.5.0 and modernize some existing code. This will allow us to get rid of safer-buffer, our only dependency.

Three major reasons for reimplementing UTF-16 and not use native codec: 1. We want to remove StringDecoder & Buffer references due to ashtuchkin#235. 2. StringDecoder is inconsistent with handling surrogates on Node v6-9 3. NPM module string_decoder gives strange results when processing chunks - it sometimes prepends '\u0000', likely due to a bug. Performance was and is a major concern here. Decoder shouldn't be affected because it uses backend methods directly. Encoder is affected due to introducing character-level loop. It's still very fast (~450Mb/s), so I'm not too worried. If needed, we can make it about 4x faster in Node.js by introducing a dedicated backend method. Browser speeds will be the same.

…rings.

…uite To do that I've added a generation step and store the data in test/tables/ folder.

ashtuchkin and others added 19 commits July 14, 2020 13:13

Convert sbcs codec and some tests to use backend (ashtuchkin#255)

228af9c

Added ESLint and Prettier.

141a8dd

Move generation-specific deps to a separate package.json

0d01f15

Apply ESLint to tests

5da0746

Apply prettier to tests

c16052d

(minor) remove __dirname from require()-s

67f91b9

Add strict mode everywhere.

9bb9d83

(minor) Rename utils.bytesFrom() to bytes() and make it accept hex st…

f49c584

…rings.

Added more eslint rules and fixed errors

2be15b0

Removed dependency on 'iconv' from sbcs-test.js and added it to web s…

a1bd8f7

…uite To do that I've added a generation step and store the data in test/tables/ folder.

Convert dbcs codec and tests (ashtuchkin#256)

5d99a92

add byteLength method and hasState property

1f5c89e

update readme and fix typo, wrong variable and function name

72517be

add byteLength test for basic encodings

ab95d0a

add byteLength test for main-test

191aa2a

format

965b5fb

implement matchAll for older enviroments

8bbd3e4

ashtuchkin force-pushed the master branch from 5d99a92 to ed88711 Compare May 23, 2021 22:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add byteLength method and hasState property #258

Add byteLength method and hasState property #258

Meigyoku-Thmn commented Sep 16, 2020

Add byteLength method and hasState property #258

Are you sure you want to change the base?

Add byteLength method and hasState property #258

Conversation

Meigyoku-Thmn commented Sep 16, 2020