StringDecoder : Buffer to string decoding
The string_decoder
module provides an API for decoding Buffer
objects into strings in a manner that preserves encoded multi-byte UTF-8 and UTF-16(for future) characters. It can be accessed using:
const { StringDecoder } = require('string_decoder');
The following example shows the basic use of the StringDecoder class.
Example
const { StringDecoder } = require('string_decoder');
const decoder = new StringDecoder('utf8');
const cent = Buffer.from([0xC2, 0xA2]);
console.log(decoder.write(cent));
const euro = Buffer.from([0xE2, 0x82, 0xAC]);
console.log(decoder.write(euro));
When a Buffer
instance is written to the StringDecoder
instance, an internal buffer is used to ensure that the decoded string does not contain any incomplete multibyte characters. These are held in the buffer until the next call to decoder.write()
or until decoder.end()
is called.
In the following example, the three UTF-8 encoded bytes of the European Euro symbol (€
) are written over three separate operations:
Example
const { StringDecoder } = require('string_decoder');
const decoder = new StringDecoder('utf8');
decoder.write(Buffer.from([0xE2]));
decoder.write(Buffer.from([0x82]));
console.log(decoder.end(Buffer.from([0xAC])));
Support
The following shows StringDecoder
module APIs available for each permissions.
User Mode | Privilege Mode | |
---|---|---|
StringDecoder | ● | ● |
decoder.write | ● | ● |
decoder.end | ● | ● |
StringDecoder Class
new StringDecoder([encoding])
encoding
{String} The character encoding theStringDecoder
will use. default: 'utf8'.- Returns: {Object}
StringDecoder
object.
Creates a new StringDecoder
instance.
Example
const decoder = new StringDecoder('utf8');
StringDecoder Object
stringDecoder.write(buffer)
buffer
{Buffer} | {TypedArray} | {DataView} ABuffer
, orTypedArray
, orDataView
containing the bytes to decode.- Returns: {String} decode result.
Returns a decoded string, ensuring that any incomplete multibyte characters at the end of the Buffer
, or TypedArray
, or DataView
are omitted from the returned string and stored in an internal buffer for the next call to decoder.write()
or decoder.end()
.
stringDecoder.end([buffer])
buffer
{Buffer} | {TypedArray} | {DataView} ABuffer
, orTypedArray
, orDataView
containing the bytes to decode, default: undefined.- Returns: {String} decode result.
Returns any remaining input stored in the internal buffer as a string. Bytes representing incomplete UTF-8 and UTF-16 characters will be replaced with substitution characters appropriate for the character encoding.
If the buffer
argument is provided, one final call to decoder.write()
is performed before returning the remaining input.