Base16384

A unicode-based encoding scheme that presents binary data (sequence of 8-bit bytes) in sequences of 14-bit printable Chinese characters. It saves 17% space compared to base64.

Inspired by fumiama/base16384.

Description

Base16384 uses 16384 (214) Chinese characters (from \u4E00 to \u8DFF) to represent binary data.

If the length of the binary data is not a multiple of 7, we will add a \u3D0x (where x is the remainder modulo 7) after the output.

Comparison

	Base64	Base16384
Overhead	33%	14%
Charset	`[0-9a-zA-Z+/]`	`[\u4E00-\u8DFF]`
Example	`RXhhbXBsZQ==`	`彞吖菁穥㴀`

Usage

import { decode, encode } from 'base16384'

const buffer = encode('Example') // Uint16Array
new TextDecoder().decode(decode(buffer)) // 'Example'

API

encode(data)

data: string | Uint8Array original binary data
returns: Uint16Array base16384-encoded data

Encode binary data to base16384.

decode(data)

data: string | Uint16Array base16384-encoded data
returns: Uint8Array original binary data

Decode base16384 to binary data.

Badges

Extracted from project README

Related Projects

ts-base32

Base32 encoder/decoder with support for multiple variants

24 Dec 2018 13

universal-base64url

Small universal base64url functions for node.js and browsers

08 Oct 2018 16

js-base64-ct

Safe Base64 encoding/decoding in pure JavaScript.

10 Jul 2021 16

universal-base64

Small universal base64 functions for node.js and browsers

20 May 2018 28

rfc4648.js

Pure Javascript implementations of all RFC4648 data encodings

11 Apr 2017 67

base64

Base64 and base64url to string or arraybuffer, and back. Works in Node, Deno or browser.

28 Dec 2021 9

uuid-base64-ts

Shorten UUID v4 to 22 characters with base64 encoding in Typescript

29 Apr 2021 4

d8code

Encode binary data as a UTF-8 string.

10 Jan 2019 9

base16384.js