Map of invalid numeric character references to their replacements, according to HTML
MIT License
Map of invalid numeric character references to their replacements, according to HTML.
This is a map from the HTML spec of C1 ASCII/Unicode control
characters (which are disallowed by HTML) to the characters those code points
would have in Windows 1252.
For example, U+0080 (Padding Character) maps to €
, because that’s used for
0x80 in Windows 1252.
Probably never, unless you’re dealing with parsing HTML or similar XML-like things, or in a place where Unicode is not the primary encoding (it is in most places).
This package is ESM only. In Node.js (version 14.14+, 16.0+), install with npm:
npm install character-reference-invalid
In Deno with esm.sh
:
import {characterReferenceInvalid} from 'https://esm.sh/character-reference-invalid@2'
In browsers with esm.sh
:
<script type="module">
import {characterReferenceInvalid} from 'https://esm.sh/character-reference-invalid@2?bundle'
</script>
import {characterReferenceInvalid} from 'character-reference-invalid'
console.log(characterReferenceInvalid[0x80]) // => '€'
console.log(characterReferenceInvalid[0x89]) // => '‰'
console.log(characterReferenceInvalid[0x99]) // => '™'
This package exports the identifier characterReferenceInvalid
.
There is no default export.
characterReferenceInvalid
Map of invalid numeric character references to their replacements, according to
HTML (Record<number, string>
).
See html.spec.whatwg.org
.
This package is fully typed with TypeScript. It exports no additional types.
This package is at least compatible with all maintained versions of Node.js. As of now, that is Node.js 14.14+ and 16.0+. It also works in Deno and modern browsers.
This package is safe.
wooorm/character-entities
wooorm/character-entities-html4
wooorm/character-entities-legacy
wooorm/parse-entities
wooorm/stringify-entities
Yes please! See How to Contribute to Open Source.