Python utilities for filtering nasty characters in unicode strings.
A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and sy...
Fast percent encoding and decoding for Python
Paste in some broken unicode text and FTFY will tell you how to fix it!
Truly universal encoding detector in pure Python
[ab]using Unicode to create tragedy
Python character encoding detector
A page listing all the Unicode characters that are valid in Python variable names
Python module for unicode blocks.
A repository of letters and look-alikes to create text normalization capable filters
Rescue of Mark Pilgrim's chardet library
Doing dirty (but extremely useful) things with equals.
Han character dictionary
Python extension for dealing with validation and cleanup of UTF-8 strings
Python library and CLI for validating UTF-8 text