Functions for filtering potentially harmful unicode characters.
Currently just has bidiclean, which ensures Unicode bidirectional characters are correctly balanced, as explained by Cal Henderson here:
http://www.iamcal.com/understanding-bidirectional-text/