budou

Budou is an automatic organizer tool for beautiful line breaking in CJK (Chinese, Japanese, and Korean).

APACHE-2.0 License

Downloads
802
Stars
1.2K
Committers
15
budou - python2 compatibility improvement Latest Release

Published by tushuhei almost 5 years ago

budou - CLI bug fix

Published by tushuhei almost 5 years ago

budou - Inline styling support

Published by tushuhei almost 5 years ago

With --inlinestyle option in the command line tool, you can populate display:inline-block in the style attribute of the output tags. i.e. you don't need additional CSS rules to make them inline blocks.

budou - Initialization with a given Resource

Published by tushuhei about 5 years ago

  • Backward compatibility made better by supporting parse initialization with a given Resource object.
budou - Tinysegmenter backend support

Published by tushuhei about 6 years ago

  • Added tinysegmenter backend support, which means Japanese sentences are parsable without any dictionary now.
budou - Multiple segmenter support

Published by tushuhei about 6 years ago

  • MeCab segmenter support
  • Documentation update
  • Interface update
    • parse and get_parser methods are introduced
    • authenticate method is now deprecated
  • Command line tool!
budou -

Published by tushuhei almost 7 years ago

  • fixed #32
budou - 0.8.1 release

Published by tushuhei almost 7 years ago

budou - 0.7.5

Published by tushuhei about 7 years ago

budou - 0.7.4

Published by tushuhei about 7 years ago

budou - html5lib 1.0b8 (=0.9999999) compatible

Published by tushuhei about 7 years ago

budou - Simpler and more secure.

Published by tushuhei about 7 years ago

  • Drop the support for inputs which include HTML snippets
  • Sanitize HTML output with lxml.html.clean.clean_html()
  • Wrap the output with one parent SPAN tag to make it as a complete tree structure and support ARIA attributes on the parent tag in the future fix.
budou - Small patch

Published by tushuhei about 7 years ago

PyPi serves this package in wheel format now.

budou - Bring back python-api-client

Published by tushuhei about 7 years ago

budou - googleapiclient -> google-cloud-language

Published by tushuhei about 7 years ago

Replaced googleapiclient with google-cloud-language library.

budou - google auth enabled.

Published by tushuhei about 7 years ago

oauth2client was recently deprecated in favor of the release of google-auth library. Accordingly, we updated Budou to use google-auth instead of oauth2client.

budou - Improve entity handling.

Published by tushuhei over 7 years ago

  • Overall refactoring.
  • Add entity mode which uses entities retrieved by Google Cloud Natural Language API.
budou - Improve punctuation mark handling

Published by tushuhei over 7 years ago