Quickly preprocesses Japanese text using NLP/NER from SpaCy for Japanese translation or other NLP tasks.
LGPL-2.1 License
Bot releases are hidden (Show)
Fixed bug with honorific processing, move to 1.6.5, introduced space correction post process.
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/1.6.4...1.6.5
Published by Bikatr7 4 months ago
Added hyphen post-processing correction for Kudasai json type
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/v1.6.3...v1.6.4
Published by Bikatr7 4 months ago
Specified exact spacy version for package.
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/v1.6.2...v1.6.3
Published by Bikatr7 4 months ago
Introduced countermeasures to prevent excessive loading of SpaCy package to offset potential memory leaks.
Noticed in:
https://github.com/explosion/spaCy/discussions/9369
and
https://github.com/Kakusui/kakusui-org/commit/cc4e15545282ef62be8b6472e7597be972143e6a
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/v1.6.1...v1.6.2
Published by Bikatr7 6 months ago
Better error handling for attempting to run Kairyou without spacy's ja model installed.
Published by Bikatr7 6 months ago
Proper testing and better exception handling.
Move to proper version python 3.10
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/v1.5.0...v1.6.0
Published by Bikatr7 7 months ago
Switched License to LGPL to allow for easier usage in other projects.
Published by Bikatr7 7 months ago
Kairyou Patch Notes - Minor Update
Enhancements:
Added a base exception class, KairyouException, for better error handling.
Exposed all internal exceptions for clarity and debugging ease.
Bug Fixes:
Resolved an issue where JSON strings were incorrectly allowed during indexing.
Improved internal API to follow PEP standards, ensuring private functions and variables are properly marked.
Please update to the latest version for an improved experience.
Published by Bikatr7 7 months ago
Fixed typo in function name, changed internal function naming convention.
Published by Bikatr7 8 months ago
Proper Release, maintains functionality but actually installs spacy alongside, you do need to install the jp model though
python -m spacy download ja_core_news_lg
Published by Bikatr7 8 months ago
Fixed bad toml and move to new version
Published by Bikatr7 8 months ago
Fixed bad toml requirements
Published by Bikatr7 8 months ago
Names now require to not contain any english characters to be flagged.
Published by Bikatr7 8 months ago
Finalized indexer and added logging
Published by Bikatr7 8 months ago
Fixed a bug in indexer not reading the given replacements JSON properly, as well as a faulty type hint.
Allowed for passing in a blacklist to ignore when indexing.
Published by Bikatr7 8 months ago
Added Indexer
Published by Bikatr7 8 months ago
Full Changelog: https://github.com/Bikatr7/Kairyou/compare/v1.0.1...v1.1.0
Published by Bikatr7 8 months ago
Fixed bug in preprocessing skipping, slight adjustment in punctuation charset.
Published by Bikatr7 8 months ago
Initial Release.
Documentation:
https://github.com/Bikatr7/Kairyou/blob/main/README.md
Please report issues to:
https://github.com/Bikatr7/Kairyou/issues
or
Published by Bikatr7 8 months ago