aim to use JapaneseTokenizer as easy as possible
MIT License
Japanese morphological analysis engine written in pure Python
Python boilerplate using uv, pre-commit, prettier, pytest, GitHub Actions, mypy, ruff, bandit & d...
👖 Conformal Tights adds conformal prediction of coherent quantiles and intervals to any scikit-le...
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
🎀 JavaScript API for spaCy with Python REST API
NLP预/后处理工具。
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Python project and library template for clean, reliable, open-source projects.
A lean web application template with FastAPI, Uvicorn, Docker, and GitHub Actions.
🔍 Lookup classes and instantiate them with style
A project template for Python package with heavy use of Github actions
Argcomplete support to tab completion of python and xonsh scripts in xonsh shell.
Simple Python package (CLI/Python API) for getting japanese readings (yomigana) and accents using...
yet another text augmentation python package
Split strings into (character-based) k-shingles