semantic-chunker

Semantic Chunker

MIT License

Downloads

112

Stars

View Code on GitHub

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Package Rankings

Top 35.21% on Pypi.org

Related Projects

split-markdown4gpt

A Python tool for splitting large Markdown files into smaller sections based on a specified token...

17 Jun 2023 20

rake-nltk

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.

18 Jan 2017 1,061

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

16 Feb 2024 9,074

argumentation-management

Annotator combining different NLP pipelines.

28 Jun 2021 0

conformal-tights

👖 Conformal Tights adds conformal prediction of coherent quantiles and intervals to any scikit-le...

01 Mar 2024 82

kshingle

Split strings into (character-based) k-shingles

02 Dec 2020 4

pyproject-starter

A template for the python project. It uses poetry for dependency management and tox for testing.

02 Oct 2022 9

darglint2

A python documentation linter which checks that the docstring description matches the definition....

05 Feb 2023 22

augtxt

yet another text augmentation python package

22 Nov 2020 2

pypackage-template

A project template for Python package with heavy use of Github actions

14 Jun 2020 53

indic_eval

A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse ra...

26 Mar 2024 31

JapaneseTokenizers

aim to use JapaneseTokenizer as easy as possible

01 Sep 2015 138

markdown-code-runner

Automatically execute code blocks within a Markdown file and update the output in-place

02 Apr 2023 91

pyproject-pre-commit

pre-commit settings for python project with pyproject.toml

04 Feb 2023 5

NonSteamLaunchers-On-Steam-Deck

Installs the latest GE-Proton and Installs Non Steam Launchers under 1 Proton prefix folder and a...

27 Apr 2023 2,025