spaCy Ecosystem

spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.

Created by
Explosion
Community Repos
1,643
Total GitHub Stars
48,551

pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

02 Oct 2016 2,132

rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

14 Oct 2016 17,893

textacy

NLP, before and after spaCy

03 Feb 2016 2,208

scispacy

A full spaCy pipeline and models for scientific/biomedical documents

24 Sep 2018 1,601

refinery

The data scientist's open-source choice to scale, assess and maintain natural language data

04 Jul 2022 1,393

skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

16 Mar 2021 917

neuralcoref

✨Fast Coreference Resolution in spaCy with Neural Networks

03 Jul 2017 2,849

sampleproject

A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"

03 Dec 2013 5,003

medspacy

Library for clinical NLP with spaCy

16 Jun 2020 525

TextDescriptives

A Python library for calculating a large variety of metrics from text

28 Jan 2020 309

zshot

Zero and Few shot named entity & relationships recognition

11 Feb 2022 343

spaczz

Fuzzy matching and more functionality for spaCy

14 Jun 2020 249

edsnlp

Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes

08 Mar 2022 112

ner-annotator

Named Entity Recognition (NER) Annotation tool for SpaCy

08 Nov 2020 550

Klayers

Python Packages as AWS Lambda Layers

06 Jan 2019 2,085

SpanMarkerNER

SpanMarker for Named Entity Recognition

28 Mar 2023 386

contextualSpellCheck

✔️Contextual word checker for better suggestions

10 Apr 2020 395

cltk

The Classical Language Toolkit

11 Jan 2014 818

cleanNLP

R package providing annotators and a normalized data model for natural language processing

14 Oct 2016 209

DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co

12 Oct 2021 166