bm25_pt

minimal pytorch implementation of bm25 (with sparse tensors)

MIT License

Stars

View Code on GitHub

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

2 minutes

Related Projects

jax-lm-training

generative language model training on top of the JAX and Huggingface 🤗

05 May 2022 8

starcoder2

Home of StarCoder2!

08 Dec 2023 1,732

bpe-summarizer

Auto summarization from BPE tokenization

17 Jun 2020 3

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

progressive-generation

NAACL 2021 - Progressive Generation of Long Text

08 Jun 2020 75

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 20...

27 May 2023 191

starcoder

Home of StarCoder: fine-tuning & inference!

24 Apr 2023 7,267

bert4keras

keras implement of transformers for humans

26 Aug 2019 5,363

quantized-training

Explore training for quantized models

16 Jul 2024 7

bilm-tf

Tensorflow implementation of contextualized word representations from bi-directional language models

29 Sep 2017 1,620

minimal-gpt-neox-20b

09 Mar 2022 126

DocumentSearchEngine

Document Search Engine project with TF-IDF abd Google universal sentence encoder model

26 Jan 2020 53

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

02 Jul 2021 1,323

transformer-smaller-training-vocab

Temporary remove unused tokens during training to save ram and speed.

01 Jan 2023 20

LongMamba

Some preliminary explorations of Mamba's context scaling.

03 Feb 2024 177