TUPE

Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.

MIT License

Stars

249

View Code on GitHub View on X

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

5 days

Related Projects

parti-pytorch

Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch

22 Jun 2022 522

sentiment-discovery

Unsupervised Language Modeling at scale for robust sentiment classification

30 Nov 2017 1,061

iTransformer

Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...

11 Oct 2023 429

SimCR

Code for NAACL 2024 main conference paper "An Empirical Study of Consistency Regularization for E...

27 Aug 2023 5

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Py...

15 May 2023 620

Robust-Prefix-Tuning

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

13 Mar 2022 16

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

clip-jax

Train vision models using JAX and 🤗 transformers

05 Aug 2022 75

ProteinDT

05 Feb 2023 41

GLM

GLM (General Language Model)

18 Mar 2021 3,170

recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

24 Apr 2023 393

chain-of-hindsight

Chain-of-Hindsight, A Scalable RLHF Method

20 Feb 2023 211

x-clip

A concise but complete implementation of CLIP with various experimental improvements from recent ...

01 Dec 2021 686

progen

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

09 Jun 2021 109

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...

09 Sep 2022 2,399