gpt2-tensorflow-to-pytorch-converter

This repository contains a quick-to-use script to convert GPT-2 models from TensorFlow to PyTorch model format.

Usage

Collect all your TensorFlow model files into a singular directory, i.e. these files:

model-<number>.meta
vocab.bpe
model-<number>.data-00000-of-00001
model-<number>.index
checkpoint
counter
encoder.json
hparams.json

Clone the repo, install prerequisites with i.e. pip install -r requirements.txt if needed.

Run the script:

python convert_model.py /path/to/your/model/files

The converted PyTorch model will be saved in the ./converted_model directory.

Notes

Have fun, I probably won't be updating this one much.

License

This project is licensed under the MIT License.

Contribute

All code improvements are welcome. This should at least work on all TF1.x-based GPT-2 architecture models.

About

Flying from the mind of FlyingFathead
Digital ghost code by ChaosWhisperer

Related Projects

gluonnlp-gpt2

03 Jun 2019 10

transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need

17 Jun 2017 4,253

tfjs-to-tf

A TensorFlow.js Graph Model Converter

06 Dec 2019 138

hub

A library for transfer learning by reusing parts of TensorFlow models.

12 Mar 2018 3,434

picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

21 Jan 2023 3,199

build-nanogpt

Video+code lecture on building nanoGPT from scratch

09 Jun 2024 3,439

keras2tensorflow

Tutorial on running keras model in C++ and python tensorflow

30 Oct 2018 11

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

28 Dec 2022 32,417

Pytorch_Merge

Merge LLM that are split in to parts

05 Apr 2023 24

transformer

Build English-Vietnamese machine translation with ProtonX Transformer. :D

22 Jun 2021 65

tf-coreml

TensorFlow to CoreML Converter

11 Oct 2017 1,330

caffe-tensorflow

Caffe models in TensorFlow

10 Nov 2015 2,796

tf-gen-models

Ready to use implementations of state-of-the-art generative models in TensorFlow 2

23 Jan 2022 3