Open Source Ecosystems

A100

T4500:

fp16

fp16pretrain

train.txt500

python3 -m torch.distributed.launch \
    --nproc_per_node=8 \
    train.py \
    --model_name_or_path uer/gpt2-chinese-cluecorpussmall \
    --train_file data/train.txt \
    --per_device_train_batch_size 16 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 150 \
    --learning_rate 1.5e-4 \
    --output_dir model/fp16 \
    --overwrite_output_dir \
    --fp16 \
    --logging_steps 10 \
    --enable_quant false

fp16

-l

python3 export.py \
    -m model/fp16/pytorch_model.bin \
    -l 500

fp16

-p

python3 generate.py \
    -m model/fp16/pytorch_model.hdf5 \
    -i "" \
    -p "uer/gpt2-chinese-cluecorpussmall"

int8

fp16pretrain

fp16fp16int8

python3 -m torch.distributed.launch \
    --nproc_per_node=8 \
    train.py \
    --model_name_or_path uer/gpt2-chinese-cluecorpussmall \
    --train_file data/train.txt \
    --per_device_train_batch_size 16 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 150 \
    --learning_rate 1.5e-4 \
    --output_dir model/fp16 \
    --overwrite_output_dir \
    --fp16 \
    --logging_steps 10 \
    --enable_quant false

int8finetune

fp16int8finetune

python3 -m torch.distributed.launch \
    --nproc_per_node=8 \
    train.py \
    --model_name_or_path uer/gpt2-chinese-cluecorpussmall \
    --train_file data/train.txt \
    --per_device_train_batch_size 16 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 200 \
    --learning_rate 5e-6 \
    --output_dir model/int8 \
    --overwrite_output_dir \
    --resume_from_checkpoint model/fp16 \
    --fp16 \
    --logging_steps 10 \
    --enable_quant true

int8

int8-qint8

python3 export.py \
    -m model/int8/pytorch_model.bin \
    -l 500 \
    -q

int8

-qint8

python3 generate.py \
    -m model/int8/pytorch_model.hdf5 \
    -i "" \
    -p "uer/gpt2-chinese-cluecorpussmall" \
    -q

Related Projects

Action-detection_SSD_pytorch

28 Jun 2019 0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

17 Aug 2020 19,926

PunctuationModel

中文标点符号模型，可以给文本添加标点符号。

14 Sep 2022 128

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

31 May 2019 7,448

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 20...

27 May 2023 191

Complex-YOLOv4-Pytorch

The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...

03 Jul 2020 1,234

minimal-gpt-neox-20b

09 Mar 2022 126

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

msc-2018-final

07 Jun 2018 66

Tencent2020_Rank1st

The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.

22 Jul 2020 1,023

GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

09 Dec 2019 2,978

gluonnlp-gpt2

03 Jun 2019 10

GPT2

An implementation of training for GPT2, supports TPUs

13 May 2019 1,419

korean-spacing-model

한국어 문장 띄어쓰기(삭제/추가) 모델입니다. 데이터 준비 후 직접 학습이 가능하도록 작성하였습니다.

16 Sep 2020 54

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

13 Nov 2021 2,591

ls-gpt2-demo

fp16

fp16pretrain

fp16

fp16

int8

fp16pretrain

int8finetune

int8

int8

Related Projects

Action-detection_SSD_pytorch

minGPT

PunctuationModel

GPT2-Chinese

ToolkenGPT

Complex-YOLOv4-Pytorch

minimal-gpt-neox-20b

BertSum

msc-2018-final

Tencent2020_Rank1st

GPT2-chitchat

gluonnlp-gpt2

GPT2

korean-spacing-model

MAE-pytorch