bert4torch底层训练框架,用keras风格写torch代码
APACHE-2.0 License
Bot releases are visible (Hide)
Published by Tongjilibo over 1 year ago
增加auto_set_cuda_devices自动选择显卡,增加log_info,log_warn, log_error等小函数
Published by Tongjilibo over 1 year ago
增加EmailCallback和WandbCallback, 增加AccelerateTrainer和DeepSpeedTrainer, grad_accumulation_steps内算一个batch,修改Trainer中部分成员函数
Published by Tongjilibo over 1 year ago
修复保存scheduler
Published by Tongjilibo over 1 year ago
Checkpoint Calback增加保存scheduler, save_weights可自行创建目录,Logger, Tensorboard模块加入lr, 修改predict和add_trainer
Published by Tongjilibo over 1 year ago
独立出callbacks.py文件, fit允许输入形式为字典,load_weights支持list输入,save_weights支持仅保存可训练参数
Published by Tongjilibo over 1 year ago
增加hf的accelerator测试用例, ddp需要外部控制执行callback, 混合精度支持bf16, 增加add_trainer方法
Published by Tongjilibo almost 2 years ago
增加Summary的Callback, 增加Tqdm的进度条展示,保留原有BaseModel的同时,增加Trainer(不从nn.Module继承), 从bert4torch的snippets迁移部分通用函数
Published by Tongjilibo almost 2 years ago
为callback增加on_train_step_end方法, 修复BaseModel(net)方式的bug
Published by Tongjilibo almost 2 years ago
修复DDP下只有master_rank打印
Published by Tongjilibo almost 2 years ago
参考Keras修改了callback的逻辑
Published by Tongjilibo almost 2 years ago
增加Checkpoint, Evaluator等自带Callback, 修改BaseModel(net)方式
Published by Tongjilibo about 2 years ago
初始版本,从bert4torch中剥离出来的普通Trainer,支持单机多卡,混合精度,梯度裁剪,梯度累积等