DeBERTa | Playwright Ecosystem Directory

Bot releases are hidden (Show)

DeBERTa - DeBERTa pre-trained models Latest Release

Published by BigBird01 about 4 years ago

DeBERTa pre-trained models

base is the model with same config as BERT base, e.g. 12 layers, 12 heads, 768 hidden dimension
base_mnli is the base model fine-tuned with MNLI data
large is the model with same config as BERT large, e.g. 24 layers, 16 heads, 1024 hidden dimension
large_mnli is the large model fine-tuned with MNLI data
xlarge is the model with 48 layers, 16 heads, 1024 hidden dimension
xlarge_mnli is the xlarge model fine-tuned with MNLI data
bpe_encoder is the GPT2 vocabulary package

Package Rankings

Top 6.75% on Proxy.golang.org

Top 10.78% on Pypi.org

Related Projects

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

08 Feb 2022 1,164

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

23 Jul 2019 19,644

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

27 May 2019 1,116

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

18 Jun 2021 10,449

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,583

GLIP

Grounded Language-Image Pre-training

24 Nov 2021 2,182

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, an...

15 Jul 2020 431

torchscale

Foundation Architecture for (M)LLMs

17 Nov 2022 3,006

AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Mo...

24 May 2022 126

MathOctopus

This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...

25 Oct 2023 41

Industrial-Foundation-Models

Dedicated to building industrial foundation models for universal data intelligence across industr...

22 Mar 2024 32

DialoGPT

Large-scale pretraining for dialogue

29 Aug 2019 2,353

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...

20 May 2023 1,483

CodeBERT

17 Jun 2020 2,192

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...

28 Nov 2022 1,287