Guide to using pre-trained large language models of source code
MIT License
GLM (General Language Model)
Ongoing research training transformer models at scale
Unsupervised Language Modeling at scale for robust sentiment classification
A lightweight library that leverages Language Models (LLMs) to enable natural language interactio...
Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
Home of StarCoder2!
Ongoing research training transformer language models at scale, including: BERT & GPT-2
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A framework for few-shot evaluation of language models.
CodeGeeX2: A More Powerful Multilingual Code Generation Model
utilities for decoding deep representations (like sentence embeddings) back to text
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
TensorFlow code for the neural network presented in the paper: "code2vec: Learning Distributed Re...