APACHE-2.0 License
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2...
Geom3D: Geometric Modeling on 3D Structures, NeurIPS 2023
A curated list of useful Python packages for data geeks
Train vision models using JAX and 🤗 transformers
My Digital Palace - A Personal Journal for Reflection - A place to store all my thoughts
Official release of InternLM2 7B and 20B base and chat models. 200K context support
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
Implementation of Alphafold 3 in Pytorch