Retrieval and Retrieval-augmented LLMs
MIT License
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Underst...
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似...
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...
Mixture-of-Experts for Large Vision-Language Models
a state-of-the-art-level open visual language model | 多模态预训练模型
Netease Youdao's open-source embedding and reranker models for RAG products.
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings