Netease Youdao's open-source embedding and reranker models for RAG products.
APACHE-2.0 License
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
MTEB: Massive Text Embedding Benchmark
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critiqu...
Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by ...
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language ...
Question and Answer based on Anything.
Retrieval and Retrieval-augmented LLMs
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Mixture-of-Experts for Large Vision-Language Models