Toolkit for Visual7W visual question answering dataset
MIT License
The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture S...
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity...
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large...
a state-of-the-art-level open visual language model | 多模态预训练模型
Reading Wikipedia to Answer Open-Domain Questions
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
The Natural Language Decathlon: A Multitask Challenge for NLP
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities...
[NAACL 2021] QAGNN: Question Answering using Language Models and Knowledge Graphs 🤖
TensorFlow Models for the Stanford Question Answering Dataset
Most popular metrics used to evaluate object detection algorithms.
A task generation and model evaluation system.
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR...