YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

GPL-3.0 License

Downloads

160

Stars

4.4K

View Code on GitHub Visit Website

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

115

Time to Close Issues

11 days

Package Rankings

Top 35.91% on Pypi.org

Badges

Extracted from project README

Related Projects

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...

22 Nov 2023 5,641

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

12 Oct 2023 2,138

MAF-YOLO

Implementation of paper - Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneo...

25 Jun 2024 45

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large...

09 Nov 2023 1,314

yolov3

YOLOv3 in PyTorch > ONNX > CoreML > TFLite

26 Aug 2018 10,166

yoloair2

☁️💡🎈专注于改进YOLOv7，Support to improve Backbone, Neck, Head, Loss, IoU, NMS and other modules

25 Aug 2022 197

PyTorch-YOLOv3

Minimal PyTorch implementation of YOLOv3

21 May 2018 7,314

F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

28 Mar 2024 28

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...

23 Feb 2024 1,061

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

17 Jun 2024 1,703

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

29 May 2022 7,818

yolov5_obb

yolov5 + csl_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）基于yolov5的旋转目标检测

17 Mar 2021 1,813

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

23 Oct 2023 2,881

yolov7_d2

🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with Ten...

23 Jun 2021 3,125