Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detection and segmentation.
MIT License
Statistics for this project are still being loaded, please check back later.