laion-prepro

Get hundred of million of image+url from the crawling at home dataset and preprocess them

Stars

201

View Code on GitHub

Ecosystems: Python

Statistics for this project are still being loaded, please check back later.

Related Projects

sygil-webui

Stable Diffusion web UI

24 Aug 2022 7,850

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

03 May 2023 1,114

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...

23 Feb 2024 1,061

mm-cot

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...

02 Feb 2023 3,760

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M u...

11 Aug 2021 3,256

CLAP

Contrastive Language-Audio Pretraining

06 Mar 2022 1,358

simple-image-recaptioning

Recaption large (Web)Datasets with vllm and save the artifacts.

11 Sep 2024 4

cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/tex...

29 Nov 2022 303

DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

18 Nov 2019 2,084

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on di...

06 Nov 2023 2,650

deepzoo

Deep Learning model Zoo

10 Aug 2018 20

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

01 Aug 2023 1,792

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

18 Sep 2023 4,242

caption-by-committee

Using LLMs and pre-trained caption models for super-human performance on image captioning.

14 Dec 2022 27

video-nonlocal-net

Non-local Neural Networks for Video Classification

26 Jan 2018 1,971