Combine CNN and RNN knowledge to build a network that automatically produces captions when given an input image. Python, PyTorch.
The LSTM model generates captions for the input images after extracting features from pre-trained...
Semantic Image Synthesis with SPADE
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Deep Learning model Zoo
A python library built to empower developers to build applications and systems with self-contain...
Text to Image Generation (Reverse image captioning): This task is just the reverse of image capti...
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...
A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.
Using LLMs and pre-trained caption models for super-human performance on image captioning.
I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch ...
Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"