CVND-Image-Captioning-Project

Combine CNN and RNN knowledge to build a network that automatically produces captions when given an input image. Python, PyTorch.

Stars

Ecosystems: Python, PyTorch

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

The LSTM model generates captions for the input images after extracting features from pre-trained...

Semantic Image Synthesis with SPADE

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Deep Learning model Zoo

A python library built to empower developers to build applications and systems with self-contain...

Text to Image Generation (Reverse image captioning): This task is just the reverse of image capti...

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...

A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.

Using LLMs and pre-trained caption models for super-human performance on image captioning.

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch ...

Pytorch code of for our CVPR 2018 paper "Neural Baby Talk"