image-captioning

Statistics for this project are still being loaded, please check back later.

PyTorch implement of image caption

神经网络与「汉字基因」

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Recognize captcha using deep learning.

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...

The LSTM model generates captions for the input images after extracting features from pre-trained...

Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.

Using LLMs and pre-trained caption models for super-human performance on image captioning.

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with ...

My solution to the Image Captioning Final Project of the Coursera "Introduction to Deep Learning"...

实现文字点选、选字、选择、点触验证码识别，基于pytorch训练

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with...