xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

OTHER License

Stars
1K

Issue Statistics

Past Year

All Time

Total Pull Requests
0
4
Merged Pull Requests
0
3
Total Issues
6
62
Time to Close Issues
about 1 hour
about 2 months
Related Projects