Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
MIT License
Given a long video, we turn it into a doc containing visual + audio info. By sending this doc to ChatGPT, we can chat over the video!
Done
Doing
Please find installation instructions in install.md.
python main.py --video_path examples/buy_watermelon.mp4 --openai_api_key xxxxx
The generated video document will be generated and saved in examples/buy_watermelon.log
python main_gradio.py --openai_api_key xxxxx
Stay tuned for our project 🔥
If you have more suggestions or functions need to be implemented in this codebase, feel free to drop us an email [email protected]
, [email protected]
or open an issue.
This work is based on ChatGPT, BLIP2, GRIT, KTS, Whisper, LangChain, Image2Paragraph.