You shouldnโt need a data center to understand your own videos.
Introducing gUrrt - a way to talk to your videos, running fully locally, without the soul-crushing VRAM requirements of large video-language models.
built by me and @muffBozo
https://t.co/gF0B8oKJxt
A thread ๐งต
@CodeLvM interview was related to AI engineering more accurately inference eng, They asked me some question related to my project , different inference engine when to use which one , paged and flash attention , vllm ,some agentic ai question
@TheAhmadOsman I was using ollama from past 7-8 month on my laptop 6gb vram and just switched to llama.cpp for my project (https://t.co/mAU42Tuq7K ) batch video captioning pipeline .
If any one want to give a read here is the link
https://t.co/8p7XSyyBFk