@theishanvyas Check out my GitHub - https://t.co/3Jk3B05Q5i
One of my project - https://t.co/JwWXobepJV
I specialize in multimodal agents - https://t.co/5akXYyLp6a
Deployed qweb 2.5 3B model with vLLM
dealt with a lot of package compatibility issues due to older gpu and cuda. But figured out eventually.
Gives around 35-40 tokens/sec
Building a timeline-aware, hybrid RAG system over public government audit data.
Focus: retrieval quality, metadata-aware search, reranking, and evaluation.
Goal: Make policy documents actually searchable and analyzable.