📚 🔍 bRAG: Complete RAG Guide
A comprehensive project showcasing RAG implementations with LangChain - from basics to advanced features like multi-query retrieval, ColBERT indexing, and RAG-Fusion.
Check out this 1.7K+ starred guide 🚀
https://t.co/RgmhWR9LqT
微软出了一门给初学者学习的 AI 智能体课程:AI Agents for Beginners。
共 10 节课程,涵盖构建 AI 智能体的所有基础知识,旨在教授我们从零开始构建一个AI智能体。
GitHub:https://t.co/aHW9oAgtzt
课程内容已做了中文翻译,学习起来更加轻松,同时提供每节课所使用的示例代码,方便我们运行。
I built a Deepseek R1 RAG Reasoning Agent running locally on my computer.
It's an Agentic RAG reasoning agent that can think, reason and fall back to web search if needed.
100% Opensource code with step-by-step tutorial.
🔗We open-source the training recipes and technical details in the blog post: https://t.co/dgoMehyCvQ.
All the related code for SFT, RL, and evaluation: https://t.co/YzD6wd5g7t.
Model weights and data: https://t.co/cwgLx3OB2P.
Collaborate, replicate, and innovate!
(1/5)🚨LLMs can now self-improve to generate better citations✅
📝We design automatic rewards to assess citation quality
🤖Enable BoN/SimPO w/o external supervision
📈Perform close to “Claude Citations” API w/ only 8B model
📄https://t.co/FHj54HiC6i
🧑💻https://t.co/nQa87KkYMo
Github 🤖: Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📊
Helps you chat with data to generate SQL, charts, and reports, using your choice of LLM. It provides an open-source GenBI solution for data-driven teams seeking insights without code.
What it offers:
→ Wren AI is an open-source GenBI AI Agent that enables data-driven teams to interact with their data through chat.
→ It generates Text-to-SQL queries, charts, spreadsheets, reports, and BI insights.
→ It supports multiple LLMs including OpenAI, Azure OpenAI, DeepSeek, Google Gemini, Vertex AI, Bedrock, Anthropic, Groq, Ollama, and Databricks.
→ Wren AI allows users to ask data questions in multiple languages and provides AI-generated summaries and visualizations of query results.
→ It features AI-powered data exploration, semantic indexing for context, and allows exporting data to Excel and Google Sheets.
You can try it here in their playground and even build with their API:
https://t.co/RlTGqgLxmf
And as usual, available on Hugging Face:
- Hybrid: https://t.co/6mwII1jS4i
- Transformer: https://t.co/W1WAURDJtF
I recently gave a tutorial on knowledge distillation for LLMs, explaining the mathematical derivations behind the commonly used methods. Sharing the slides here given the recent interest in this topic.
https://t.co/u1LYcY4s7G
DeepSeek by hand is just another level, @ProfTomYeh made a lecture on it:
- Multi-Head Attention
- Multi-Head Latent Attention
- Single Expert
- Mixture of Experts
- Sparse Mixture of Experts
- Shared+Routed Mixture of Experts
- RoPE
Nobody can explain Transformers and Self-Attention like professor Bryce...
He's one of the Hidden gems of YouTube, his all videos are packed with knowledge and he teach with enthusiasm and dedication.
Below 👇🏻 link in comments.
Run DeepSeek-R1 (671B) locally on @OpenWebUI - Full Guide
No GPU required.
Using our 1.58-bit Dynamic GGUF and llama.cpp.
Tutorial: https://t.co/xaR9KpJzcj
A quick summary of deepseek-v3 model
(there may be wrong or missing details as this is based on a quick skim)
tons of respect to their engineering team...