@flynas@flynascare تم اعادة جدولة رحلتي رقم XY16 بتارخ 09/03/2026
من الساعة 2 الي الساعة 4
وغيرت رحلتي الي تاريخ 10/03/2026
رقم XY54
لم اجد حقيبتي .. ذهبت لمكتب الامتعة في مطار جدة مرتين وافادوا انها وصلت الرياض و في مطار الرياض لم اجدها ورفض الموظف تسجيل يلاغ لاختلاف الرحلة
AI Agents collapse when given access to hundreds of tools at once.
This unified MCP server uses progressive discovery to guide AI Agents through 1000s of tools instead of loading everything upfront.
100% Opensource.
we hijacked microsoft's copilot studio agents and got them to spill out their private knowledge, reveal their tools and let us use them to dump full crm records
these are autonomous agents.. no human in the loop
#DEFCON#BHUSA@tamirishaysh
Is Chain-of-Thought Reasoning of LLMs a Mirage?
... Our results reveal that CoT reasoning is a brittle mirage that vanishes when it is pushed beyond training distributions. This work offers a deeper understanding of why and when CoT reasoning fails, emphasizing the ongoing challenge of achieving genuine and generalizable reasoning.
... Our findings reveal that CoT reasoning works effectively when applied to in-distribution or near
in-distribution data but becomes fragile and prone to failure even under moderate distribution shifts.
In some cases, LLMs generate fluent yet logically inconsistent reasoning steps. The results suggest that what appears to be structured reasoning can be a mirage, emerging from memorized or interpolated patterns in the training data rather than logical inference.
... Together, these findings suggest that LLMs are not principled reasoners but rather sophisticated simulators of reasoning-like text.
A RAG engine for deep document understanding!
RAGFlow lets you build enterprise-grade RAG workflows on complex docs with well-founded citations.
Supports multimodal data understanding, web search, deep research, etc.
100% local & open-source with 55k+ stars!