Claude Code creator:
"100% of our pull requests at Anrtopic are run by Claude Code. 80–90% of code review too.
The feature I’m using the most today is /loops. I’m not prompting Claude anymore - I’m building loops"
in 1-hour interview, Boris reveals his setup, which helps him build the #1 coding tool of this year.
Worth more than a $500 vibe-coding course.
Anthropic shipped a CLI for Claude Platform every API endpoint now runs straight from your terminal.
Your terminal just became a direct line into Claude's infrastructure.
MANUALLY DRAGGING BOXES FOR ARCHITECTURE DIAGRAMS IS FINALLY DEAD
There is a new open-source agent skill that turns raw codebases into cleanly routed https://t.co/f9zjBxfJLp diagrams without you placing a single coordinate.
The project, drawio-skill, runs directly inside Claude Code, Cursor, or Copilot.
Instead of opening a blank canvas, you just ask your agent to map the repo.
Here is what it actually does:
→ Extracts the module structure (supports Python, JS/TS, Go, Rust)
→ Uses Graphviz for auto-layout and routing
→ Drops redundant edges so the graph stays readable
→ Builds native, editable https://t.co/f9zjBxfJLp files
But the standout feature is visual self-checking.
Once it generates the diagram, the agent "looks" at the resulting PNG. If it sees stacked edges or clipped text, it auto-fixes the layout across up to 5 iterative rounds.
It runs from a single file. No MCP server. No background daemon.
Best part?
It's 100% free and open-source.
repo link in 🧵↓
geçen hafta claude ile 6 saat dijital ayak izimi sildim.
sonuç: 47 data broker listingi kaldırıldı. 12 ölü hesap silindi. 3 arama sonucu bastırıldı.
nasıl yaptım, adım adım. thread.
Hackers can use a crafted GGUF file to leak private information you put into your local LLM or agent.
Many people may not have a good understanding of what GGUF is, so here is a simple primer.
GGUF is easier to understand if you stop thinking of it as “a model” and start thinking of it as a container file.
A local LLM is mostly a giant pile of numbers: the weights. But the runtime (llama.cpp for example) also needs a map:
what architecture is this?
what tokenizer should be used?
what chat template applies?
where does each tensor start?
what shape is it?
what quantization type is it?
GGUF packs that into one file.
The most important part is the tensor table. It is basically a manifest that says:
tensor: blk.0.attn_q.weight
shape: [4096, 4096]
type: Q4_K_M
offset: 1835008
The loader reads that and says: “go to this byte position, interpret these bytes as this kind of tensor, and attach it to this layer.”
That’s why GGUF is so useful for local inference: one portable file can include weights + metadata + tokenizer info + quantization layout.
And that’s also why GGUF parsing matters for security. A GGUF file is structured input to a model loader. If the file lies about offsets, sizes, or shapes, the loader has to catch it.
GGUF is normally associated with llama.cpp and most commonly stands for GGML Universal Format. The “GG” comes from the initials of llama.cpp creator Georgi Gerganov. Some people may also call it GPT-Generated Unified Format.
Ollama uses llama.cpp under the hood. Many tools support GGUF because they wrap or interoperate with llama.cpp. Hugging Face Transformers can load GGUF for specialized workflows, and some runtimes/tools expose GGUF import/export. MLX has GGUF-related save/export utilities, but MLX models usually use MLX-native weights, not GGUF as the main runtime format.
Çin, 7/24 kendi kendine çalışan ve %100 yerel olarak çalışan bir yapay zekâ çalışanı piyasaya sürdü.
Araştırma yapıyor, kod yazıyor, web siteleri oluşturuyor, slayt sunumları hazırlıyor ve videolar üretiyor. Hepsi kendi başına. Hepsi bilgisayarınızda.
%100 Açık Kaynak.
Higgsfield releases Virality Predictor
What does it mean:
> Upload any clip up to 15s
> Get viral potential, hook score & hold rate
> See a heatmap of brain regions your clip activates
> Pair with Ad Reference for recreated videos
Available via MCP/CLI and on the platform.
// Scalable Patterns for Agentic AI Workflows //
Besides context engineering, we should be putting a lot more system engineering efforts around agents.
This paper shows an example of why it matters.
(bookmark it)
Let's start with an important question: Where does your agentic RAG pipeline actually lose time?
It's almost never the LLM call. It's usually the data plane underneath. Serialization between preprocessing, embedding, and vector retrieval, plus coordination overhead between distributed services.
New work introduces AAFLOW, a unified distributed runtime that models agentic workflows as an operator abstraction over Apache Arrow and Cylon. A zero-copy data plane connects preprocessing, embedding, and retrieval directly. Resource-deterministic scheduling and async batching cut coordination cost.
The result: up to 4.64× pipeline speedup and 2.8× gains in embedding and upsert phases, with comparable LLM throughput.
None of that comes from LLM inference acceleration. It all comes from cleaner data flow.
Paper: https://t.co/9fqkRRsV39
Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX
THIS GUY JUST BUILT A DRONE THAT TRACKS TARGETS WITH A LASER USING CLAUDE
> No robotics team
> No engineering degree
> No budget
He used Claude Code to write the entire system in Python - a drone that locks onto a target, follows it with a laser and corrects its own aim automatically after every attempt.
The AI gets more accurate the longer it runs because it learns from every single shot.
He's still improving it and this is only the beginning.
You have the same tool he used.
The article below is about how to actually use it to its full potential.
Someone just built a tool that turns any GitHub repo into an interactive knowledge graph.
You just paste any repo, and it builds a live D3.js graph of every function, class, and call relationship, then lets you query it in plain English with an AI agent.
100% Open Source and runs in your browser.
Fun interactive science app ideas | Part 3
Played around with generating 3D biological structures and made an app to explore them interactively
UI Design
GPT Images 2
Code
Gemini 3.1 Pro
More demos ↓
There’s now a Hermes Agent Desktop App
You can finally run the self-evolving 24/7 autonomous AI agent from Nous Research inside a full native desktop UI instead of only the CLI.
Multi-agent management, persistent memory, autonomous workflows, and local infrastructure support all inside one app. Full demo: https://t.co/qbLRPjJd2y
Google just launched CodeWiki paste any repo & it auto-generates the entire documentation.
> Diagrams, architecture maps, dependency trees
> Step-by-step tutorials built automatically
> A chatbot trained on your whole codebase, ready to answer questions
40% of the code Claude writes for you is wasted. you're paying for the rewrite.
a 65-line markdown file fixes it. 120,000 developers have starred it.
the author tested it on "30 codebases over 6 weeks" and reported a mistake rate drop from 41% to either 11% or 3%
depending on whether you read the headline or the body.
the irony is that the article is right.
CLAUDE.md is the most under-leveraged file in your stack.
65 lines of behavioral rules outperform a 4,000-token preferences dump.
"be careful" is useless. testable imperatives are gold.
"be senior" doesn't work Claude already thinks it is.
the 4 rules that ship the most leverage:
/ state assumptions, never guess silently
/ minimum code, nothing speculative
/ surgical changes, don't refactor adjacent code
/ define success, loop until verified
compliance: ~80%. mistake rate: from ~40% to single digits.
no human caught the contradicting numbers in the title.
nobody had to.