๐ DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.
๐น DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
๐น DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.
Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!
๐ Tech Report: https://t.co/drlDrxkYtp
๐ค Open Weights: https://t.co/T13Y8i7SDM
1/n
Glad to see my paper accepted by FSE 2026!
We proposed this agentic vulnerability detection idea early in the DeepSeek R1 era and have since enhanced its capabilities with advanced strategies. Our work has safeguarded millions of dollars in the smart contract ecosystem.
@OpenAI Short answer: LLMs cannot replace senior security researchers. They still can't generate end-to-end exploits for complex real-world on-chain incidents.
Are AI agents ready for detecting and exploiting smart contract vulnerabilities?
We re-evaluated @OpenAI's EVMbench with a contamination-free dataset of real-world hacks.
Our data shows different results. ๐งต
Paper: https://t.co/zDQ2aRdCXt
Introducing Claude Code Security, now in limited research preview.
It scans codebases for vulnerabilities and suggests targeted software patches for human review, allowing teams to find and fix issues that traditional tools often miss.
Learn more: https://t.co/n4SZ9EIklG
@PostgreSQL has long powered core @OpenAI products like ChatGPT and the API. Over the past year, our production load grew 10ร and keeps rising. Today we run a single primary with nearly 50 read replicas in production, delivering low double-digit millisecond p99 client-side latency and five-nines availability. In our latest OpenAI Engineering blog, we unpack the optimizations we made to to scale @Azure PostgreSQL to millions of queries per second for more than 800M ChatGPT users. Check out the full post here: https://t.co/VTnxhlwlat
Given the recent Balancer/Yearn exploits, imagine what on-chain miracles weโd see if Lazarus started developing an LLM for automated attacks. In addition, they should cite the A1 preprint by @lzhou1110, as it is the first work for on-chain AEG.
New on our Frontier Red Team blog: We tested whether AIs can exploit blockchain smart contracts.
In simulated testing, AI agents found $4.6M in exploits.
The research (with @MATSprogram and the Anthropic Fellows program) also developed a new benchmark: https://t.co/QpGPMqlDRG
Since y'all spammed my timeline full of #Ethereum existential crises, here's a letter I sent to EF leadership in a year and half ago ๐ฌ.
(link in next post because Twitter...)