We’re launching @JudgmentLabs today and announcing $32M in funding.
As AI agents take on more of the work that creates economic value, they generate massive amounts of production data: the clearest record of how they behave with users, software, and the real world.
Judgment builds infrastructure for improving AI agents from production data.
Chert (@Cherthq) is building the infrastructure for teams to send, receive, and automate conversations over iMessage using APIs.
They’re already working with B2C startups, customer experience teams, and sales teams to reach end users through trusted and conversational messaging.
Congrats on the launch, @GaryGao891219 & @ianyfong!
https://t.co/Ow9hfhlMDZ
The best part about being a founder is the ability to choose who you share your life’s work with. Could not have chose better and nobody more deserving :)
2 weeks ago, Brett Adcock posted a public browser agent challenge; the website contained 30 steps and had to be solved in under 5 minutes. The first thing that stood out to me was that I only got to step 8 in 5 minutes, a far cry from 30 minutes. Any agent that can complete this would surely have to be ‘superhuman’. In addition, no frontier model was able to solve this, so, in some sense, any solution also needs to 'push the frontier' on this particular challenge.
As it turns out, traditional post training (RL) was not the best solution to this problem instead a recursive DSPy policy pushed GPT OSS 120B from being unable to complete step 1 to finishing all 30 in 4:10 minutes (10x faster than Sonnet 4.6). Wrote a blog detailing this below.
.@shofoai is assembling the largest index of short-form videos.
They build complete pipelines that collect, segment, sanitize, and label videos from across social media to curate custom datasets for AI labs.
Congrats on the launch @zixihong_, @BraidenDishman, @AlexzendorMisra, and @AndreBragaML!
https://t.co/9H3qS0Qnut
Two freshmen, no money, no plan. Just a side project to find alumni.
Four months later, they went viral at Stanford, hit 40K DAUs, and got into YC.
Now they’re attempting to build the internet’s most powerful people-search engine, already capable of running 10,000 LLM calls at once on Groq.
This is the story of @cladoai and its mission to shake up the multi-billion dollar business of finding people online.
🧵👇
2/ Top agents to research anything
GRID brings together 100+ intelligence partners, spanning agents, models, data, and tools. Our ecosystem includes leading Web2 agents designed to support virtually any type of research.
@ExaAILabs: Real-time, citation-rich web search that retrieves the most relevant and credible sources.
@cladoai: A people-search and profiling tool that aggregates web and social media data to generate comprehensive research on individuals.
@askalphaxiv: Surfaces papers, preprints, and technical research with summaries.
Our Open Deep Search framework supplements this workflow with top-tier current information
gpt-oss prepares the answer based on all of the information provided above
@composio: Connects AI to 100+ SaaS tools, APIs, and workflows (e.g., Notion, Slack, Stripe, GitHub) so an agent take actions across apps.