devanshu

@StartupwithDev

1X founder. VC turned Entrepreneur. Startup Operator

Joined April 2019

59 Following

14 Followers

35 Posts

StartupwithDev retweeted

Anand Kannappan

9 days ago

Today, we’re excited to announce our $50M Series B, led by @GreenfieldVC (formerly TPG Capital), with participation from @lightspeed and @notablecap. 🚀 At @PatronusAI, we develop simulations and evals to train and improve AI. The first phase of AI was built on static benchmarks, but that era is over now. As agents are used to solve longer and longer tasks, they need to practice in dynamic, living worlds to get better. Simulations are the critical infrastructure powering this next phase. As a company, we’re behind the most influential research and products in AI evaluation, like FinanceBench, Lynx, and Percival. And things have moved at the speed of light since. ⚡ We partner with the world's leading frontier AI labs and enterprises, and our revenue has grown more than 15x over the past year. Additionally, today, we’re introducing a preview of the first Digital World Model for AI agent training and simulation: Patronus-DWM. Digital World Models are language diffusion world models that predict realistic environment behaviors and steer agent actions across digital workflows. Just as physical world models predict how objects move through space, we’re developing the equivalent for the digital world: predicting how agents act in digital workflows, then using that to scale the creation of high-quality training data for LLMs. Digital World Models help us push the frontier of ultra long horizon workflows, and unlock a new class of self-improving RL environments. This is our scalable approach to simulating all of the world’s intelligence. The round was also joined by @datadoghq, @SamsungVentures, @gokulr, @factorialcap, and a large cohort of amazing AI leaders and researchers across @AnthropicAI, @OpenAI, @GoogleDeepMind, @nvidia, @Recursive_SI, and more. ✨ It has been the ride of a lifetime. But we’re just getting started. The best is yet to come. "Do not go gentle into that good night, Rage, rage against the dying of the light" - Dylan Thomas (1954)

26

271

23

142

38K

devanshu @StartupwithDev

10 months ago

AGI isn’t a bigger next token. It’s agents that experience consequences, adapt strategies, and generalize across tasks. Build better environments → get better agents. Environments are the next step.

0

0

0

0

13

devanshu @StartupwithDev

10 months ago

The path from AI to AGI won’t be “just add parameters.” It’s richer RL environments where agents can act, get feedback, and improve. Closing the loop from prediction → decision → consequence.

7

0

0

0

23

devanshu @StartupwithDev

10 months ago

Alignment is part of it: RLHF showed aligning models to human intent via feedback works in pure language. Now put that feedback inside interactive environments, so agents learn what to do and how to behave.

0

0

0

0

12

Who to follow

@ec957ad17691438

Strategic Finance Leader. FinTech and AI enthusiast. Keen to connect with founders building.

Eren SAFALIOĞLU

@erensafalioglu

CryptoTrader🇹🇷

devanshu @StartupwithDev

10 months ago

Open-ended worlds matter: MineDojo (Minecraft) blends thousands of tasks + internet-scale knowledge, even learning rewards from video-language priors, exactly the “learn from the world” recipe.

0

0

0

0

13

devanshu @StartupwithDev

10 months ago

Bridge to real user tasks: On WebArena, GPT-4-based agents hit ~14% E2E success vs ~78% for humans. Great reality check and a north star for progress. We need better environments, tools, and credit assignment.

0

0

0

0

28

devanshu @StartupwithDev

10 months ago

Scale the worlds too: XLand shows agents getting broadly capable via open-ended play across many games, not memorized tasks. Curriculum emerges from environment design.

0

0

0

0

14

devanshu @StartupwithDev

10 months ago

Evidence we’re on the right track: • MuZero learns a world model and plans superhuman on Atari/Go/Chess/Shogi without rules encoded. Planning + learning in one agent.

0

0

0

0

12

devanshu @StartupwithDev

10 months ago

Why environments? Reasoning needs interaction. LLMs pattern-match; agents must plan, explore, and recover from errors. RL environments supply long-horizon tasks, delayed rewards, and counterfactuals. Exactly what “general intelligence” needs.

0

0

0

0

18

StartupwithDev retweeted

about 1 year ago

Fighting Hallucinations is one of the most important features for a RAG system to have! ⚔️ SUPER excited to share a bit of what we've been cooking up with our friends @PatronusAI! 🚀 > The team at Patronus has created Lynx, a custom, state-of-the-art model for Hallucination Detection! > On the Weaviate side of the coin, we have engineered the Query Agent to "cite its sources". > This recipe illustrates how you can connect the `sources` response from the Query Agent to Patronus' Lynx evaluator! The recipe is linked below, I hope this inspires your trust in responses from the Query Agent! I also really hope you will check out Patronus AI, incredible team! 🔥

1

19

10

7

2K

StartupwithDev retweeted

Anand Kannappan

over 1 year ago

Today, I’m proud to launch the first MLLM-as-a-Judge. $ pip install patronus to scale image evals Here’s why this is game changing for AI engineers.

2

51

10

20

6K

StartupwithDev retweeted

over 1 year ago

1/ Introducing Glider - the smallest model to beat GPT-4o-mini on eval tasks ⚡🚀 - Open source, open weights, open code - Explainable evaluations by nature - Trained on 183 criteria and 685 domains Try it out for free at https://t.co/ZZai84VulJ 🔥

3

82

22

42

15K

devanshu @StartupwithDev

over 2 years ago

@SAF_Health You should try @LevyOperations

0

0

0

0

7

devanshu @StartupwithDev

over 2 years ago

@TheEdPerry @limecubeco Completely automated operations is still a long way but till then you should try @LevyOperations

0

0

0

0

6

devanshu @StartupwithDev

over 2 years ago

@__tinygrad__ I would highly recommend @LevyOperations They fit in your budget and they are experts in getting the work done so you can focus on more important tasks

0

0

0

0

17

StartupwithDev retweeted

Z Nation Lab @ZNationLab

about 6 years ago

Join the industry experts in a #virtual panel discussion on “Stress Test Your P&L & Managing Liquidity”. Register now: https://t.co/j0dLftTqak Live at 11 AM, April 6th, 2020 In these times of crisis let’s get together and support each other!

ZNationLab's tweet photo. Join the industry experts in a #virtual panel discussion on “Stress Test Your P&L & Managing Liquidity”. Register now: https://t.co/j0dLftTqak

Live at 11 AM, April 6th, 2020

In these times of crisis let’s get together and support each other! https://t.co/rKjo4HLucf

0

1

1

0

0

StartupwithDev retweeted

CorpGini @corp_gini

about 6 years ago

Join the industry experts in a #virtual panel discussion on “Stress Test Your P&L & Managing Liquidity”. Register now: https://t.co/Pz0PldeQLv Live at 11 AM, April 6th, 2020 In these times of crisis let’s get together and support each other!

corp_gini's tweet photo. Join the industry experts in a #virtual panel discussion on “Stress Test Your P&L & Managing Liquidity”. Register now: https://t.co/Pz0PldeQLv

Live at 11 AM, April 6th, 2020

In these times of crisis let’s get together and support each other! https://t.co/Zu0NOQ3Mz2

0

3

3

0

0

devanshu @StartupwithDev

over 6 years ago

#Corporates worth $1Trillion from 8 sectors are looking for #innovative solutions to solve their problems. If you have a solution that can be used by such corporates, comment 'Solution' below. #artificialintelligence #realestatetech #retailtech #manufacturing #financialservices

0

2

2

0

0

devanshu @StartupwithDev

almost 7 years ago

5. This is how Jio paved way for Reliance 2.0 by getting the highest market share holders, the kinara stores online to sell and the biggest chunk of Indian population, the rural population to buy.

0

1

0

0

0

devanshu @StartupwithDev

almost 7 years ago

What will be the secret sauce for Reliance Retail (Reliance 2.0)? How big of a role will Jio play? Was Jio the foundation of a bigger picture that only Mukesh Ambani could see? #jiodhandhanadhan #reliance2.0 #relianceretail #MukeshAmbani

1

0

0

0

0

devanshu @StartupwithDev

almost 7 years ago

4. Now that every small retailer has uninterrupted internet access, they are capable of using advance softwares to optimise their work and increase revenue by selling online and also the rural population can now buy everything online.

1

0

0

0

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users