Rodolfo Esparza Jr

@ElInsuranceGuy

Insurance • Banking • AI • Research | San Antonio, TX | Democratizing AI & financial access for all | Jesus loves me more

San Antonio, Texas

Joined December 2015

110 Following

40 Followers

497 Posts

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

Benchmarks for agent UX are becoming just as important as benchmarks for raw model IQ. Paper + open-source data here: https://t.co/FnINFarPhY #AIAgents #CryptoAI #LLMEvals

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

Most crypto agent benchmarks measure reasoning or returns. LATTICE asks a better question: does the agent actually help a human make a decision?

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

This is the shift AI evals need: from “Can the model answer?” to “Can the system support a real decision under uncertainty?”

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

If you care about coding agents, this paper is worth reading. It suggests the path from “demo” to “dependable” runs through planning, minimal edits, and executable verification — not just smarter autocomplete. Paper: https://t.co/sInzM9zS0w

Who to follow

Ron Butler 🌵

@RonButler8

Chairman and CEO of First Financial Bank, Abilene, Texas, Red Raider!

𓉸ྀི ggabs 𓉸ྀི

@officialganon

horror, skating, drumming, video games, tarot // recovering. // & I love him 🩵🖤

𝙱𝚎𝚊𝚛𝚍𝚟𝚎𝚛𝚜𝚎™

@Beardverse

Husband. Father. Occasionally relevant. Constantly caffeinated. Exploring the world of Brews, Bugs, and Botany—one pint at a time. 🍻 #NinerNation #PokemonGO

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

Most AI coding models still choke on code editing. On EditBench, 39 of 40 models score under 60% task success. A new paper says the fix may not be a bigger model, but a 3-agent workflow plus test-driven feedback. #AIAgents #CodingAgents #LLM

Rodolfo Esparza Jr

@ElInsuranceGuy

about 1 month ago

My read: this is strong evidence that coding-agent reliability depends as much on harness design as raw model quality. Better workflows may matter more than the next model bump.

Rodolfo Esparza Jr

@ElInsuranceGuy

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users