Agents seem to be everywhere! Happy to announce that now they're at @rungalileo as well!
To advance our mission of helping developers create reliable and trustworthy AI products, today we're excited to launch Agentic Evals 🚀🚀
Ready to build reliable AI agents? Introducing Galileo Agentic Evaluations - comprehensive agent testing that transforms proof-of-concepts into production-ready systems.
🔍 Visualize every step of agent planning & execution
📊 Agent-specific metrics with 0.93+ AUC on benchmarks
⚡️ Optimize costs and latency across multi-step workflows
Watch the demo👇
We’ve raised $45M in Series B funding to deliver Evaluation Intelligence to every AI team! 🎉🚀
Since the start of 2024 Galileo has:
✅ Grown revenue by 834%
✅ Quadrupled our number of enterprise customers
✅ Onboarded six Fortune 50 companies
🧵👇
1/ This week, we launched the Hallucination Index, which ranks popular LLMs on their propensity to hallucinate for common GenAI tasks.
We evaluated 11 LLMs across 3 GenAI tasks using 2 powerful metrics.
Here’s what we found👇
#AI#LLM#Hallucinations#HallucinationIndex
☎️Wondering how contact centers are driving business continuity and preserving customer experience during COVID-19? Join Cresta and industry leaders from Cox, Intuit, Sleep Number, and TTEC to find out.
https://t.co/Lgbq9YEyG6
#remoteworkforce#WFH#cx#contactcenter#crestaAI