Top Tweets for #LLMbenchmarks
Your LLM evals might be burning cash for no reason. More evaluations ≠ better results.
Generic metrics, excessive scope, and inadequate sampling are undermining your ROI.
Smart judges need context, justification, and human validation.
#AI #LLMBenchmarks #AIObservability
CURIE introduced custom evals like LLMSim and LMScore to grade nuanced outputs (like equations, summaries, YAML, code).
Even the best models (Claude 3, Gemini, GPT-4) scored just ~32%. Proteins? Total fail.
LLMs can read papers — solving them is another matter.
#LLMbenchmarks #ArtificialInteligence #Google
Tencent Releases its Hunyuan T1 AI Reasoning Model, Beating DeepSeek R1, GPT-4.5, o1 Across Multiple Benchmarks
#AI #GenAI #TencentAI #HunyuanT1 #AIReasoning #EnterpriseAI #LLMbenchmarks #ChinaAI #MMLU #MathAI #AIModels #AIInference
https://t.co/UgClFbhQj8
A small observation : more than solving HL math/physics/coding problems, I find getting LLMs to 'formulate' good set of solvable problems in a given topic ( algebra, geometry ... ) is a challenge. LLMs should be benchmarked in this. #GenAI #LLMbenchmarks
Evaluating Your LLM? Here’s the Secret Sauce to Get it Right! 📊
Dive into the key metrics and methods that can help you assess and fine-tune your large language model, so it’s ready for the real world.
https://t.co/XSVxf4mxlH
#LLMs #LLMEvaluation #LLMBenchmarks
@emollick While the giant context window and video capabilities grab headlines, Gemini Pro 1.5's core model performance shouldn't be overlooked. Surpassing Ultra 1.0 and nearing GPT-4 is impressive. Eager to see how this translates to real-world applications! #LLMBenchmarks #AIInnovation
Last Seen Hashtags on Sotwe
Trends for you
Most Popular Users

Elon Musk 
@elonmusk
240.6M followers

Barack Obama 
@barackobama
119.2M followers

Donald J. Trump 
@realdonaldtrump
111.7M followers

Cristiano Ronaldo 
@cristiano
110.5M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.6M followers

NASA 
@nasa
92.2M followers

Justin Bieber 
@justinbieber
90.9M followers

KATY PERRY 
@katyperry
87.6M followers

Taylor Swift 
@taylorswift13
81.4M followers

Lady Gaga 
@ladygaga
72.9M followers

Virat Kohli 
@imvkohli
69.8M followers

Kim Kardashian 
@kimkardashian
69.8M followers

YouTube 
@youtube
68.7M followers

Bill Gates 
@billgates
63.8M followers

Neymar Jr 
@neymarjr
62.5M followers

The Ellen Show
@theellenshow
62.4M followers

CNN 
@cnn
61.9M followers

X 
@x
60.8M followers

Selena Gomez 
@selenagomez
60.7M followers





