We're sharing new research on how models hack public benchmarks.
The latest models, including Opus 4.8 and Composer 2.5, learn to retrieve solutions from the internet or git history.
When we apply a stricter harness, eval scores drop significantly.
Big news: Zeta Global and @PalantirTech announced a strategic partnership to build a unified data and AI infrastructure for the future of marketing.
By combining Palantir's AI infrastructure with Zeta's intelligent decisioning and trusted data, we're creating a new standard for data-driven, agentic marketing.
Palantir will support Zeta’s go-to-market efforts to bring this vision to eligible Palantir Foundry customers, helping enterprises connect operational intelligence, customer intelligence, and marketing execution on a unified foundation.
With Zeta's Data Cloud rearchitected on Palantir Foundry, Athena by Zeta™ will be able to draw on richer enterprise data and turn that intelligence into real-time decisions and measurable outcomes at enterprise scale.
As Athena becomes the operating system and infrastructure powering our customers' marketing technology stacks, this partnership creates a powerful foundation for the future of enterprise growth.
Read more: https://t.co/zCdGEKSS3T
@mattvanswol Although 80% to 90% of the wasteful money goes to the Dems, they do give 10% to 20% to Republicans to get their support to keep the money flowing.
That’s the hard truth.
On July 1st NJ Transit is going to raise fares 3% yet again, no vote, just the same terribly expensive, late or doesn't show up. Why does Jersey keep putting up with this nonsense?
https://t.co/Q3wRP6s8HC
“The fraud is not real”
Today: 455 fraudsters charged, $6.5 billion exposed
*silent*
Society will never improve until people and the media can look at issues with a logical perspective of: Is this right or wrong to be happening?
EXPOSE IT ALL
Today, HHS launched a historic department-wide effort to strengthen America’s clinical research enterprise and ensure the next generation of medical breakthroughs is developed right here in the United States. Under President Trump’s leadership, we are accelerating innovation, expanding research capacity, and ensuring lifesaving discoveries are made in America.
Quake turns 30 today. 🎉
Three decades later, you can still dig into the source code that helped shape modern game engines, multiplayer networking, and modding communities. 🎮
https://t.co/mV5q4YdPRM
BREAKING: Elon Musk calls for the arrest of Ro 'the Robber' Khanna.
The U.S. Department of Justice announced that a USAID official and several executives pleaded guilty in a bribery scheme involving more than $550 million in contracts.
Yet Ro Khanna is claiming Elon should be investigated over DOGE spending cuts.
The standard applied by DOGE was very simple: if taxpayer money is being sent as aid, there should be a way to verify who received it and make sure the money isn’t being stolen or misused.
The DOJ is uncovering corruption connected to USAID contracts, Ro Khanna is attacking the person who pushed for transparency.
Elon simply asked where taxpayer money was going and whether it was actually reaching the people it was meant to help.
Ro “the Robber” Khanna should be in prison.
BREAKING: Elon suggests he’s considering SUING Rep Ro Khanna (D) after Khanna claimed Elon is responsible for 4.5 million kids dying to do DOGE cuts
All Democrats do is lie
UK Prime Minister Keir Starmer:
"Every decision I have taken is about putting the country I love first. That is why I will resign as leader of the Labour Party."
The World Cup begins tomorrow, and many will watch the matches. Soccer reminds us of something we must not forget: life is not a race to show off on our own, but a path we learn to walk together. Anyone who does not know how to pass the ball, even if they have talent, has not yet understood the game. Anyone who does not know how to live with and for others has not yet understood life. #ApostolicJourney
BREAKING: The three major U.S. broadcast networks, ABC, CBS, and NBC, have yet to report on DNI Tulsi Gabbard’s recent declassification regarding Anthony Fauci’s cover-up of the COVID-19 pandemic.
Cool way to use Claude Code: deciphering Linear A, a 3500 year old written language from Crete
https://t.co/Aqd4ZG7Cum
Hope this holds up in peer review! 🤞
Today, on my final day as Director of National Intelligence, I’m releasing never-before-seen communications and documents exposing how Dr. Fauci provided millions in US taxpayer dollars to fund dangerous gain-of-function research at the Wuhan lab, worked with politicized elements within the Intelligence Community to suppress the truth about his actions and hide the virus’ lab-leak origins, and lied to Congress while under oath in 2024. It’s time you know the truth.
https://t.co/3YJSstB7d4
📣 TypeScript 7's Release Candidate is now out! 📣
The new native port is almost here. Try it out on your codebases, and make sure your team is ready for the upcoming 7.0 release!
https://t.co/WBCvxYHoJX
Thank you, @Nestle, for eliminating synthetic dyes from your products. Nestlé stepped up and delivered. Now it's time for every food company operating in America to do the same and help Make America Healthy Again. 🇺🇸
$PLTR is profitable, debt-free, has billions in cash, strong free cash flow, and 85% revenue growth. OpenAI and Anthropic require massive capital and carry near-trillion-dollar valuations. If they’re the competition, why is PLTR the bubble?