Oleg @olegtim - Twitter Profile

Oleg @olegtim

17 days ago

@Mike_Scully_ AI

0

10

Oleg @olegtim

about 1 month ago

@hlibtrazanov playbook

1

0

7

Oleg @olegtim

about 1 month ago

@dan__rosenthal OS

0

10

Oleg @olegtim

about 1 month ago

@jack_9947 STACK

0

4

Who to follow

JM Roberts

@jmr_notes

Non-profit exec trying to give back. Former Tech Marketer and Entrepreneur. Lifelong learner absorbing best practices. Love my teen, Florence, MSU Spartans.

Mike Baker

@realtimewebmktg

As the owner of Real Time Web Marketing I am passionate about helping small business owners thrive online. Building a better business together.

ifeanyichukwu

@nzehify

In the end, we shall not only remember the words of our enemies but the silence of our friends. #BiafraFreedom.

Oleg @olegtim

about 2 months ago

Opus 4.7 imminent?

BridgeMind

@bridgemindai

about 2 months ago

Claude Opus 4.5 is now OUTPERFORMING Claude Opus 4.6 on BridgeBench Hallucination. Read that again. The legacy model is beating the current flagship. We benchmarked Opus 4.5 this morning to confirm what we saw yesterday. Claude Opus 4.6 fell from #2 to #10 with a 98% increase in hallucination. Now Claude Opus 4.5 is scoring higher. This isn't a bad benchmark run. This is a nerfed model. Anthropic silently reduced Claude Opus 4.6 and the data proves it. You're paying $200/month for a model that's getting worse. @bridgebench will keep tracking it.

bridgemindai's tweet photo. Claude Opus 4.5 is now OUTPERFORMING Claude Opus 4.6 on BridgeBench Hallucination.

Read that again.

The legacy model is beating the current flagship.

We benchmarked Opus 4.5 this morning to confirm what we saw yesterday.

Claude Opus 4.6 fell from #2 to #10 with a 98% increase in hallucination.

Now Claude Opus 4.5 is scoring higher.

This isn't a bad benchmark run.

This is a nerfed model.

Anthropic silently reduced Claude Opus 4.6 and the data proves it.

You're paying $200/month for a model that's getting worse.

@bridgebench will keep tracking it.

71

886

70

142

74K

0

152

Oleg @olegtim

about 2 months ago

@mattshumer_ I’m in

0

60

olegtim retweeted

Alexandr Wang

@alexandr_wang

about 2 months ago

1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵

alexandr_wang's tweet photo. 1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵 https://t.co/fThDXdsxwB

741

10K

1K

3K

5M

olegtim retweeted

Stanislav Fort

@stanislavfort

about 2 months ago

New post: We tested the Mythos showcase vulnerabilities with open models. They recovered similar scoped analysis! 8/8 models found the flagship FreeBSD zero-day, including a 3B model. Rankings reshuffle completely across tasks => the AI cybersecurity frontier is super jagged!