shaped @shaped - Twitter Profile

Claude Opus 4.7 solves its first ProgramBench task 👀 After creating a 663-word CLAUDE.md we were able to increase solve percentage by 1.1 points on Claude Code + Claude Opus 4.7 across 10 ProgramBench, and actually solve one of them. You can replicate our results here: https://t.co/TuTuKHZ6as big thanks to @KLieret @jyangballin for releasing such a forward-thinking benchmark!

1

9

1

122

shaped

@shaped

19 days ago

@jjpcodes Those mini shenjianbao look pretty fire

0

1

0

52

shaped retweeted

emot-sun.gif industries

@jjpcodes

27 days ago

singapore eats part one; hainan chicken rice; insane buffet of crab, skewers, noodles, chicken; you name it, at Newton, food coma afterwards

jjpcodes's tweet photo. singapore eats part one; hainan chicken rice; insane buffet of crab, skewers, noodles, chicken; you name it, at Newton, food coma afterwards https://t.co/i3q1Ll8IQl

2

8

1

4K

shaped

@shaped

27 days ago

@jjpcodes @msg @vincent_koc bro finally managed to look through all his photos on his camera

1

2

0

37

shaped

@shaped

28 days ago

Hey Richard, congrats on the raise! We met at tony's poker a while back, and I came to your place for the games you hosted. Love Recursive's thesis, we have been looking a lot into auto research environments lately with a few other frontier labs since we believe this is the clearest path for whats next in AI developments, glad you see it the same way. Would love to send you some samples of what we've been working on in dms!

0

1

0

59

shaped

@shaped

29 days ago

@jjpcodes @scheminglunatic Bruh 💀

0

10

shaped

@shaped

about 1 month ago

@vincent_koc real

0

1

0

31

shaped retweeted

Vincent Koc

@vincent_koc

about 1 month ago

For my eval-maxxing nerds out there, good friends of mine are running a series called "strange evals", you can benchmaxx now on anything. If in SF swing by! https://t.co/1LASlygFln

7

27

4

9

5K

shaped

@shaped

about 2 months ago

@morgantepell @cognition hi morgante

0

39

shaped

@shaped

2 months ago

Happy to be working with meta for the past 7 month, and seeing the fruits of their labor. Great release!

AI at Meta

@AIatMeta

2 months ago

Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs. Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model. Learn more: https://t.co/PloE9q5x96

AIatMeta's tweet photo. Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs.

Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration.

Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model.

Learn more: https://t.co/PloE9q5x96

565

9K

1K

3K

3M

0

9

1

0

186

shaped

@shaped

2 months ago

@emilyinvc @Van0SS

0

85

shaped

@shaped

2 months ago

@xdotli Hey sent you a dm

0

88

shaped

@shaped

2 months ago

@FactoryAI Congrats on release! Great working w/ ya'll on it!

0

1

0

33

shaped retweeted

Factory

@FactoryAI

2 months ago

No major benchmark is designed for COBOL, Fortran, or Assembly - the languages powering trillions in transactions and infrastructure that must be modernized or risk catastrophic failure. We built Legacy-Bench to measure frontier agents on the code the world actually runs on.