@prz_chojecki Hoping to see you benchmark sakana on math stuff. Curious to see how it turns out. While it probably wont beat frontier reasoning maybe it can beat frontier at optimization (repeated iteration) and long computations of novel constructions?
@WeekendInvestng Inflation peddling in the age of ai which is a deflationary tech. If things work out, we should see the price of goods and services fall not rise!
@alexwg If we extrapolate the complexity theory papers premise to computational biology and materials we get some very interesting predictions.
Primarily that there is no general polynomial-cost compiler for arbitary phenotypes.
@rand_longevity I think they will probably figure out how to fix the jailbreak and re release it and then someone else will obv jailbreak it again.
The whole thing reminds me of bane from batman. "Theatricality and deception are powerful agents"
@AcerFur@VictorTaelin I dont think gatekeeping intelligence is a good thing. The way to defend against nefarious actors is to employ that same intelligence at scale, not to gatekeep it.
@prz_chojecki Dude I got cut out in the middle of a proof attempt. Now trying to see it through w 5.5 pro 🥲
I got one super interesting analytic NT result with it tho...polishing and reviewing it rn.
(Btw pls check your dms!)
Releasing when? Boy, am I excited for Ulam-1. Rooting for you guys to be releasing the first small LLM capable of doing genuine math research!
(I have so many questions. Is it gonna be os? Are you guys gonna release different models for different fields like combinatorics, analytic number theory etc?)
@prz_chojecki Also it seems unlike gpt pro, fable can actually make meaningful progress even after 3-4 prompts. Doesnt tend to do that thing that gpt does.