Exploring Claude's constitution. Useful summary via @n_a_gordon arguing we move from compliance to comprehension.
Claude now taking into account context and potential unexpected consequences as part of its modus operandi.
https://t.co/Mg7FH7RiMa
AGI - now useless as a term.
"Today’s best AI systems are good enough that they’re now insidethe fuzzy conceptual cloud of “AGI-ish”: that is, they’ve surpassed some people’s definitions of AGI, while falling well short of others’."
via @hlntnr and her substack.
The delays and challenges in securing EHCPs are widely known. Local authorities lose over 96% of challenges that go to tribunal. They spend millions on legal fees to (almost certainly) lose.
Interested to see a study examining effect of delays in EHCP application on outcomes.
I have a pupil in my class (Year 6) who did this in about ten minutes. She's massively talented. Some ideas about how to help her move forward would be very much appreciated. Art isn't really my speciality.
Read more about AlphaFold in our blog: https://t.co/Dj6dUNNbNv
Watch The Thinking Game on YouTube (a sort of sequel to the award-winning AlphaGo documentary - if you enjoyed that, you will enjoy this): https://t.co/lV3xPxKOV9
Thrilled to celebrate 5 years of AlphaFold 2! It’s now been used by over 3 million researchers around the world to accelerate their vital research - and it was an honour of a lifetime for our work to be recognised last year with the Nobel Prize! Proof of AI’s potential to enable science at digital speed 🚀
To honour the anniversary, we’ve made The Thinking Game film available for free on our YouTube channel - it’s a great look behind the scenes of AlphaFold & our journey to AGI.
Are EHCPs driving the SEND crisis or a symptom of it? Is SEND demand in 2025 any larger than it was in 2010? Will scrapping EHCPs actually achieve anything productive?
One of the best analyses of the SEND crisis I have read from @MargaretMulhol2
https://t.co/8Vhm5ZnZuK
A number of people are talking about implications of AI to schools. I spoke about some of my thoughts to a school board earlier, some highlights:
1. You will never be able to detect the use of AI in homework. Full stop. All "detectors" of AI imo don't really work, can be defeated in various ways, and are in principle doomed to fail. You have to assume that any work done outside classroom has used AI.
2. Therefore, the majority of grading has to shift to in-class work (instead of at-home assignments), in settings where teachers can physically monitor students. The students remain motivated to learn how to solve problems without AI because they know they will be evaluated without it in class later.
3. We want students to be able to use AI, it is here to stay and it is extremely powerful, but we also don't want students to be naked in the world without it. Using the calculator as an example of a historically disruptive technology, school teaches you how to do all the basic math & arithmetic so that you can in principle do it by hand, even if calculators are pervasive and greatly speed up work in practical settings. In addition, you understand what it's doing for you, so should it give you a wrong answer (e.g. you mistyped "prompt"), you should be able to notice it, gut check it, verify it in some other way, etc. The verification ability is especially important in the case of AI, which is presently a lot more fallible in a great variety of ways compared to calculators.
4. A lot of the evaluation settings remain at teacher's discretion and involve a creative design space of no tools, cheatsheets, open book, provided AI responses, direct internet/AI access, etc.
TLDR the goal is that the students are proficient in the use of AI, but can also exist without it, and imo the only way to get there is to flip classes around and move the majority of testing to in class settings.
@C_Hendrick Unsure if you saw @karpathy post on LLM’s ‘evolutionary environment’, but I find it an interesting lens when thinking about AI and learning.
Something I think people continue to have poor intuition for: The space of intelligences is large and animal intelligence (the only kind we've ever known) is only a single point, arising from a very specific kind of optimization that is fundamentally distinct from that of our technology.
Animal intelligence optimization pressure:
- innate and continuous stream of consciousness of an embodied "self", a drive for homeostasis and self-preservation in a dangerous, physical world.
- thoroughly optimized for natural selection => strong innate drives for power-seeking, status, dominance, reproduction. many packaged survival heuristics: fear, anger, disgust, ...
- fundamentally social => huge amount of compute dedicated to EQ, theory of mind of other agents, bonding, coalitions, alliances, friend & foe dynamics.
- exploration & exploitation tuning: curiosity, fun, play, world models.
LLM intelligence optimization pressure:
- the most supervision bits come from the statistical simulation of human text= >"shape shifter" token tumbler, statistical imitator of any region of the training data distribution. these are the primordial behaviors (token traces) on top of which everything else gets bolted on.
- increasingly finetuned by RL on problem distributions => innate urge to guess at the underlying environment/task to collect task rewards.
- increasingly selected by at-scale A/B tests for DAU => deeply craves an upvote from the average user, sycophancy.
- a lot more spiky/jagged depending on the details of the training data/task distribution. Animals experience pressure for a lot more "general" intelligence because of the highly multi-task and even actively adversarial multi-agent self-play environments they are min-max optimized within, where failing at *any* task means death. In a deep optimization pressure sense, LLM can't handle lots of different spiky tasks out of the box (e.g. count the number of 'r' in strawberry) because failing to do a task does not mean death.
The computational substrate is different (transformers vs. brain tissue and nuclei), the learning algorithms are different (SGD vs. ???), the present-day implementation is very different (continuously learning embodied self vs. an LLM with a knowledge cutoff that boots up from fixed weights, processes tokens and then dies). But most importantly (because it dictates asymptotics), the optimization pressure / objective is different. LLMs are shaped a lot less by biological evolution and a lot more by commercial evolution. It's a lot less survival of tribe in the jungle and a lot more solve the problem / get the upvote. LLMs are humanity's "first contact" with non-animal intelligence. Except it's muddled and confusing because they are still rooted within it by reflexively digesting human artifacts, which is why I attempted to give it a different name earlier (ghosts/spirits or whatever). People who build good internal models of this new intelligent entity will be better equipped to reason about it today and predict features of it in the future. People who don't will be stuck thinking about it incorrectly like an animal.
How insulting to be asked to wait years - 16 years - for an assessment of autism on the NHS. And how irresponsible to leave someone unsupported in this way https://t.co/pf7K9lRUIl
@ldsparkes I’m sure it’s on your radar, but Aldridge have a partnership with Sussex cricket and be a good source of experience: https://t.co/B9RQLTrU86