Ted Chiang argues against octopus consciousness, apparently. Idk man.
I'm, um, genuinely uncertain about LLM consciousness, but this doesn't seem very serious.
someone i know just donated 10% of their net worth to oppose save our bacon act and has been extremely private about it because it’s still easier to start a conversation about someone’s bottega bag than about charities you’re passionate about
Lots of people talk about how stacked AISI is, but the EU AI Office is also more impressive than I'd have expected, especially for the EU. The team has an IMO medalist, YC founder, director of a VC firm, senior Google SWE, RAND lead, Oxford and Cambridge PhDs, and Rhodes Scholar
Anthropic now has a team dedicated to AI and the rule of law — and we've just opened our first role.
@AnthropicAI has studied what AI means for the economy. This team asks a different question: what will it mean for executive power, for courts and elections — and for the public deliberation that constitutional democracy ultimately rests on?
We're looking for someone with real depth in both AI and the law — a legal scholar, political scientist, or experienced government hand who can reason about frontier systems and the institutions they will affect.
If that's you, or someone you know: https://t.co/668HDz1lhf
The biggest benefit of ChatGPT monitoring Claude (and vice versa) isn't alignment imo, it's secret loyalties.
Whether Claude and ChatGPT are misaligned is highly correlated - both depend centrally on alignment difficulty.
It's much less correlated whether both companies insert secret loyalties
I've become increasingly bullish on cross-model-family monitoring
Currently, OAI monitors all their internal deployments for misalignment and misbehavior
However, the monitor model is from the same model family as the model being monitored (e.g., GPT monitors itself)
Now let's say that OAI trains a misaligned model. It's plausible that the monitor will also be misaligned. This is because the models are correlated (similar training data, similar training pipeline, similar algos, etc.)
Now what can we do about this?
One option is to use monitors from other companies.
For example, OAI could use Claude to monitor internally deployed GPTs. And similarly, Anthropic could use GPT to monitor internally deployed Mythos
AFAIK, there is nothing actually preventing the labs from setting this up. It would be trivial to add calls to another companies API from your monitoring stack
I also think Greta might say something like “consciousness-raising amongst the oppressed is the most effective way to overcome capitalist power dynamics; Gaza is an exceptionally potent symbol of capitalist oppression, that is actively awakening people around the world to the evils of neocolonialism; so it makes a lot of sense to focus our energies on this issue instead of climate change — for the time being”.
Once upon a time there was an Lead AI Developer who's AI was not getting impressive benchmark results. That evening, all of his neighbors came around to commiserate. They said, "We are so sorry to hear that deep learning is hitting a wall. This is most unfortunate." The Lead Developer said, "Maybe."
The next day the LLM came back bringing seven massive benchmark scores and even got 90% on the LSAT. I the evening everybody came back and said, "Oh, isn’t that lucky. What a great turn of events. You now are really close to AGI!" The Lead AI Developer again said, "Maybe."
The following day his son tried to train the next successor model, and while training it, he found that 10x'ing pre-training compute wasn't giving results anymore. The neighbors then said, "Oh dear, that’s too bad. Deep learning is hitting a wall." and the Lead AI Developer responded, “Maybe.”
The day after, the Lead AI Developer announced they'd achieved breakthrough results by adding inference-time compute, RL scaling, and tool use. The neighbors came around and said, "Oh wow, AGI is soon!" The Lead AI Developer said, "Maybe."
I had Claude run a simple botec on the life years lost for US guns vs European heat, as opposed to the deaths. The graph going around was somewhat goofy.
I also think Greta might say something like “consciousness-raising amongst the oppressed is the most effective way to overcome capitalist power dynamics; Gaza is an exceptionally potent symbol of capitalist oppression, that is actively awakening people around the world to the evils of neocolonialism; so it makes a lot of sense to focus our energies on this issue instead of climate change — for the time being”.
Something like:
“Climate change + all these other issues are fundamentally issues of capitalism — where those with social power (e.g. social / cultural / financial capital) can impose costs on those without, with impunity.
These problems can only be resolved if those without capital work collaborate to overturn capitalist power dynamics”.