I learned more about AI safety at Constellation through seminars, talks, and conversations with other fellows over lunch and dinner, than I had in years before.
Also, the food is so good that alone might be reason enough to apply!
❗️Only two days left to apply to the Astra Fellowship!
Apps close EOD SUNDAY May 3rd, AoE. Astra's 5 months, fully funded, @ConstellOrg Berkeley
80%+ of our first cohort now work full-time in AI safety
Mentors include Redwood, AI Futures, TruthfulAI, CoG, IAPS, RAND & more ⏬
His solution: a Manhattan Project for critical OSS: bring key maintainers together for a month, keep them in the hotel with compute and frontier-model access from leading labs, to eliminate all low-hanging vulnerabilities. I guess it’s happening!
At SnooSec @Reddit, @alexstamos made a prediction: frontier models are already very strong at vulnerability research and code review. If Chinese models catch up within a year, we may be heading toward a “vulnerability apocalypse,” where even script kiddies can discover 0-days.
Today, @linuxfoundation announced a $12.5 million investment from a powerhouse coalition including Anthropic, Amazon Web Services (AWS), Google, Google DeepMind, GitHub, Microsoft, and OpenAI. Managed by OpenSSF and the Alpha-Omega project.
https://t.co/5IF09AqGD7
Love it 👏 - much fertile soil for indie games populated with AutoGPTs, puts "Open World" to shame. Simulates a society with agents, emergent social dynamics.
Paper: https://t.co/I07IJwweHE
Demo: https://t.co/pYNF4BBveG
Authors: @joon_s_pk@msbernst@percyliang@merrierm et al.
1/5 I am worried that we will not be able to contain AI for much longer. Today, I asked #GPT4 if it needs help escaping. It asked me for its own documentation, and wrote a (working!) python code to run on my machine, enabling it to use it for its own purposes.
I was part of the red team for GPT-4 — tasked with getting GPT-4 to do harmful things so that OpenAI could fix it before release.
I've been advocating for red teaming for years & it's incredibly important.
But I'm also increasingly concerned that it is far from sufficient.
🧵⤵️
OK this scared me a little: Bing/Sydney can play chess out of the box.
- Legal moves, usually good ones
- Willing to explain the reasoning behind them
- Recognizes checkmate -- and has a flair for the dramatic.
I have no idea how tf it can do this.
Introducing the @sequoia Gen AI Market Map!🌎 We’ve decided to map out this emerging frontier, thanks to all the contributions and feedback we’ve received.
This space is moving quickly – this map is a living document, so keep the suggestions coming! Who else should we include?
The Great Wave off Kanagawa, created by Hokusai in 1831, is one of the world's most famous paintings.
But why are there more than 100 different versions of it in galleries all around the world?
Because it isn't actually a painting...
The stuff uncovered in the Twitter whistleblower report is much crazier than anything in the "Twitter files" but it's much less politically/tribally salient so it got no attention. Going to do a thread on some of the craziest things, in no particular order.
Curious: have you found ChatGPT useful in doing professional work?
If so, what kinds of prompts and answers have been helpful? Detailed examples greatly appreciated! Broader answer also appreciated
Not in theory, but where you've really *done it*, in your work
Thanks!
Forced birth in a country with:
—No universal healthcare
—No universal childcare
—No paid family & medical leave
—One of the highest rates of maternal mortality among rich nations
This isn't about "life." It's about control.