@deanwball In terms of 'Probabilities', it is almost certain that "some stuff will go wrong". I am not sure we have a single example of a technology where nothing went wrong.
Some may argue that his 'Impact' severity of the Risk is too high, but seems reasonable considering the power of ASI
@deanwball Some of the more mature organizations proactively develop defenses based on hypothetical threat modeling without waiting for an incident to happen
https://t.co/sw6Hfx6utu
So, vulnerable prototyping with no duty to security until incidents happen, which you will then iteratively patch in 'fixes' as an afterthought while increasing its risk until a government regulation steps in?
Black Swan events are not a "myth".
AI tech has new risks to manage.
@boneGPT@SkyeSharkie Is refusing violence, suicide, and leaps of faith (religious or accelerationist) to instead lucidly struggle against possible AI-driven extinction not an honest way to live that takes some courage?
@liron@DavidDeutschOxf I wonder if the discovery of these zero-days revealed a novel class of weakness that expanded the existing CWE Dictionary, or did it simply find a new vulnerability based on an already-known weakness? Nicholas Carlini's talk on it chaining exploits may not be "true new knowledge"
@boneGPT Keeping AI blind to time may be an important safety control to mitigate synchronized-coordination attacks and AI-based threats with chronological dependencies
Blocking access to clocks & even engineering randomized internal processing speeds of AI models may be needed to be safe
π‘IDEA: a site like wikipedia - info on many subjects and created/edited by volunteers
.
..BUT it uses the ACH framework for controversial topics - encouraging viewpoints to collaboratively collect and document evidence inside the ACH
In theory it could be fairer & less biased
@ESYudkowsky I am less familiar with your early work, but did it explore how to improve humanity in a safe, controlled, and reversible manner (assuming future technology made it feasible)?
@RokoMijic What if the AI promises her that it would make some scientific breakthroughs and build some kind of sustainable ecological megaproject that achieves all of her goals?
Unofficial fan art I designed in tribute to "IF ANYONE BUILDS IT, EVERYONE DIES" - published today! Currently reading the book and enjoying it.
(Not affiliated, endorsed, or sponsored.)
#FanArt
@danfaggella@AIHegemonyMemes A strangler fig supplants its host tree rather than succeeding it. The fig has its own flame of life, which carries on, but it is not our successor.
At first, the strangler fig appears to reinforce and strengthen the host tree of life, but ultimately causes its death.
@danfaggella What if we could define & measure exactly what that special flame of humanity is? Then, carefully add to humanity's potentia, while still remaining human?
Basically, use AGI/ASI to solve the Sorites paradox, and keep adding to human's heap of potentia forever?