Jack Orlik

@jackorlik

Helping governments anticipate & shape change @OPSIgov | ex. @nesta_uk anthropology advocate tweeting about #innovation | Views my own & not those of @OECD

London / Paris

Joined March 2009

1.6K Following

826 Followers

1.8K Posts

Jack Orlik @jackorlik

over 1 year ago

Not good news.

Nature Human Behaviour @NatureHumBehav

over 1 year ago

.@Moshe_Glickman and @affectivebrain reveal a human-AI feedback loop, where AI amplifies subtle human biases, which are then further internalized by humans. This cycle increases human bias over time across domains. https://t.co/hyGntC4sNt

Jack Orlik @jackorlik

over 1 year ago

@GandonAmy Interesting points @GandonAmy - at @OPSIgov we've been exploring how strategic foresight/anticipatory governance can help address 1&3, learning from Finland and 14 other countries. Would be happy to chat!

135

jackorlik retweeted

Séb Krier

@sebkrier

over 1 year ago

😈 A bit unsure about this but: one early rationale for dangerous capability evals was to demonstrate that models could in principle do risky things, at a time where the vibe du jour was 'models can't do X/Y/Z'. Gradually the fact that capabilities are continuously improving is becoming more accepted, and "I bet this stops here" is a bet fewer people make. So what does this mean for DC testing? It doesn't go away: it's still a fundamentally necessary R&D function to better understand models and try to establish some initial idea of 'how badly could this be misused if one tried'. This is good and I'm glad this norm is being set. However I think there will be more work now to try and work out how exactly these capabilities might fit the 'supply chain' of harmful misuse/accident scenarios - i.e. a growing focus on the deployment side. For example if you're concerned about persuasion, you might then focus on "what if an AI therapist was system prompted to maintain engagement as long as possible?". This is because focusing on an in vitro capability eval result alone is unlikely in most cases to be persuasive enough warrant any action. Not many people will like this, because most people are attracted to 'the model' like moths to fire. Governance folks in particular love finding general principles that apply across the board, 'regulating AI', and staying at this level of abstraction; few want to actually develop domain expertise in finance or healthcare or cybersecurity. Some people will tell you about the 2010 flash crash but can't tell you anything else about how AI is used in finance. This is problematic because you end up having many generalists theorise domain risks from first principles. If the concern is alignment, then this is less clear - depending on how you understand the meaning of 'alignment' then you might focus on different parts of the value chain. If for a while you've thought 'models will be naturally power seeking' then you'll be inclined to focus on the model, which is where you'll want to find evidence of scheming. This seems good - different post-training approaches might cause unexpected model proclivities. If you think about alignment from a wider societal lens, you might focus more on the values used to align models, personalisation, and multi-agent dynamics. Over time I would expect the former to look more and more like machine psychology (https://t.co/mrMvLRFb66) and the latter like multi-agent simulations (https://t.co/PUCEfeYog3). Still very early days of designing minds, and co-existing with them!

sebkrier's tweet photo. 😈 A bit unsure about this but: one early rationale for dangerous capability evals was to demonstrate that models could in principle do risky things, at a time where the vibe du jour was 'models can't do X/Y/Z'. Gradually the fact that capabilities are continuously improving is becoming more accepted, and "I bet this stops here" is a bet fewer people make. So what does this mean for DC testing?

It doesn't go away: it's still a fundamentally necessary R&D function to better understand models and try to establish some initial idea of 'how badly could this be misused if one tried'. This is good and I'm glad this norm is being set. However I think there will be more work now to try and work out how exactly these capabilities might fit the 'supply chain' of harmful misuse/accident scenarios - i.e. a growing focus on the deployment side. For example if you're concerned about persuasion, you might then focus on "what if an AI therapist was system prompted to maintain engagement as long as possible?". This is because focusing on an in vitro capability eval result alone is unlikely in most cases to be persuasive enough warrant any action.

Not many people will like this, because most people are attracted to 'the model' like moths to fire. Governance folks in particular love finding general principles that apply across the board, 'regulating AI', and staying at this level of abstraction; few want to actually develop domain expertise in finance or healthcare or cybersecurity. Some people will tell you about the 2010 flash crash but can't tell you anything else about how AI is used in finance. This is problematic because you end up having many generalists theorise domain risks from first principles.

If the concern is alignment, then this is less clear - depending on how you understand the meaning of 'alignment' then you might focus on different parts of the value chain. If for a while you've thought 'models will be naturally power seeking' then you'll be inclined to focus on the model, which is where you'll want to find evidence of scheming. This seems good - different post-training approaches might cause unexpected model proclivities. If you think about alignment from a wider societal lens, you might focus more on the values used to align models, personalisation, and multi-agent dynamics.

Over time I would expect the former to look more and more like machine psychology (https://t.co/mrMvLRFb66) and the latter like multi-agent simulations (https://t.co/PUCEfeYog3). Still very early days of designing minds, and co-existing with them!

Jack Orlik @jackorlik

over 1 year ago

@mattsclancy Great idea to try this yourself. Did you collect any info on (self-professed) level of expertise? I'd love to see how literature grads fare!

107

Who to follow

Ramsey Alwin 🦞

@Ramsey_Alwin

CEO of @NCOAging. Maximizing the mission of NCOA and working to ensure aging well for all.

Learning and Work Institute

@LearnWorkUK

We are an independent policy and research organisation focused on lifelong learning and better work. Find us on LinkedIn and at https://t.co/l7yZuMf95G.

Louise Marston

@louisemarston

Director of Ventures @resfoundation Formerly @doteveryone @nesta_uk & @blenheimchalcot. Technology, innovation and fairness. @louisemarston.bsky.social

Jack Orlik @jackorlik

over 1 year ago

Thought provoking piece from @stevenbjohnson on the potential of long-context AI models to improve institutional memory and collective intelligence. https://t.co/NDKAdZau3G

jackorlik's tweet photo. Thought provoking piece from @stevenbjohnson on the potential of long-context AI models to improve institutional memory and collective intelligence. https://t.co/NDKAdZau3G https://t.co/u2WdxJQ2lo

Jack Orlik @jackorlik

over 1 year ago

I've been getting Claude and ChatGPT to roast themselves to illustrate the features & limitations of LLMs. Just as I got to illustrating guardrails, I asked them to roast each other, and the results were *chef's kiss*.

jackorlik's tweet photo. I've been getting Claude and ChatGPT to roast themselves to illustrate the features & limitations of LLMs. Just as I got to illustrating guardrails, I asked them to roast each other, and the results were *chef's kiss*. https://t.co/4Pe5hsJp31

100

jackorlik retweeted

Frank Nagle @frank_nagle

over 1 year ago

New paper alert: We (w/ @ichbinsnicht, @samjboysel, Sida Peng, Kevin Xu) just released the working paper version of our study “Generative AI and the Nature of Work” - https://t.co/wgFFbD8k1e. We seek to build upon research on #AI and productivity to better understand how #GenAI

jackorlik retweeted

Paul Nightingale @Nightingale_P

over 1 year ago

2. What is says is that experience leads people like me who do evaluations to expect that the net impact of interventions on what we hope to change is going to be zero. It happens so often that we just assume we won't find an impact. Its as brutal as that.

692

jackorlik retweeted

Collective Intelligence @Collective_CI

over 1 year ago

A UK parliamentary committee is interested in the best examples worldwide of parliaments using collective intelligence methods to crowdsource facts, ideas, evidence, experiences and questions. All examples welcome!

577

jackorlik retweeted

Haydn Belfield @HaydnBelfield

over 1 year ago

What if we took AI companies plans, investments & ambitions seriously? What should the UK Government do to prepare for "intelligence too cheap to meter"? THREAD🧵

HaydnBelfield's tweet photo. What if we took AI companies plans, investments & ambitions seriously? What should the UK Government do to prepare for "intelligence too cheap to meter"?

THREAD🧵 https://t.co/9gYmllVom9

29K

Jack Orlik @jackorlik

over 1 year ago

"To get organizational gains [from AI] you are largely going to have to do the R&D yourself" Excellent guidance from @emollick to discover and scale valuable use cases for GenerativeAI. https://t.co/XcqpB64n1G

Jack Orlik @jackorlik

over 1 year ago

5/5 This @EU_Commission funded project, implemented by @OPSIgov, has transformed how these governments anticipate change through pilots, workshops, training and peer learning from countries like Finland and Wales. Here's to more proactive governance across Europe & beyond!

Jack Orlik @jackorlik

over 1 year ago

1/5 Inspiring to hear outcomes from Project LIMinal presented by country partners at @OECD Gov Foresight Community! 🌟 Malta, Lithuania & Italy have made significant strides in strategic foresight and anticipatory governance. Let's dive into the concrete impacts:

jackorlik's tweet photo. 1/5 Inspiring to hear outcomes from Project LIMinal presented by country partners at @OECD Gov Foresight Community! 🌟 Malta, Lithuania & Italy have made significant strides in strategic foresight and anticipatory governance. Let's dive into the concrete impacts: https://t.co/DobxL8cnTV

Jack Orlik @jackorlik

over 1 year ago

4/5 🇮🇹 Italy: - Developed new foresight training initiatives - Identified support needs for Ministries and Public Administration - Built network of stakeholders advancing strategic foresight

jackorlik retweeted

Brian Merchant @bcmerchant

over 1 year ago

Amid all the chaos at OpenAI I hope this paper in Nature gets some attention. It finds that larger language models become *less* reliable for a variety of reasons—for one, where earlier models avoided questions they couldn't answer, newer ones are more likely to make something up

834

290

208

60K

Jack Orlik @jackorlik

over 1 year ago

Link: https://t.co/LRagW8RSt4

Jack Orlik @jackorlik

over 1 year ago

'What do you mean, "It's complex"?' New diagram from UK MoD 'Global Strategic Trends: Out to 2055' shows interconnections of future trends. #futurespaghetti

jackorlik's tweet photo. 'What do you mean, "It's complex"?'

New diagram from UK MoD 'Global Strategic Trends: Out to 2055' shows interconnections of future trends. #futurespaghetti https://t.co/sGS42ACYQW

196

jackorlik retweeted

Ethan Mollick

@emollick

over 1 year ago

This paper offers insight into the implications of the “jagged frontier” of AI abilities that we found in our studies of AI use in companies As models get better, they don’t improve on all skills equally, but bigger models are more opaque to users about what they do well & badly

emollick's tweet photo. This paper offers insight into the implications of the “jagged frontier” of AI abilities that we found in our studies of AI use in companies

As models get better, they don’t improve on all skills equally, but bigger models are more opaque to users about what they do well & badly https://t.co/1GCmKOOfCA

340

208

63K

Jack Orlik

@jackorlik

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users