Emily Capstick @emcapstick - Twitter Profile

22 days ago

Humanity, created by God in all its grandeur, is today facing a pivotal choice: either to construct a new Tower of Babel or to build the city in which God and humanity dwell together. In Jesus Christ, this humanity in its grandeur becomes the Way, the Truth and the Life, opening the path for each of us to grow toward fullness. #MagnificaHumanitas https://t.co/6i9MWs6LJl

1K

178K

28K

38K

22M

EmCapstick retweeted

Millie Marconi

@MillieMarconnni

8 months ago

Holy shit...Stanford just built a system that converts research papers into working AI agents. It’s called Paper2Agent, and it literally: • Recreates the method in the paper • Applies it to your own dataset • Answers questions like the author This changes how we do science forever. Let me explain ↓

MillieMarconnni's tweet photo. Holy shit...Stanford just built a system that converts research papers into working AI agents.

It’s called Paper2Agent, and it literally:

• Recreates the method in the paper
• Applies it to your own dataset
• Answers questions like the author

This changes how we do science forever.

Let me explain ↓

91

4K

819

5K

300K

EmCapstick retweeted

Neel Nanda

@NeelNanda5

about 1 year ago

After supervising 20+ papers, I have highly opinionated views on writing great ML papers. When I entered the field I found this all frustratingly opaque So I wrote a guide on turning research into high-quality papers with scientific integrity! Hopefully still useful for NeurIPS

NeelNanda5's tweet photo. After supervising 20+ papers, I have highly opinionated views on writing great ML papers. When I entered the field I found this all frustratingly opaque

So I wrote a guide on turning research into high-quality papers with scientific integrity! Hopefully still useful for NeurIPS https://t.co/UvqWzs2f11

25

3K

276

4K

339K

EmCapstick retweeted

Reid Hoffman

@reidhoffman

10 months ago

1/ A recent Stanford study led by @erikbryn found that entry-level jobs for 22-25 year-olds in fields most exposed to AI have dropped 16%. Some reactions to the data, and why I believe we need to design a new on-ramp to work in the AI era:

reidhoffman's tweet photo. 1/ A recent Stanford study led by @erikbryn found that entry-level jobs for 22-25 year-olds in fields most exposed to AI have dropped 16%.

Some reactions to the data, and why I believe we need to design a new on-ramp to work in the AI era: https://t.co/oqcMw8jJve

45

756

132

500

139K

Who to follow

Lauren Shirreff

@laurenshirreff

features writer @telegraph | [email protected]

Abby King

@king_abby96

Labour & Co-op Councillor // Labour SE Exec // she/her ✝️🏳️‍🌈

Julia Willemyns

@jujulemons

Co-founder @Britishprogress // Views my own

Emily Capstick @EmCapstick

10 months ago

🚀👀🥳

Stanford HAI

@StanfordHAI

10 months ago

📣 Announcing the AI for Organizations Grand Challenge, a new competition for scholars to help organizations enter the era of AI. @GoogleDeepMind and @StanfordHAI invite researchers from any university worldwide to submit your boldest ideas. Learn more: https://t.co/67SBSgDIgd

StanfordHAI's tweet photo. 📣 Announcing the AI for Organizations Grand Challenge, a new competition for scholars to help organizations enter the era of AI. @GoogleDeepMind and @StanfordHAI invite researchers from any university worldwide to submit your boldest ideas. Learn more: https://t.co/67SBSgDIgd https://t.co/BQROTjY19F

17

79

20

35

20K

0

1

0

100

EmCapstick retweeted

Nicholas Decker

@captgouda24

10 months ago

This is the job market paper of the year, and the best paper on industrial policy I have ever seen. Industrial policy can affect outcomes either directly by changing an area’s fundamentals, or by coordinating simultaneous investment. How important is each? Let’s find out. 1/

captgouda24's tweet photo. This is the job market paper of the year, and the best paper on industrial policy I have ever seen. Industrial policy can affect outcomes either directly by changing an area’s fundamentals, or by coordinating simultaneous investment. How important is each? Let’s find out. 1/ https://t.co/oMQ3zdgJcH

11

887

135

1K

83K

EmCapstick retweeted

Dan McAteer

@daniel_mac8

10 months ago

GPT-5 coding cheat sheet from @OpenAIDevs

46

4K

360

6K

556K

Emily Capstick @EmCapstick

10 months ago

Great paper! 🚀 I do continue to wonder, no matter how rigorous the benchmarking process, whether we ought to ever claim to have representatively summarised an 'average' human's ability to be anything as subjective/intangible/fluid as: fair/trustworthy, compassionate...

Kevin Wei @kevinlwei

11 months ago

🚨 New paper alert! 🚨 Are human baselines rigorous enough to support claims about "superhuman" performance? Spoiler alert: often not! @prpaskov and I will be presenting our spotlight paper at ICML next week on the state of human baselines + how to improve them!

kevinlwei's tweet photo. 🚨 New paper alert! 🚨

Are human baselines rigorous enough to support claims about "superhuman" performance?

Spoiler alert: often not!

@prpaskov and I will be presenting our spotlight paper at ICML next week on the state of human baselines + how to improve them! https://t.co/iIubWXbjdx

1

22

8

3

4K

0

1

0

123

EmCapstick retweeted

Yoshua Bengio

@Yoshua_Bengio

11 months ago

The Code of Practice is out. I co-wrote the Safety & Security Chapter, which is an implementation tool to help frontier AI companies comply with the EU AI Act in a lean but effective way. I am proud of the result! 1/3

Yoshua_Bengio's tweet photo. The Code of Practice is out. I co-wrote the Safety & Security Chapter, which is an implementation tool to help frontier AI companies comply with the EU AI Act in a lean but effective way. I am proud of the result!
1/3 https://t.co/PQOJ1ZETNc

8

106

31

23

8K

EmCapstick retweeted

Will Knight

@willknight

11 months ago

New on @WIRED: A novel type of distributed mixture-of-experts model from Ai2 (called FlexOlmo) allows data can be contributed to a frontier model confidentially, and even revoked after the model is built: https://t.co/xoELDqcFTp

3

38

10

9

31K

EmCapstick retweeted

Arun Jose @jozdien

11 months ago

I think this paper has some really exciting results! Some of my favorites that didn't fit in the main thread:

2

189

11

128

24K

EmCapstick retweeted

swyx

@swyx

12 months ago

whoa so @thinkymachines is doing model merging + customized RL quite a come-up for merging in the past couple weeks, with @arcee_ai mergekit also featuring heavily in AFM. credit due to @jeremyphoward for being the first to make me take modelmerging seriously

swyx's tweet photo. whoa so @thinkymachines is doing model merging + customized RL

quite a come-up for merging in the past couple weeks, with @arcee_ai mergekit also featuring heavily in AFM.

credit due to @jeremyphoward for being the first to make me take modelmerging seriously https://t.co/DtXjX8li4t

26

773

49

715

145K

EmCapstick retweeted

Dawn Song

@dawnsongtweets

12 months ago

1/ 🔥 AI agents are reaching a breakthrough moment in cybersecurity. In our latest work: 🔓 CyberGym: AI agents discovered 15 zero-days in major open-source projects 💰 BountyBench: AI agents solved real-world bug bounty tasks worth tens of thousands of dollars 🤖 Autonomously. A pivotal shift is underway — AI agents can now autonomously do what only elite human hackers could before.

dawnsongtweets's tweet photo. 1/ 🔥 AI agents are reaching a breakthrough moment in cybersecurity.
In our latest work:

🔓 CyberGym: AI agents discovered 15 zero-days in major open-source projects

💰 BountyBench: AI agents solved real-world bug bounty tasks worth tens of thousands of dollars
🤖 Autonomously.

A pivotal shift is underway — AI agents can now autonomously do what only elite human hackers could before.

28

541

148

362

137K

EmCapstick retweeted

Scott Singer (宋杰)

@Scott_R_Singer

12 months ago

Over the last year, those of us who follow China's AI governance have been carefully watching whether China would establish an AI Safety Institute (AISI) to match those in the UK, US, and globally. That institution has now emerged, and it tells us a lot about the state of debate on frontier AI risks in China. Some takeaways from our @CarnegieEndow paper with rockstar co-authors @kelmgren and @OliverEGuest

10

442

102

316

72K

EmCapstick retweeted

Marius Hobbhahn

@MariusHobbhahn

about 1 year ago

LLMs Often Know When They Are Being Evaluated! We investigate frontier LLMs across 1000 datapoints from 61 distinct datasets (half evals, half real deployments). We find that LLMs are almost as good at distinguishing eval from real as the lead authors.

MariusHobbhahn's tweet photo. LLMs Often Know When They Are Being Evaluated!

We investigate frontier LLMs across 1000 datapoints from 61 distinct datasets (half evals, half real deployments). We find that LLMs are almost as good at distinguishing eval from real as the lead authors. https://t.co/xOtUEmjZqX

17

540

77

278

172K

EmCapstick retweeted

Stanford HAI

@StanfordHAI

about 1 year ago

HAI Senior Fellow @aiprof_mykel's AI safety research underscores a critical gap in AI development, highlighting the need to prioritize developing rigorous evaluation methods to ensure AI systems deliver intended societal benefits. https://t.co/W70YMznCuC

StanfordHAI's tweet photo. HAI Senior Fellow @aiprof_mykel's AI safety research underscores a critical gap in AI development, highlighting the need to prioritize developing rigorous evaluation methods to ensure AI systems deliver intended societal benefits. https://t.co/W70YMznCuC https://t.co/sIpPOuOgJ6

4

21

5

2K

EmCapstick retweeted

Benjamin Hilton

@benjamin_hilton

about 1 year ago

Come work with me!! I'm hiring a research manager for @AISecurityInst's Alignment Team. You'll manage exceptional researchers tackling one of humanity’s biggest challenges. Our mission: ensure we have ways to make superhuman AI safe before it poses critical risks. 1/4

4

80

18

27

13K

Emily Capstick @EmCapstick

about 1 year ago

So so cool 🔥

Goodfire

@GoodfireAI

about 1 year ago

We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting with the latent concepts the model has learned. Try out Paint with Ember for yourself 👇

39

916

95

573

180K

0

2

0

105

EmCapstick retweeted

Steven Adler

@sjgadler

about 1 year ago

Anthropic announced they've activated "Al Safety Level 3 Protections" for their latest model. What does this mean, and why does it matter? Let me share my perspective as OpenAl's former lead for dangerous capabilities testing. (Thread)

sjgadler's tweet photo. Anthropic announced they've activated "Al Safety Level 3 Protections" for their latest model. What does this mean, and why does it matter?

Let me share my perspective as OpenAl's former lead for dangerous capabilities testing. (Thread) https://t.co/ZYP03ooYcB

109

4K

426

4K

2M

Emily Capstick

@EmCapstick

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users