Alex Bowe @alexbowe - Twitter Profile

alexbowe retweeted

0xkato

@0xkato

5 days ago

LLMs explained without all that yucky math stuff https://t.co/cpgVukSbJx

24

2K

169

5K

526K

alexbowe retweeted

Noam Brown

@polynoamial

11 days ago

After AlphaGo, the skill of human Go players noticeably improved. I suspect we will see a similar pattern in math.

186

9K

974

2K

777K

alexbowe retweeted

Nick Lindquist

@nick_lindquist

24 days ago

Central Park is great, but it takes up a lot of space and isn’t utilized to its full potential. That’s why I worked with McKinsey on a plan to make it a state of the art data center, complemented by rooftop parking and nuclear power. We can still build beautiful things.

nick_lindquist's tweet photo. Central Park is great, but it takes up a lot of space and isn’t utilized to its full potential.

That’s why I worked with McKinsey on a plan to make it a state of the art data center, complemented by rooftop parking and nuclear power.

We can still build beautiful things. https://t.co/MemRKeVMsK

1K

9K

1K

289

514K

alexbowe retweeted

priyanshu.sol

@priyanshudotsol

about 1 month ago

someone wrote a 680 page interactive book on cs algorithms

99

16K

2K

17K

964K

Who to follow

ASHG

@GeneticsSociety

The American Society of Human Genetics (ASHG) is the world's largest professional membership organization for human genetics & genomics specialists.

ナカモトダイスケ

@nakamotoskywalk

YouTube登録者数147万人【ナカモトフウフ】仕事依頼は下記メールへ。 [email protected]

Erik Garrison

@erikgarrison

(pan)genomes from many points of view. Assistant Professor UTHSC Memphis

Alex Bowe

@alexbowe

about 2 months ago

@weezerOSINT @_der_blAde_ I just started learning about BYOVD etc this week and find it fascinating - would love to know some of your favourite resources.

1

2

0

495

alexbowe retweeted

ksa 🏴‍☠️

@kosa12m

about 2 months ago

Best paper I've read so far this month: All elementary functions (sin, cos, tan, exp, log, powers, roots, hyperbolic functions, π, e, and even basic arithmetic) can be generated from just one binary operator: eml(x, y) = exp(x) − ln(y) …plus the constant 1.

kosa12m's tweet photo. Best paper I've read so far this month:

All elementary functions (sin, cos, tan, exp, log, powers, roots, hyperbolic functions, π, e, and even basic arithmetic) can be generated from just one binary operator:
eml(x, y) = exp(x) − ln(y)
…plus the constant 1. https://t.co/SY7i73pN2q

91

11K

2K

7K

1M

alexbowe retweeted

shira

@shiraeis

about 2 months ago

Found a paper that suggests we may have spent years training agents to become hunters of proxy reward when the more basic thing intelligence craves is not a reward at all, but to not run out of viable futures. The paper proposes that behavior is best understood as maximizing future action-state path occupancy, which collapses mathematically into a discounted entropy objective. The agent doesn’t necessarily want to GET something, but rather is trying to keep as many meaningful trajectories alive as possible. The obvious objection is “so it just does random shit? fuck around and find out?” No, this is where it gets pretty beautiful. The agent is variable when variation is cheap and becomes surgically goal-oriented the moment an absorbing state (death, starvation, falling over, etc) gets close enough to threaten its future path space. Variability is the same drive as goal-directedness, just operating under different constraints. The demos are kinda wild: - A cartpole (classic move a cart to keep a pole from falling control task) that doesn’t merely balance but dances and swings through a huge range of angles and positions because why not? The whole point is occupying state space, and rigid balance is a voluntarily impoverished life. - A prey-predator gridworld where the mouse PLAYS with the cat, teasing it and using both clockwise and counterclockwise routes around obstacles to lure it away from the food source before slipping in to eat, using both routes roughly equally. A reward-maximizing agent would collapse to one strategy and exploit it. Here, the agent keeps its behavioral repertoire - A quadruped trained with Soft Actor-Critic and ZERO external reward that learns to walk, jump, spin, and stabilize, and then makes a beeline for food only when its internal energy drops low enough that starvation becomes a real threat The thing that hit me hardest is the comparison to empowerment and free energy principle agents. Both collapse to near-deterministic policies with almost no behavioral variability. This paper’s agents find the highest-empowerment state and exploit it. FEP agents converge to classical reward maximizers. As far as I’m aware, this is the only framework that produces agents you could describe as being “alive.” The AI implication here is that we undertrain for behavioral repertoire. Most systems hit the benchmark by collapsing onto a narrow attractor basin of good-enough trajectories. They’re competent for sure, but brittle too, with one viable plan, executed until the world shifts and leaves them with nothing. The thing I increasingly want from agents isn’t competence per se, but option-preserving competence. I want agents with the ability to keep multiple viable plans alive and switch between them without catastrophe. We’ve been so focused on teaching agents what to want that we never stopped to ask what happens if wanting isn’t the point, if the deepest drive isn’t necessarily toward anything, but away from the walls closing in. paper: https://t.co/Kn3mllmmPK

shiraeis's tweet photo. Found a paper that suggests we may have spent years training agents to become hunters of proxy reward when the more basic thing intelligence craves is not a reward at all, but to not run out of viable futures.

The paper proposes that behavior is best understood as maximizing future action-state path occupancy, which collapses mathematically into a discounted entropy objective. The agent doesn’t necessarily want to GET something, but rather is trying to keep as many meaningful trajectories alive as possible.

The obvious objection is “so it just does random shit? fuck around and find out?”

No, this is where it gets pretty beautiful. The agent is variable when variation is cheap and becomes surgically goal-oriented the moment an absorbing state (death, starvation, falling over, etc) gets close enough to threaten its future path space.

Variability is the same drive as goal-directedness, just operating under different constraints.

The demos are kinda wild:

- A cartpole (classic move a cart to keep a pole from falling control task) that doesn’t merely balance but dances and swings through a huge range of angles and positions because why not? The whole point is occupying state space, and rigid balance is a voluntarily impoverished life.

- A prey-predator gridworld where the mouse PLAYS with the cat, teasing it and using both clockwise and counterclockwise routes around obstacles to lure it away from the food source before slipping in to eat, using both routes roughly equally. A reward-maximizing agent would collapse to one strategy and exploit it. Here, the agent keeps its behavioral repertoire

- A quadruped trained with Soft Actor-Critic and ZERO external reward that learns to walk, jump, spin, and stabilize, and then makes a beeline for food only when its internal energy drops low enough that starvation becomes a real threat

The thing that hit me hardest is the comparison to empowerment and free energy principle agents. Both collapse to near-deterministic policies with almost no behavioral variability. This paper’s agents find the highest-empowerment state and exploit it. FEP agents converge to classical reward maximizers.

As far as I’m aware, this is the only framework that produces agents you could describe as being “alive.”

The AI implication here is that we undertrain for behavioral repertoire. Most systems hit the benchmark by collapsing onto a narrow attractor basin of good-enough trajectories. They’re competent for sure, but brittle too, with one viable plan, executed until the world shifts and leaves them with nothing.

The thing I increasingly want from agents isn’t competence per se, but option-preserving competence.

I want agents with the ability to keep multiple viable plans alive and switch between them without catastrophe.

We’ve been so focused on teaching agents what to want that we never stopped to ask what happens if wanting isn’t the point, if the deepest drive isn’t necessarily toward anything, but away from the walls closing in.

paper: https://t.co/Kn3mllmmPK

76

1K

131

985

71K

alexbowe retweeted

Mason Wang

@masonwang025

2 months ago

new post tomorrow!

5

886

67

624

47K

Alex Bowe

@alexbowe

3 months ago

Dumping ROMs using a microscope…

azya @OneBitOnePixel

3 months ago

McDonald's Chicken Nugget Tetris (2023) has now been dumped and emulated. This is a significant achievement for me, as the game's microcontroller is quite modern, and tackling something like this was beyond my reach until now.

77

10K

1K

2K

275K

0

1

0

1

237

Alex Bowe

@alexbowe

3 months ago

@OneBitOnePixel @LowMax You dump ROMs by taking photos of them? That is the coolest thing I’ve heard all day.

1

11

0

1

317

Alex Bowe

@alexbowe

5 months ago

@0thernet It’s a privilege to have annoyed users, especially ones who give you feedback

0

1

0

87

Alex Bowe

@alexbowe

7 months ago

@0thernet The Siri one is so true. I’m textin this mf like it’s my personal assistant.

1

2

0

143

Alex Bowe

@alexbowe

7 months ago

@0thernet @zocomputer Let's fucking zo

1

12

0

2K

Alex Bowe

@alexbowe

8 months ago

@OpenAI Please give it a Developer Console tool to automate reverse engineering REST APIs.

0

6K

Alex Bowe

@alexbowe

8 months ago

@evanjconrad You could probably just schedule tweet them at this point

0

45

Alex Bowe

@alexbowe

9 months ago

@0thernet @zocomputer Zo far Zo good!

0

2

0

118

Alex Bowe

@alexbowe

10 months ago

@crislenta @TheDeFiSalesman @zach_yadegari @naval What did he say?

0

1

38

Alex Bowe

@alexbowe

10 months ago

@MisalignedModel @vitransformer @AmanGokrani Those are handy, but not all prompts and metadata are logged in the official logs. It’s better to use a proxy or monkey-patch the API calls (I’ve been using claude-trace by @badlogicgames)

1

0

60

Alex Bowe

@alexbowe

10 months ago

@swyx Count me in! How do I sign up?

1

0

127

alexbowe retweeted

Deedy

@deedydas

10 months ago

Huge computer science result: A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs. This improves on Turing award winner Tarjan’s O(m + nlogn) with Dijkstra’s, something every Computer Science student learns in college.

deedydas's tweet photo. Huge computer science result:

A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs.

This improves on Turing award winner Tarjan’s O(m + nlogn) with Dijkstra’s, something every Computer Science student learns in college. https://t.co/a1Lfa4DyBw

238

22K

2K

12K

2M

Alex Bowe

@alexbowe

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users