Sobhan Mohammadpour @___sobhan___ - Twitter Profile

8 days ago

Simple deep RL was thought to fail in imperfect-info games like poker. A new ICLR 2026 paper shows that with proper tuning, generic methods like PPO match or beat specialized approaches like fictitious play and counterfactual regret minimization. https://t.co/RvkZ6XxR6J 1/2

1

10

5

1K

___sobhan___ retweeted

Co-op Tory 🍁

@CoopTory

5 months ago

It is becoming increasingly apparent that Carney is a modern Castlereagh.

27

683

27

173

94K

Sobhan Mohammadpour @___sobhan___

6 months ago

@msharmavikram It was really nice that the guide was a single html page.

0

3

0

948

Sobhan Mohammadpour @___sobhan___

9 months ago

Never a dull moment

0

1

0

79

Sobhan Mohammadpour @___sobhan___

11 months ago

Another day, another

0

2

0

86

___sobhan___ retweeted

Alexey Zaytsev @xl0xl0xl0

12 months ago

TIL, this is actually valid Python.

0

2

1

0

202

___sobhan___ retweeted

Chad Scherrer @ChadScherrer

over 4 years ago

UNICODE NEEDS BETTER SUBSCRIPT/SUPERSCRIPT COVERAGE https://t.co/TuE31994Fp Easy ways to help make this happen: • Retweet this • Star the repo • Show it to others

ChadScherrer's tweet photo. UNICODE NEEDS BETTER SUBSCRIPT/SUPERSCRIPT COVERAGE

https://t.co/TuE31994Fp

Easy ways to help make this happen:
• Retweet this
• Star the repo
• Show it to others https://t.co/y49edEjNod

3

49

30

3

0

Sobhan Mohammadpour @___sobhan___

about 1 year ago

@qubitium @BayesWatch & co had a blog post on the Julia blog about something somewhat similar https://t.co/YmYKRpEWWf They observed between 30x to 5x improvements

0

1

0

12

Sobhan Mohammadpour

@_sobhan_

Last Seen Users on Sotwe

Trends for you

Most Popular Users