Riya Patel @riyapee - Twitter Profile

Riya Patel

@riyapee

3 months ago

@shivanijpatel Red flag

1

2

0

2K

Riya Patel

@riyapee

3 months ago

Pinterest is the new vs code

0

13

1

0

2K

Riya Patel

@riyapee

4 months ago

@OmarHayat0 @OfficialLoganK Yeah looks like the form is only open for US schools

1

3

0

865

Riya Patel

@riyapee

4 months ago

Can Canadian students get access to a year of free Gemini pro too @OfficialLoganK

Logan Kilpatrick

@OfficialLoganK

4 months ago

Introducing Gemini 3.1 Pro, our new SOTA model across most reasoning, coding, and stem use cases!

555

7K

578

708

645K

8

83

2

12

15K

Who to follow

Tanay Kothari

@tankots

CEO at https://t.co/Q10J8b7EwN | Forbes 30 under 30 | Stanford CS + AI | Competitive programmer

Krish Mehta

@djkesu1

building @PalatialSim | robots @CILVRatNYU | se @ uwaterloo

4 months ago

@ErikKaum @puffer_ai yes! code is here https://t.co/8mewzv8y2V

1

0

57

Riya Patel

@riyapee

4 months ago

A 766K param model with RL outperforms Opus 4.6 on 8 bit games. I put 4 agents into a Pico Park emulation for 30 minutes. 500 million frames later, they’ve mastered cooperation and can consistently win the game. Play alongside my agents in the blog below! Trained with @puffer_ai

23

320

25

189

29K

Riya Patel

@riyapee

4 months ago

@dhruvbhatia0 Can the agents watch these PR videos and create a verifiable loop ?

1

0

96

Riya Patel

@riyapee

4 months ago

@morphllm 🫡

0

2

0

83

Riya Patel

@riyapee

4 months ago

@Samhanknr @puffer_ai yup! code here : https://t.co/8mewzv8y2V

1

120

Riya Patel

@riyapee

4 months ago

@Anishfishhh @puffer_ai I made the game, this wasn’t the og game so was easy to expose the game state

1

0

95

Riya Patel

@riyapee

4 months ago

@Anishfishhh @puffer_ai No not visual I don’t feed the pixel values in. I have access to game data so map all objects in screen which is the input to the cnn

1

0

283

Riya Patel

@riyapee

4 months ago

@silennai @puffer_ai yup, beat opus on cost to train and final performance

0

1

0

270

Riya Patel

@riyapee

4 months ago

@shivanijpatel @puffer_ai they do, their names are written under them

1

3

0

439

Riya Patel

@riyapee

4 months ago

@puffer_ai The model is trained with PPO as the core algorithm using actor-critic architecture. The encoder uses both a CNN for the grid input to keep the spatial information and an MLP for the self data vector.