Trevor McInroe @trevormcinroe - Twitter Profile

Pinned Tweet

Trevor McInroe @trevormcinroe

6 months ago

Introducing Terra Nova, a new comprehensive challenge environment for RL research inspired by Civilization V.

4

47

6

12

5K

Trevor McInroe @trevormcinroe

about 2 months ago

Feels like a good time to announce that I've joined Skild as a Research Scientist. Looking forward to continuing the push for RL in the real world.

Skild AI

@SkildAI

about 2 months ago

We have acquired Zebra Technologies’ robotics arm (formerly Fetch Robotics). This is what happens when orchestration meets intelligence -- a major step toward fully autonomous warehouses. More robots. More environments. One unified brain.

9

290

47

51

73K

0

10

0

182

Trevor McInroe @trevormcinroe

4 months ago

@dirtman Are you sure quantity is the best metric for comparison?

1

0

29

Trevor McInroe @trevormcinroe

4 months ago

@Sentdex Try MBPO+SAC. Then you can explore within the imagined trajectories of the model with a variety of strategies. Also can cut SAC's entropy target in half, e.g., -dim(A)/2

1

4

0

4

441

Who to follow

Willem Röpke

@willem_ropke

Research @cohere | Interested in learning

Yihao Xue

@xue_yihao65785

Research Scientist @ Google | PhD, UCLA

Stefano V. Albrecht

@s_albrecht

Research in AI and machine learning for autonomous systems. MIT Press textbook: https://t.co/TlgjB3qF5U Cambridge Press book: https://t.co/KP3KAU8VAZ

Trevor McInroe @trevormcinroe

4 months ago

@arnie_hacker This presupposes a convergence on the "correct" morphology.

0

32

Trevor McInroe @trevormcinroe

4 months ago

@alpercanbe re (2) This is harder to nail down and likely depends on audience. You're right that we usually don't write out xent loss, but most RL papers still have the obligatory paragraph defining the MDP and learning objective "\pi^{\ast} = \argmax_{\pi} \mathbb{E}_{\pi} \sum_t ..." ;)

0

112

Trevor McInroe @trevormcinroe

4 months ago

@alpercanbe It's a mixture of (1) relevancy and (2) vibes. re (1) If I am writing a paper on optimizers that improve upon Adam, I'll need to explicitly write out the maths of Adam. If the paper is on something else and I'm just using Adam, I can just cite (Kingma & Ba, 2015).

1

0

122

Trevor McInroe @trevormcinroe

5 months ago

@ID_AA_Carmack There's been some work on the disconnect between RL and other areas of ML w.r.t. NN size. Check out https://t.co/g0NDdL8SFA and https://t.co/EI5LXUH3oO

0

1

0

1

125

Trevor McInroe @trevormcinroe

5 months ago

@kevin_zakka Keep up the great work.

0

1

0

248

Trevor McInroe @trevormcinroe

6 months ago

@willccbb @TheZachMueller Return to monke

0

32

Trevor McInroe @trevormcinroe

6 months ago

@yoavgo An (usually learned) approximation of the MDP's transition function and reward function. In the POMDP case, we're likely modeling the observation function instead of the state transition-function directly.

1

3

0

349

Trevor McInroe @trevormcinroe

6 months ago

@thegautamkamath @TmlrOrg Living in the UK these past four years has made me miss Mexican food to an extreme degree...

0

3

0

105

Trevor McInroe @trevormcinroe

6 months ago

@Stone_Tao Thanks! I've been using ManiSkill for a world-modeling project lately. Now that's cool stuff

0

2

0

92

Trevor McInroe @trevormcinroe

6 months ago

Introducing Terra Nova, a new comprehensive challenge environment for RL research inspired by Civilization V.

4

47

6

12

5K

Trevor McInroe @trevormcinroe

6 months ago

Music credit: "A Brighter Future" by All Good Folks: https://t.co/QipaQQ7GZY "Go Ahead" by Gerald Olivieri: https://t.co/T42eNalRWg

0

3

0

290

Trevor McInroe @trevormcinroe

6 months ago

On a completely unrelated note, if anyone wants to play a game of Civ, feel free to DM me!

1

4

0

347

Trevor McInroe @trevormcinroe

6 months ago

0

5

2

0

511

Trevor McInroe

@trevormcinroe

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users