Sibesh Kar @sibeshkar - Twitter Profile

Pinned Tweet

over 4 years ago

To build machines that can program themselves to do any task, you need to measure how well they learn to do things they were never programmed to do. Today we release a new benchmark that's the first attempt to do this for any real-world task.

sibeshkar's tweet photo. To build machines that can program themselves to do any task, you need to measure how well they learn to do things they were never programmed to do.

Today we release a new benchmark that's the first attempt to do this for any real-world task. https://t.co/RfJygh1rHC

2

66

10

23

0

Sibesh Kar @sibeshkar

about 1 month ago

interesting how relativity means travelling to larger and larger scales of the universe will require mastering tinier and tinier scales of autonomy/self-replication

0

4

0

83

Sibesh Kar @sibeshkar

2 months ago

converse: if information doesnt need to travel, it takes no time

Sibesh Kar @sibeshkar

3 months ago

hypothesis : gravity is what happens when information has to travel, and travel takes time

0

5

0

208

0

1

0

96

Sibesh Kar @sibeshkar

3 months ago

hypothesis : gravity is what happens when information has to travel, and travel takes time

0

5

0

208

Who to follow

Kshitij Khandelwal

@kshitijgokul

CTO & founder @pixxelspace, building and operating some of the world's most powerful remote sensing satellites

Prateek Swain

@prateekswain22

founder @SwadeshHQ 🦚 previously @Princeton @YCombinator | Tweeting about 🇮🇳 , startups & other things

Vinayak

@vinayak_agg

Founder @BiteSpeed. Forbes 30u30 Asia. My twitter is a running epiphany addiction on startups and life.

sibeshkar retweeted

1517 Fund

@1517fund

5 months ago

The beauty of Lingdong Huang's λ-2D - a drawn programming language.

41

3K

318

1K

131K

Sibesh Kar @sibeshkar

6 months ago

the human abstract world-modelling system never ceases to amaze https://t.co/1Rur3Uv2Z4

0

1

127

Sibesh Kar @sibeshkar

6 months ago

underrated tweet

(((ل()(ل() 'yoav))))👾

@yoavgo

7 months ago

i would argue that the nature of the arc challenge is that incremental improvements means you are doing it wrong. a true solution should move from not working to near 100% in one go

15

182

7

36

20K

0

179

Sibesh Kar @sibeshkar

7 months ago

insane https://t.co/Oqx5XY6H6j

0

150

Sibesh Kar @sibeshkar

8 months ago

octopus learns how to play a custom piano! https://t.co/htNRZeWmkB

0

1

0

135

Sibesh Kar @sibeshkar

over 1 year ago

Collating thread of biological intelligence generalising to out-of-distribution tasks from just a few samples. 1. Pro basketball player composes models of the hoop and the windmill mentally, in order to simulate the right trajectory to throw ball at - in just two tries.

Massimo

@Rainmaker1973

over 1 year ago

Professional Red Bull basketball athlete Chris Matthews attempts to shoot hoops through a moving wind turbine. [📹 TheLethalShooter]

98

19K

830

1K

2M

1

3

0

4

1K

Sibesh Kar @sibeshkar

about 1 year ago

Chimpanzees model the structural/mechanical properties of materials to craft new tools for unseen challenges https://t.co/8HJpA6V3TG

1

0

292

Sibesh Kar @sibeshkar

9 months ago

two-strip technicolor : the shoemaker and the elves https://t.co/QuIcJ9QxsV

0

164

Sibesh Kar @sibeshkar

9 months ago

perhaps the highest bandwidth human-machine interaction in the future looks something like an LSP https://t.co/dlgnfsInrg

Sibesh Kar @sibeshkar

over 2 years ago

a thought is a program, converting a synthesized thought from a context-free grammar to context-sensitive one is lossy transmission

2

0

698

0

183

Sibesh Kar @sibeshkar

9 months ago

the language server protocol is severely underrated/underutilized technology an instrument to collaborate live with a machine via shared syntax & context in realtime

1

0

203

Sibesh Kar @sibeshkar

11 months ago

reiterating w.r.t GPT-5 : expecting exponential gains from log-linear trends (in play since GPT-2) is a category error https://t.co/NZibwFqrJp

Sibesh Kar @sibeshkar

over 1 year ago

a linear gain in 'intelligence' (or more accurately, benchmark performance) for an exponential increase in resources fancy betting on a curve that has explicit diminishing returns The correct learning regime will display the exact opposite curve src : https://t.co/QoBViISmv4

sibeshkar's tweet photo. a linear gain in 'intelligence' (or more accurately, benchmark performance) for an exponential increase in resources

fancy betting on a curve that has explicit diminishing returns

The correct learning regime will display the exact opposite curve

src : https://t.co/QoBViISmv4 https://t.co/n8TiwHGuf0

1

0

379

0

188

Sibesh Kar @sibeshkar

over 2 years ago

a deep learning network : rigid, dense fully-connected, ordered, non-local updates, needs a lot of data/energy to learn anything new a biological brain : flexible, sparse, local updates, messy, can adapt to a new task from a few samples and less than a few watts

1

18

1

3

2K

Sibesh Kar @sibeshkar

12 months ago

https://t.co/9Wgz74Ktr5 per "saturate in months" prediction : LLM RL post-training follows the same log-linear laws as RL pre-training, which means similar theoretical wall (reached faster) (in graphs : xAI 10x’d the amount of compute used on RL for only marginal perf improvement)

sibeshkar's tweet photo. https://t.co/9Wgz74Ktr5

per "saturate in months" prediction : LLM RL post-training follows the same log-linear laws as RL pre-training, which means similar theoretical wall (reached faster)

(in graphs : xAI 10x’d the amount of compute used on RL for only marginal perf improvement)

Sibesh Kar @sibeshkar

over 1 year ago

first principles : > reasoning model perf is downstream of base model perf > if base model perf plateaus, reasoning model perf is a few months from plateauing > unless there's a way to auto-induce shorter CoTs > unlikely - penalizing CoT length leads to memorization in practice

1

0

565

1

0

1

368

Sibesh Kar @sibeshkar

11 months ago

wild https://t.co/meKwIeZpCQ

0

113

Sibesh Kar @sibeshkar

11 months ago

everyday we wakeup and pretend the world we live in is completely normal while the thing on the left routinely wraps & dissolves itself to turn into the thing on the right