Shreyas Kapur @shreyaskapur - Twitter Profile

Pinned Tweet

about 2 years ago

My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n

111

5K

584

3K

742K

Shreyas Kapur @shreyaskapur

about 1 year ago

@ChungMinKim Incredible work ✨ This is so cool!!

1

2

0

377

Shreyas Kapur @shreyaskapur

about 1 year ago

wow

Sergey Levine

@svlevine

about 1 year ago

π-0.5 is here, and it can generalize to new homes! Some fun experiments with my colleagues at @physical_int, introducing π-0.5 (“pi oh five”). Our new VLA can put dishes in the sink, clean up spills and do all this in homes that it was not trained in🧵👇

10

538

63

143

59K

0

3

0

791

Shreyas Kapur @shreyaskapur

about 1 year ago

@martinsit nice

0

1

0

38

Who to follow

Kevin Black

@kvablack

phd @berkeley_ai, research @physical_int

Zhengdong

@zhengdongwang

natural history @GoogleDeepMind / consumer.

Trucking HR Canada

@truckingHR

Driving HR solutions for a modern trucking and logistics workforce. Visit our website ⬇️ for the latest labour market data, HR resources and events.

Shreyas Kapur @shreyaskapur

about 1 year ago

@JeremyNguyenPhD I'm trying to figure out a good way to share this, since running GPUs is pretty expensive. Though my plans were more so to turn this from a demo to a more polished toy/game 😊

1

16

0

1

4K

Shreyas Kapur @shreyaskapur

about 1 year ago

I've been waiting 10 years to make this.

187

8K

505

4K

785K

Shreyas Kapur @shreyaskapur

about 1 year ago

@xf1280 I experimented with a bunch of image to 3D models. In this video I'm using fast3d mainly because of very low latency, though in my experiments other models like hunyuan3d and trellis gave better quality meshes.

1

14

1

10

633

Shreyas Kapur @shreyaskapur

about 1 year ago

@alexanderchen Thanks Alex! The "system" prompt I wrote specifies that the model should largely follow the user sketch, but if it's a particularly bad sketch, the model is allowed to be creative. It would be so cool to include that more as a slider for user control ✨

1

13

0

1

4K

Shreyas Kapur @shreyaskapur

about 1 year ago

@a_lidayan all credits to you haha

0

1

10K

Shreyas Kapur @shreyaskapur

about 1 year ago

@omegablitz_ yesss, that's exactly what I was going for!!

0

32

0

1

11K

Shreyas Kapur @shreyaskapur

over 1 year ago

@sergeykarayev I think @ndea may be Hobbit coded :)

0

1

36

Shreyas Kapur @shreyaskapur

over 1 year ago

I wrote up the full results on my blog, https://t.co/RSZUy2MoEP alongside example outputs from models. (2/2)

0

3

0

1

1K

Shreyas Kapur @shreyaskapur

over 1 year ago

Can LLMs do lateral thinking puzzles? I tested a bunch of language models on questions from @lateralcast and the #OnlyConnect gameshow! (1/2) 🧵

1

8

0

2

3K

shreyaskapur retweeted

Jiahai Feng @feng_jiahai

over 1 year ago

LMs can generalize to implications of facts they are finetuned on. But what mechanisms enable this, and how are these mechanisms learned in pretraining? We develop conceptual and empirical tools for studying these qns. 🧵

feng_jiahai's tweet photo. LMs can generalize to implications of facts they are finetuned on. But what mechanisms enable this, and how are these mechanisms learned in pretraining? We develop conceptual and empirical tools for studying these qns. 🧵 https://t.co/sGW4Fgd5GK

5

148

21

106

25K

Shreyas Kapur @shreyaskapur

over 1 year ago

Come check out my tree diffusion poster at the system 2 reasoning at scale workshop at NeurIPS!

Shalev

@Shalev_lif

over 1 year ago

Best poster moment at #NeurIPS2024

28

10K

713

774

377K

0

33

1

3

3K

shreyaskapur retweeted

Luke Bailey

@LukeBailey181

over 1 year ago

Can interpretability help defend LLMs? We find we can reshape activations while preserving a model’s behavior. This lets us attack latent-space defenses, from SAEs and probes to Circuit Breakers. We can attack so precisely that we make a harmfulness probe output this QR code. 🧵

11

371

82

218

59K

Shreyas Kapur @shreyaskapur

over 1 year ago

I'll be at NeurIPS, let me know if you want to catch up or chat about program synthesis, world models, neurosymbolic, search, probabilistic programming, or mourning the loss of King Da Ka.

1

14

1

1K

shreyaskapur retweeted

Tejas Kulkarni

@tejasdkulkarni

almost 2 years ago

I am currently holding my dad's cryopreserved brain tumor samples in hopes of creating a personalized vaccine for immunotherapy. However, there are some critical and time-sensitive questions in the attached post: https://t.co/1haayeNsa0 This is time-sensitive so would appreciate any DMs/RTs.

tejasdkulkarni's tweet photo. I am currently holding my dad's cryopreserved brain tumor samples in hopes of creating a personalized vaccine for immunotherapy. However, there are some critical and time-sensitive questions in the attached post: https://t.co/1haayeNsa0

This is time-sensitive so would appreciate any DMs/RTs.

15

164

58

52

44K

Shreyas Kapur @shreyaskapur

almost 2 years ago

@EmilevanKrieken I think it has a lot of synergies with GFlowNets (which we mention in the paper) and one of our baseline methods (REPL Flow) is a mix between Ellis et. al. reimagined as a GFlowNet.

1

0

150

Shreyas Kapur @shreyaskapur

about 2 years ago

My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n

111

5K

584

3K

742K

Shreyas Kapur @shreyaskapur

almost 2 years ago

@EmilevanKrieken In our current mutation scheme, the expression can get longer or shorter at roughly the same probability, so not sure about the limiting distribution. Anecdotally we noticed that if we noise the program some number of times, the programs resemble just random programs.

0

1

0

316

Shreyas Kapur

@shreyaskapur

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users