Andrii Maksai @AMaksai - Twitter Profile

AMaksai retweeted

Hanxiao Liu @Hanxiao_6

3 days ago

Happy to share our recent work!

4

71

9

6

17K

Andrii Maksai @AMaksai

3 months ago

@giffmana I let Claude peek into my tmux pane for that purpose sometimes

0

75

Andrii Maksai @AMaksai

3 months ago

@WenjieYang00 Agreed, though I think at least part of the issue is aggressive post-training for tool use which masks some vision capabilities. Thanks for reading:)

0

1

0

18

Andrii Maksai @AMaksai

4 months ago

For a while now, I wanted to check how well frontier models can look at an image and handwrite on top of it. So I did a thing: InkSlop: Vibe-coded benchmark for Spatial Reasoning with Digital Ink Link and key points below👇

AMaksai's tweet photo. For a while now, I wanted to check how well frontier models can look at an image and handwrite on top of it. So I did a thing:

InkSlop: Vibe-coded benchmark for Spatial Reasoning with Digital Ink

Link and key points below👇 https://t.co/5O2FunYA53

2

9

2

2K

Who to follow

Maayan Albert

@maayanalbert

designer turned engineer @browserbase / hosting dinners for the girlies & building useful tools for humans

Andrii Maksai @AMaksai

4 months ago

@giffmana @helloiamleonie @JinaAI_ Not in the infographics, but it's actually quite pronounced in the paper

1

3

0

1

196

Andrii Maksai @AMaksai

4 months ago

The bonus: Results with Nano Banana Pro & notes on vibe-coding, vibe-analyzing, vibe-debugging, and vibe-reporting the whole thing included.

AMaksai's tweet photo. The bonus: Results with Nano Banana Pro & notes on vibe-coding, vibe-analyzing, vibe-debugging, and vibe-reporting the whole thing included. https://t.co/BYhagAmtU2

0

2

1

145

Andrii Maksai @AMaksai

4 months ago

The finding: Models really prefer tool use to actually looking. And when the tools are taken away, performance drops for most task & model combinations (ex. 0% on mazes)

AMaksai's tweet photo. The finding: Models really prefer tool use to actually looking. And when the tools are taken away, performance drops for most task & model combinations (ex. 0% on mazes) https://t.co/3U3cSzaKsd

1

2

0

121

Andrii Maksai @AMaksai

4 months ago

@ibab Plot twist: Did YOU write it? :P

0

77

Andrii Maksai @AMaksai

5 months ago

@giffmana I didn't but of course, GPT is not a savage! If you squint really hard, you can also see that the editor is in vim mode :P

0

13

Andrii Maksai @AMaksai

5 months ago

@giffmana Mostly Fig. 4, but sometimes Fig. 6 for a bunch of trivial parallel changes (because switching in tmux >> switching sessions in CC extension)

0

23

Andrii Maksai @AMaksai

9 months ago

@NandoDF Depends on the performance on the task, whether this is the largest model, and how easy it is to synthesize the prompts, I guess? SFT on most if the model has not seen a lot of this type of data before, distill a larger model, maybe with synthetic prompts if doable, RL otherwise?

0

781

Andrii Maksai @AMaksai

over 1 year ago

@ericjang11 Not sure anything beyond https://t.co/zl8xicVEgo was ever publicized

0

1

0

1

31

Andrii Maksai @AMaksai

over 1 year ago

@giffmana Nice! Nit: GitHub links all go to 404, although maybe that's just temporary?

0

133

AMaksai retweeted

Alex Wiltschko

@awiltschko

over 1 year ago

Well, we actually did it. We digitized scent. A fresh summer plum was the first fruit and scent to be fully digitized and reprinted with no human intervention. It smells great. Holy moly, I’m still processing the magnitude of what we’ve done. And yet, it feels like as we cross this finish line we are instantly at a new starting line. I’ll have more to share about what’s in store that we’re building on top of this. A huge HUGE congrats to the entire team across scientific, engineering, operational, and creative disciplines. It takes a village named Osmo to do this. I don’t know if this is embarrassing, but I carry the plum scent with me a lot of places and smell it constantly. It makes me smile. I’m curious, if y’all want to smell it? If we made a limited release fragrance of the first teleported scent and dedicated the proceeds to science, would you want it?

545

17K

2K

7K

3M

AMaksai retweeted

Google AI

@GoogleAI

over 1 year ago

Today we describe a model taught to read and write so it can extract and digitize the strokes of handwriting without the need for specialized equipment. It then outputs realistic looking digital handwriting that can be handled like standard digital text. https://t.co/y3hj54ONkP

GoogleAI's tweet photo. Today we describe a model taught to read and write so it can extract and digitize the strokes of handwriting without the need for specialized equipment. It then outputs realistic looking digital handwriting that can be handled like standard digital text. https://t.co/y3hj54ONkP https://t.co/fxiXiFm2Iv

30

334

95

93

42K

Andrii Maksai @AMaksai

over 1 year ago

@francoisfleuret Isn't this somewhat identical to having one layer do f(x)=-x, and the second one writing down some completely new content there? Is the hypothesis that having zeroes in the mask makes it easier from the optimization perspective somehow?

1

0

162

Andrii Maksai

@AMaksai

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users