avantika @avaa411 - Twitter Profile

3 days ago

what if we spread nyc tech week events across 52 weeks instead of concentrating them into 1 every 6 min during the best weather week of the year (ans: we’d be in SF)

ana

@anamika__x

5 days ago

NYTW has a new event every 6 minutes. if you went back-to-back without sleeping you would still miss 98% of it. use this chatbot to plan your week!

3

57

5

19

15K

2

21

0

2

4K

avantika

@avaa411

6 days ago

recent ai-flavoured nightmares: - the concept of taking an english igcse exam & being presented with an insightful / emotive LLM conversation transcript as the case-study for analysis. genre, content, context, aim, theme, syntax, diction, rhythm, imagery, form, and tone. - the concept of my models becoming spoilt and constantly demanding hand-corrected samples for better fine-tuning.

1

4

0

274

avaa411 retweeted

Dwarkesh Patel

@dwarkesh_sp

7 days ago

What is the most compelling example of a task in a non-verifiable domains where models really struggle? That might hint at lack of generalization from verifiable to non-verifiable domains.

283

1K

30

586

244K

avantika

@avaa411

9 days ago

the bay area propaganda all makes sense now

0

7

0

244

Who to follow

Matjaž Leonardis

@MatjazLeonardis

Interested in learning and creativity. In both humans and machines. (Tweets about education, relationships, institutions, math and computation).

hardcore software engineer | technology brother | mindpunk wizard

avantika

@avaa411

18 days ago

the spicy claude code mosaic is an anthropic experiment to test the extent to which human users can continue to derive meaning from word fragments as they become increasingly garbled. a ploy to turn us into more efficient token parsers

avaa411's tweet photo. the spicy claude code mosaic is an anthropic experiment to test the extent to which human users can continue to derive meaning from word fragments as they become increasingly garbled.

a ploy to turn us into more efficient token parsers https://t.co/LPgSxLWofA

0

4

0

119

avantika

@avaa411

21 days ago

@manaltdan @AnthropicAI congrats, Dan!!!

0

45

avantika

@avaa411

about 1 month ago

@aleozimok too late but would love to join next time!

1

0

462

avantika

@avaa411

about 1 month ago

uncanny

roon

@tszzl

about 1 month ago

everyone is assuming this is some kind of quirk chungus marketing campaign but if you’ve worked with 5.4 and beyond they tend to call everything goblins, gremlins etc and it’s just super noticeable and if you work with them all day you start to get annoyed

203

2K

30

188

299K

0

7

0

549

avantika

@avaa411

about 2 months ago

@luke_drago_ @WorkshopLabs @thinkymachines Wow, congrats!

0

1

0

257

avantika

@avaa411

2 months ago

@kaseyklimes so good!

1

0

352

avantika

@avaa411

2 months ago

ah i’ll try gemini! ended up remembering/reconstructing the piece while waiting for chat and claude to produce the notes correctly 🫠 it’s an interesting problem space, since music is a fundamentally different symbolic language from natural language (polyphonic, multidimensional, continuous) - so there are all sorts of limitations when trying to manipulate it w LLMs

0

1

0

24

avantika

@avaa411

2 months ago

curious if anyone's had luck with LLMs writing sheet music from sample audio (attempted transcriptions on claude resulting in phantom notes & dissonance)

1

5

0

518

avantika

@avaa411

3 months ago

@matthewjmandel @davidchalmers42 adjacent

0

1

0

119

avantika

@avaa411

3 months ago

don’t count your tokens before they hatch

0

7

0

333

avantika

@avaa411

3 months ago

@_meagan_orourke @reason lets go!!! Congrats!

0

1

0

74

avaa411 retweeted

Alex Chalmers

@chalmermagne

3 months ago

A few weeks ago, I joined @cosmos_inst. AI is the transformative technology of our era. By acting as a force-multiplier on intelligence, I believe it has the potential to do tremendous good for humanity. At the same time, I worry that we may pursue the path of least resistance and increasingly outsource our judgment and our ability to think. We risk ending up in the warm bath of learned helplessness. Often these conversations feel like they’re confined to communities that would rather we abandon the development of advanced AI systems. Many of my fellow optimists treat them as minor details we don’t need to worry about. But over the past few months I feel like I’ve found real community. Through meeting @mbrendan1, the Cosmos team, and the wider network at the first Cosmos symposium, I’ve encountered a rare combination of technical depth and moral seriousness. At Cosmos, I’ll be continuing to fight the battle of ideas. To give you a flavour of what I’ll be working on, for my first piece, I’ve written about why even highly capable AI systems will make for poor economic planners. https://t.co/payqWG60fe

chalmermagne's tweet photo. A few weeks ago, I joined @cosmos_inst. AI is the transformative technology of our era. By acting as a force-multiplier on intelligence, I believe it has the potential to do tremendous good for humanity. At the same time, I worry that we may pursue the path of least resistance and increasingly outsource our judgment and our ability to think. We risk ending up in the warm bath of learned helplessness.

Often these conversations feel like they’re confined to communities that would rather we abandon the development of advanced AI systems. Many of my fellow optimists treat them as minor details we don’t need to worry about.

But over the past few months I feel like I’ve found real community. Through meeting @mbrendan1, the Cosmos team, and the wider network at the first Cosmos symposium, I’ve encountered a rare combination of technical depth and moral seriousness.

At Cosmos, I’ll be continuing to fight the battle of ideas. To give you a flavour of what I’ll be working on, for my first piece, I’ve written about why even highly capable AI systems will make for poor economic planners. https://t.co/payqWG60fe

17

149

23

86

27K

avantika

@avaa411

3 months ago

@madhavsinghal_ nice, thanks!

0

3

0

40

avantika

@avaa411

3 months ago

Scaling laws formalize performance as a function of three variables: model size, dataset size, and compute. Data quality is excluded from the framework.

3

12

0

4

536

avantika

@avaa411

3 months ago

The companies that figure out how to make domain-expert reasoning tractable as training data will determine the next inflection in model capability. If you're building in this space, I’d love to chat!

0

4

0

121

avantika

@avaa411

3 months ago

The binding constraint for model performance, then, is the absence of process level data -- how domain experts reason, allocate attention, process new info, weigh evidence, & dynamically update beliefs to reach conclusions over the length of a task.

1

3

0

135

avantika

@avaa411

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users