Aaron Daubman @daubman - Twitter Profile

daubman retweeted

over 1 year ago

Our NeurIPS paper is published on arXiv. In this paper, we propose a new optimizer ADOPT, which converges better than Adam in both theory and practice. You can use ADOPT by just replacing one line in your code. https://t.co/6kMDGAd8QF

ishohei220's tweet photo. Our NeurIPS paper is published on arXiv.
In this paper, we propose a new optimizer ADOPT, which converges better than Adam in both theory and practice.
You can use ADOPT by just replacing one line in your code.

https://t.co/6kMDGAd8QF https://t.co/rRVI5hn9Jg

23

1K

192

1K

383K

Aaron Daubman @daubman

over 1 year ago

@bernhardsson What temperature do you read at?

1

0

175

daubman retweeted

Tom Dörr

@tom_doerr

over 1 year ago

"Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs."

tom_doerr's tweet photo. "Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs." https://t.co/1YO9N9wBay

16

5K

621

6K

385K

daubman retweeted

Milos Vukadinovic

@milos_ai

over 1 year ago

"Make sure that your model can overfit on small training set" might be the single best sanity check when building ML models. It helped me solve countless implementation errors and to better understand capacity. I first heart it from @fchollet, thanks!

17

596

56

293

49K

Who to follow

Peter Sobot

@psobot

research engineering lead @Spotify 🇨🇦🎶👨🏼‍🔬🥁🎹🎸

Ajay Kalia

@ajaymkalia

product + AI/ML. founder @getyouralt / prev personalization @spotify. who gives a damn about the profits of Tesco?

Richard Whitcomb

@rwhitcomb

Digitizing smell @osmo_labs 👃past: @nvidia, @spotify, @twitter, @bluefinlabs

daubman retweeted

Sumit @_reachsumit

over 1 year ago

Learning ID-free Item Representation with Token Crossing for Multimodal Recommendation Represents items using learnable multimodal tokens instead of IDs, achieving better recommendation performance with only 20% of parameters compared to ID-based methods https://t.co/2hIuLpcvTS

0

9

4

2

591

daubman retweeted

Jordi Cabot @JordiCabot

over 1 year ago

JordiCabot's tweet photo. https://t.co/3K0WDTkHiy

92

12K

2K

870K

Aaron Daubman @daubman

over 1 year ago

@tubstr @Frenck Is this pre-release frigate 15 or something else?

0

18

Aaron Daubman @daubman

over 1 year ago

@Frenck I updated one device managing my SDN last night and a bunch of morning routines failed this morning until plugs and switches were power cycled 🤦‍♂️

0

1

0

80

daubman retweeted

James Surowiecki

@JamesSurowiecki

over 1 year ago

Having Medicare cover in-home care for the elderly is the single most substantive (and, I think, materially beneficial) policy proposal either candidate has offered in this race, and as far as I can tell, it's gotten almost no attention from the press, or on social media.

JamesSurowiecki's tweet photo. Having Medicare cover in-home care for the elderly is the single most substantive (and, I think, materially beneficial) policy proposal either candidate has offered in this race, and as far as I can tell, it's gotten almost no attention from the press, or on social media. https://t.co/9bXSaYJoPv

480

13K

4K

251

735K

daubman retweeted

Jo Kristian Bergum

@jobergum

over 1 year ago

We have solved a long-standing issue with local (per node) IDF/significance computations. Introducing a global IDF model, plug-and-play with Vespa. Now, it is also much more relevant for Vespa streaming mode, which doesn't build an index. Read more at https://t.co/WRV1VNtaZT

1

52

6

15

2K

daubman retweeted

tomaarsen @tomaarsen

over 1 year ago

Model2Vec distills a fast model from a Sentence Transformer by passing its vocabulary through the model, reducing embedding dims via PCA and applying Zipf weighting. Inference with the resulting static embeddings are lightning-fast, e.g. 10k texts/sec: https://t.co/gRUZ83Pf2Y 🧵

4

240

47

130

11K

daubman retweeted

Sumit @_reachsumit

over 1 year ago

DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities Enhances Learned Sparse Retrieval by incorporating Wikipedia entities, addressing limitations in handling complex vocabulary and improving retrieval. 📝https://t.co/QaokpKcQpk 👨🏽‍💻https://t.co/9Yq7CTvKdC

_reachsumit's tweet photo. DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities

Enhances Learned Sparse Retrieval by incorporating Wikipedia entities, addressing limitations in handling complex vocabulary and improving retrieval.

📝https://t.co/QaokpKcQpk
👨🏽‍💻https://t.co/9Yq7CTvKdC https://t.co/fcZue0HGba

0

30

5

11

1K

daubman retweeted

Sarah Catanzaro

@sarahcat21

over 1 year ago

And the dirty secret is how much $$$ folks are spending on annotations of dubious quality.

1

15

1

0

2K

daubman retweeted

SeungHeon Doh @SeungHeon_Doh

over 1 year ago

A joint embedding model that leverages a fine-tuned LLM to handle music descriptions from various datasets has been released. In addition to semantic queries (genre, instruments, mood, theme), it also supports metadata similarity queries for music search."

SeungHeon_Doh's tweet photo. A joint embedding model that leverages a fine-tuned LLM to handle music descriptions from various datasets has been released. In addition to semantic queries (genre, instruments, mood, theme), it also supports metadata similarity queries for music search." https://t.co/d4rgs1gcG1

1

50

8

16

3K

daubman retweeted

Paul Watson @paul_c_watson

over 1 year ago

Man sits by me on train. MAN: Loads of psychopaths around here ME: Really? MAN: Loads mate ME: How'd you know? MAN: There's signs aren't there? ME: I guess? MAN: I love them (47 minutes of awkward silence.) Man leaves train, he has a bike. I realise he was saying 'cycle paths'.

2K

257K

19K

8K

6M

daubman retweeted

Sumit @_reachsumit

over 1 year ago

Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers Uses LLMs' attention patterns instead of generation, outperforming existing approaches on complex retrieval tasks while significantly reducing latency. 📝https://t.co/AISKhrONNw 👨🏽‍💻https://t.co/aGrpfF26TU

_reachsumit's tweet photo. Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers

Uses LLMs' attention patterns instead of generation, outperforming existing approaches on complex retrieval tasks while significantly reducing latency.

📝https://t.co/AISKhrONNw
👨🏽‍💻https://t.co/aGrpfF26TU https://t.co/4zPmHwWY7f

0

21

4

12

1K

daubman retweeted

merve

@mervenoyann

over 1 year ago

solving problems using LLMs that can be solved by fine-tuning BERT is a skill issue

113

3K

250

1K

641K

daubman retweeted

Gergely Orosz

@GergelyOrosz

over 1 year ago

Seriously impressive: patent troll blackmails Cloudflare claiming they are infringing on 4 BS patents in 100 use cases. Cloudflare goes to court, and proves they don't infrige on anything from the BS patent, and has ALL the patent troll's patents donated to the public!

26

3K

191

239

197K

Aaron Daubman @daubman

over 1 year ago

Me at standup: "Tim, I'm focused on the future!"

0

56

Aaron Daubman

@daubman

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users