dp @vo_d_p - Twitter Profile

vo_d_p retweeted

Saganism

@Saganismm

1 day ago

41 years ago, Carl Sagan gave a clear and elegant explanation of the science behind global warming.

411

14K

3K

9K

466K

vo_d_p retweeted

Linus ✦ Ekenstam

@LinusEkenstam

2 days ago

Anyone who tries to argue climate change with you, just pull up this. Not about no AC’s no more. This is about future generations being able to grow food in europe or not.

314

7K

1K

2K

2M

vo_d_p retweeted

Maxime Labonne

@maximelabonne

2 days ago

Fun surprise: DeepSeek used my open-perfectblend dataset to train their new DSpark drafter Time to promote it again! It's an open-source reproduction of "The Perfect Blend" paper. If you ever need >1M diverse prompts in math, chat, and code, it does the job.

maximelabonne's tweet photo. Fun surprise: DeepSeek used my open-perfectblend dataset to train their new DSpark drafter

Time to promote it again! It's an open-source reproduction of "The Perfect Blend" paper.

If you ever need >1M diverse prompts in math, chat, and code, it does the job. https://t.co/eWrwoGCqSI

34

1K

120

644

49K

dp @vo_d_p

1 day ago

They have benefit so much from published research whose results were conducted on open models, large part of them are Chinese ones. Let the spice flow.

Derya Unutmaz, MD

@DeryaTR_

2 days ago

DeepSeek should ban Anthropic from implementing Dspark into their models! In fact, Anthropic should be banned from using any AI research from China and required to remove any non-US data they used to pre-train their models! Cry, Dario, cry!😅

51

2K

141

282

115K

0

27

Who to follow

kait💕

@kaitbj1

Click the link below 👇send me a dm instanly let talk more on telegram 🥀💦

vo_d_p retweeted

2 days ago

DeepSeek is the GOAT. 🐳 They just published DSpark, a new speculative decoding method that boosts throughput by 51% to 400%. They also open-sourced DeepSpec, the training framework behind it. This is the real open AI.

Yuchenj_UW's tweet photo. DeepSeek is the GOAT. 🐳

They just published DSpark, a new speculative decoding method that boosts throughput by 51% to 400%.

They also open-sourced DeepSpec, the training framework behind it.

This is the real open AI. https://t.co/Gj4afGKviJ

102

4K

450

1K

349K

vo_d_p retweeted

Alexander Whedon

@alex_whedon

3 days ago

One of the key insights from our SubQ 1.1 Small technical report (https://t.co/Y6kBYgPXv6 ) was that super long-context pre-training decreased the reliance on super-long-context post-training to enable super-long-context modeling capabilities. Million-token-plus pre-training enabled the model the extrapolate post-trained capabilities to larger-than-trained lengths. For example, we pre-trained a model with one-million-token inputs and then did post-training at or below one million tokens, and it was able to perform NIAH with high accuracy at multi-million-token lengths. Read more in our technical report!

alex_whedon's tweet photo. One of the key insights from our SubQ 1.1 Small technical report (https://t.co/Y6kBYgPXv6 ) was that super long-context pre-training decreased the reliance on super-long-context post-training to enable super-long-context modeling capabilities. Million-token-plus pre-training enabled the model the extrapolate post-trained capabilities to larger-than-trained lengths. For example, we pre-trained a model with one-million-token inputs and then did post-training at or below one million tokens, and it was able to perform NIAH with high accuracy at multi-million-token lengths. Read more in our technical report!

3

55

12

30

6K

dp @vo_d_p

3 days ago

@SanderSassen You know thermodynamics. The outside will be topped with extra heat if you set your home temp lower. Touch your fridge side and tell me it's wrong.

0

1

0

9

vo_d_p retweeted

Alastair Mellon SDP

@MellonSdp6741

4 days ago

If you change every suburban house in London from this to this does it have an impact on the heat in the City? Asking for a friend.

MellonSdp6741's tweet photo. If you change every suburban house in London from this to this does it have an impact on the heat in the City? Asking for a friend. https://t.co/KsEgwPLMBf

324

7K

327

379

4M

vo_d_p retweeted

Alexander Whedon

@alex_whedon

4 days ago

Join Subquadratic in SF for a casual gathering this Saturday (link in comments)! We are a foundation model company building the most compute-, memory-, and sample-efficient foundation model architectures for the next era of AI computing! We love to chat about challenging base assumptions of the industry. We are hiring folks to work on large-scale pre- and post-training, long-context modeling, model architectures beyond attention, world models, efficient inference and training, and more.

alex_whedon's tweet photo. Join Subquadratic in SF for a casual gathering this Saturday (link in comments)!

We are a foundation model company building the most compute-, memory-, and sample-efficient foundation model architectures for the next era of AI computing! We love to chat about challenging base assumptions of the industry.

We are hiring folks to work on large-scale pre- and post-training, long-context modeling, model architectures beyond attention, world models, efficient inference and training, and more.

8

51

14

6

5K

dp @vo_d_p

4 days ago

@icanvardar 45 days a year holiday welcome to Europe

0

4

dp @vo_d_p

4 days ago

@francoisfleuret Trust is luxury good in authoritarianism countries. As it opts for low miss rate of distrusting motive, false alarm rate is high as a result. It rather trusts no one than being wrong once.

0

34

dp @vo_d_p

5 days ago

@puckrin Living in Europe here. I dont see a point to invest to AC units just to use one week out of 52 weeks per year. We have other ways to cope with heat too.

0

6