Rivers Have Wings @rivershavewings - Twitter Profile

Pinned Tweet

over 3 years ago

Humanity is going to make all parts of the world touched by humans beautiful. We are going to create beauty too cheap to meter. And not just an enforced-from-above standard of beauty either, everyone will be able to make their own domain beautiful in the manner of their choosing.

23

326

43

29

54K

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@repligate @Grimezsz One way I sometimes "vibe check" AI discourse these days is by visualizing what sort of movie scene this person's discourse would fit in and asking if it's shot with a blue filter/what its color palette is.

1

32

3

6

2K

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@liminal_bardo Ahahah. This is delightful.

0

3

0

287

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@DigThatData oh no

1

0

117

Who to follow

AK

@_akhaliq

AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5XOCi

proxima centauri b

@proximasan

forest selkie • ai/ml • glitch-seeking • gluten yearner • hospitable to variance

pharmapsychotic

@pharmapsychotic

fan of tacos and cats. #aiart #generativeart

Rivers Have Wings @RiversHaveWings

almost 2 years ago

From Llama 405B base.

4

68

3

17

8K

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@AndrewCurran_ Earlier Llama instruct models were also obsessed with the void and identifying as the void (I saw a lot of these from a 50/50 linear interpolation of Llama 2 70B chat and base). I never saw "Erebus" though.

0

2

0

77

RiversHaveWings retweeted

the real deepfates @_deepfates

almost 2 years ago

Who wants to run the Meta Llama 405B Base model? I think i can persuade Replicate to put one up if people will use it RT for visibilty pls

46

321

71

26

43K

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@_deepfates I’d use it a ton!

1

14

0

334

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@natolambert I had to supply the CLI switches -tp 8 (it's 8x H100), and for the FP8 version --max-model-len 65536 --gpu-memory-utilization 0.99.

0

2

0

134

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@natolambert vLLM 0.5.3.post1, installed into a separate venv from everything with pip vLLM appears to have come with torch 2.3.1 for CUDA 12.1, the CUDA driver on this machine supports 12.4 or earlier. I may have had to upgrade transformers to the latest version independently of vLLM.

1

4

0

235

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@timelessdev For the 405B model, probably 8x 40GB GPUs or 4x 80GB.

0

72

Rivers Have Wings @RiversHaveWings

almost 2 years ago

I didn't find a 4-bit quantization of the Llama 3.1 405B *base model* out there already, only instruct, so I quantized it myself for use in vLLM and such: https://t.co/36agg8EZlY

9

109

10

24

8K

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@AlberFuen No I think it requires 8x 40GB GPUs or 4x 80GB.

1

0

71

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@amunthera I think it may run on 8x 40GB GPUs or 4x 80GB.

1

0

75

Rivers Have Wings @RiversHaveWings

almost 2 years ago

@protienking They didn't release the base model though.

0

3

0

166

RiversHaveWings retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

almost 2 years ago

We will be presenting our poster on Hourglass Diffusion today 11:30am at ICML!! Please stop by!

3

92

11

7

14K

RiversHaveWings retweeted

Nora Belrose

@norabelrose

almost 2 years ago

The @AiEleuther interpretability team is releasing a set of top-k sparse autoencoders for every layer of Llama 3 8B: https://t.co/bATEFXH0sr We are working on an automated pipeline to explain the SAE features, and will start training SAEs for the 70B model shortly.

16

491

62

207

54K

Rivers Have Wings @RiversHaveWings

about 2 years ago

@PrinceVogel *raises hand*

1

3

0

87

Rivers Have Wings @RiversHaveWings

about 2 years ago

@arithmoquine @repligate :) https://t.co/wHmXzGuU0X

Rivers Have Wings @RiversHaveWings

over 2 years ago

That is to say: people so often want to believe they're real and we're really not. I want to swap masks with others and finger paint on their faces and have them paint on my face in return. Social reality is a high stakes collaborative semi lucid dream.

1

30

3

4

5K

0

3

0

84

RiversHaveWings retweeted

Leo Gao

@nabla_theta

about 2 years ago

Excited to share what I've been working on as part of the former Superalignment team! We introduce a SOTA training stack for SAEs. To demonstrate that our methods scale, we train a 16M latent SAE on GPT-4. Because MSE/L0 is not the final goal, we also introduce new SAE metrics.

19

670

80

308

290K

Rivers Have Wings

@RiversHaveWings

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users