Bhavyanshu Parasher @bvynshu - Twitter Profile

bvynshu retweeted

Thrilla the Gorilla

@ThrillaRilla369

12 months ago

A 3 PM hotel check-in and 11 AM check-out has got to be one of the biggest scams🤨

509

10K

475

82

389K

bvynshu retweeted

Massimo

@Rainmaker1973

over 1 year ago

Automatic snow chains deployment systems like the Onspot mechanism, allow vehicles to increase their traction on snow and ice with a relatively immediate activation triggered from the cab.

18

423

48

63

145K

Bhavyanshu Parasher @bvynshu

over 1 year ago

@kesarito Qq: why is IQ AI limited to wiki for crypto? Why not disrupt wikipedia (which is biased) and become the one true source for all data?

0

15

bvynshu retweeted

terminally onλine εngineer

@tekbog

over 1 year ago

i cant believe ChatGPT lost its job to AI

839

215K

24K

6K

6M

Who to follow

Harsh Shrivastava

@Harsh1952

Retired, Chief Financial officer

LA | לא

@ligiabraham

הודי מלידה 🇮🇳|🇮🇱 יהודי ממוצאו

bvynshu retweeted

over 1 year ago

# Run server ./bin/llama-server -m /path/to/model.gguf -c 64000 -t 56 --host 0.0.0.0 --port 8080 -c is the context length -t is cpu threads

6

182

7

79

54K

bvynshu retweeted

Harrison Kinsley

@Sentdex

over 1 year ago

Depending on the context window, the "used" RAM varies, but the cache in this case is nearly the full 1TB. I am finding ~64K context to be the sweet spot, the full 128K is very slow, like 1 t/s or less, but no real gains going less than 64K.

3

402

18

55

73K

bvynshu retweeted

Avichal - Electric ϟ Capital

@avichal

over 1 year ago

Ironic that we got free AI from a hedge fund and $200/month AI from a nonprofit.

642

46K

5K

3K

2M

bvynshu retweeted

Yuchen Jin

@Yuchenj_UW

over 1 year ago

"Tighten export controls on chips" is a loser's attitude. - DeepSeek can already run inference on Huawei Ascend chips - This will only push China to accelerate GPU development and create its own CUDA When did the US start fearing competition? And why oppose open-source AI?

Yuchenj_UW's tweet photo. "Tighten export controls on chips" is a loser's attitude.

- DeepSeek can already run inference on Huawei Ascend chips
- This will only push China to accelerate GPU development and create its own CUDA

When did the US start fearing competition?
And why oppose open-source AI? https://t.co/jWYeH6L3xX

346

4K

384

521

401K

bvynshu retweeted

signüll

@signulll

over 1 year ago

🤣

signulll's tweet photo. 🤣 https://t.co/7CclzdhZg4

74

2K

186

280

162K

Bhavyanshu Parasher @bvynshu

over 1 year ago

All those AI jobs no longer feel tractive anymore and overall feels over-inflated/hyped. Pull back was essential and DeepSeek made the decision easier.

0

61

bvynshu retweeted

kache

@yacineMTB

over 1 year ago

Seeing openai employees cope like this is all I need to know about them never ever making it ever. So bearish

164

15K

1K

694

507K

bvynshu retweeted

Georgi Gerganov

@ggerganov

over 1 year ago

pack it up boys, it's over

113

7K

598

3K

706K

bvynshu retweeted

Orikron 🇵🇹 骆培思

@orikron

over 1 year ago

Average American "Ph.D."

541

21K

2K

935

552K

bvynshu retweeted

rohit

@seatedro

over 1 year ago

this means nothing to me anymore

124

15K

897

2K

798K

Bhavyanshu Parasher @bvynshu

over 1 year ago

So weird that net-tools doesn't come pre-installed with ubuntu -- it's basic

0

10

Bhavyanshu Parasher @bvynshu

over 1 year ago

@KevinNaughtonJr Because we are in it together, as family 😅

0

1

0

21

Bhavyanshu Parasher @bvynshu

over 1 year ago

No point learning this the hard way and spending hours troubleshooting why the program won't start

0

9

Bhavyanshu Parasher @bvynshu

over 1 year ago

About system safety Always check file usage lsof <path> # Is the file open by a process? Always try to truncate instead of delete if unsure of its usage truncate -s 0 <path>

1

0

16

bvynshu retweeted

Q @qtnx_

over 1 year ago

after releasing a Sparse Autoencoder for llama 1B, i'm happy to announce that we've scaled up to 8 billion parameter models, having a trained a SAE for DeepSeek R1 Distill Llama 8B, and releasing it as open source.