vpj

Verified account

@vpj

Joined April 2008

249 Following

957 Followers

8.2K Posts

11 days ago

@saurabh_shah2 @baneepbanana 😂

0

1

0

0

85

vpj retweeted

30 days ago

I don't remember where I found this, but its spot on.

bhalligan's tweet photo. I don't remember where I found this, but its spot on. https://t.co/fqrusbkUqH

733

32K

7K

8K

45M

30 days ago

@ChrSzegedy 💯 😂

0

0

0

0

264

vpj retweeted

about 1 month ago

DeepInfra has raised its $107M in Series B funding 🚀 AI is moving from training to production-scale deployment, and inference is becoming the system constraint. DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik

DeepInfra's tweet photo. DeepInfra has raised its $107M in Series B funding 🚀

AI is moving from training to production-scale deployment, and inference is becoming the system constraint.

DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik

8

54

10

12

12K

Who to follow

AI4Science at @sbintuitions. Ex-UTokyo, Ex-Preferred Networks, Tokyo. Into ☕️ and 🏃🏽‍♂️. 🇱🇰 → 🇯🇵

Software Professional from Kandy,Sri Lanka. Love the game of cricket.

Nuwan Samarasekera

Founder. Ex-Google. Building TestChimp https://t.co/4UK6024VFY QA - built for the age of human + agent hybrid teams

vpj retweeted

about 1 month ago

DeepInfra is now a first-class provider in @OpenClaw. One key, every model. 🦞

2

11

4

0

2K

vpj retweeted

about 1 month ago

DeepInfra × Hugging Face DeepInfra is live on @HuggingFace Inference Providers. Run DeepSeek V4, Kimi-K2.6, GLM-5.1 and 100+ more open models straight from the Hub — same OpenAI-compatible API, same low per-token pricing, no markup. Just add :deepinfra to the model name.

DeepInfra's tweet photo. DeepInfra × Hugging Face
DeepInfra is live on @HuggingFace Inference Providers.
Run DeepSeek V4, Kimi-K2.6, GLM-5.1 and 100+ more open models straight from the Hub — same OpenAI-compatible API, same low per-token pricing, no markup.
Just add :deepinfra to the model name. https://t.co/K0hsIv3SwV

8

72

14

19

25K

vpj retweeted

about 1 month ago

The DeepSeek V4 garbled output bug in open source inference engine is fixed in SGLang. To everyone affected over the weekend, sorry for the trouble. Huge thanks to @Ant_Group for landing the fix PR. It was a cross-company, cross-timezone, sub-48-hour marathon. @ollama and @humansand surfaced it first; @nvidia, @AIatMeta, and @FireworksAI_HQ raised the same signal soon after. @deepseek_ai replied in seconds at every hour. @FireworksAI_HQ stayed up late with us until it shipped. @SemiAnalysis_ and @ollama provided the machines that made the debugging possible. The SGLang team dug in through the weekend. The real OSS is the friends we made along the way.🫶

lmsysorg's tweet photo. The DeepSeek V4 garbled output bug in open source inference engine is fixed in SGLang.
To everyone affected over the weekend, sorry for the trouble.

Huge thanks to @Ant_Group for landing the fix PR. It was a cross-company, cross-timezone, sub-48-hour marathon. @ollama and @humansand surfaced it first; @nvidia, @AIatMeta, and @FireworksAI_HQ raised the same signal soon after. @deepseek_ai replied in seconds at every hour. @FireworksAI_HQ stayed up late with us until it shipped. @SemiAnalysis_ and @ollama provided the machines that made the debugging possible. The SGLang team dug in through the weekend.

The real OSS is the friends we made along the way.🫶

16

282

27

38

80K

vpj retweeted

about 1 month ago

DeepSeek V4 is live on DeepInfra at launch 🔥 V4-Pro: 1.6T MoE / 49B active. Frontier-tier reasoning. $1.74 in · $3.48 out · $0.145 cached V4-Flash: 284B MoE / 13B active. Fast & cheap for agents, RAG, long-context extraction. $0.14 in · $0.28 out · $0.028 cached

DeepInfra's tweet photo. DeepSeek V4 is live on DeepInfra at launch 🔥

V4-Pro: 1.6T MoE / 49B active. Frontier-tier reasoning.
$1.74 in · $3.48 out · $0.145 cached

V4-Flash: 284B MoE / 13B active. Fast & cheap for agents, RAG, long-context extraction.
$0.14 in · $0.28 out · $0.028 cached https://t.co/gjxtnyr6ex

7

23

1

1

1K

vpj retweeted

about 2 months ago

Day 0. GLM-5.1 from @Zai_org is live on DeepInfra. Open source getting close to GPT-5.4 and Claude Opus 4.6. Powered by @nvidia B300 Blackwell Ultra. Early access pricing, costs will drop as we scale. $1.40 in / $4.40 out / $0.26 cached per 1M tokens ↓

DeepInfra's tweet photo. Day 0. GLM-5.1 from @Zai_org is live on DeepInfra.
Open source getting close to GPT-5.4 and Claude Opus 4.6.
Powered by @nvidia B300 Blackwell Ultra.
Early access pricing, costs will drop as we scale.
$1.40 in / $4.40 out / $0.26 cached per 1M tokens ↓ https://t.co/LGXHwkPbVf

1

9

4

1

1K

about 2 months ago

@saurabh_shah2 🙊

0

2

0

0

66

3 months ago

@jeremyberman @saurabh_shah2 @rramador @ma_tay_ 😂

0

2

0

0

134

3 months ago

@niloofar_mire @moby763canary21 @humansand 😂

0

1

0

0

82

3 months ago

@GauravML Do you upload the pdf and ask questions in chat? Or is there a unique interface for this?

1

0

0

0

39

vpj retweeted

3 months ago

Kimi K2.5 Turbo just dropped on Deep Infra 🚀 #1 by speed: 341 tokens/sec #1 by price: $0.90/1M tokens credits to @ArtificialAnlys for benchmarks

DeepInfra's tweet photo. Kimi K2.5 Turbo just dropped on Deep Infra 🚀

#1 by speed: 341 tokens/sec
#1 by price: $0.90/1M tokens

credits to @ArtificialAnlys for benchmarks https://t.co/sL8gLSh9X6

12

303

20

77

26K

vpj retweeted

3 months ago

there is still no substitute for perfectly understanding every single line of code in your codebase i fall into the trap of just skimming through ai changes to "just make sure it looks good" all the time, and it makes me lose so much time to not perfectly understand every line

158

3K

94

412

251K

vpj retweeted

3 months ago

At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: https://t.co/lThuIjQvF9 What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (https://t.co/9d485OLrSY). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.

38

1K

70

357

210K

vpj retweeted

3 months ago

you should join humans& we have great perks for example @rramador will buy you a cool dino

saurabh_shah2's tweet photo. you should join humans&
we have great perks
for example
@rramador will buy you a cool dino https://t.co/zqy5abLVKP

6

63

1

7

7K

3 months ago

@Diyi_Yang @saurabh_shah2 @rramador 😂

0

2

0

0

36

3 months ago

0

0

0

0

22

Last Seen Users on Sotwe

Trends for you

Most Popular Users