Graphsignal

about 2 months ago

@GraphsignalAI More on how Graphsignal uses dstack across on-prem and GPU cloud for development and inference: https://t.co/ssFzsc38o2

0

3

2

1

120

GraphsignalAI retweeted

about 2 months ago

New: a case study on how @GraphsignalAI uses dstack for development and inference benchmarking. Graphsignal builds tooling to profile model inference, and uses dstack across a fleet of @nvidia DGX Spark devices and @verdacloud to keep the workflow consistent across on-prem and cloud: https://t.co/fDZSrXV0wW

0

7

5

2

395

Supercharge your voice with the leading real-time AI voice changer and soundboard 🎙️ Discord: https://t.co/qcnBs6uuXz

2 months ago

@dstackai And we also rely on @dstackai for development and testing of our SDK: https://t.co/aSw2gnjEln. Can’t wait to share more in another post!

0

2

1

180

Who to follow

Voicemod

@voicemod

Jan Leike

@janleike

AI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.

Noam Brown

@polynoamial

Researching reasoning @OpenAI | Co-created Libratus/Pluribus superhuman poker AIs, CICERO Diplomacy AI, and OpenAI o-series 🍓 reasoning models

GraphsignalAI retweeted

2 months ago

autodebug by @GraphsignalAI is a closed-loop system for inference optimization. It uses @dstackai to provision GPUs and redeploy services on each pass through the loop: benchmark → read profiling telemetry → tweak config → redeploy → repeat. What's interesting here is the combination of agentic optimization and heterogeneous hardware: the system is not tuning a fixed deployment, it is continuously searching across infrastructure and configuration. There's no manual step between iterations. @dmitrimelikyan's writeup: https://t.co/uvSBN2maMJ

dstackai's tweet photo. autodebug by @GraphsignalAI is a closed-loop system for inference optimization.

It uses @dstackai to provision GPUs and redeploy services on each pass through the loop:

benchmark → read profiling telemetry → tweak config → redeploy → repeat.

What's interesting here is the combination of agentic optimization and heterogeneous hardware: the system is not tuning a fixed deployment, it is continuously searching across infrastructure and configuration.

There's no manual step between iterations.

@dmitrimelikyan's writeup: https://t.co/uvSBN2maMJ

1

8

6

2

728

2 months ago

@dstackai and here is the repo https://t.co/zcxItMwYFg, inspired by @karpathy authoresearch

0

4

2

0

282

2 months ago

autodebug: an autonomous loop that deploys an inference service, benchmarks it, reads profiling telemetry, and redeploys with a better config. Then repeats. Uses @GraphsignalAI for inference profiling, @dstackai for GPU provisioning, Claude Code as the agent. https://t.co/HHdHa6TcaM https://t.co/IsIBt9hbel

2

14

5

7

2K

GraphsignalAI retweeted

Andrey Cheptsov

@andrey_cheptsov

2 months ago

Config tuning is just the start. The same loop can optimize inference code and even custom CUDA kernels. It all depends on what tools the agent can use.

1

5

3

2

436

GraphsignalAI retweeted

2 months ago

Agent orchestration is evolving fast! Agents + orchestration + telemetry → closed-loop systems. Our friends at GraphSignal show how this unlocks continuous inference optimization in production — across heterogeneous hardware. This is where things get interesting.

0

5

3

1

253

GraphsignalAI retweeted

2 months ago

Now @GraphsignalAI integrates with dstack — add @sgl_project profiling, tracing, and GPU metrics to your inference services. pip install 'graphsignal[cu12]' + wrap with graphsignal-run. That's it. https://t.co/TEFptiG1ak

dstackai's tweet photo. Now @GraphsignalAI integrates with dstack — add @sgl_project profiling, tracing, and GPU metrics to your inference services.

pip install 'graphsignal[cu12]' + wrap with graphsignal-run. That's it.

https://t.co/TEFptiG1ak https://t.co/CnLiZtDLrL

0

6

1

444

3 months ago

New post: AI Debugging and Optimization For Production Inference https://t.co/VbZKQnTy2R Use Claude Code to debug and optimize AI systems with rich production context from Graphsignal

0

3

1

0

67

3 months ago

New post: Traditional Observability Is Blind to Inference https://t.co/Y2zS4twlBs

0

1

0

60

3 months ago

New post: vLLM production observability - from model to hardware. https://t.co/Jjun3ahbia

0

4

3

231

7 months ago

@dstackai @nvidia Thank you for the mention! We’re big fans of @dstackai

0

4

0

54

7 months ago

Fresh dev setup on #dgx_spark running as a @dstackai fleet⚡️

3

8

4

1

728

about 1 year ago

LLM API Latency Optimization Explained https://t.co/Fi7o5DB3R5

1

4

2

1

168

GraphsignalAI retweeted

Nathan Benaich

@nathanbenaich

over 1 year ago

berlin done, munich next! good crew coming too:

1

21

1

2

4K

over 2 years ago

YES, you need to see the prompts! Great article by @HamelHusain https://t.co/v2pXVzfc40

0

2

0

128

over 2 years ago

Learn how to measure and analyze LLM streaming performance using time-to-first-token metrics and traces ➡️ https://t.co/Q8drhdw6GZ

GraphsignalAI's tweet photo. Learn how to measure and analyze LLM streaming performance using time-to-first-token metrics and traces

➡️ https://t.co/Q8drhdw6GZ https://t.co/5G06CyioyO

0

3

1

0

217