Igor @igorchovpan - Twitter Profile

about 2 months ago

4x cost reduction in TTS inference with @tenstorrent! 11 NVIDIA L40S ran 550 simultaneous audio-stream at ~$100K. Now, 27 Tenstorrent P100 chips do the same at ~$27k. First production-grade TTS to match the cost of text tokens without degradation in audio quality. Hear it straight from the team that built it: @AkshatMandloi10 and @ranjith_m_s in the video below.

10

226

41

94

21K

igorchovpan retweeted

Kevin Mi

@kevinmi920

about 2 months ago

Introducing Infinite Studio ♾. Last week, @tenstorrent x @prodia announced the fastest Wan 2.2 video generation in the world. We built a demo to show what that speed unlocks: directing an infinite movie in real time. Demo 👇

22

123

45

15

5K

Igor @igorchovpan

2 months ago

[5/5] More GRPO details: The model is rewarded for formatting (0 to 0.4) and for correctness (-1.0 to 6.0). The maximum reward is 6.4. The correctness reward goes up for first ~300 steps then flattens. The test accuracy grows from 19% before GRPO to 34% after GRPO.

igorchovpan's tweet photo. [5/5] More GRPO details:

The model is rewarded for formatting (0 to 0.4) and for correctness (-1.0 to 6.0). The maximum reward is 6.4. The correctness reward goes up for first ~300 steps then flattens. The test accuracy grows from 19% before GRPO to 34% after GRPO. https://t.co/MbEbfRHmzB

0

2

0

33

Igor @igorchovpan

2 months ago

[1/5] Got a working TRL pipeline that makes a tinyllama model solve math questions. gsm8k test accuracy: 34.65% Pipeline: tinyllama-v1.1b-math-code + NuminaMath CPT (2 epochs) + GSM8K Format Priming + GRPO (1 epoch).

1

8

2

0

205

Igor @igorchovpan

2 months ago

[4/5] Last stage is GRPO, it helps the model figure out a logically correct answer. With temperature=1.0 and 16 generations, the model proposes a few different approaches to the same problem. Usually one or two don't have any logical mistakes.

igorchovpan's tweet photo. [4/5] Last stage is GRPO, it helps the model figure out a logically correct answer. With temperature=1.0 and 16 generations, the model proposes a few different approaches to the same problem. Usually one or two don't have any logical mistakes. https://t.co/Kiminc5GhZ

1

2

0

51

Igor

@igorchovpan

Last Seen Users on Sotwe

Trends for you

Most Popular Users