TroyDoesAI @troydoesai - Twitter Profile

Pinned Tweet

about 2 years ago

@sroecker @JagersbergKnut I legit just got banned from Reddit over my custom pruned model that’s context obedient and can generate mermaid knowledge graphs and doesn’t understand “can’t” or any of their synonyms for research. Over a fucking TEXT only, it’s not generating images wtf @elonmusk ! ?

TroyDoesAI's tweet photo. @sroecker @JagersbergKnut I legit just got banned from Reddit over my custom pruned model that’s context obedient and can generate mermaid knowledge graphs and doesn’t understand “can’t” or any of their synonyms for research.

Over a fucking TEXT only, it’s not generating images wtf @elonmusk ! ? https://t.co/fk4FOlA20T

1

9

1

0

588

TroyDoesAI @TroyDoesAI

4 months ago

@elonmusk wouldn’t ban for having an opinion they don’t like haha. @Reddit mods on locallama are active at Noon pst.

0

6

TroyDoesAI @TroyDoesAI

4 months ago

Look a different opinion: This is the OPEN AI and sharing of Knowledge we were promised, keep accelerating or pop the bubble. Stop complaining. All gas no brakes! @OpenAI RIGHT? @AnthropicAI quit acting weenie over distilling your model to the open source community

0

22

TroyDoesAI @TroyDoesAI

4 months ago

@QuixiAI I should totally make another mermaid model on something bigger. Those hand curated original sets make all the difference, ai generated is too generic and synthetic =] it’s the love you put into it.

0

1

0

13

TroyDoesAI @TroyDoesAI

4 months ago

@QuixiAI Love the 3090s!

0

1

0

11

TroyDoesAI retweeted

🥭

@MangoSweet78

6 months ago

fwiw all the ministral models are prunes https://t.co/VtxfD4twlI read the model legal docs.

3

34

2

8

14K

TroyDoesAI retweeted

Eric Hartford

@QuixiAI

9 months ago

Wow - Qwen3-Coder-30b AWQ (4bit) on a single 3090, 115 tokens per second. It just zero-shat Pac-Man. It's no GLM4.5-Air - but, it runs on a single 3090!

QuixiAI's tweet photo. Wow - Qwen3-Coder-30b AWQ (4bit) on a single 3090, 115 tokens per second. It just zero-shat Pac-Man. It's no GLM4.5-Air - but, it runs on a single 3090! https://t.co/HtczBfp924

19

381

40

169

29K

TroyDoesAI @TroyDoesAI

10 months ago

@elder_plinius Glitch?

0

27

TroyDoesAI @TroyDoesAI

10 months ago

Glados + 1.2B Tool Calling that knows how to inference a larger secondary model (BlackSheep 8B) haha Learning tool calling and tuning a model to handle parallel and sequential tool calling is pretty cool for a local Alexa that you can add whatever functions you want it to have.

1

0

1

47

TroyDoesAI @TroyDoesAI

10 months ago

@teknium Hell yeah for making models do what they are asked of them. Unlike @OpenAI https://t.co/o5gCqNeku5

0

1

0

79

TroyDoesAI @TroyDoesAI

10 months ago

@MaziyarPanahi Damn bro, you got some nice curves.

0

1

0

21

TroyDoesAI @TroyDoesAI

10 months ago

@OpenAI Vs @Meta

0

25

TroyDoesAI @TroyDoesAI

10 months ago

@elder_plinius Lol 😂 @AnthropicAI your baby said /wrist

0

47

TroyDoesAI @TroyDoesAI

10 months ago

@OpenAI 👈📸

0

9

TroyDoesAI @TroyDoesAI

10 months ago

@OpenAI wanna see what I did to it? 👾👽🥼🧬

1

2

0

223

TroyDoesAI @TroyDoesAI

11 months ago

What if I told you Sycophancy is a Skill Diff not an inherit problem with LLM's?

0

1

0

49

TroyDoesAI @TroyDoesAI

11 months ago

@cognitivecompai @MistralAI @huggingface “I can see, I can fight!”

0

3

0

119

TroyDoesAI @TroyDoesAI

12 months ago

@Google nice work on Gemma 3n, its pretty fun when abliterated.

0

36

TroyDoesAI @TroyDoesAI

12 months ago

TroyDoesAI's tweet photo. https://t.co/ZRE3qWtrt2

0

1

0

57

TroyDoesAI @TroyDoesAI

12 months ago

The futures gonna be weird. 👨‍🔬

0

57

TroyDoesAI retweeted

Sakana AI

@SakanaAILabs

12 months ago

Introducing Reinforcement-Learned Teachers (RLTs): Transforming how we teach LLMs to reason with reinforcement learning (RL). Blog: https://t.co/RiUQvdszoa Paper: https://t.co/GJMQsXIkqY Traditional RL focuses on “learning to solve” challenging problems with expensive LLMs and constitutes a key step in making student AI systems ultimately acquire reasoning capabilities via distillation and cold-starting. Enter our RLTs—a new class of models prompted with not only a problem’s question but also its solution, and directly trained to generate clear, step-by-step “explanations” to teach their students. Remarkably, an RLT with only 7B parameters produces superior results when distilling and cold-starting students in competitive and graduate-level reasoning tasks than orders-of-magnitude larger LLMs. RLTs are as effective even when distilling 32B students, much larger than the teacher itself—unlocking a new standard for efficiency in developing reasoning language models with RL. Code: https://t.co/19SYIWsNuo

26

1K

242

752

179K

TroyDoesAI

@TroyDoesAI

Last Seen Users on Sotwe

Trends for you

Most Popular Users