Nylan Richard

Verified account

@nylric17

Working on custom AI models for dev teams. Writing about what I learn in post-training / fine-tuning.

San Francisco, CA

Joined February 2025

624 Following

28 Followers

184 Posts

10 days ago

I always felt like it missed a reliable playbook for startups using OpenClaw/Hermes

0

0

0

0

14

13 days ago

This is so obvious that classic LLMs are not the path to AGI

0

0

0

0

7

27 days ago

FDE is very cool but shouldn’t your product be simple enough to use to not need an engineer’s implementation?

0

1

0

0

22

about 1 month ago

@MariosGeorgakis Very interesting

0

0

0

0

70

nylric17 retweeted

about 1 month ago

VLMs (Vertical Language Models) are beating top LLMs. These small 7B to 15B niche-focused models are beating SoTA models in their niche benchmarks. I post-trained a 6B dense model in 15 days and beat Sonnet 4.6 and Gemini 3 Flash. I use Codex 5.5 (Extra High) to plan the SFT dataset scope, then I use DeepSeek v4 Pro & Kimi 2.6 API to generate handwritten examples. (No synthetic, templated datasets.) Codex runs each batch through quality gates and filters out all the weak data. I was able to build a 350M-parameter dataset for just $300 using Codex as the orchestrator & DS + Kimi as the executors. I can compete with giant data labs on my own, beat their VLMs, and not break the bank. This happened only because of open-source models, as they're fighting neck-and-neck with SoTA models. If I had to start a career right now, I'd start an agency that fine-tunes SLMs (small language models) for enterprises. I'd charge them a $10k to $20k one-time fee. Use Qwen 3.5 or Gemma 4 as base models, use Codex as the brain and DeepSeek v4 + Kimi as the muscle, and post-train a strong SLM under $1000. This might feel far-fetched, but in 6 months you'll see agencies like these. Not everything requires an LLM. SLMs can achieve vertical intelligence if properly trained with 10x lower cost, no privacy issues, and full control over the model. I'll be sharing technical findings here on X on the go. If you enjoy nerdy fine-tuning stuff, stay tuned.

31

474

31

638

32K

about 1 month ago

I just lost all my conversations with Claude Cowork

0

0

0

0

17

about 2 months ago

Everybody will tell you to do it but they will never

0

0

0

0

8

about 2 months ago

@sayinshallah @chang_defi I mean your application might be reviewed by some Claude and skills

0

0

0

0

15

about 2 months ago

@rohit4verse It has been the exact same prediction since 16months

0

0

0

0

36

about 2 months ago

@zan2434 @eddiejiao_obj @drewocarr This is amazing

0

0

0

0

47

about 2 months ago

@steel_ph0enix @catgirlprostate which model were you using?

0

0

0

0

3

about 2 months ago

RLM

0

0

0

0

16

about 2 months ago

The competition OpenAI vs Anthropic is probably the one that has the most positive impact on their users

0

0

0

0

17

about 2 months ago

Is it me or Wispr Flow is very slow? after dictation It was ok all the week but since this morning not anymore

0

1

0

0

38

about 2 months ago

@emilheap @thsottiaux lets call this skill emil-brain

1

1

0

0

152

about 2 months ago

@thsottiaux That's what matters

1

2

0

0

2K

about 2 months ago

@mattshumer_ dmed

0

0

0

0

150

about 2 months ago

Claude is down. Again. Please use local models

0

0

0

0

289

about 2 months ago

@ClementDelangue Interesting. In coding? Do you have something specific in mind?

0

0

0

0

64

Last Seen Users on Sotwe

Trends for you

Most Popular Users