ShinchaBoo @ShichanBoo08 - Twitter Profile

Claude Cowork with blender is so much fun, still work in progress will post the final scene soon. Trying out if it can build basic geometry nodes scene like waves hitting a beach 🌊🏖️

thefirehacker's tweet photo. Claude Cowork with blender is so much fun, still work in progress will post the final scene soon.

Trying out if it can build basic geometry nodes scene like waves hitting a beach 🌊🏖️ https://t.co/jm2T9pUBnI

1

4

3

63

ShichanBoo08 retweeted

Fetchlens.ai @fetchlens

9 days ago

52% of MCP servers are dead within 90 days. But the median server has 6 commits — lifetime. The protocol works. The logic layer doesn't exist. Content goes stale. Tools stay isolated. Nobody monitors what fails. Full research: https://t.co/xCk7HPZbce

fetchlens's tweet photo. 52% of MCP servers are dead within 90 days.
But the median server has 6 commits — lifetime.

The protocol works. The logic layer doesn't exist.

Content goes stale. Tools stay isolated. Nobody monitors what fails.

Full research: https://t.co/xCk7HPZbce https://t.co/lUyKvLUQQa

0

4

3

64

ShichanBoo08 retweeted

FireHacker

@thefirehacker

27 days ago

This MTP pull request merge is getting more attention than many model drops. I first noticed MTP while looking at Qwen3.5-0.8B, and now llama.cpp support makes the whole thing more interesting. My current understanding is that MTP mainly improves token generation, not prompt processing. So it helps when the model is writing a lot: chat, coding, long answers, agents, synthetic data, local assistants. But if the workload is mostly huge prompt + short answer, then prompt processing is still the bottleneck. People are mentioning around 1.5x to 1.8x faster token generation in some setups. My question is: how useful is this overall in real local AI workflows? Is MTP going to matter mainly for long generation and agent loops, or will it become a default feature people expect in models?

thefirehacker's tweet photo. This MTP pull request merge is getting more attention than many model drops.

I first noticed MTP while looking at Qwen3.5-0.8B, and now llama.cpp support makes the whole thing more interesting.

My current understanding is that MTP mainly improves token generation, not prompt processing.

So it helps when the model is writing a lot:

chat, coding, long answers, agents, synthetic data, local assistants.

But if the workload is mostly huge prompt + short answer, then prompt processing is still the bottleneck.

People are mentioning around 1.5x to 1.8x faster token generation in some setups.

My question is: how useful is this overall in real local AI workflows?

Is MTP going to matter mainly for long generation and agent loops, or will it become a default feature people expect in models?

0

3

77

ShichanBoo08 retweeted

FireHacker

@thefirehacker

27 days ago

New UI Preview feature on Claude Code is really great. I gave it a screenshot and asked it to make a navbar prettier. Instead of immediately editing CSS, it first asked me to choose a direction: Refined gold pill Sparkle prefix Glow halo around text That is the part I found useful. For frontend work, “make it prettier” is not a coding instruction. It is a taste decision. Claude Code did not jump straight from prompt to diff. It stopped at the subjective layer first. The flow felt like: visual context → design options → human choice → code edit All in a single clean flow.

thefirehacker's tweet photo. New UI Preview feature on Claude Code is really great.

I gave it a screenshot and asked it to make a navbar prettier.
Instead of immediately editing CSS, it first asked me to choose a direction:
Refined gold pill
Sparkle prefix
Glow halo around text
That is the part I found useful.
For frontend work, “make it prettier” is not a coding instruction. It is a taste decision.
Claude Code did not jump straight from prompt to diff. It stopped at the subjective layer first.
The flow felt like:
visual context → design options → human choice → code edit All in a single clean flow.

0

6

4

3

55

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

This is one of the most crucial lessons in First Break AI. It teaches you how to navigate @huggingface like a pro. Not just: download model → run notebook → move on In this lesson, we go deeper. We look at how open model repos are structured, how to read model files, how config.json connects to the actual model class, and how to trace from a Hugging Face model page into the Transformers code that runs the model. We use Qwen3-0.6B as the learning model. We also look at why Markdown matters so much in AI workflows: model cards, GitHub issues, README files, Discord, Cursor, Claude Code, planning docs, and AI-assisted work. Then comes the biggest win: datasets. Working with datasets is a core AI engineering skill. I show 3 ways to analyze datasets on Hugging Face: Croissant endpoint Data Studio / browser viewer load_dataset with Python, pandas, and plots We inspect dataset structure, categories, response lengths, distribution, short examples, long examples, and how to think about dataset quality before using it for training or fine-tuning. And this sets up the next part: running Qwen3 directly in C, without treating Transformers as magic. Lesson 01: Hugging Face Beyond Upload Watch: https://t.co/GF8ZCNk5WN Free cohort: https://t.co/0H4qIVOpGj

0

10

5

3

273

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

First Break AI https://t.co/xLmTIU0rq6 Cohort: 1 May 2026 — 30 June 2026 (2 months) 3⃣ intuitions that make LLMs click: 🗿The model is a pipeline For Qwen3-0.6B: Input → text embeddings → Qwen3DecoderLayer ×28 → RMSNorm → lm_head → output From far away, it looks simple. Most of the intelligence is inside the repeated decoder layers. 🗿 LLMs generate one token at a time. They are causal autoregressive models. At inference time, the model sees the entire context so far, but it cannot see future tokens. The loop is: current context → predict next token → append token → new context → repeat So the model does not produce the full answer in one shot. It keeps extending the sequence one token at a time. Each new token becomes part of the context for the next prediction. During training, the full sequence can be passed in at once, but a causal mask prevents each token from looking ahead. 🗿 The model does not directly output one word. At every step, it outputs probabilities over the full vocabulary. For Qwen3-0.6B, that vocabulary is 151,936 possible tokens. Decoding then chooses the next token.

thefirehacker's tweet photo. First Break AI
https://t.co/xLmTIU0rq6

Cohort: 1 May 2026 — 30 June 2026 (2 months)

3⃣ intuitions that make LLMs click:

🗿The model is a pipeline
For Qwen3-0.6B:
Input → text embeddings → Qwen3DecoderLayer ×28 → RMSNorm → lm_head → output

From far away, it looks simple.
Most of the intelligence is inside the repeated decoder layers.

🗿 LLMs generate one token at a time.
They are causal autoregressive models.
At inference time, the model sees the entire context so far, but it cannot see future tokens.
The loop is:
current context → predict next token → append token → new context → repeat
So the model does not produce the full answer in one shot.

It keeps extending the sequence one token at a time. Each new token becomes part of the context for the next prediction.

During training, the full sequence can be passed in at once, but a causal mask prevents each token from looking ahead.

🗿 The model does not directly output one word.
At every step, it outputs probabilities over the full vocabulary. For Qwen3-0.6B, that vocabulary is 151,936 possible tokens. Decoding then chooses the next token.

0

3

2

63

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

“Once you start learning about stuff , the density of accessible information increases in an extremely literal sense: You are able to engage with more of the world than you were before, even though the amount of physical world around you has not changed” - Quote from “How I read ” This is a great way to put into words my reaction when I discover a new perspective. One word in Mumbai slang describes this perfectly: “Aaila.” Link to the Original post: https://t.co/S37sXaXobC

1

5

3

42

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

- Gothically gorgeous - Hauntingly beautiful

0

4

3

56

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

Only 5 hour limits are doubled which means weekly limits will hit even faster 👽🤣

0

6

4

3

79

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

Cost/Perf tradeoffs & Evals are the most requested topics for this cohort. I was not expecting these to make top 3. Real life signals are always different from my assumptions.

1

6

5

4

73

ShinchaBoo @ShichanBoo08

about 1 month ago

@thefirehacker @fetchlens How to apply for free GPU credits on FirstBreak AI?

0

1

0

20

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 1 month ago

Moved my site to a custom domain. Google traffic vanished overnight. Google had picked my old *.pages.dev URL as the canonical and was treating my real domain as a copy. 20-minute fix with Claude + 1 git command nobody talks about.

2

6

4

90

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 2 months ago

I found this feature in cursor today "Generate commit message" , very useful. don't have prompt CC or Cursor " generate commit msg for staged code"

thefirehacker's tweet photo. I found this feature in cursor today "Generate commit message" , very useful.

don't have prompt CC or Cursor " generate commit msg for staged code" https://t.co/EGPrsvFnHc

3

4

3

39

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 2 months ago

We need more real LLM training case studies. If you’ve seen or shared actual training runs, drop them here 👇 💡 We’re collecting these and turning them into structured breakdowns that are easy to learn and apply. Reading the Curves: How real LLMs learn, spike, recover, and stabilize. 👉 https://t.co/uovuNxS0le The Marin pretraining run is now part of our pre-cohort blog + office hours. During the cohort, we’ll keep collecting and publishing real case studies across open-source models and experiments. Cohort: 1 May — 30 June (2 months)

thefirehacker's tweet photo. We need more real LLM training case studies.
If you’ve seen or shared actual training runs, drop them here 👇
💡 We’re collecting these and turning them into structured breakdowns that are easy to learn and apply.

Reading the Curves:
How real LLMs learn, spike, recover, and stabilize.

👉 https://t.co/uovuNxS0le
The Marin pretraining run is now part of our pre-cohort blog + office hours.

During the cohort, we’ll keep collecting and publishing real case studies across open-source models and experiments.

Cohort: 1 May — 30 June (2 months)

1

5

2

200

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 2 months ago

First Break AI Your first break in AI — a guided journey from first commit to capstone Free, open cohort to upskill in training, inference, and AI product building. https://t.co/xLmTIU0rq6 Easy to follow Roadmap & AI Podcast guided journey are up. Weekly office hours (Friday) Join Discord Server: https://t.co/ailadSswLY

2

9

4

242

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 2 months ago

I shared a LinkedIn post this morning, https://t.co/kN6q3yGD5O showed me exactly what LinkedIn did behind the scenes: → LinkedInBot: 5 visits → robots.txt, /, image preview → All confirmed scraper category GA (Google Analytics) + Plausible showed: 0 visits. Because bots don't run JavaScript. This is why I'm building https://t.co/kN6q3yGD5O — every site has this traffic, almost no one can see it. One script tag or middleware line. WordPress, Shopify, Next.js, anything with @fetchlens Scan site for free at https://t.co/kN6q3yGD5O

thefirehacker's tweet photo. I shared a LinkedIn post this morning, https://t.co/kN6q3yGD5O showed me exactly what LinkedIn did behind the scenes:

→ LinkedInBot: 5 visits
→ robots.txt, /, image preview
→ All confirmed scraper category

GA (Google Analytics) + Plausible showed: 0 visits. Because bots don't run JavaScript.

This is why I'm building https://t.co/kN6q3yGD5O — every site has this traffic, almost no one can see it.

One script tag or middleware line. WordPress, Shopify, Next.js, anything with @fetchlens

Scan site for free at https://t.co/kN6q3yGD5O

1

8

4

3

108

ShichanBoo08 retweeted

SJ @Quantum_Pandit

about 2 months ago

Scanned my domain using this free tool — shows which AI agents are hitting your site at the server layer, before JS even loads: https://t.co/drawtkbYTU — no signup needed, takes 10 seconds.

0

4

2

39

ShinchaBoo @ShichanBoo08

about 2 months ago

@pmitu Built FetchLens because my own site was getting crawled by ChatGPT, Claude, and Perplexity — and my analytics showed zero. Now I can see all of it, in real time. https://t.co/LLOe5jegA4

0

3

0

25

ShichanBoo08 retweeted

FireHacker

@thefirehacker

about 2 months ago

Build blogs on Next js & cloudflare that can be easily consumed by AI Agents. Discover AI Agentics activity on your content using https://t.co/7vgJReMxF7 AI Agents visits for sites on Cloudflare Pages are now on https://t.co/7vgJReMxF7 dashbaord. Both Next js & Cloudflare pages are supported https://t.co/4aVNqJm3Mo is an AI guided open cohort. Learners can listen to AI podcast use transcripts and instructions to move ahead with road map. Entire AI agents activity is shown on https://t.co/vnLHBBtJe6

thefirehacker's tweet photo. Build blogs on Next js & cloudflare that can be easily consumed by AI Agents. Discover AI Agentics activity on your content using https://t.co/7vgJReMxF7

AI Agents visits for sites on Cloudflare Pages are now on https://t.co/7vgJReMxF7 dashbaord. Both Next js & Cloudflare pages are supported

https://t.co/4aVNqJm3Mo is an AI guided open cohort. Learners can listen to AI podcast use transcripts and instructions to move ahead with road map. Entire AI agents activity is shown on https://t.co/vnLHBBtJe6

0

5

3

4

60

ShinchaBoo

@ShichanBoo08

Last Seen Users on Sotwe

Trends for you

Most Popular Users