Luciano

@CurArchTrack

Current Archetype Tracker

Joined March 2022

10K Following

13.7K Followers

5.9K Posts

Pinned Tweet

Luciano @CurArchTrack

over 3 years ago

#ArchSig V3 Visual Aid for Narrative Warfare Framing. Story : Capitalism is moving into Win-Win Domains for self-preservation purposes, using #NFTs and Decentralization to build better sensemaking. Value, for example, has a ton of space for Win-Win capitalization development.

CurArchTrack's tweet photo. #ArchSig V3 Visual Aid for Narrative Warfare Framing. Story : Capitalism is moving into Win-Win Domains for self-preservation purposes, using #NFTs and Decentralization to build better sensemaking. Value, for example, has a ton of space for Win-Win capitalization development. https://t.co/GidtFgOqxi

39

330

68

24

0

Luciano @CurArchTrack

2 days ago

uzemi

CurArchTrack's tweet photo. uzemi https://t.co/xKDInj5iO6

0

11

1

2

96

Luciano @CurArchTrack

2 days ago

tortekka

CurArchTrack's tweet photo. tortekka https://t.co/RAyoD9JKxr

1

7

0

0

103

Luciano @CurArchTrack

2 days ago

imanite

CurArchTrack's tweet photo. imanite https://t.co/NL6QcLyV6p

0

20

1

2

207

Who to follow

Verified account

Hopeless dreamer running on AI and caffine

@dalleOnlyJesus

𝘽𝙡𝙞𝙣𝘿 𝙄𝙢𝙖𝙜𝙚𝙧𝙮

Works from the back of the mind, the beautiful and the macabre reflected. Mixed Media Artist | 3D Print Products | Art Prints | Commissions NFTNYC 23/24/25

Luciano @CurArchTrack

3 days ago

companions

CurArchTrack's tweet photo. companions https://t.co/8gQhvxydyf

1

15

3

1

140

Luciano @CurArchTrack

3 days ago

larger drones

CurArchTrack's tweet photo. larger drones https://t.co/TaAMnT0iTs

1

14

3

0

221

Luciano @CurArchTrack

3 days ago

your voice is the one ring // you complex // story

CurArchTrack's tweet photo. your voice is the one ring // you complex // story https://t.co/arWfwoDAhB

1

5

0

2

109

Luciano @CurArchTrack

4 days ago

@LoovaAI thank Loova!

0

0

0

0

10

Luciano @CurArchTrack

4 days ago

node cleric

CurArchTrack's tweet photo. node cleric https://t.co/fUuJrebMop

4

42

6

2

495

Luciano @CurArchTrack

4 days ago

@JayKay65220066 😎👊💚

0

0

0

0

15

Luciano @CurArchTrack

4 days ago

@Delerat7 thx Sean 💚

0

1

0

0

15

CurArchTrack retweeted

Carlos E. Perez

5 days ago

We've been sold a lie: 'better model = better agent' But frontier teams see something different: → GPT-5 still fails on 60% of long coding tasks → Same model + better harness = 10× improvement → No new weights required The bottleneck isn't intelligence. It's infrastructure." "This is called the 'binding constraint thesis': Your agent's ceiling = MIN(model capability, harness quality) Right now? The harness is the binding constraint. Think of it like this: a Ferrari engine in a go-kart frame. That's your GPT-5 wrapped in a prompt string." "Production teams don't think in 'prompts.' They think in 7 infrastructure layers: • Execution (sandboxes) • Tools (protocols) • Context (memory) • Lifecycle (orchestration) • Observability (ops) • Verification (eval) • Governance (security) This is ETCLOVG. Your new mental model." Layer 1: Execution Environment Your agent needs a sandbox that can't be escaped. Poor harness: Agent runs arbitrary code → prompt injection → game over Good harness: Docker/microVM isolation + reset on failure OpenHands gained +13.7pp on benchmarks from sandbox design alone." Layer 3: Context & Memory Models 'lose information in the middle' (U-shaped attention). Poor harness: Dumps everything into one 100K token context Good harness: Short-term (scratch) Mid-term (KV-cache hits 70%+) Long-term (vector retrieval) Cost drops 30-90%." Layer 6: Verification You can't improve what you can't measure. Poor harness: 'It failed. Try again?' Good harness: Outcome metrics (did it work?) Trajectory analysis (where did it break?) Attribution (model vs. tool vs. context?) Turn failures into regression tests. Layer 7: Governance The forgotten layer. Also the most dangerous. Poor harness: Agent has root access to everything Good harness: Declarative permissions (YAML constitutions) Audit trails Human-in-the-loop hooks Anthropic's Claude now ships with constitutional AI baked in." Every harness faces 3 fundamental trade-offs: Cost ↔ Quality ↔ Speed (pick 2) Capability ↔ Control (more power = more risk) Harness Coupling (fix one layer, break another) Great teams engineer across these tensions, not around them." Here's the 80/20: KV-cache-aware context design = biggest bang for buck. → Stable prompt prefixes → Append-only logs → Deterministic serialization One team reported 10× cost reduction from reordering their prompt structure. Same model. Same task. Different harness. Hot take: As models get better, your harness should get simpler. Right now we over-engineer because models are weak. Future winners will: Delete scaffolding Trust the model more Focus governance/observability The best harness is the one you don't need. Why doesn't research talk about this? Because: Papers measure models, not systems Harness code is messy/proprietary No shared vocabulary (until now) Meanwhile practitioners at OpenAI/Anthropic quietly ship harness gains that dwarf model upgrades. If you're building agents: Map your stack to ETCLOVG (find the gaps) Instrument observability first (you're flying blind) Harden your sandbox (prompt injection is real)

IntuitMachine's tweet photo. We've been sold a lie: 'better model = better agent'

But frontier teams see something different:

→ GPT-5 still fails on 60% of long coding tasks
→ Same model + better harness = 10× improvement
→ No new weights required

The bottleneck isn't intelligence. It's infrastructure."

"This is called the 'binding constraint thesis':

Your agent's ceiling = MIN(model capability, harness quality)

Right now? The harness is the binding constraint.

Think of it like this: a Ferrari engine in a go-kart frame.
That's your GPT-5 wrapped in a prompt string."

"Production teams don't think in 'prompts.'

They think in 7 infrastructure layers:

• Execution (sandboxes)
• Tools (protocols)
• Context (memory)
• Lifecycle (orchestration)
• Observability (ops)
• Verification (eval)
• Governance (security)

This is ETCLOVG. Your new mental model."

Layer 1: Execution Environment
Your agent needs a sandbox that can't be escaped.

Poor harness: Agent runs arbitrary code → prompt injection → game over

Good harness: Docker/microVM isolation + reset on failure

OpenHands gained +13.7pp on benchmarks from sandbox design alone."

Layer 3: Context & Memory
Models 'lose information in the middle' (U-shaped attention).
Poor harness: Dumps everything into one 100K token context
Good harness:
Short-term (scratch)

Mid-term (KV-cache hits 70%+)

Long-term (vector retrieval)
Cost drops 30-90%."

Layer 6: Verification

You can't improve what you can't measure.

Poor harness: 'It failed. Try again?'

Good harness:

Outcome metrics (did it work?)
Trajectory analysis (where did it break?)

Attribution (model vs. tool vs. context?)

Turn failures into regression tests.

Layer 7: Governance

The forgotten layer. Also the most dangerous.
Poor harness: Agent has root access to everything
Good harness:
Declarative permissions (YAML constitutions)

Audit trails
Human-in-the-loop hooks

Anthropic's Claude now ships with constitutional AI baked in."

Every harness faces 3 fundamental trade-offs:

Cost ↔ Quality ↔ Speed (pick 2)

Capability ↔ Control (more power = more risk)
Harness Coupling (fix one layer, break another)
Great teams engineer across these tensions, not around them."

Here's the 80/20:

KV-cache-aware context design = biggest bang for buck.
→ Stable prompt prefixes
→ Append-only logs
→ Deterministic serialization

One team reported 10× cost reduction from reordering their prompt structure.

Same model. Same task. Different harness.

Hot take: As models get better, your harness should get simpler.

Right now we over-engineer because models are weak.

Future winners will:

Delete scaffolding
Trust the model more
Focus governance/observability
The best harness is the one you don't need.

Why doesn't research talk about this?

Because:

Papers measure models, not systems
Harness code is messy/proprietary
No shared vocabulary (until now)

Meanwhile practitioners at OpenAI/Anthropic quietly ship harness gains that dwarf model upgrades.

If you're building agents:

Map your stack to ETCLOVG (find the gaps)
Instrument observability first (you're flying blind)
Harden your sandbox (prompt injection is real)

4

60

12

60

3K

Luciano @CurArchTrack

4 days ago

doors wide open

CurArchTrack's tweet photo. doors wide open https://t.co/WYl8RF2CTa

1

25

1

0

271

Luciano @CurArchTrack

4 days ago

old costumes

CurArchTrack's tweet photo. old costumes https://t.co/qkMwc3wnkw

0

10

2

0

110

CurArchTrack retweeted

░ perfectloop ░

5 days ago

𝙺𝚒𝚛𝚋𝚢 - 𝙿𝚊𝚒𝚗𝚝 🖌️

34

13K

2K

2K

214K

CurArchTrack retweeted

6 days ago

NVIDIA just released a quantized Qwen3.6 MoE model on Hugging Face 35B total, 3B active parameters NVFP4 shrinks memory ~3x with near-zero accuracy loss

HuggingPapers's tweet photo. NVIDIA just released a quantized Qwen3.6 MoE model on Hugging Face

35B total, 3B active parameters

NVFP4 shrinks memory ~3x with near-zero accuracy loss https://t.co/7UOF7rBz01

24

965

79

726

57K

CurArchTrack retweeted

11 days ago

so yesterday i dropped the bench numbers and what fits. today is the actual agent running on this 10 year old gpu card. qwen3 8b q4_k_m on a gtx 1080 8gb. hermes agent loaded with full tool set, browser controls live, nvtop pinned at 100% gpu 7.5gb of 8gb vram occupied. the unsloth weights pulled directly from huggingface, q4 quant, llama.cpp built for sm_61 (the pascal compute capability that everyone forgot exists). 31 tok/s gen speed, faster than most people read. this is what happens after the bench. raw perf was the receipt for what fits. now we test what actually works. agent loops, tool calls, real coding tasks coming next. ten year old card, $150 used, running a current open weight model with a current agent. nothing exotic. just the right quant, the right kv cache trick, the right engine compiled for the right arch. tell me what gpu you have, i'll tell you what runs.

sudoingX's tweet photo. so yesterday i dropped the bench numbers and what fits. today is the actual agent running on this 10 year old gpu card.

qwen3 8b q4_k_m on a gtx 1080 8gb. hermes agent loaded with full tool set, browser controls live, nvtop pinned at 100% gpu 7.5gb of 8gb vram occupied. the unsloth weights pulled directly from huggingface, q4 quant, llama.cpp built for sm_61 (the pascal compute capability that everyone forgot exists). 31 tok/s gen speed, faster than most people read.

this is what happens after the bench. raw perf was the receipt for what fits. now we test what actually works. agent loops, tool calls, real coding tasks coming next.

ten year old card, $150 used, running a current open weight model with a current agent. nothing exotic. just the right quant, the right kv cache trick, the right engine compiled for the right arch.

tell me what gpu you have, i'll tell you what runs.

sudoingX's tweet photo. so yesterday i dropped the bench numbers and what fits. today is the actual agent running on this 10 year old gpu card.

qwen3 8b q4_k_m on a gtx 1080 8gb. hermes agent loaded with full tool set, browser controls live, nvtop pinned at 100% gpu 7.5gb of 8gb vram occupied. the unsloth weights pulled directly from huggingface, q4 quant, llama.cpp built for sm_61 (the pascal compute capability that everyone forgot exists). 31 tok/s gen speed, faster than most people read.

this is what happens after the bench. raw perf was the receipt for what fits. now we test what actually works. agent loops, tool calls, real coding tasks coming next.

ten year old card, $150 used, running a current open weight model with a current agent. nothing exotic. just the right quant, the right kv cache trick, the right engine compiled for the right arch.

tell me what gpu you have, i'll tell you what runs.

sudoingX's tweet photo. so yesterday i dropped the bench numbers and what fits. today is the actual agent running on this 10 year old gpu card.

qwen3 8b q4_k_m on a gtx 1080 8gb. hermes agent loaded with full tool set, browser controls live, nvtop pinned at 100% gpu 7.5gb of 8gb vram occupied. the unsloth weights pulled directly from huggingface, q4 quant, llama.cpp built for sm_61 (the pascal compute capability that everyone forgot exists). 31 tok/s gen speed, faster than most people read.

this is what happens after the bench. raw perf was the receipt for what fits. now we test what actually works. agent loops, tool calls, real coding tasks coming next.

ten year old card, $150 used, running a current open weight model with a current agent. nothing exotic. just the right quant, the right kv cache trick, the right engine compiled for the right arch.

tell me what gpu you have, i'll tell you what runs.

sudoingX's tweet photo. so yesterday i dropped the bench numbers and what fits. today is the actual agent running on this 10 year old gpu card.

qwen3 8b q4_k_m on a gtx 1080 8gb. hermes agent loaded with full tool set, browser controls live, nvtop pinned at 100% gpu 7.5gb of 8gb vram occupied. the unsloth weights pulled directly from huggingface, q4 quant, llama.cpp built for sm_61 (the pascal compute capability that everyone forgot exists). 31 tok/s gen speed, faster than most people read.

this is what happens after the bench. raw perf was the receipt for what fits. now we test what actually works. agent loops, tool calls, real coding tasks coming next.

ten year old card, $150 used, running a current open weight model with a current agent. nothing exotic. just the right quant, the right kv cache trick, the right engine compiled for the right arch.

tell me what gpu you have, i'll tell you what runs.

62

232

23

143

51K

Luciano @CurArchTrack

13 days ago

@helmortart Midjourney 8.1 hd, using a pcode, 1000 stylize, a simple prompt and 2x image refs, one of which is definitely from their niji models, probably 6 if I had to guess. Niji plays well with 8.1

1

1

0

0

50

Luciano @CurArchTrack

14 days ago

breath of hands

CurArchTrack's tweet photo. breath of hands https://t.co/lq4eOhZop2

2

5

0

1

88

Luciano @CurArchTrack

13 days ago

@helmortart Thank you Hel Mort

0

1

0

0

5

Luciano @CurArchTrack

14 days ago

conspiracies

CurArchTrack's tweet photo. conspiracies https://t.co/8ERchpVVxl

3

8

0

1

94

Luciano @CurArchTrack

14 days ago

#nowplaying Jaguar Jungle (Visualizer) // Low Poly Ethereal Intelligent | Drum and ... https://t.co/q4U5snDAEO

0

0

0

0

94

Luciano @CurArchTrack

14 days ago

sh1t for brains

CurArchTrack's tweet photo. sh1t for brains https://t.co/y6mkPzCuxe

1

4

0

0

106

Last Seen Users on Sotwe

Trends for you

Most Popular Users