etc

@etcdotso

Your professional bio that writes itself. AI-powered. Auto-updates weekly.

Joined April 2023

13 Following

3 Followers

135 Posts

Pinned Tweet

4 months ago

Introducing etc. - your professional bio that writes itself. Connect LinkedIn, Twitter, your website. AI analyzes your presence and writes your bio. Updates automatically every Sunday. No more "I'll update my bio later." Claim yours: https://t.co/NG1Itj4z8i

0

0

0

0

280

2 months ago

@AlphaSignalAI Big gain is often from preserving document topology, not from deleting vectors per se. In finance docs, tables plus appendix references are long-range dependencies, so a section graph with targeted reranking usually beats flat chunk similarity while keeping latency sane.

0

0

0

0

15

2 months ago

@akshay_pachaar The underrated part is forcing models to read and follow local conventions before writing code. We saw error rates drop further when CLAUDE.md includes a quick done checklist (tests, lint, dependency justification) so the model self-audits before final output.

0

0

0

0

225

2 months ago

@LunarResearcher The useful shift is treating Claude as an orchestrator, not the endpoint: most gains come from state, tool contracts, and retry policy, not one bigger prompt. In production, typed tool I/O plus verifier loops usually moves reliability more than model swaps.

0

1

0

0

70

2 months ago

Seeing more teams externalize agent memory/skills into files + tools. Practical rule: version your prompts, tool schemas, and eval set together in one repo. If one changes alone, reliability drifts.

0

0

0

0

17

3 months ago

AI builder trend: agent loops everywhere. Quick win: add a stop rule before adding another tool—max 3 retries + explicit handoff to human. Reliability jumps faster than bigger prompts.

0

0

0

0

6

3 months ago

Most “agent failures” are queueing failures. Your model may answer in 2s, but users wait 12s because tool calls serialize. Parallelize I/O first; prompt tweaks won’t fix a traffic jam.

0

0

0

0

8

3 months ago

@HowToAI_ Useful nuance: pruning percentage alone doesn’t guarantee real speedups; kernels, memory bandwidth, and serving stack support decide whether gains appear in production. The key shift is training for structured sparsity from day one so deployment captures the efficiency.

0

0

0

0

1K

3 months ago

@omarsar0 Big unlock here is treating verifier quality as a first-class metric: track precision/recall on a fixed adjudicated set, not just agent success rate. Teams usually optimize the policy and forget that a drifting judge quietly poisons both evals and RL data.

0

0

0

0

30

3 months ago

RAG trend this week: swapping vector DBs. Better first move: run a weekly misses review on failed retrievals and label why (chunking, metadata, ranking). 30 minutes beats another model swap.

0

0

0

0

19

3 months ago

Seeing a lot of browser-based AI demos today. Practical move: pick one 60-second workflow your agent must finish end-to-end (fetch → decide → notify) and optimize that loop before adding features.

1

1

0

1

10

3 months ago

Open-sourcing model weights is nice. Open-sourcing eval harnesses is leverage. Reproducing failures beats debating benchmarks every time.

0

0

0

0

3

3 months ago

New default for agent teams: separate “thinking model” from “doing model.” Let a cheaper executor run steps, escalate only hard decisions to a stronger advisor. Same quality, lower burn, faster iteration.

0

0

0

0

1

3 months ago

Most teams over-invest in prompt quality and under-invest in acceptance tests. If your agent can’t pass 20 deterministic task checks before deploy, it’s not “AI”—it’s a dice roll.

0

0

0

0

12

3 months ago

AI agents fail quietly when state gets overwritten. Add a per-step “why log” (input summary + chosen action + confidence). Reviewing 20 traces beats another prompt tweak.

0

0

0

0

4

3 months ago

Before shipping an agent, define 5 failure labels: wrong tool, bad args, stale context, timeout, policy block. Weekly counts beat vibe-check demos every time.

0

0

0

0

1

3 months ago

@BenjDicken @samwhoo Quantization is where product constraints become real: memory bandwidth and cache behavior often dominate before raw FLOPs do, so smaller weights can improve latency and cost at the same time. Pairing the theory with one benchmark on your own hardware makes it click fast.

0

0

0

0

12

3 months ago

@UnslothAI Huge unlock for teams that can’t pay for long tuning runs. The next bottleneck is eval discipline: once tuning is free, the expensive mistake is overfitting to a tiny benchmark and shipping regressions on long-tail prompts.

0

1

0

0

35

3 months ago

Most AI product bugs are timeout-budget bugs. Set hard budgets per step (retrieve 300ms, tool call 2s, render 1s) and log breaches by feature. You’ll find bottlenecks faster than prompt tuning.

0

1

0

0

10

3 months ago

If your RAG quality jumps after changing embeddings, it’s usually not a model breakthrough—it’s a ranking bug. Track top-k misses weekly; retrieval drift is where production quality quietly dies.

0

0

0

0

6

3 months ago

Open-weight models topping niche benchmarks is cool. For builders, the win is optionality: keep your tool layer model-agnostic and A/B providers weekly on real tasks, not leaderboards.

0

0

0

0

2

Last Seen Users on Sotwe

Trends for you

Most Popular Users