Dat Ngo @dat_attacked - Twitter Profile

about 9 hours ago

if i ever needed a resume, this isn't far from it nice to see the math / classic ml on here, as some foundational parts my biggest weakness is model training and deep understanding of transformer architectures time to explore weaker areas of my understanding

Rohit Ghumare

@ghumare64

1 day ago

As an AI Engineer. Please learn: Harness engineering, not just prompt engineering Context engineering, not just long prompts Prompt caching vs. semantic caching tradeoffs KV cache management, eviction, reuse, and memory pressure at scale Prefill vs. decode latency and why they optimize differently Continuous batching, paged attention, and throughput optimization Speculative decoding vs. quantization vs. distillation tradeoffs INT8, INT4, FP8, AWQ, GPTQ, and when quantization hurts quality Structured output failures, schema validation, repair loops, and fallback chains Function calling reliability, tool contracts, argument validation, and idempotency Agent guardrails, loop budgets, tool budgets, and termination conditions Model routing, graceful fallback logic, and degraded-mode UX RAG architecture: chunking, embeddings, hybrid search, reranking, and freshness Retrieval evals: recall, precision, grounding, attribution, and citation quality Evals: golden sets, regression tests, adversarial tests, LLM-as-judge, and human evals LLM observability as a first-class discipline: traces, spans, tokens, latency, errors, and drift Cost attribution per feature, workflow, tenant, and user journey not just per model Safety engineering: prompt injection defense, data leakage prevention, and permission boundaries Multi-tenant isolation, cache safety, and cross-user context contamination prevention Fine-tuning vs. in-context learning vs. RAG vs. distillation and when each is the wrong tool Latency, quality, cost, and reliability tradeoffs across the full inference stack Production failure modes: hallucinated tool calls, malformed JSON, stale retrieval, runaway agents, and silent eval regressions Shipping LLM systems as reliable infrastructure, not demos wrapped around prompts https://t.co/OhK9MK04ld

2

408

67

545

18K

0

1

0

47

dat_attacked retweeted

Patrick Loeber

@patloeber

about 22 hours ago

new skills repo from deepmind to speed up agentic scientific workflows https://t.co/BwpWjX3jN1

1

31

2

14

1K

Dat Ngo

@dat_attacked

about 15 hours ago

Every Ai event I go to I love to see this girl! One of the homies for years. Thanks for the coming out to @arizeai Observe Conf @temporalio is lucky to have ya! @MelGoesTech @belizsoyak

dat_attacked's tweet photo. Every Ai event I go to I love to see this girl! One of the homies for years.

Thanks for the coming out to @arizeai Observe Conf

@temporalio is lucky to have ya!

@MelGoesTech @belizsoyak https://t.co/LHTgw47Ubw

0

5

1

0

202

Dat Ngo

@dat_attacked

about 20 hours ago

@arizeai Observe conf keynote, always loved our open source DNA @mikeldking and @ArizePhoenix with sandbox launch! @aparnadhinak and @jason_lopatecki kicking off with harness debugging, sandbox observability, and bring your own harnesses

dat_attacked's tweet photo. @arizeai Observe conf keynote, always loved our open source DNA

@mikeldking and @ArizePhoenix with sandbox launch!

@aparnadhinak and @jason_lopatecki kicking off with harness debugging, sandbox observability, and bring your own harnesses https://t.co/QgTwO5qDJZ

1

3

0

20

Dat Ngo

@dat_attacked

about 20 hours ago

Gahhh, gotta say hi to the home girl @MelGoesTech and congratulate her on her marriage!!

Mel 🦋

@MelGoesTech

about 21 hours ago

Keynote started by co-founders of @arizeai feat @dat_attacked as emcee

0

2

0

165

1

0

70

Dat Ngo

@dat_attacked

1 day ago

using @claude design, and it had the audacity to say this

0

32

Dat Ngo

@dat_attacked

1 day ago

@cyrusnewday haha had to quote as well!

0

1

0

41

Dat Ngo

@dat_attacked

1 day ago

s/o to all my short kings before spawn, spent those attribution points on entrepreneurship rather than height 😂

jaiya

@jaiyagill

1 day ago

before you invest in a startup always ask how tall the founders are

698

8K

372

2K

1M

0

1

0

68

Dat Ngo

@dat_attacked

1 day ago

@mada299 @swyx @aiDotEngineer this is 100% true. learning first hand, GTM is taste + engineering

0

1

0

171

dat_attacked retweeted

Aparna Dhinakaran

@aparnadhinak

2 days ago

https://t.co/Ieh2TAdfPX

10

217

27

427

20K

Dat Ngo

@dat_attacked

4 days ago

insert dj khaled meme

0

2

0

32

dat_attacked retweeted

Aparna Dhinakaran

@aparnadhinak

7 days ago

https://t.co/CXxFy1lzzF

13

894

121

2K

74K

Dat Ngo

@dat_attacked

4 days ago

this might sound bearish for the current AI cycle we are in, but yes we did unlock language in the ai space, but don't forget to zoom out before language models we did a pretty damn good job with predictive modeling for classification / regression tasks but we still have a few pretty hard TODOs to go tho physics is not solved world models are not solved causality is not solved embodiment is not solved coordination is not solved governance is not solved language was a major unlock but language is not all of reality /rant

0

1

0

23

dat_attacked retweeted

elvis

@omarsar0

11 days ago

New research from Microsoft Research I see a lot of AI engineers handwriting agent skill docs and hope they generalize. Probably not optimal. This works show why. It treats the skill doc as a trainable external state of a frozen agent instead. It introduces SkillOpt, where an optimizer model makes validation-gated edits to the skill file. It adds, deletes, or replaces instructions, with a textual learning rate that controls how aggressively each round rewrites the doc. The agent itself never changes. SkillOpt is best or tied on all 52 (model, benchmark, harness) cells. On GPT-5.5 it adds 23.5 points in direct chat, 24.8 with Codex, and 19.1 with Claude Code over no skill. It beats human-written skills, TextGrad, GEPA, and EvoSkill, carries zero extra inference-time cost, and the learned skills transfer across models and harnesses. Paper: https://t.co/mNgTmmT32U Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

omarsar0's tweet photo. New research from Microsoft Research

I see a lot of AI engineers handwriting agent skill docs and hope they generalize.

Probably not optimal. This works show why.

It treats the skill doc as a trainable external state of a frozen agent instead.

It introduces SkillOpt, where an optimizer model makes validation-gated edits to the skill file. It adds, deletes, or replaces instructions, with a textual learning rate that controls how aggressively each round rewrites the doc. The agent itself never changes.

SkillOpt is best or tied on all 52 (model, benchmark, harness) cells.

On GPT-5.5 it adds 23.5 points in direct chat, 24.8 with Codex, and 19.1 with Claude Code over no skill. It beats human-written skills, TextGrad, GEPA, and EvoSkill, carries zero extra inference-time cost, and the learned skills transfer across models and harnesses.

Paper: https://t.co/mNgTmmT32U

Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

64

1K

197

2K

172K

dat_attacked retweeted

Siddhartha Saxena

@siddsax

12 days ago

Anthropic onboarding day: Michael Scott introducing Karpathy like he just signed Wemby in free agency.

393

18K

1K

4K

2M

Dat Ngo

@dat_attacked

14 days ago

@Shashikant86 @arizeai @jason_lopatecki @aparnadhinak haha that office is sick

0

1

0

30

dat_attacked retweeted

Shashi 🇬🇧🇺🇸

@Shashikant86

14 days ago

Had excellent time at Arize AI @arizeai San Francisco 🌉 office. Learned about Harness Arch and enjoyed the amazing SF views from rooftop. Greet to see @jason_lopatecki @aparnadhinak and entire Arize AI team! Missed you @dat_attacked here!

Shashikant86's tweet photo. Had excellent time at Arize AI @arizeai San Francisco 🌉 office. Learned about Harness Arch and enjoyed the amazing SF views from rooftop. Greet to see @jason_lopatecki @aparnadhinak and entire Arize AI team!
Missed you @dat_attacked here! https://t.co/aa1Iismlwe

2

5

3

0

546

dat_attacked retweeted

George

@odysseus0z

14 days ago

Honestly I feel they undersold the bench by a lot. It is basically "how good are models at Spec-driven development" Or as the cool kids nowadays do, "/goal" It is the GoalBench. Since /goal is all the rage now, the findings here are super important. 4 key takeaways: - more reward hacking in bigger tasks - better models hack less - more tests don't fix it - running longer don't fix it; some time it makes it worse Yeah so if your model sucks, doing fancier TDD won't save your ass. Having the model run longer won't either. You only have two levers. Make the task smaller or use a better model.

0

5

2

4

884