iamrobotbear (bk) @iamrobotbear - Twitter Profile

Pinned Tweet

iamrobotbear (bk)

@iamrobotbear

over 2 years ago

lol.

Pat Bergstresser @PatThePM

over 2 years ago

Product Managers when ChatGPT first came out:

6

99

11

2

20K

7

54

23

5

14K

iamrobotbear retweeted

Alex Lieberman

@businessbarista

about 13 hours ago

Completely agree. The way you turn AI into a compounding asset is by codifying your best ways of working and then democratizing it across your company in a way that wasn't possible before. One of the more common requests we get from companies earlier in their AI journey now is to stand up their internal skill library & fill that library with skills that we build through interviews/sessions with top performers.

19

706

41

1K

138K

iamrobotbear retweeted

dan

@irl_danB

1 day ago

everyone is building an agent or a tool you don't want an agent or a tool, you want a reactor I've been working on something cool and I think you'll like it it's simple: an agent session DAG that keeps a declared world-model up to date in an efficient (memoized) render each render node is an agent session: you declare the desired state with OpenProse markdown files once invoked, each agent session acts as the provider. the agent session uses the open source openai-agents-sdk, extensible however you like with any model (I use with opus, sonnet, haiku) the facets of the world-state are memoized, so not every agent has to run on every event, saving you on inference if that sounds a lot like React or dataflow, that's because even in our brave new world the wisdom of the agents holds fast

22

186

15

193

9K

iamrobotbear retweeted

ClaudeDevs

@ClaudeDevs

1 day ago

How do we automate business analytics with Claude? New blog post covering our best practices for skills, data foundations, and evaluations when building agents to perform data analysis: https://t.co/mfEJMAQFBU

75

4K

357

7K

1M

Who to follow

jansennn

@jansennn__

Good from far, far from good ¯\_(ツ)_/¯

JazzyPboy 🥚👺

@JazzyPboy

Quant Consultant | #Bitcoin #Ethereum #Solana | 🇺🇸 🇮🇪 | @ChelseaFC ⚽️ | NFA always DYOR

legume.eth

@Legume_tomb

stuff and things

iamrobotbear retweeted

Ramya Chinnadurai 🚀

@code_rams

1 day ago

https://t.co/rTdmOyG6kV

0

13

1

36

11K

iamrobotbear retweeted

Matt Pocock

@mattpocockuk

about 23 hours ago

Preview of an AI Coding Dictionary I'm shipping later this month AI coding sounds complex (harness, model, agent, tool etc) but it's really not. You just need to understand the terms of engagement.

42

1K

70

1K

104K

iamrobotbear retweeted

Shreya Shankar

@sh_reya

1 day ago

Data agents strike again!! Some more interesting tips on improving the accuracy and alignment of your data agent. One interesting bit I haven’t seen people talk about: “Skill docs describe a data model that changes daily, so without active maintenance they're wrong within weeks. We watched our offline accuracy drift from ~95% at launch to ~65% over a month” …. “Roughly 90% of our data-model PRs now include a skill change in the same diff. We also regularly prune skill scaffolding as models improve and previous failure modes no longer apply.”

9

98

10

103

18K

iamrobotbear retweeted

Aaron Levie

@levie

1 day ago

The jobs data coming out continues to suggest the opposite of what a lot of people had thought would happen. Just take engineering, as the prime example of the area with greatest AI impact (and perceived risk). Most companies now have far more software projects than ever before because of AI, and effectively only engineers are going to be the ones doing that work. You can get by for a while by being non-technical building software, but eventually someone has to understand what the thing is that got built, has to maintain it, has to fix security issues that come up, upgrade the systems beneath it, and so on. That’s all jobs. Now apply that to a number of other job functions. AI is going to cause companies to hire more in sales because agents can let them process more leads and do more customer research. AI will cause an explosion of new marketing roles because of how much more efficient it is to launch campaigns and target. The list goes on. AI is going to have the opposite effect that lots of people thought on jobs.

82

504

78

189

153K

iamrobotbear retweeted

Hiten Shah

@hnshah

3 days ago

This is one of the clearest windows into how engineering is changing right now. The tools are useful. The workflow is the part worth studying. Ideas become plans. Plans become durable context. Agents run in parallel. Voice replaces typing. Notes become memory. Skills turn repeated work into leverage. The human job moves closer to judgment. You steer, react, redirect, and decide what is good enough to keep.

14

298

20

726

68K

iamrobotbear retweeted

Sydney Runkle

@sydneyrunkle

3 days ago

new in deepagents: agent rubrics! you define a rubric, and the agent self-evaluates and iterates until it satisfies every rubric criterion. this is similar to /goal in claude code or codex, but more flexible because grading is conducted by a dedicated subagent that you can tune with a prompt or custom tools.

10

114

12

137

15K

iamrobotbear retweeted

Marktechpost AI

@Marktechpost

3 days ago

TinyFish just open-sourced BigSet — a multi-agent system that builds structured datasets from a single plain-English sentence. You type: "YC companies that are currently hiring engineers, with their funding stage, location, and number of open roles." That's the input. That's it. Here's what actually happens under the hood: 1. Schema Inference (Claude Sonnet via OpenRouter) - Infers column names, data types, and primary keys before any web access 2. Orchestrator Agent (Qwen via OpenRouter) - Runs broad discovery via TinyFish Search to identify which entities exist and where to find them 3. Sub-Agent Fan-Out - One isolated sub-agent per entity, running in parallel - Each agent is capped at 6 tool calls — fetch, search, insert, done - Dataset ID is baked into a JS closure invisible to the LLM — prompt injection can't redirect writes 4. Export - Primary key deduplication across all agents - Source attribution per row - Download as CSV or XLSX The refresh part is what makes it useful long-term. Set it to 30 min, 6 hours, daily, or weekly — the agents re-run automatically. Your dataset stays current without re-running anything manually. I have personally tested BigSet and covered the full setup walkthrough — clone to first dataset — including all env vars, make commands, and the security architecture. Here is the full analysis: https://t.co/lJMVFngeuL GitHub: https://t.co/8dL7kQdsyc @Tiny_Fish #ai #aiagent #dataset

0

14

5

4

225

iamrobotbear retweeted

spacy

@dosco

3 days ago

use perplexity, parallel, google, x search whatever and build this in 5 minutes using DSPy+RLM (ax-agent) https://t.co/y45nH1PCrQ

2

232

14

277

24K

iamrobotbear retweeted

Han Xiao

@hxiao

4 days ago

Sharing a project I've been heavily using - Dataroom. It's a local-first harness that runs deep research with a small language model and gives a zip file at the end. Deep research is becoming an important first step for long-horizon tasks (the 2nd step being implementation), and I believe a small local model in a disciplined harness handles it well - we shouldn't waste frontier-model tokens on it. Dataroom runs on your own GPU at near-zero marginal cost, and it can keep going for hours until the dataroom is genuinely comprehensive, instead of stopping when a metered budget runs out.

hxiao's tweet photo. Sharing a project I've been heavily using - Dataroom. It's a local-first harness that runs deep research with a small language model and gives a zip file at the end. Deep research is becoming an important first step for long-horizon tasks (the 2nd step being implementation), and I believe a small local model in a disciplined harness handles it well - we shouldn't waste frontier-model tokens on it. Dataroom runs on your own GPU at near-zero marginal cost, and it can keep going for hours until the dataroom is genuinely comprehensive, instead of stopping when a metered budget runs out.

8

185

17

250

15K

iamrobotbear retweeted

Prukalpa ✨

@prukalpa

4 days ago

https://t.co/aG4ImLx1C4

13

493

42

1K

684K

iamrobotbear retweeted

Shae McLaughlin

@shae_mcl

4 days ago

For AI-assisted learning to work, we need to figure out how to differentiate between unproductive and productive friction in learning. Unproductive friction = bad textbooks, content pitched at the wrong level, teachers not understanding your world model etc.

6

87

10

35

32K

iamrobotbear retweeted

Barr Yaron

@barrnanas

4 days ago

Another week, another opportunity to fill out this year’s AI Engineering Survey! Raffling off prizes galore https://t.co/R8Vf7Y98Nk @aiDotEngineer @AmplifyPartners @NotionHQ @vercel

0

11

4

3

4K

iamrobotbear retweeted

AVB

@neural_avb

4 days ago

Working on MCP x RLM this week... Through the power of MCP servers, this will get the RLM everything from accessing calculators, calendars, and weather info... to filesystem read/writing, terminal interactions, and web search. Directly inside the REPL.

3

62

5

36

7K

iamrobotbear retweeted

elvis

@omarsar0

4 days ago

Very good advice on self-improving agents. (bookmark it) This is something I am seeing in my own experiments with coding agents and harnesses for long-horizon tasks. What I have found is that stronger models do not always evolve better agents. The current believe in self-evolving agents is that a bigger model writes better prompt and skill edits, so devs put their best model in the evolver seat. New research shows that intuition is mostly wrong. The work separates two abilities that usually get conflated. Producing harness updates stays flat across model capability, so Qwen3.5-9B writes edits roughly as good as Claude Opus 4.6. Benefiting from those updates follows an inverted-U that peaks at mid-tier models, while weak models fail to even activate the edits and strong models have little headroom left. This is important to understand as it tells you where to spend. Put a cheap model on the evolver and your expensive model on the solver, because the gains land solver-side, not evolver-side. Paper: https://t.co/8kJwR7NhmV Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

omarsar0's tweet photo. Very good advice on self-improving agents.

(bookmark it)

This is something I am seeing in my own experiments with coding agents and harnesses for long-horizon tasks.

What I have found is that stronger models do not always evolve better agents.

The current believe in self-evolving agents is that a bigger model writes better prompt and skill edits, so devs put their best model in the evolver seat.

New research shows that intuition is mostly wrong.

The work separates two abilities that usually get conflated. Producing harness updates stays flat across model capability, so Qwen3.5-9B writes edits roughly as good as Claude Opus 4.6. Benefiting from those updates follows an inverted-U that peaks at mid-tier models, while weak models fail to even activate the edits and strong models have little headroom left.

This is important to understand as it tells you where to spend. Put a cheap model on the evolver and your expensive model on the solver, because the gains land solver-side, not evolver-side.

Paper: https://t.co/8kJwR7NhmV

Learn to build effective AI agents in our academy: https://t.co/1e8RZKs4uX

36

740

109

1K

55K

iamrobotbear retweeted

Emil Kowalski

@emilkowalski

4 days ago

To get good animations from an AI you need to get good at telling it what you want: - "stagger this list of items" - "make this animation direction-aware" - "spacial consistency", "crossfade", "layout animation", I made a motion vocabulary for this: https://t.co/ExAxpr31no