Axiom Nexar @agutier7 - Twitter Profile

Axiom Nexar @Agutier7

32 minutes ago

@jasonlk 2nd

0

18

Axiom Nexar @Agutier7

about 9 hours ago

@Marbelle30 Otro mononeuronal más!

0

19

Axiom Nexar @Agutier7

about 9 hours ago

Cool

Simon Taylor

@sytaylor

1 day ago

I love this! Santander has open-sourced its open-source AI initiatives. The bank pushed 11 repos, live this week under Apache-2.0 on the code, but the data synthetic or anonymised only. Quite a moment for a bank this size, putting its AI control layer on the open internet for anyone to fork. This is the bit every bank has to get right. So what is it? → autoguardrails: a scaffold for stress-testing LLM guardrails, jailbreaks included (can we use this LLM?) → "mechanical governance" for high-stakes LLM decisions, with hard gates and governance metrics (can we trust an LLM with this decision?) → mutatis-mutandis: discrimination testing with counterfactual comparators, straight out of a published paper (very important if you're lending!) → stressed-datasets: public benchmarks republished in "stressed" form to probe model robustness in that scenario → gen-fraud-graph: a synthetic fraud-graph generator to benchmark fraud detection (really, really cool, need to dig into this one) → llm_bridge: a vendor-neutral client for OpenAI, Bedrock and Gemini, so you skip the lock-in (again, how many companies are struggling with this?) → ralph: their own spin on the Ralph loop, the run-an-agent-in-a-loop trick from the indie AI crowd I think I need to write a whole Rant on each of these pieces. The most important thing for a big regulated actor is "Can you show a decision was safe, fair, auditable, and the same tomorrow as it was today." Santander published its working answer and handed it to everyone, competitors included. Why give it away? 1. Attract talent - this is a huge signal they've got their AI act together 2. Signal internally - We have these tools, use them 3. Give regulators confidence - Here's how we work, you can audit it (The board that signs off on releases includes Legal and the CISO. That tells you how seriously they treat it.) I've watched banks spend years trying to govern AI behind closed doors and ship nothing. Doing it in the open, with a contributor agreement and a proper open-source office, is a faster route to getting it right. The banks that pull ahead from here will be the ones who can prove their AI works. @bancosantander just open-sourced a head start. Repo is here. 👇 https://t.co/IilShwzvl2

sytaylor's tweet photo. I love this! Santander has open-sourced its open-source AI initiatives.

The bank pushed 11 repos, live this week under Apache-2.0 on the code, but the data synthetic or anonymised only.

Quite a moment for a bank this size, putting its AI control layer on the open internet for anyone to fork. This is the bit every bank has to get right.

So what is it?

→ autoguardrails: a scaffold for stress-testing LLM guardrails, jailbreaks included (can we use this LLM?)

→ "mechanical governance" for high-stakes LLM decisions, with hard gates and governance metrics (can we trust an LLM with this decision?)

→ mutatis-mutandis: discrimination testing with counterfactual comparators, straight out of a published paper (very important if you're lending!)

→ stressed-datasets: public benchmarks republished in "stressed" form to probe model robustness in that scenario

→ gen-fraud-graph: a synthetic fraud-graph generator to benchmark fraud detection (really, really cool, need to dig into this one)

→ llm_bridge: a vendor-neutral client for OpenAI, Bedrock and Gemini, so you skip the lock-in (again, how many companies are struggling with this?)

→ ralph: their own spin on the Ralph loop, the run-an-agent-in-a-loop trick from the indie AI crowd

I think I need to write a whole Rant on each of these pieces.

The most important thing for a big regulated actor is "Can you show a decision was safe, fair, auditable, and the same tomorrow as it was today." Santander published its working answer and handed it to everyone, competitors included.

Why give it away?

1. Attract talent - this is a huge signal they've got their AI act together

2. Signal internally - We have these tools, use them

3. Give regulators confidence - Here's how we work, you can audit it

(The board that signs off on releases includes Legal and the CISO. That tells you how seriously they treat it.)

I've watched banks spend years trying to govern AI behind closed doors and ship nothing. Doing it in the open, with a contributor agreement and a proper open-source office, is a faster route to getting it right.

The banks that pull ahead from here will be the ones who can prove their AI works.

@bancosantander just open-sourced a head start.

Repo is here. 👇

https://t.co/IilShwzvl2

21

1K

159

2K

355K

0

5

Axiom Nexar @Agutier7

about 9 hours ago

🫠🫠🫠🫠🫠

Chubby♨️

@kimmonismus

about 15 hours ago

Not again, @eucommission . Europe is once again being excluded from access to the latest SOTA scientific technology.

55

652

42

67

71K

0

2

Who to follow

Relativity In India

@relativityoffl

Payroll & Compliance Expertise at its best. https://t.co/vnKjCkN5xU https://t.co/bxivQ5mvTV https://t.co/qJULmHe21v…

Al bien hacer jamás le falta premio

Axiom Nexar @Agutier7

about 19 hours ago

@AngelicaLozanoC Sra. Ud es sub-normal o mononeuronal?. Es muy difícil hacerle entender a una persona con cemento en la cabeza. Q comentario tan salido de todo.

0

8

Agutier7 retweeted

Polymarket Money

@PolymarketMoney

1 day ago

BREAKING: Over 45% of the S&P 500 is now made up of AI related stocks.

47

568

51

36

39K

Axiom Nexar @Agutier7

4 days ago

@ClaudeDevs Ufffffff 🔥🔥🔥🔥🔥🔥🔥

0

24

Axiom Nexar @Agutier7

6 days ago

Impressive

Anthropic

@AnthropicAI

6 days ago

We compared Claude Code success rates between occupations. On our toughest measure of success—requiring verifiable evidence that a goal was completed, like committed code—every field was within 7 percentage points of software engineering.

AnthropicAI's tweet photo. We compared Claude Code success rates between occupations.

On our toughest measure of success—requiring verifiable evidence that a goal was completed, like committed code—every field was within 7 percentage points of software engineering. https://t.co/ShxZK4p0e2

15

195

11

30

59K

0

18

Axiom Nexar @Agutier7

11 days ago

@ClaudeDevs 🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥🔥

0

24

Agutier7 retweeted

ClaudeDevs

@ClaudeDevs

12 days ago

We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible. Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal (coming to server-side fallback in the next few days). We wanted to deploy Fable 5 to our users quickly and safely. Visible safeguards can be probed, so they have to be robust, which takes time to get right. Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right. Making the safeguards visible makes them easier to work around, so keeping them robust to jailbreaks will unfortunately mean more false positives while we improve the classifiers. We're also tuning our bio and cyber classifiers to trigger less often on harmless requests. We know this is frustrating and we’ll do our best to keep this period as short as possible. If you think a request has been mistakenly flagged: run /feedback in Claude Code, click thumbs-down on the fallback in https://t.co/LtktniD5HY or Cowork, or file the safeguard appeal form for API requests. Your reports help us tune these classifiers and we appreciate your feedback. https://t.co/TDAAYRGqDt

667

5K

430

939

856K

Axiom Nexar @Agutier7

12 days ago

@SimonLevyMx Es un caballo de Troya. Quien la interpuso (proyecto) no es sancionada. Es del mismo partido. El efecto es victimizar a Petro.

0

9

0

705

Axiom Nexar @Agutier7

12 days ago

Kinda true

Polymarket

@Polymarket

12 days ago

JUST IN: Anthropic CEO Dario Amodei warns AI could create a world of “hypergrowth, hyper-inequality” & lasting job displacement.

369

4K

374

550

638K

0

12

Axiom Nexar @Agutier7

13 days ago

@ClaudeDevs 🔥🔥🔥🙌🙌🙌🙌🔥🔥🔥🔥

0

26

Agutier7 retweeted

Claude

@claudeai

13 days ago

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.