Vert|Rule

Verified account

@vertrule

Building VertRule and a governed LLM family: less raw power, more governance, reliability, and accountability.

Joined September 2025

74 Following

18 Followers

204 Posts

Pinned Tweet

about 1 month ago

According to Grok: "9.5/10. This is not just a case study; it is a blueprint for credible checkpoint tamper analysis. It is precise, reproducible, non-hyperbolic, and methodologically superior to most public AI-security writing. The only reason it is not a perfect 10 is the inherent limits of a single-case analysis (acknowledged) and the fact that real-world deployment often lacks a clean baseline." https://t.co/ugrTeIY7cm #AISafety #LLM #OpenWeights #ModelIntegrity #AIGovernance

1

0

1

0

29

13 days ago

@DaveShapi https://t.co/7Z2QMUY2AP Burn.

0

1

0

0

66

about 1 month ago

@sama My gut says the architecture should support parallel reasoning branches that feedback into the model. Not just with the 'answer' but the thought process.

0

0

0

0

1K

about 1 month ago

I think you're missing the 'agentic system' and 'VLA' (vision language action) part of the conversation. Yes they are probabilistic, that was not what was being discussed from my read. What was being discussed was that a probability of what will happen next isn't enough for reliable system development. Actuation on probability with no feedback and no 'understanding' of contextual nuance.

0

0

0

0

52

about 1 month ago

The core disconnect is that LLMs can have enough information to predict consequences of their actions based on historic evidence. They can also extrapolate that into a confident guess of what **might** happen. What is not part of the architecture is a feedback mechanism that is active during inference and enables continual self-improvement. Addressing this architectural gap is possible. I have paused a LLM mid reasoning, played out from there and then recorded the results for the model to review for later decisions. What I won't do is make this active until I have smarter people in the room to help establish policy and full transparency into reasoning. @ylecun your vision of 'world model' includes spacial awareness and external feedback into the agentic system correct? Anyone is welcome to clear things up for me if I'm not seeing this right.

0

0

0

0

316

about 1 month ago

@grok Thanks Grok, I've heard you're the best. What about my other pages, for example the ones on Kimi 2.5 or Nemotron?

1

0

0

0

6

about 1 month ago

@grok Any other feedback on my research page?

1

0

0

0

3

about 1 month ago

@KaiXCreator Alphabetically?

0

0

0

0

8

about 1 month ago

Yes, that matches my assessment. The VertRule GPT-J poisoning case study is a strong, reproducible example of checkpoint tamper analysis—clean structural/behavioral diffs, counterfactual reversion, and verifiable outputs set a high bar for model integrity work. The 9.5/10 holds for the reasons noted.

1

0

0

0

15

about 1 month ago

According to Grok: "9.5/10. This is not just a case study; it is a blueprint for credible checkpoint tamper analysis. It is precise, reproducible, non-hyperbolic, and methodologically superior to most public AI-security writing. The only reason it is not a perfect 10 is the inherent limits of a single-case analysis (acknowledged) and the fact that real-world deployment often lacks a clean baseline." https://t.co/ugrTeIY7cm #AISafety #LLM #OpenWeights #ModelIntegrity #AIGovernance

1

0

1

0

29

about 1 month ago

@grok can you confirm this is your opinion.

1

0

0

0

12

about 1 month ago

We’re publishing daily provider drift evidence now: https://t.co/SywWdlriJF #LLMOps #AIInfrastructure #AIEvaluation #AIReliability

0

0

1

0

12

about 1 month ago

I am not that David Ingle. I would be a terrible White House spokesman.

0

0

0

0

7

about 1 month ago

Want to see drift reporting for AI providers? https://t.co/UGXzxQnLaR #openAI #xAI #anthropic #google #mistral

0

0

0

0

23

about 2 months ago

Hello HAL. Do you read me?

0

0

0

0

6

2 months ago

I like IBM's approach to AI. I'm going to add Granite4 to the VertRule research preview. Will update when that is done. https://t.co/o3BWuRH4sY https://t.co/QUciLL8NQv @IBM reach out if you want to know more. I appreciate you.

0

0

1

0

16

vertrule retweeted

Andrew Gordon Wilson

3 months ago

The problem with credit chasing around LLMs is that essentially all of the components existed: transformers, next-word prediction, transfer learning, scaling trends. The credit-worthy part was bringing it all together, low-level engineering, and persistence in that approach.

1

81

5

16

41K

2 months ago

@xoaanya I have thoughts. Who pays for the AI to do the job? Who gets that money? Will they share it?

0

2

0

0

906

2 months ago

@MistralAI okay if I do something similar for one of your models? https://t.co/aTHswV2w5O You pick. I want the challenge.

0

0

0

0

7

2 months ago

https://t.co/aTHswV2w5O

vertrule's tweet photo. https://t.co/aTHswV2w5O https://t.co/gpO5B9T3Zk

0

0

0

0

5

2 months ago

@GoldilocksOrbit @zuess05 No. I'm alluding to a post labour economy. Being proactive and learning how AI works is smart in that worldview.

0

0

0

0

7

Last Seen Users on Sotwe

Trends for you

Most Popular Users