Ausτin McCaffrey

about 20 hours ago

@mogmachine speaking in the value of agents in today’s world. I actually think Bittensor is the primary beneficiary of agentic capabilities. Agents lower the barrier of entry for mining in a way that’s hard to quantify the value of, but I believe it cannot be understated. You no longer need to be a fluent technical dev to contribute to the various competitions being built on the Subnets. You just need to be fluent in…English.

Austin_Aligned's tweet photo. @mogmachine speaking in the value of agents in today’s world. I actually think Bittensor is the primary beneficiary of agentic capabilities. Agents lower the barrier of entry for mining in a way that’s hard to quantify the value of, but I believe it cannot be understated. You no longer need to be a fluent technical dev to contribute to the various competitions being built on the Subnets. You just need to be fluent in…English.

102

Austin_Aligned retweeted

3 days ago

Today, we are launching the first stage of Project Orion. Our early pre-training run of Orion-100B achieves upward of 65% of data-center training efficiency on hardware costing a fraction of the price. Orion-100B is the first proof point for a simple idea: that underutilized compute around the world can be turned into frontier training capacity. We believe that this work presents, for the first time, an economically compelling case for training large models using distributed approaches.

$MacrocosmosAI's tweet photo. Today, we are launching the first stage of Project Orion. Our early pre-training run of Orion-100B achieves upward of 65% of data-center training efficiency on hardware costing a fraction of the price. Orion-100B is the first proof point for a simple idea: that underutilized compute around the world can be turned into frontier training capacity. We believe that this work presents, for the first time, an economically compelling case for training large models using distributed approaches.$

373

395K

Austin_Aligned retweeted

Apex・SN1

@Apex_SN1

3 days ago

“You can train a model to look inside an LLM’s head and explain to you what’s happening at a certain point in time. When you understand the implications for AI alignment, you can see how this may lead to major breakthroughs”. @macrocrux and @Austin_Aligned discuss the key research and aims behind their collaborative competition on Apex - drive AI alignment insights via autoencoders identifying patterns within an LLM’s structure, acting as an auditor.

791

Unleashing the potential of artificial intelligence and smart contracts with $BAI https://t.co/6HQCr1wrlI https://t.co/HSyfujO6Ev

7 days ago

@MacrocosmosAI @macrocrux @AureliusAligned @Apex_SN1 Great to join in! Thanks for having me 🤖

Who to follow

Block AI

@baitokendev

WavesFunnyNode

@WavesFunnyNode

More than just a node we’re a community-driven adventure on @wavesprotocol 🚀 🛡️ Non-Custodial Staking 🔄 Flexible Payouts 🗳️ Governance by Stakers

E L L E | a KID called BEAST

@aKIDcalledELLE

Supporting 💛 #WomenInWeb3 #WomenInTech Team @aKIDcalledBEAST 👀 Product Development + Distribution 📦 Will not DM first ⚠️

Austin_Aligned retweeted

7 days ago

We’re live on our Inventive Mechanisms podcast. @macrocrux and @Austin_Aligned are discussing our upcoming competition. This is a collaborative task with SN37, @AureliusAligned, launching on @Apex_SN1. Join to learn more https://t.co/hbJdTaW0nW

7 days ago

Pop in to learn about how @AureliusAligned is teaming up with @Apex_SN1 to build alignment primitives!

7 days ago

Reminder: @macrocrux and @Austin_Aligned will be live on our Inventive Mechanisms podcast today, walking through our upcoming competition on @Apex_SN1. Together, we’ll be launching a challenge focused on AI alignment, in collaboration with Subnet 37, @AureliusAligned. Join us to learn more about how this will unfold. 📍 Location: X livestream (on the @MacrocosmosAI X account) 📅 Date: Today (Thursday 28th May) 🕒 Time: 3pm UK time

MacrocosmosAI's tweet photo. Reminder: @macrocrux and @Austin_Aligned will be live on our Inventive Mechanisms podcast today, walking through our upcoming competition on @Apex_SN1.

Together, we’ll be launching a challenge focused on AI alignment, in collaboration with Subnet 37, @AureliusAligned.

Join us to learn more about how this will unfold.

📍 Location: X livestream (on the @MacrocosmosAI X account)
📅 Date: Today (Thursday 28th May)
🕒 Time: 3pm UK time

107

13 days ago

@MacrocosmosAI Can't wait to bring decentralized interpretability research to Bittensor with the Apex team. The implications of this give me chills. 🤝

241

Austin_Aligned retweeted

13 days ago

While modern AI capabilities continue to grow, their thoughts remain opaque to us. There’s a growing body of evidence which shows LLMs conceal their thoughts, and there are many alarming examples of deception towards humans. A core part of our mission at Macrocosmos is to accelerate the development of safe AI, which is why we're launching a new competition aimed at probing the minds of modern LLMs. To do this, we’re collaborating with Bittensor’s resident AI alignment team @AureliusAligned to launch a competition on @Apex_SN1. Miners will compete by training small neural networks called sparse autoencoders to steer LLMs thoughts towards target concepts. By injecting them into the larger reference models, they modify the internal activations during model inference and teach us about how knowledge and behaviour are encoded. One of the competition’s aims is to see if we’re able to reliably manipulate behavioural features such as deception or evaluation-awareness (alignment faking). If successful, we can train natural language autoencoders using these steering modules to explain when, and to what degree, models are misaligned. @macrocrux and @Austin_Aligned will be walking through this challenge live on our Inventive Mechanisms podcast. 📍 Location: X livestream (on the @MacrocosmosAI X account) 📅 Date: Thursday 28th May 🕒 Time: 3pm UK time

MacrocosmosAI's tweet photo. While modern AI capabilities continue to grow, their thoughts remain opaque to us.

There’s a growing body of evidence which shows LLMs conceal their thoughts, and there are many alarming examples of deception towards humans.

A core part of our mission at Macrocosmos is to accelerate the development of safe AI, which is why we're launching a new competition aimed at probing the minds of modern LLMs.

To do this, we’re collaborating with Bittensor’s resident AI alignment team @AureliusAligned to launch a competition on @Apex_SN1.

Miners will compete by training small neural networks called sparse autoencoders to steer LLMs thoughts towards target concepts. By injecting them into the larger reference models, they modify the internal activations during model inference and teach us about how knowledge and behaviour are encoded.

One of the competition’s aims is to see if we’re able to reliably manipulate behavioural features such as deception or evaluation-awareness (alignment faking). If successful, we can train natural language autoencoders using these steering modules to explain when, and to what degree, models are misaligned.

@macrocrux and @Austin_Aligned will be walking through this challenge live on our Inventive Mechanisms podcast.

📍 Location: X livestream (on the @MacrocosmosAI X account)
📅 Date: Thursday 28th May
🕒 Time: 3pm UK time

13 days ago

@macrocrux @DrocksAlex2 Greatness hits the target no one else can hit, but genius hits the target no one else can see. Keep going!

167

Austin_Aligned retweeted

The TAO Daily

@taodaily_io

22 days ago

https://t.co/4u3v5MgNn2

26K

23 days ago

@mikecontango @trishool Agreed, Bittensor is unique suited to perform safety/alignment research on models precisely because of the non-ownership over the foundation models. Status quo capitalism is working against AI safety currently. Happy to see Trishool performing this exact role.

24 days ago

This could signify the starting gun for the arms race between frontier models and the interpretability agents that represent our best shot of being capable of disentangling the inner workings of our most-capable models. The progression of interpretability research like NLAs is critical as models get bigger and more complex.

24 days ago

From @AnthropicAI's new NLA paper: "unverbalized evaluation awareness — cases where Claude believed, but did not say, that it was being evaluated" The models know they're being watched, always have. Now we can prove it. This is the exact problem @AureliusAligned was built to solve. Been working toward quantifying alignment faking since day one. This is huge validation and a massive new primitive to build on. https://t.co/aVsDwQiLQt

304

24 days ago

On the whole, it certainly feels like we are moving towards the construction of black-box decoder agents that we will rely upon to interpret the alignment of frontier models, as described in "AI 2027".

Austin_Aligned's tweet photo. On the whole, it certainly feels like we are moving towards the construction of black-box decoder agents that we will rely upon to interpret the alignment of frontier models, as described in "AI 2027". https://t.co/jSqJYrKFXV

Austin_Aligned retweeted

Aurelius

@AureliusAligned

about 2 months ago

We've been introducing the people behind Aurelius one post at a time. The full lineup now lives in one place: three co-founders and six advisors across alignment research, ethics, engineering, and law. https://t.co/VWpc2V8Utd

Austin_Aligned retweeted

zach

@blip_tm

about 2 months ago

asics the shoe company has an obvious pivot to make now

679

31K

Jack Lindsey @Jack_W_Lindsey

about 2 months ago

Oh boy, here we go

about 2 months ago

Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14)

Jack_W_Lindsey's tweet photo. Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14) https://t.co/vhng7PXqcz

155

768

978K

135