We’re excited to announce that Trishool’s HaloGuard 1.0 𝐡𝐚𝐬 𝐚𝐜𝐡𝐢𝐞𝐯𝐞𝐝 𝐒𝐎𝐓𝐀 prompt-safety performance among open-weight guard models.
Today, we present HaloGuard 1.0, a constitutional input classifier for multilingual AI safety.
It is built as a first-layer input guard that checks user prompts before they reach a downstream LLM, agent, or application.
This is part of the safety infrastructure being built through @trishoolai , our decentralised AI red-teaming subnet on Bittensor SN23.
Full arXiv paper goes live soon.
Today, we present ResBM (https://t.co/EqfVZRerDY), a 128x activation compression technique for achieving SOTA training results in low-bandwidth, distributed communication settings for pipeline parallel training across the internet.
This technology underpins @IOTA_SN9 - our distributed training platform.
Training frontier models over the internet requires new techniques.
Today, we present ResBM, a residual encoder-decoder bottleneck architecture that enables 128x activation compression for low-bandwidth distributed pipeline parallel training.
Developed for @IOTA_SN9, we show SOTA compression without significant loss in convergence rates, increases in memory, or compute overhead.
Expect the full paper release in the next 72 hours.