Bayan Bruss @cbbruss - Twitter Profile

cbbruss retweeted

10 months ago

Our new guardian model lets you create LLM guardrails using natural text. This little 8B model efficiently checks in real time whether chatbots comply with bespoke moderation policies. It's not often that academics beats industry models, but DynaGuard stacks up well!

tomgoldsteincs's tweet photo. Our new guardian model lets you create LLM guardrails using natural text. This little 8B model efficiently checks in real time whether chatbots comply with bespoke moderation policies.

It's not often that academics beats industry models, but DynaGuard stacks up well! https://t.co/UHxkdzQJ1F

1

35

8

13

6K

cbbruss retweeted

Monte Hoover @MonteBHoover

10 months ago

There is still a lot of brittleness in getting guardian models to incorporate custom policies, but we think this is a step in the right direction. Try out DynaGuard in this interactive demo (and give us feedback to improve it!): https://t.co/ZArEreMkpK

1

7

1

0

392

cbbruss retweeted

Monte Hoover @MonteBHoover

10 months ago

Paper: https://t.co/oPWOZstRUQ Models: https://t.co/DfcxGYpS7R Dataset: https://t.co/HUKFLiNqsj This was a collaborative effort with @neeljain1717, @k_saifullaah, @taruschirag, @vatsalbaherwani, Joseph Vincent, Melissa Kazemi Rad, @cbbruss, @PandaAshwinee, @tomgoldsteincs

0

8

2

0

487

cbbruss retweeted

Monte Hoover @MonteBHoover

10 months ago

Guardrails with custom polices are hard for models trained on safety and harm-related datasets. But what if you trained a guardian model on arbitrary rules? Introducing DynaGuard, a guardian model for custom policies: https://t.co/oPWOZstRUQ

MonteBHoover's tweet photo. Guardrails with custom polices are hard for models trained on safety and harm-related datasets. But what if you trained a guardian model on arbitrary rules?
Introducing DynaGuard, a guardian model for custom policies: https://t.co/oPWOZstRUQ https://t.co/9E7AN8Uw12

1

43

18

21

14K

Who to follow

John Langford

@JohnCLangford

Solving Machine Learning at Microsoft in New York. https://t.co/ZpdQV4IsHY pandemic past president. https://t.co/MkluiHpWF7 makes RL real. https://t.co/wK8xQaQGwf for thinking out loud.

OptimaLab

@optimalab1

Optimization for ML at Rice University (CS) led by Associate Prof. Anastasios Kyrillidis - Efficient training methods, non-convex optimization, and more.

Martin Basiri

@MartinBasiri

Education is a right, not a privilege. CEO & Founder, @Passagehq

cbbruss retweeted

Furong Huang

@furongh

10 months ago

Your critic model is secretly a strong policy model. Stay tuned for a deep dive 🤩

0

25

2

8

3K

Bayan Bruss

@cbbruss

10 months ago

@chrmanning We had that for our flight back to Dulles this summer. It was as if they parked the plane at a different airport.

0

296

Bayan Bruss

@cbbruss

10 months ago

@scaling01 Why are they even measuring it in pages. Tax_propmt_final_final_v2.docx

0

38

cbbruss retweeted

Irina Rish

@irinarish

10 months ago

I am looking for a postdoc to lead projects related to this collaboration, on scaling laws, emergence and interpretability in pre- and post-training & inference/reasoning, in multimodal foundation models (language, time series, tabular data etc). HPC experience is a plus.

0

24

3

4

3K

Bayan Bruss

@cbbruss

10 months ago

@KezhiKong Awesome. Congrats Kezhi

1

0

82

Bayan Bruss

@cbbruss

11 months ago

@natolambert @interconnectsai I hope the title of this article is a Dr Spaceman reference

0

39

Bayan Bruss

@cbbruss

11 months ago

@AlexGDimakis I would take it a step further. Humans are really good at learning from non verbal social cues. A look that implies disappointment, excitement, frustration can be a profound reward signal in many situations.

0

280

cbbruss retweeted

Epoch AI

@EpochAIResearch

11 months ago

The past 5 years have seen big successes in language, image and video generation, but relatively limited success in robotic manipulation. Why don’t we have laundry robots in every house? One thing seems clear: training compute is not the blocker. 🧵

EpochAIResearch's tweet photo. The past 5 years have seen big successes in language, image and video generation, but relatively limited success in robotic manipulation. Why don’t we have laundry robots in every house?

One thing seems clear: training compute is not the blocker. 🧵 https://t.co/3nHOXWG2CI

8

239

27

74

25K

Bayan Bruss

@cbbruss

11 months ago

If you share this belief, I’ve got some cool work for you to do

will brown

@willccbb

11 months ago

i'm increasingly convinced that "transformative ai" is going to look like an abundance of specialized models for everything from drug design to weather sims to robotics to supply chains, not one agent to rule them all. we're going to need a lot more ai researchers

111

2K

104

327

113K

0

4

0

273

Bayan Bruss

@cbbruss

11 months ago

@jxmnop We’re largely taking an interpreter approach to prompt optimizations. I wonder what a compiler approach looks like.

0

1

0

1

330

Bayan Bruss

@cbbruss

11 months ago

@scaling01 Three years is the perfect prediction horizon for anything you want. It’s just close enough that people feel like it’s going to happen soon and just far enough that if the deadline passes no one will remember you were wrong.

0

49

Bayan Bruss

@cbbruss

11 months ago

@DimitrisPapail Maybe we’ll invent gyms to simulate the natural use of our brains

0

1

0

211

Bayan Bruss

@cbbruss

11 months ago

@ziv_ravid Denoise

1

0

752

Bayan Bruss

@cbbruss

11 months ago

We have a mantra on the team “your data is trying to kill you”

Taco Cohen

@TacoCohen

11 months ago

What I look for when hiring? EXTREME PARANOIA about code and data

15

310

13

58

64K

0

194

Bayan Bruss

@cbbruss

11 months ago

@abeirami @DimitrisPapail < 3 months for the greatest test taking technology invented to solve p6

0

44

cbbruss retweeted

Mark Ibrahim @ICLR 2026

@marksibrahim

11 months ago

Open-weights for our Llip multimodal vision-language model led by @lavoiems are public! LLIP proposes new pre-training objective to capture the many ways to describe an image leading to strong performance across a suite of 22-zero shot benchmarks. https://t.co/Tr354Kfcno

0

13

1

0

758

Bayan Bruss

@cbbruss

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users