SISL @sislaboratory - Twitter Profile

5 days ago

Important release for standardized AI eval reporting with SISL contributions by @AnkaReuel, @MLamparth, @aiprof_mykel. Check it out!

EvalEval Coalition @evaluatingevals

5 days ago

🚀We launch Evaluation Cards (beta): a centralized public record of AI evaluation results 🚀 Not another leaderboard. Every score comes with who ran it, the settings they used, what the benchmark tests and the other results reported for the same model, side by side. 🧵👇

evaluatingevals's tweet photo. 🚀We launch Evaluation Cards (beta): a centralized public record of AI evaluation results 🚀

Not another leaderboard. Every score comes with who ran it, the settings they used, what the benchmark tests and the other results reported for the same model, side by side. 🧵👇 https://t.co/shl5YlcHcT

5

35

12

13

6K

0

1

0

111

SISL @SISLaboratory

11 days ago

Check out the paper: “Enhancing a Risk Model by Adding Transient Statistical Factors” Alexandros Tzikas, Emmanuel Candes, Trevor Hastie, Stephen Boyd, @aiprof_mykel, Ronald Kahn https://t.co/GoxObCwq1I

0

86

SISL @SISLaboratory

11 days ago

New work from SISL: Financial firms estimate how asset returns move together, called a risk model, to build and stress-test portfolios. We show how to keep a risk model up to date and sharper using only the return data you already have.

SISLaboratory's tweet photo. New work from SISL: Financial firms estimate how asset returns move together, called a risk model, to build and stress-test portfolios. We show how to keep a risk model up to date and sharper using only the return data you already have. https://t.co/e0xlEc8otY

1

3

1

2

286

SISL @SISLaboratory

11 days ago

The method is an expectation-maximization algorithm controlled by just two choices, the number of added factors and a half-life weighting recent returns more heavily. On the Barra short-term US model over 870 US large-cap equities, the extended model improves out-of-sample fit.

SISLaboratory's tweet photo. The method is an expectation-maximization algorithm controlled by just two choices, the number of added factors and a half-life weighting recent returns more heavily. On the Barra short-term US model over 870 US large-cap equities, the extended model improves out-of-sample fit. https://t.co/18gn8nMW5l

1

0

61

Who to follow

Marco Pavone

@drmapavone

Prof @Stanford, Distinguished Research Scientist and AV research lead @nvidia. PhD from @MITAeroAstro. Robotics, autonomous systems, AI. Opinions are my own.

Heni Ben Amor

@asurobot

Percept AI | Associate Professor for Robotics and Machine Learning at Arizona State University. Director of Interactive Robotics Lab.

Anca Dragan

@ancadianadragan

Google DeepMind • AI safety, alignment, collaboration • post training • associate professor @ UC Berkeley EECS

SISLaboratory retweeted

Max Lamparth @MLamparth

13 days ago

Had a great time presenting our paper on Reward Bias Substitution as an oral at #RLEval Workshop at @CAISconf #CAIS2026 last week. Thanks to everyone who came by and asked such thoughtful questions!

MLamparth's tweet photo. Had a great time presenting our paper on Reward Bias Substitution as an oral at #RLEval Workshop at @CAISconf #CAIS2026 last week. Thanks to everyone who came by and asked such thoughtful questions! https://t.co/IE2umwtJb8

1

13

5

0

770

SISL @SISLaboratory

18 days ago

A new must read SISL paper for anyone working on RLHF and reward modes. Check it out!

Max Lamparth @MLamparth

18 days ago

New paper: We identify a new class of reward hacking caused by mitigations, which we call reward bias substitution. We prove no standard benchmark detects it, even with oracle access to the true reward. We find it active in GRPO, in SOTA reward models, and published methods.

1

37

5

16

5K

0

4

0

283

SISL @SISLaboratory

20 days ago

Check out Romeo Valentin's lecture in AA228V/CS238V Validation of Safety Critical Systems on "Explainability". He goes through variety of methods and example problems how to use gradients and latent representations for safety validation of AI systems. https://t.co/pSW28TssyV

SISLaboratory's tweet photo. Check out Romeo Valentin's lecture in AA228V/CS238V Validation of Safety Critical Systems on "Explainability". He goes through variety of methods and example problems how to use gradients and latent representations for safety validation of AI systems.

https://t.co/pSW28TssyV https://t.co/1B0XOqqa3T

0

5

2

1

225

SISL @SISLaboratory

25 days ago

6/6: Kiana Jafari, Paul Rust, Duncan Eddy, Robbie Fraser, @NinaVasan, Darja Djordjevic, Akanksha Dadlani, @MLamparth, Eugenia Kim, @aiprof_mykel

0

157

SISL @SISLaboratory

25 days ago

1/6: New SISL paper accepted at FAccT 2026: Learning from human feedback assumes expert judgments can be reliably aggregated into training signal. We tested this in the high-stakes mental health domain.

SISLaboratory's tweet photo. 1/6: New SISL paper accepted at FAccT 2026: Learning from human feedback assumes expert judgments can be reliably aggregated into training signal. We tested this in the high-stakes mental health domain. https://t.co/ICZ2HEKq3e

1

2

0

2

862

SISL @SISLaboratory

25 days ago

5/6: Check out the paper: "Expert Evaluation and the Limits of Human Feedback in Mental Health AI Safety Testing" 📄 Paper: https://t.co/9uWDlWCper 📂 Open-source dataset: https://t.co/CERZU4FhVD

1

0

74

SISL @SISLaboratory

29 days ago

Youtube: https://t.co/LlUjMG3VWm

0

1

0

68

SISL @SISLaboratory

29 days ago

Congratulations to SISLer Alexandros Tzikas for successfully defending his Ph.D. thesis! Check out the recording where he explored how a single geometric idea, projection, can unify how we learn, measure, and adapt under uncertainty in high-dimensional decision-making problems.

SISLaboratory's tweet photo. Congratulations to SISLer Alexandros Tzikas for successfully defending his Ph.D. thesis! Check out the recording where he explored how a single geometric idea, projection, can unify how we learn, measure, and adapt under uncertainty in high-dimensional decision-making problems. https://t.co/fkEzOy1oCp

1

3

1

0

191

SISL @SISLaboratory

about 1 month ago

Check out SISLers Daniel Fein and Max Lamparth, Ph.D. talk about their recent work on debiasing language reward models with a great presentation by Daniel now on youtube: https://t.co/3ppuJ7PcAL Thank you Safe AI Germany for hosting! Paper: https://t.co/cUgpUsrAUo

SISLaboratory's tweet photo. Check out SISLers Daniel Fein and Max Lamparth, Ph.D. talk about their recent work on debiasing language reward models with a great presentation by Daniel now on youtube:
https://t.co/3ppuJ7PcAL

Thank you Safe AI Germany for hosting!

Paper: https://t.co/cUgpUsrAUo https://t.co/2bkAGJHKGv

0

3

1

0

230

SISLaboratory retweeted

Houjun Liu @houjun_liu

about 1 month ago

🚨 Your coding agent may be secretly sticking vulnerabilities into your code!! 🚨 Wouldn't you want to fix that? Hint: asking it to write secure code is not enough. (1/n)

houjun_liu's tweet photo. 🚨 Your coding agent may be secretly sticking vulnerabilities into your code!! 🚨

Wouldn't you want to fix that? Hint: asking it to write secure code is not enough. (1/n) https://t.co/r71AmNn4nc

4

81

38

51

25K

SISL @SISLaboratory

about 2 months ago

Ever wondered what's really behind alpha vectors in POMDPs? 🤔 @Sydney_Katz's new Stanford lecture demystifies it all and introduces QMDP as a clean, offline approximation. Core stuff for anyone building AI agents 👇https://t.co/NpfCnIq5V0

SISLaboratory's tweet photo. Ever wondered what's really behind alpha vectors in POMDPs? 🤔 @Sydney_Katz's new Stanford lecture demystifies it all and introduces QMDP as a clean, offline approximation. Core stuff for anyone building AI agents 👇https://t.co/NpfCnIq5V0 https://t.co/XIaZAZoxTp

0

8

1

5

636

SISL

@SISLaboratory

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users