Orr Krupnik @orrkrup - Twitter Profile

orrkrup retweeted

14 days ago

VLA is 95% certain about current action. Will it 95% succeed in the task? Obviously, not necessarily. But if you’re clever, you can *calibrate* action prob. to task success. Our #ICML2026 paper formulates this + SOTA algorithms based on new connection to RL temporal differences

0

46

10

37

8K

Orr Krupnik @orrkrup

20 days ago

More generally, we’re interested in fast, targeted correction of unwanted LLM behaviors without full retraining, avoiding degradation of model quality in other areas. Check out https://t.co/YCxi8pIXJD for more exciting examples! 2/2

0

34

Orr Krupnik @orrkrup

20 days ago

Great example of the post-training behavioral adaptation work we’re doing at @hirundo_ai. In this case, the focus was resilience to prompt injection while preserving general model capabilities, all using very limited data. 1/2

Google Gemma

@googlegemma

21 days ago

Case study time! Hirundo trained Gemma 4 E4B to resist adversarial overrides while overcoming the alignment tax: - Weight-level defense based on Gemma 4 architecture - Stronger security posture than models over 100x its size - Preserves utility across benchmarks

googlegemma's tweet photo. Case study time! Hirundo trained Gemma 4 E4B to resist adversarial overrides while overcoming the alignment tax:

- Weight-level defense based on Gemma 4 architecture
- Stronger security posture than models over 100x its size
- Preserves utility across benchmarks

12

260

28

66

18K

1

6

0

1

207

Orr Krupnik @orrkrup

2 months ago

@ChenTessler More data, more compute. I’m sure it’ll solve it

0

1

0

13

Who to follow

Yonathan Efroni

@EfroniYonathan

Assistant Professor@TAU | Research@doubleAI

Tom Zahavy

@TZahavy

Building creative agents @GoogleDeepMind. AlphaProof, AlphaZero_db, PuzzleGen, Convex RL, meta gradients. Staff research scientist, discovery team

Tal Daniel

@TalDaniel8

Postdoc @ CMU Robotics Institute Ph.D. from the Technion ECE Research interests include self-supervsied learning, generative modeling, RL, robotics.

Orr Krupnik @orrkrup

4 months ago

Would love to see how much of this was human-guided and how much was fully the agent optimizing the “get PR merged” reward function.

Niels Rogge @NielsRogge

4 months ago

This is the most insane @github thread I’ve ever seen https://t.co/AQoIeaadoD

19

249

25

114

33K

0

99

Orr Krupnik @orrkrup

4 months ago

New preprint out! TL;DR: With a few calls to a lightweight, pre-trained VLM, we can select better experience for off-policy RL training, and improve both performance and sample efficiency 📈🤖🏆 Proud to have worked on this with @elad_sharony and @TomJurgenson !

Elad Sharony @elad_sharony

4 months ago

Experience replay is the backbone of off-policy RL. But here's a question: Which experiences should you replay? New paper: 𝐕𝐋𝐌-𝐆𝐮𝐢𝐝𝐞𝐝 𝐄𝐱𝐩𝐞𝐫𝐢𝐞𝐧𝐜𝐞 𝐑𝐞𝐩𝐥𝐚𝐲 Project page: https://t.co/8tmBFdMxmX 🧵

1

14

3

7

4K

1

20

1

7

2K

Orr Krupnik @orrkrup

6 months ago

עיריית חיפה חושפת את סדר העדיפויות התחבורתי שלה בפרסום תמים לכאורה. ומה השניים שלא מוזכרים בכלל? (רמז בתחתית התמונה) #פידתחבורה #חיפה

orrkrup's tweet photo. עיריית חיפה חושפת את סדר העדיפויות התחבורתי שלה בפרסום תמים לכאורה. ומה השניים שלא מוזכרים בכלל? (רמז בתחתית התמונה)
#פידתחבורה #חיפה https://t.co/2R12iP3jLE

0

100

Orr Krupnik @orrkrup

10 months ago

@AvivTamar1 How long will it be before we stop writing papers because no human reads anymore anyway?

1

0

65

orrkrup retweeted

Google Cloud @googlecloud

10 months ago

We're partnering with Tzafon, an AI R&D lab, to build next-generation agentic machine intelligence! Through our partnership, we will provide compute capacity & cloud services to train Tzafon’s new multi-agent models & develop new automation frameworks → https://t.co/gC4Zca9a4w

googlecloud's tweet photo. We're partnering with Tzafon, an AI R&D lab, to build next-generation agentic machine intelligence!

Through our partnership, we will provide compute capacity & cloud services to train Tzafon’s new multi-agent models & develop new automation frameworks → https://t.co/gC4Zca9a4w https://t.co/LrF5KU5r3D

3

48

7

5

5K

orrkrup retweeted

Tzafon @tzafon_company

11 months ago

We’re soon releasing Lightcone, which brings Light to your Mac. Lightcone will be able to automate any task on your computer. It will be powered by our very own pre-trained foundation model, built from the ground up for computer use. https://t.co/Tf1y6yRmjO

0

15

3

2K

orrkrup retweeted

Tzafon @tzafon_company

about 1 year ago

We’re hiring ML Engineers at Tzafon in SF. Join our elite team (IOI, IMO, Google alumni) working at the intersection of AI research and practical engineering. Focus areas include multi-agent RL, memory architectures, efficient sampling techniques, and Large Action Models. Send ML demos or papers to [email protected] or DM directly.

0

12

4

2

1K

Orr Krupnik @orrkrup

about 1 year ago

Thrilled to share that I'll be joining the team at @tzafon_company in a few weeks' time! Excited for the opportunity to work at the frontier of AI research, and to build awesome stuff with some great people, making an impact in the real world.

Tzafon @tzafon_company

about 1 year ago

Today we're announcing Tzafon, an applied artificial intelligence research and development firm, to the world. Our mission is to expand the frontiers of machine intelligence. Through the intersection of artificial intelligence & software engineering, we're looking to push the boundaries of what machines are capable of. We're launching WayPoint, our first open-source product—a robust, scalable solution for managing large fleets of browser instances, capable of launching up to 1,000 browsers per second and easily handling well over 10,000 browsers concurrently without issue. We've secured a $4M pre-seed funding round led by Streamlined and are rapidly expanding our elite team of IOI & IMO medalists, PhDs, and alumni from Google, Jane Street, and PayPal. Interested in joining us? Reach out at [email protected]. We're also launching a production-ready version of WayPoint—sign up below to get early access. - Waitlist: https://t.co/kQSzn6lTq5 - WayPoint Repo: https://t.co/oXLVL5qHyu - WayPoint Blog post: https://t.co/oK2WYFdz37

0

6

3

2K

3

10

1

0

823

Orr Krupnik @orrkrup

about 1 year ago

הסרטון החדש של @livable_city_il מתאר את קצה הקרחון של הבעיות שמונעות מחיפה להיות העיר שהיא יכולה להיות. הלוואי שזו הסנונית הראשונה שתפתח את השיח האורבניסטי בחיפה, ותיצור שיחות שלא עוצרות אחרי "אבל יש עליות!" https://t.co/Hmu1r13ObF

1

3

0

253

Orr Krupnik @orrkrup

about 1 year ago

@danieldan9211 הימור שלי שאלו החניות האחרונות שייתפסו שם באזור, כי צריך לחכות לקרוסלה כדי להכניס או להוציא את האוטו. מעניין לראות אם ב-11 כשייגמרו כל החניות האחרות זה עדיין יהיה ב-0 תפוסה. ... ואני לא אהיה מופתע אם זה גם עולה יותר משאר החניון

1

0

915

orrkrup retweeted

Aviv Tamar @AvivTamar1

over 1 year ago

Want to learn / teach RL? Check out new book draft: Reinforcement Learning - Foundations https://t.co/142MbSiTIQ W/ @shiemannor and @YishayMansour This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.

AvivTamar1's tweet photo. Want to learn / teach RL?
Check out new book draft:
Reinforcement Learning - Foundations
https://t.co/142MbSiTIQ
W/ @shiemannor and @YishayMansour
This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE. https://t.co/gobVpPqS49

10

627

104

717

64K

Orr Krupnik @orrkrup

about 2 years ago

@chris_j_paxton Do you think we’re close to reaching the limit of what we can do with data and current architectures though? I feel like the tail of the distribution of scenarios (at least for general purpose / home robots) is at least as heavy as the driving one, if not more so…

1

3

0

481

Orr Krupnik @orrkrup

about 2 years ago

@AvivTamar1 If this is operational, it's a huge leap forward from their previous humanoid (H1) which doesn't seem nearly as impressive when seen in person.

1

0

76

Orr Krupnik @orrkrup

about 2 years ago

Happening now at Halle B @ #ICLR2024, stop by for a quick chat!

Zohar Rimon

@ZoharRimon

about 2 years ago

Drop by our poster to hear more about MAMBA! Happening now at halle B #113 #ICLR2024 @iclr_conf

0

15

1

689

0

12

0

297

Orr Krupnik @orrkrup

about 2 years ago

Come say Hi at #ICLR2024! I’ll be at our poster on Friday afternoon with @ZoharRimon, but also around the entire conference if anyone wants to chat 🤖🦾📈

Zohar Rimon

@ZoharRimon

about 2 years ago

Heading to Vienna for #ICLR2024✈️ Me and @orrkrup will be presenting our work - MAMBA. If you're interested in - Meta RL 📊 - Generalization in RL 🌎 - Efficient exploration 🔍 Ping me or come by our poster on Friday 16:30-18:30 in Halle B #113 https://t.co/NCLWLA5ISR @iclrconf

0

20

2

4

2K

0

12

2

0

767

Orr Krupnik

@orrkrup

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users