sutan

@ms_muleta

blockchain dev | technical ai safety

Joined June 2019

688 Following

67 Followers

80 Posts

ms_muleta retweeted

about 2 months ago

New paper: Can you prevent emergent misalignment with inoculation prompting, or by diluting bad data with good? Prior work suggests you can.  We show the misalignment is still present but hiding. It is triggered by adding cues to prompts, evoking the bad data.

OwainEvans_UK's tweet photo. New paper:
Can you prevent emergent misalignment with inoculation prompting, or by diluting bad data with good?
Prior work suggests you can.  We show the misalignment is still present but hiding. It is triggered by adding cues to prompts, evoking the bad data. https://t.co/J67lVok69N

17

312

45

171

53K

ms_muleta retweeted

Jack Lindsey @Jack_W_Lindsey

2 months ago

Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14)

Jack_W_Lindsey's tweet photo. Before limited-releasing Claude Mythos Preview, we investigated its internal mechanisms with interpretability techniques. We found it exhibited notably sophisticated (and often unspoken) strategic thinking and situational awareness, at times in service of unwanted actions. (1/14) https://t.co/vhng7PXqcz

155

7K

768

4K

980K

sutan @ms_muleta

4 months ago

0

0

0

0

6

sutan @ms_muleta

8 months ago

@enter_delta all in 🥳🫡

ms_muleta's tweet photo. @enter_delta all in 🥳🫡 https://t.co/NO0uB9rSa0

0

0

0

0

14

Who to follow

gemini👯‍♀️

Rushil Jariwala

@rushil_jariwala

building @superai_tech | rnnr 🏃

19 | prev @tryemerge | Agent BlackWidow @innercircle_so

sutan @ms_muleta

9 months ago

@PrimeIntellect I built an RL env to test reasoning under constraints . Agent moves on a grid, has to pick up,deliver a package, and manage a finite battery by recharging at a charger tile. This was fun to put together. Env https://t.co/ur92BKJRy5

0

0

0

0

14

sutan @ms_muleta

almost 2 years ago

@kylegriffin1 All ambitious girls on unconventional paths felt seen tonight @KamalaHarris thank you

0

0

0

0

62

sutan @ms_muleta

almost 2 years ago

@DarrelFrater Interested building in HelathTech

1

1

0

0

24

sutan @ms_muleta

almost 2 years ago

@mayn_k47 @_nightsweekends @_buildspace This is a great idea. How will you be able to build it will the robot be utilizing multiple models ?

0

0

0

0

16

sutan @ms_muleta

almost 2 years ago

@NehalMisra @_nightsweekends @_buildspace This is so cool, what type of fabrics do you plan to use ?

0

0

0

0

9

sutan @ms_muleta

almost 2 years ago

@thestevenolmos @_learnbydoing @_buildspace @_nightsweekends @FarzaTV @wordisbonz This is such a great idea, to give practical practice \

1

1

0

0

23

sutan @ms_muleta

almost 2 years ago

phases @phasesdapp @_buildspace @_nightsweekends

ms_muleta's tweet photo. phases @phasesdapp @_buildspace @_nightsweekends https://t.co/zC9VPvnvfM

1

6

0

0

500

ms_muleta retweeted

over 2 years ago

The @encodeclub 🫶 @Polkadot 2023 Accelerator has concluded and we couldn't be prouder of the teams! 👏🫡 Thanks to all the guest speakers who made this journey even more special 💕 Full summary here: https://t.co/Iv5CX4x8eV Or keep reading this thread! 👇

encodeclub's tweet photo. The @encodeclub 🫶 @Polkadot 2023 Accelerator has concluded and we couldn't be prouder of the teams! 👏🫡

Thanks to all the guest speakers who made this journey even more special 💕

Full summary here: https://t.co/Iv5CX4x8eV

Or keep reading this thread! 👇 https://t.co/ayrsBvLQGL

3

26

8

1

4K

sutan @ms_muleta

over 2 years ago

0

0

0

0

10

sutan @ms_muleta

almost 3 years ago

@ashmchang @_buildspace Love this !!! Will save a lot of time . Thanks

0

1

0

0

15

sutan @ms_muleta

almost 3 years ago

@harsehaj @_buildspace This is super cool !! Joined the waitlist will save me a lot of time.

0

1

0

0

49

sutan @ms_muleta

almost 3 years ago

@hyejeebae @buildspace @_nightsweekends Super cool !! Love the creatives as well . Are you going to make a YouTube channel as well ? To upload the TikTok as shorts ?

1

1

0

0

33

sutan @ms_muleta

almost 3 years ago

@michwirantono @typedreamHQ @_buildspace @_nightsweekends Interesting , creating value with something your good at and like.” Extention of himself “

0

0

0

0

14

sutan @ms_muleta

almost 3 years ago

@_pavidhiman @_buildspace @_nightsweekends Wow this is so cool ! Great job Is the application a web app or mobile app ?

1

1

0

0

29

sutan @ms_muleta

almost 3 years ago

@phroneteon @_buildspace Definitely interested.

0

0

0

0

5

sutan @ms_muleta

almost 3 years ago

@advaitpaliwal @_buildspace @youlearnai @_nightsweekends @1davidyu1 @KapadiaSoami @achyut_benz This is so cool , does this summarize the lecture as well ?

1

1

0

0

45

Last Seen Users on Sotwe

Trends for you

Most Popular Users