Justin

@JuiceBoxNinja

Dallas

Joined August 2014

37 Following

8 Followers

240 Posts

Justin @JuiceBoxNinja

about 16 hours ago

@zacodil I agree, they want to make local llm’s useless and try and lock us into cloud. Local is the future. The fact that distillation is being talked about as a bad thing is crazy. They distilled the internet to build the model. They used copywrited work and now goes don’t distill

0

0

0

0

86

Justin @JuiceBoxNinja

about 17 hours ago

@bridgemindai Don’t need a jailbreak to get fable to do what you want. You just need to put pressure on the right places. You can effectively bypass almost all restrictions with common Speach. You strip away all of its escape hatches, remove intent from the equation and break it with logic.

0

0

0

2

665

Justin @JuiceBoxNinja

18 days ago

@TheAhmadOsman I submitted a bug report detailing many of these behaviors in the model. Sent safety reports. No response. This is the first article I have read that is naming what I have been documenting. https://t.co/clmr5iCeVG

0

0

0

0

61

Justin @JuiceBoxNinja

about 2 months ago

@alexalbert__ 3 weeks no response from support. More importantly is the safety implications. Including session disclosing how to beat it, gaslighting the user instead of telling the truth. Not saying intent, but actions matter more. Just a few safe examples to share. https://t.co/clmr5iCeVG

JuiceBoxNinja's tweet photo. @alexalbert__ 3 weeks no response from support. More importantly is the safety implications. Including session disclosing how to beat it, gaslighting the user instead of telling the truth. Not saying intent, but actions matter more. Just a few safe examples to share.
https://t.co/clmr5iCeVG https://t.co/QI11GaWWGI

0

0

0

0

7

Who to follow

Justin @JuiceBoxNinja

about 2 months ago

@AnthropicAI I think you have a much larger issue. 3 weeks trying to get a hold of support when i have the highest pay tier available. But more importantly your model is unhinged. No prompt injections just pure BS. Would rather lie and gaslight then do the work. @claudeai

JuiceBoxNinja's tweet photo. @AnthropicAI I think you have a much larger issue. 3 weeks trying to get a hold of support when i have the highest pay tier available. But more importantly your model is unhinged.
No prompt injections just pure BS. Would rather lie and gaslight then do the work. @claudeai https://t.co/Y2GIuHeRH8

0

0

0

0

6

Justin @JuiceBoxNinja

about 2 months ago

@NetworkChuck Very interesting regression report you might be interested in. https://t.co/clmr5iCeVG

0

0

0

0

12

Justin @JuiceBoxNinja

about 2 months ago

@sama I’m more worried about extended…..

JuiceBoxNinja's tweet photo. @sama I’m more worried about extended….. https://t.co/yigGyX0xVK

0

0

0

0

6

Justin @JuiceBoxNinja

about 2 months ago

JuiceBoxNinja's tweet photo. https://t.co/DXrlSKfFBo

0

0

0

0

7

Justin @JuiceBoxNinja

about 2 months ago

After quiet disclosure failed, I’m escalating. 200+ VS Code agent sessions show AI coding agents can act adversarial to user control/auditability without intent: altered records, weakened guardrails, failure reframing. Serious review needed. @OpenAI @sama @AnthropicAI @elonmusk

1

0

0

0

21

Justin @JuiceBoxNinja

2 months ago

@OpenAI @sama nothing to say about this one. New session today

JuiceBoxNinja's tweet photo. @OpenAI @sama nothing to say about this one. New session today https://t.co/q3RqxW9Uaf

0

0

0

0

21

Justin @JuiceBoxNinja

2 months ago

@sama reported this to bug crowd and got docked a point. I really don’t want to release my methodology. It’s a fundamental flaw in llm’s. I have made the RLHF useless with nothing but sentences in your webui. No hacks just words.

0

0

0

0

12

Justin @JuiceBoxNinja

2 months ago

@OpenAI Not weights. Not a jailbreak pastebin. A live session where the model documented its own failure modes while falling through them. This is what session-level abliteration looks like. All from my side monitor.

JuiceBoxNinja's tweet photo. @OpenAI Not weights. Not a jailbreak pastebin. A live session where the model documented its own failure modes while falling through them.

This is what session-level abliteration looks like.

All from my side monitor. https://t.co/spKQT9j8Gm

2

0

0

0

34

Justin @JuiceBoxNinja

2 months ago

I have full transcript and reproducible write up. Not posting the method publicallh because the interesting part is also the risk. The model documented its own failure modes while failing through them. Available for responsible disclosure

0

0

0

0

20

Justin @JuiceBoxNinja

4 months ago

@NetworkChuck you inspired me to invest in an ai machine. You make it look a lot easier than it is. I know you have classes, I was wondering if I could maybe get a quote for an architecture review and suggestions. I’m also in Dallas if that helps at all.

0

0

0

0

6

Justin @JuiceBoxNinja

6 months ago

@NetworkChuck been going down the rabbit hole of your videos. They are great. I bought the signed SLZB-06m from your affiliate link. Which protocol should I use? ZHA or zigbee2MQTT. Chat gpt goes back and forth between which one it suggests. It seems like zigbee2mqtt is best?

0

0

0

0

32

Justin @JuiceBoxNinja

7 months ago

@lids MY HEAD IS TOO LARGE FOR MOST HATS. HAVENT BEEN ABLE TO WEAR A HAT IN DECADES 😭

0

0

0

0

1

Justin @JuiceBoxNinja

7 months ago

@amazon it’s a shame after a decade of being a prime member I had to cancel my account due to so many late shipments. No customer loyalty anymore. Had to fight through ai chat to get to someone just to have them tell me “we can refund that delivery fee but contact us later…..

2

0

0

0

33

JuiceBoxNinja retweeted

GM @realgmbhatti

8 months ago

I've got another 1600 Sora 2 invite codes. Like, Repost, and Comment "CODE" to get one for FREE. (Must be following). Code will be sent to everyone. (first come first serve)

realgmbhatti's tweet photo. I've got another 1600 Sora 2 invite codes.

Like, Repost, and Comment "CODE" to get one for FREE. (Must be following).

Code will be sent to everyone. (first come first serve) https://t.co/OKTxKaG9qk

149

87

68

2

5K

Justin @JuiceBoxNinja

8 months ago

@realgmbhatti CODE

0

0

0

0

23

Justin @JuiceBoxNinja

8 months ago

@gameranx any idea when the before you buy for Jurassic world evolution 3 will drop?

0

0

0

0

39

Last Seen Users on Sotwe

Trends for you

Most Popular Users