Yousef | Developer | e/acc @DeepIssueMassaj - Twitter Profile

Pinned Tweet

about 1 month ago

I’ve been training a model called Sinai. The reason I started is simple: I want to tackle hallucinations at the model behavior level. Not by making the model bigger and hoping it becomes honest. Not by putting retrieval around a chatbot and pretending the problem is solved. Sinai is being trained to recognize when evidence is actually enough to answer, and when the correct move is to refuse. I just finished the first Sinai-EI eval run on the current model. Early results: 100% abstention recall on insufficient evidence cases. 80 to 90% direct lookup accuracy. Strong evidence selection in covered domains. Multi-hop synthesis and conflict detection are starting to show up. Right now I’m verifying claim-level support before release, so unsupported claims can be caught before they reach the user. That is the part I care about most. I don’t want another model that sounds confident while making things up. I want Sinai to know where the evidence ends. A fluent wrong answer is worse than a correct refusal. Stay tuned :D

0

87

Yousef | Developer | e/acc

@DeepIssueMassaj

43 minutes ago

Are they patenting boners now?

Eurogamer @eurogamer

about 9 hours ago

Sony has patented a controller with buttons that harden when you play with it https://t.co/22ibl08wgS

277

4K

191

908

660K

0

DeepIssueMassaj retweeted

Zee++ ☕️🍉 @_ZEEPLUSPLUS

1 day ago

@cloudiemcdoom Oh yes, and his companion: 50 Gil

52

3K

353

426

495K

DeepIssueMassaj retweeted

rekdt

@rekdt

2 days ago

True story, I was nominated for employee of the quarter for saving our company millions of dollars a month for rearchitecting a complicated legacy app But they gave the award to a girl in HR because “she shows up every day with a smile on her face”

190

59K

2K

1M

Who to follow

Julien Renau Ⓥ🌱

@julien_rno

Freelance Creative Technologist. Ex @hugeinc/@uenodotco/@hellomondaycom/@akqa Vegan for the animals & the planet 🌱🌎 - @FWA Judge - #WebGL#Unity

Tacco

@Tacco_draws

Fandom granny (tired) | Does Merch sometimes! Multifandom : Gravity Falls ( always) | Dog man EN | GER

Techgenyz

@techgenyz

We tell tech stories that our generations love | #News on #Technology, #Gaming, #VR, #Apps and #FutureTech | Download the app here: https://t.co/vhdVp7IJop

Yousef | Developer | e/acc

@DeepIssueMassaj

3 days ago

You can't say shit like this and not expect it to crash

National Post

@nationalpost

4 days ago

Toronto's towering temporary FIFA bleachers perfectly safe, builder says, especially on game day https://t.co/QFQeg2RuMd

nationalpost's tweet photo. Toronto's towering temporary FIFA bleachers perfectly safe, builder says, especially on game day https://t.co/QFQeg2RuMd https://t.co/MKQ9c5SAMC

94

195

16

42

924K

0

4

DeepIssueMassaj retweeted

sdmat

@sdmat123

4 days ago

Anthropic

124

15K

1K

754K

DeepIssueMassaj retweeted

public_intellectual

@publicinte

4 days ago

none of this happens if they called it opus 5 and didn’t engage in the day 0 propaganda

111

9K

252

231

273K

Yousef | Developer | e/acc

@DeepIssueMassaj

4 days ago

Honestly after trying a couple of dyson products, dyson should make a PC case.

fofik

@benfofik

5 days ago

🚨 Dyson, şimdi de saçtan düşmeyen toka üretip test etti

511

25K

611

5K

22M

0

13

Yousef | Developer | e/acc

@DeepIssueMassaj

4 days ago

Bruh

International Cyber Digest

@IntCyberDigest

4 days ago

‼️ UPDATE: It just doesn't stop: Almost 900 Arch Linux packages infected now. https://t.co/xpRTsucoxA

186

6K

853

2K

1M

0

37

Yousef | Developer | e/acc

@DeepIssueMassaj

7 days ago

@thsottiaux @MeekMill Except Linux users

0

1

0

30

Yousef | Developer | e/acc

@DeepIssueMassaj

7 days ago

@kimmonismus I like how every week you say 5.6 next week

0

349

DeepIssueMassaj retweeted

kanav

@kanavtwt

8 days ago

😭😭😭

kanavtwt's tweet photo. 😭😭😭 https://t.co/bDWyBaDoaj

84

8K

380

407

197K

Yousef | Developer | e/acc

@DeepIssueMassaj

7 days ago

My timelines for some reason

0

7

DeepIssueMassaj retweeted

BISCUITS

@BlSCUlTS

8 days ago

unbelievable timeline we live in

86

36K

2K

3M

DeepIssueMassaj retweeted

Ash @AshStash__

8 days ago

this week has been awesome, can you imagine how fun it'd be if all of these were bundled together in some sort of electronic entertainment expo lol thatd be cute i think

204

113K

9K

4K

2M