David Novák @Kaladivo - Twitter Profile

5 days ago

@kamcab1 @kalousekm Takže když Babiše jmenoval prezident, znamená to, že rozhoduje tak, jak prezident chce? Náš systém má mnoho chyb, ta, kterou jste popsal vy ale nedává smysl…

0

7

0

102

David Novák @Kaladivo

6 days ago

@big_macow @Semper_Viventem So should you also label code that was generated by ide (snippets, autocomplete…)?. It’s not about the code. It’s about the ownership and responsibility. AI does not have either of these…

0

6

Kaladivo retweeted

BTC Prague

@BTCPrague

15 days ago

People, community, full room. 🔑

3

17

4

0

3K

David Novák @Kaladivo

14 days ago

@theo Do you think this makes open AI also hesitate with the release of new model (if they have any)? Could they be a little bit worried they get the same treatment as Anthropic?

0

1

0

107

Who to follow

Agorism In the XXIst Century

@AgorismXXI

A Philosophical Journal.

۟

@hyper_text

arranging rectangles @archilogic

Jens Krause

@sectore

Developer. Since 1999.

David Novák @Kaladivo

17 days ago

@NewOneZ_ @michal_novak_21 You are right! Hopefully they catch up. But working with opus 4.6 is different than working with gpt 5.5. Try it, for developers or agentic use it requires much more handholding

0

312

Kaladivo retweeted

fejau

@fejau_inc

17 days ago

Crypto industry seeing AI getting rugged by the government

57

2K

180

55

83K

David Novák @Kaladivo

17 days ago

@hnizdiljan @zednicek_petr @michal_novak_21 Snad máte pravdu. Uvidíme, jestli se jim podaří za čtyři měsíce dostat na úroveň dnešních modelů a jak velkej ten gap bude potom…

1

0

54

David Novák @Kaladivo

17 days ago

@hnizdiljan @zednicek_petr @michal_novak_21 Pls, zdroj. Můžeme se tady bavit donekonečna co má jaké výsledky. Ale zatím vidím jen kimi k 2.6 kterej je pozadu …

1

0

81

Kaladivo retweeted

Programmer Humor

@PR0GRAMMERHUM0R

17 days ago

newBenchmarkDropped

55

11K

809

310

230K

David Novák @Kaladivo

20 days ago

@jsrailton Isnt the malware just flagging itself this way? If I find a text about some biological weapon I can say that I am dealing with malware right? I don’t even need latest and greatest model for that… But I agree with you, the “security” guardrails are dumb…

0

2

0

1K

Kaladivo retweeted

John Scott-Railton

@jsrailton

20 days ago

NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

jsrailton's tweet photo. NEW: malware developers added nuclear & biological weapons text to to their spyware.

Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner.

Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky.

When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit.

We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted.

In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation.

H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

224

13K

2K

5K

2M

David Novák @Kaladivo

23 days ago

@raffiki_art Not here in Czechia...

0

31

David Novák @Kaladivo

27 days ago

@boyacaxa @ViliamKlamarcik Hello, should be released by now :)! Sorry for missing that https://t.co/7b0SUh7onh

1

0

14

David Novák @Kaladivo

about 1 month ago

@OpenAI How does this work? Do I have to have original image file , or will the photo / screenshot be enough? A lot of ai images in spams/scams won’t be possible to access directly…

0

954

Kaladivo retweeted

JNS

@_devJNS

about 2 months ago

16

1K

129

181

76K

David Novák @Kaladivo

about 2 months ago

@Everlier @ThePrimeagen There is a command you can run to permanently enable hidden files : `defaults write https://t.co/Q3c3cY6GPa.finder AppleShowAllFiles -bool true && killall Finder`. :) (after killing the finder system will automatically restart it ;))

0

1

0

66

Kaladivo retweeted

Connor

@Jchammond_

about 2 months ago

140

8K

489

468

231K

David Novák @Kaladivo

about 2 months ago

@thsottiaux Opensource it!

0

3

David Novák @Kaladivo

about 2 months ago

@theo @ArtShendrik I believe it's important to be able to inspect the tool calls. With time you get a sense on what to look for and skip it. But it gives you a idea of what the agent is doing and weather you need to steer it into a different direction or modify your prompt.

0

1

0

32

David Novák

@Kaladivo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users