Hans-Peter Zorn @data_hpz - Twitter Profile

data_hpz retweeted

25 days ago

@ylecun @eladgil Born in Zurich 🇨🇭: Vision Transformer (ViT) Google SigLIP, BiT, MLP-Mixer (Google Brain) Google Lens Google Maps pLSA (foundation of modern NLP) Microsoft HoloLens vision/SLAM Multimodal core of Gemini (DeepMind ZH)

1

85

6

18

13K

data_hpz retweeted

Azeem Azhar

@azeem

about 1 month ago

Love a bit of exponential

3

40

16

0

5K

data_hpz retweeted

Azeem Azhar

@azeem

about 1 month ago

Weird conclusion

2

29

2

7K

data_hpz retweeted

Andrej Karpathy

@karpathy

2 months ago

- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - Wow, feeling great, it’s so convincing! - Fun idea let’s ask it to argue the opposite. - LLM demolishes the entire argument and convinces me that the opposite is in fact true. - lol The LLMs may elicit an opinion when asked but are extremely competent in arguing almost any direction. This is actually super useful as a tool for forming your own opinions, just make sure to ask different directions and be careful with the sycophancy.

2K

31K

2K

9K

3M

Who to follow

Renato Umeton, Ph.D.

@renato_umeton

AI, Data Science, ML, Optimization, & Computer Science in Healthcare, at @StJude🧣

ODSC (Open Data Science Conference) AI

@_odsc

Bringing together the global data science community to help foster the exchange of innovative ideas and encourage the growth of open source software.

Tu Vu

@tuvllms

Assistant Professor @VT_CS & @Google (part-time). PhD from @UMass_NLP. Google FLAMe/FreshLLMs/Flan-T5 Collection/SPoT #LLMs #NLProc #AI

data_hpz retweeted

François Chollet

@fchollet

2 months ago

ARC-AGI-3 is out now! We've designed the benchmark to evaluate agentic intelligence via interactive reasoning environments. Beating ARC-AGI-3 will be achieved when an AI system matches or exceeds human-level action efficiency on all environments, upon seeing them for the first time. We've done extensive human testing that shows 100% of these environments are solvable by humans, upon first contact, with no prior training and no instructions. Meanwhile, all frontier AI reasoning models do under 1% at this time.

236

3K

340

726

622K

Hans-Peter Zorn

@data_hpz

2 months ago

@yoavgo I have some MCP that can create quite nice images and charts for those typst-presentations: https://t.co/OcRX46jWG0

1

0

56

Hans-Peter Zorn

@data_hpz

2 months ago

@yoavgo Nobody wants this, especially if you can have Claude generate Lean typst code (or LaTeX Beamer, if you must) for presentations that you can check into git.

1

0

79

data_hpz retweeted

Simon Willison

@simonw

2 months ago

Thankfully the LiteLLM package has now been marked as "quarantined" on PyPI so attempting to install the compromised update via pip et al shouldn't work

simonw's tweet photo. Thankfully the LiteLLM package has now been marked as "quarantined" on PyPI so attempting to install the compromised update via pip et al shouldn't work https://t.co/BmrbWCoLXn

53

883

94

89

120K

Hans-Peter Zorn

@data_hpz

2 months ago

The harness -- not the prompt -- is the product. What's your approach: prompt, context, or harness? 6/6

0

1

68

Hans-Peter Zorn

@data_hpz

2 months ago

The evolution of AI engineering in 3 years -- from hoping the LLM listens to ensuring the system enforces. 1/6

1

2

197

Hans-Peter Zorn

@data_hpz

2 months ago

The shift is about WHO is in control. Most teams are still in prompt engineering mode, trying to get the LLM to "behave." The real shift is architectural: build systems that constrain what the LLM can do. 5/6

1

0

74

Hans-Peter Zorn

@data_hpz

2 months ago

How do you handle context loss when your AI agent works with large specifications? Tulla is my experimental open-source implementation of Semantic SDD. It is research, beware. But I am open for feedback. no warranties. https://t.co/Gi0VONXW36 6/6

0

1

0

117

Hans-Peter Zorn

@data_hpz

2 months ago

Spec-Driven Development is the current trend: write a detailed spec, let the AI code. But there the fundamental flaw. 1/6

1

3

1

3

340

Hans-Peter Zorn

@data_hpz

2 months ago

SDD: hand the construction crew a bunch of textfiles and hope they remember page 42. Semantic SDD: feed them verified instructions for the specific brick they're holding. 5/6

1

2

0

103

Hans-Peter Zorn

@data_hpz

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users