Stucki @benstucki - Twitter Profile

benstucki retweeted

Saurabh ✧ @saurabhyadavz

almost 2 years ago

If you're a real coder, you'll relate to this 🤌

223

14K

2K

1K

1M

Stucki @benstucki

about 2 years ago

The DAIO project has an official website and a new update is up on the latest. Happy halvening and here's to another interesting four-year cycle! https://t.co/jS0SWmxlxz https://t.co/TOFW905HrR

0

132

Stucki @benstucki

about 2 years ago

Week 3 Update on Project DAIO. It's early days, but I'm enjoying the work. https://t.co/sfxBTox1YL

0

133

Stucki @benstucki

about 2 years ago

For anyone following along on my attempt to parse the BTC blockchain. Here's an update: we have charts! https://t.co/fIa8zNkS8l

0

1

0

173

Who to follow

Nicholas L. Holland

@nicholasholland

Chief Product Officer - https://t.co/5eVYVTWzEP | Ex-SVP @ HubSpot | 3x Founder

Daniel Dura

@ddura

CEO at @GraphiumHealth: enhancing healthcare with software. Ex-Adobe. Passionate about #Tech, #Business, and flying. #BYUalum 🚀

Ryan Campbell

@Ryancampbell

Having fun with AI multi-agent teams building https://t.co/oak0Ah4RW6 and https://t.co/wBGSTmEA6T

Stucki @benstucki

about 2 years ago

Everybody agrees that X-Men '97 should release new episodes on Saturday mornings right? Can I write my senators about this?

0

1

0

187

Stucki @benstucki

about 2 years ago

I started a new project. If anybody's interested in BTC and/or data analysis, then follow along on YouTube! Hit like, subscribe, etc, etc. https://t.co/jnAXKkNpTc

1

0

164

benstucki retweeted

joel ⛈️

@joelhooks

over 2 years ago

"you seem a little old to be so front end focused"

18

211

10

9

20K

benstucki retweeted

A Giant Meteor @votegiantmeteor

over 2 years ago

I'm proud to announce my candidacy for President of the United States. Let's make an impact in 2024! #votegiantmeteor

3

83

22

3

3K

benstucki retweeted

Weston Beecroft @Westoncb

over 2 years ago

Been trying to reconcile a bunch of different aspects of NN behavior and cognition by playing with formulations of latent space at a level (also) suited for reasoning about everyday cognitive phenomena In part it's led to a compact, diagrammable geometric system

Westoncb's tweet photo. Been trying to reconcile a bunch of different aspects of NN behavior and cognition by playing with formulations of latent space at a level (also) suited for reasoning about everyday cognitive phenomena

In part it's led to a compact, diagrammable geometric system https://t.co/BnQrwzZBTF

8

143

5

93

21K

benstucki retweeted

Linus ✦ Ekenstam

@LinusEkenstam

over 2 years ago

Vision Pro Occlusion Mesh Aka Night Vision This is actually a back door into the Matrix

46

2K

178

416

386K

benstucki retweeted

Adam Grant

@AdamMGrant

over 2 years ago

The best cure for loneliness is not more frequent interaction. It's more meaningful interaction. Many people enjoy solitude. They can spend up to ~75% of their time alone without feeling isolated. What matters most for well-being is the quality of connections, not the quantity.

AdamMGrant's tweet photo. The best cure for loneliness is not more frequent interaction. It's more meaningful interaction.

Many people enjoy solitude. They can spend up to ~75% of their time alone without feeling isolated.

What matters most for well-being is the quality of connections, not the quantity. https://t.co/PdWDCyyTJq

132

14K

3K

2M

benstucki retweeted

Stella Biderman @BlancheMinerva

over 2 years ago

The AI models = nuclear weapons analogy is terrible for a lot of reasons, but most importantly it heavily misleads policy-makers. Nuke-inspired regulation won't prevent people from building powerful AIs, but it will protect tech companies from competition 🧵

19

391

82

64

204K

benstucki retweeted

Keiran Paster

@keirp1

over 2 years ago

Introducing OpenWebMath, a massive dataset containing every math document found on the internet - with equations in LaTeX format! 🤗 Download on @HuggingFace: https://t.co/tFBHaX2Jpt 📝 Read the paper: https://t.co/mLnbbVPBWS w/ @dsantosmarco, @zhangir_azerbay, @jimmybajimmyba!

keirp1's tweet photo. Introducing OpenWebMath, a massive dataset containing every math document found on the internet - with equations in LaTeX format!

🤗 Download on @HuggingFace: https://t.co/tFBHaX2Jpt
📝 Read the paper: https://t.co/mLnbbVPBWS

w/ @dsantosmarco, @zhangir_azerbay, @jimmybajimmyba! https://t.co/7usulE8Yy8

58

1K

233

576

196K

benstucki retweeted

Anthropic

@AnthropicAI

over 2 years ago

The fact that most individual neurons are uninterpretable presents a serious roadblock to a mechanistic understanding of language models. We demonstrate a method for decomposing groups of neurons into interpretable features with the potential to move past that roadblock.

115

6K

991

3K

2M

benstucki retweeted

Brandon

@brandon_xyzw

over 2 years ago

Created a WebGL neural network visualization for the quadruped robot I've been working on. The more red, the higher the activation. Bluer is less activation, while green is a midpoint.

77

2K

226

308

196K

benstucki retweeted

Andrej Karpathy

@karpathy

over 2 years ago

With many 🧩 dropping recently, a more complete picture is emerging of LLMs not as a chatbot, but the kernel process of a new Operating System. E.g. today it orchestrates: - Input & Output across modalities (text, audio, vision) - Code interpreter, ability to write & run programs - Browser / internet access - Embeddings database for files and internal memory storage & retrieval A lot of computing concepts carry over. Currently we have single-threaded execution running at ~10Hz (tok/s) and enjoy looking at the assembly-level execution traces stream by. Concepts from computer security carry over, with attacks, defenses and emerging vulnerabilities. I also like the nearest neighbor analogy of "Operating System" because the industry is starting to shape up similar: Windows, OS X, and Linux <-> GPT, PaLM, Claude, and Llama/Mistral(?:)). An OS comes with default apps but has an app store. Most apps can be adapted to multiple platforms. TLDR looking at LLMs as chatbots is the same as looking at early computers as calculators. We're seeing an emergence of a whole new computing paradigm, and it is very early.

karpathy's tweet photo. With many 🧩 dropping recently, a more complete picture is emerging of LLMs not as a chatbot, but the kernel process of a new Operating System. E.g. today it orchestrates:

- Input & Output across modalities (text, audio, vision)
- Code interpreter, ability to write & run programs
- Browser / internet access
- Embeddings database for files and internal memory storage & retrieval

A lot of computing concepts carry over. Currently we have single-threaded execution running at ~10Hz (tok/s) and enjoy looking at the assembly-level execution traces stream by. Concepts from computer security carry over, with attacks, defenses and emerging vulnerabilities.

I also like the nearest neighbor analogy of "Operating System" because the industry is starting to shape up similar:
Windows, OS X, and Linux <-> GPT, PaLM, Claude, and Llama/Mistral(?:)).
An OS comes with default apps but has an app store.
Most apps can be adapted to multiple platforms.

TLDR looking at LLMs as chatbots is the same as looking at early computers as calculators. We're seeing an emergence of a whole new computing paradigm, and it is very early.

295

9K

2K

4K

2M

benstucki retweeted

Demis Hassabis

@demishassabis

over 2 years ago

Excited to share #AlphaMissense our new AI system that can classify whether genetic mutations (missense variants) are benign or harmful - a critical step toward uncovering causes of many diseases, from cystic fibrosis to cancer. In @ScienceMagazine today https://t.co/pIsskIe1cP

53

3K

630

363

451K

benstucki retweeted

AK

@_akhaliq

over 2 years ago

Language Modeling Is Compression paper page: https://t.co/tECPHg8y8S It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training increasingly large and powerful self-supervised (language) models. Since these large language models exhibit impressive predictive capabilities, they are well-positioned to be strong compressors. In this work, we advocate for viewing the prediction problem through the lens of compression and evaluate the compression capabilities of large (foundation) models. We show that large language models are powerful general-purpose predictors and that the compression viewpoint provides novel insights into scaling laws, tokenization, and in-context learning. For example, Chinchilla 70B, while trained primarily on text, compresses ImageNet patches to 43.4% and LibriSpeech samples to 16.4% of their raw size, beating domain-specific compressors like PNG (58.5%) or FLAC (30.3%), respectively. Finally, we show that the prediction-compression equivalence allows us to use any compressor (like gzip) to build a conditional generative model.

_akhaliq's tweet photo. Language Modeling Is Compression

paper page: https://t.co/tECPHg8y8S

It has long been established that predictive models can be transformed into lossless compressors and vice versa. Incidentally, in recent years, the machine learning community has focused on training increasingly large and powerful self-supervised (language) models. Since these large language models exhibit impressive predictive capabilities, they are well-positioned to be strong compressors. In this work, we advocate for viewing the prediction problem through the lens of compression and evaluate the compression capabilities of large (foundation) models. We show that large language models are powerful general-purpose predictors and that the compression viewpoint provides novel insights into scaling laws, tokenization, and in-context learning. For example, Chinchilla 70B, while trained primarily on text, compresses ImageNet patches to 43.4% and LibriSpeech samples to 16.4% of their raw size, beating domain-specific compressors like PNG (58.5%) or FLAC (30.3%), respectively. Finally, we show that the prediction-compression equivalence allows us to use any compressor (like gzip) to build a conditional generative model.

41

2K

365

1K

775K

benstucki retweeted

Scott (Human) - 𐌃𐌏ᖇ𐌉乙乙𐌃𐌕

@Dorizzdt

over 2 years ago

Way too soon .. that scar tissue is still not healed 😂😂 Sigh .. JavaScript on its best day doesn’t even come close to what we once had #unity

Dorizzdt's tweet photo. Way too soon .. that scar tissue is still not healed 😂😂

Sigh .. JavaScript on its best day doesn’t even come close to what we once had

#unity https://t.co/CNm3umnp78

2

18

1

0

3K

benstucki retweeted

Jonnie Hallman @destroytoday

almost 3 years ago

One of the best feelings as an engineer is when you reach the point where nothing seems impossible or unlearnable and the only factors getting in your way are time and energy. The frustrating part is that you only seem to reach this point when you have limited time and energy.

3

73

2

6K

Stucki

@benstucki

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users