Luke Salamone

@LukeASalamone

Machine learning engineer. In the words of a wise man, "I'm nice at ping pong"

Bay area

Joined May 2016

474 Following

278 Followers

518 Posts

Pinned Tweet

Luke Salamone @LukeASalamone

about 9 years ago

I have discovered a truly remarkable proof of P=NP which this tweet is too small to contain.

2

3

0

0

0

Luke Salamone @LukeASalamone

over 1 year ago

@y0b1byte In section 2.3.2 they said that cold started RL still had language mixing problems. They had to specifically introduce a language matching reward to mitigate this.

0

0

0

0

25

Luke Salamone @LukeASalamone

almost 2 years ago

I have never been able to generate an accurate chess board in any position. This may be an “AI hard” task: solving it probably requires a breakthrough in reasoning @GaryMarcus

LukeASalamone's tweet photo. I have never been able to generate an accurate chess board in any position. This may be an “AI hard” task: solving it probably requires a breakthrough in reasoning @GaryMarcus https://t.co/p5uWrwfIgL

almost 2 years ago

https://t.co/6whvSBlrio

0

10

1

9

30K

10

67

8

24

94K

LukeASalamone retweeted

almost 2 years ago

Frontier models like GPT-4o (and now Claude 3.5 Sonnet) may be at the level of a "Smart High Schooler" in some respects, but they still struggle on basic tasks like tic-tac-toe. There was hope that native multimodal training would help but that hasn't been the case.

polynoamial's tweet photo. Frontier models like GPT-4o (and now Claude 3.5 Sonnet) may be at the level of a "Smart High Schooler" in some respects, but they still struggle on basic tasks like tic-tac-toe. There was hope that native multimodal training would help but that hasn't been the case. https://t.co/1iDq0DCL4Q

polynoamial's tweet photo. Frontier models like GPT-4o (and now Claude 3.5 Sonnet) may be at the level of a "Smart High Schooler" in some respects, but they still struggle on basic tasks like tic-tac-toe. There was hope that native multimodal training would help but that hasn't been the case. https://t.co/1iDq0DCL4Q

polynoamial's tweet photo. Frontier models like GPT-4o (and now Claude 3.5 Sonnet) may be at the level of a "Smart High Schooler" in some respects, but they still struggle on basic tasks like tic-tac-toe. There was hope that native multimodal training would help but that hasn't been the case. https://t.co/1iDq0DCL4Q

polynoamial's tweet photo. Frontier models like GPT-4o (and now Claude 3.5 Sonnet) may be at the level of a "Smart High Schooler" in some respects, but they still struggle on basic tasks like tic-tac-toe. There was hope that native multimodal training would help but that hasn't been the case. https://t.co/1iDq0DCL4Q

40

485

47

155

102K

Who to follow

Dun & Bradstreet

Official X. A leading global provider of business decisioning data & analytics, enabling companies around the world to improve their business performance

Verified account

Chief Technology Officer @IcehouseVenture Advocate of design thinking & equity crowdfunding. Coffee blogger, ski instructor & business author: https://t.co/VVrkd4yYw3

Compute-Boy brings you the latest news, technology much more and product unboxing that focuses on the Unboxing , Apple update & windows update.

LukeASalamone retweeted

about 2 years ago

A short post on the best architectures for real-time image and video processing. TL;DR: use convolutions with stride or pooling at the low levels, and stick self-attention circuits at higher levels, where feature vectors represent objects. PS: ready to bet that Tesla FSD uses convolutions (or perhaps more complex *local* operators) at the low levels, combined with more global circuits at higher levels (perhaps using self-attention). Transformers on low-level patch embeddings are a complete waste of electrons.

61

1K

110

986

748K

LukeASalamone retweeted

Aran Komatsuzaki

@arankomatsuzaki

about 2 years ago

Octopus v2: On-device language model for super agent Presents a new method that empowers an on-device 2B model to outperform GPT-4 in both accuracy and latency, and decrease the context length by 95% https://t.co/J1ikDK4ELx

arankomatsuzaki's tweet photo. Octopus v2: On-device language model for super agent

Presents a new method that empowers an on-device 2B model to outperform GPT-4 in both accuracy and latency, and decrease the context length by 95%

https://t.co/J1ikDK4ELx https://t.co/Kcu6QRdJTk

10

242

57

186

67K

LukeASalamone retweeted

Robert Komaniecki @Komaniecki_R

about 2 years ago

The Trautonium. Invented early 1930s. Just listen to this thing.

492

34K

6K

12K

2M

LukeASalamone retweeted

over 2 years ago

Google presents Genie Generative Interactive Environments introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie can be considered a foundation world model. It is comprised of a spatiotemporal video tokenizer, an autoregressive dynamics model, and a simple and scalable latent action model. Genie enables users to act in the generated environments on a frame-by-frame basis despite training without any ground-truth action labels or other domain-specific requirements typically found in the world model literature. Further the resulting learned latent action space facilitates training agents to imitate behaviors from unseen videos, opening the path for training generalist agents of the future.

75

2K

500

1K

684K

LukeASalamone retweeted

over 2 years ago

Google Deepmind presents Grandmaster-Level Chess Without Search paper page: https://t.co/qwpbAb9DL7 largest model reaches a Lichess blitz Elo of 2895 against humans, and successfully solves a series of challenging chess puzzles, without any domain-specific tweaks or explicit search algorithms. We also show that our model outperforms AlphaZero's policy and value networks (without MCTS) and GPT-3.5-turbo-instruct. A systematic investigation of model and dataset size shows that strong chess performance only arises at sufficient scale. To validate our results, we perform an extensive series of ablations of design choices and hyperparameters.

_akhaliq's tweet photo. Google Deepmind presents Grandmaster-Level Chess Without Search

paper page: https://t.co/qwpbAb9DL7

largest model reaches a Lichess blitz Elo of 2895 against humans, and successfully solves a series of challenging chess puzzles, without any domain-specific tweaks or explicit search algorithms. We also show that our model outperforms AlphaZero's policy and value networks (without MCTS) and GPT-3.5-turbo-instruct. A systematic investigation of model and dataset size shows that strong chess performance only arises at sufficient scale. To validate our results, we perform an extensive series of ablations of design choices and hyperparameters.

35

1K

256

640

266K

LukeASalamone retweeted

𝕭𝖏ø𝖗𝖓 𝕾𝖙𝖆𝖆𝖑

@_nonfigurativ_

over 2 years ago

Entangled #fxhash

2K

64K

9K

13K

10M

LukeASalamone retweeted

over 2 years ago

Three logicians walk into a bar. The bartender asks: 'Does everyone want a drink?' The first logician says: 'I don't know.' The second logician says: 'I don't know.' The third logician says: 'Yes.'

40

7K

638

424

509K

LukeASalamone retweeted

over 2 years ago

The SHA256 for this sentence begins with: one, eight, two, a, seven, c and nine.

83

2K

292

312

419K

LukeASalamone retweeted

Luke Gessler @LukeGessler

almost 3 years ago

this paper's nuts. for sentence classification on out-of-domain datasets, all neural (Transformer or not) approaches lose to good old kNN on representations generated by.... gzip https://t.co/6eZiXlJxOX

LukeGessler's tweet photo. this paper's nuts. for sentence classification on out-of-domain datasets, all neural (Transformer or not) approaches lose to good old kNN on representations generated by.... gzip https://t.co/6eZiXlJxOX https://t.co/sF9kd1FzI4

121

5K

814

2K

3M

LukeASalamone retweeted

over 3 years ago

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models abs: https://t.co/G6568wgwky project page: https://t.co/L1jLVcPTdz

5

342

71

97

80K

Luke Salamone @LukeASalamone

over 3 years ago

@VictorButoi The biggest red flag is claiming to measure perplexity without access to the model logits. It’s borderline fraudulent.

0

0

0

0

63

Luke Salamone @LukeASalamone

over 3 years ago

@anmarasovic @soldni For my blog search, I tokenize searchable text into trigrams before ranking with BM25. It’s snappy because it’s all happening in the browser and gives intuitive results. https://t.co/J32RPzTzd4

0

1

0

0

134

LukeASalamone retweeted

Mosquito Capital @MosquitoCapital

over 3 years ago

I've seen a lot of people asking "why does everyone think Twitter is doomed?" As an SRE and sysadmin with 10+ years of industry experience, I wanted to write up a few scenarios that are real threats to the integrity of the bird site over the coming weeks.

1K

56K

14K

11K

0

LukeASalamone retweeted

almost 4 years ago

It's here–the deepest, sharpest infrared view of the universe to date: Webb's First Deep Field. Previewed by @POTUS on July 11, it shows galaxies once invisible to us. The full set of @NASAWebb's first full-color images & data will be revealed July 12: https://t.co/63zxpNDi4I

NASA's tweet photo. It's here–the deepest, sharpest infrared view of the universe to date: Webb's First Deep Field.

Previewed by @POTUS on July 11, it shows galaxies once invisible to us. The full set of @NASAWebb's first full-color images & data will be revealed July 12: https://t.co/63zxpNDi4I https://t.co/zAr7YoFZ8C

8K

533K

123K

8K

0

LukeASalamone retweeted

Armin Ronacher ⇌

almost 5 years ago

I don't want to say anything but that's not the right license Mr Copilot.

63

4K

1K

215

0

LukeASalamone retweeted

about 4 years ago

DALLE-2 has a secret language. "Apoploe vesrreaitais" means birds. "Contarra ccetnxniams luryca tanniounons" means bugs or pests. The prompt: "Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons" gives images of birds eating bugs. A thread (1/n)🧵

giannis_daras's tweet photo. DALLE-2 has a secret language.
"Apoploe vesrreaitais" means birds.
"Contarra ccetnxniams luryca tanniounons" means bugs or pests.

The prompt: "Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons" gives images of birds eating bugs.

A thread (1/n)🧵 https://t.co/VzWfsCFnZo

184

8K

2K

1K

0

LukeASalamone retweeted

about 4 years ago

Freakishly good. Better than many humans.

ATabarrok's tweet photo. Freakishly good. Better than many humans. https://t.co/rOYAwFRUc7

35

1K

148

131

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users