Chris Mayfield @abandonrules - Twitter Profile

Pinned Tweet

Chris Mayfield @abandonrules

about 6 years ago

Wonder what this is?

1

2

0

Chris Mayfield @abandonrules

6 months ago

Christmas Ornament Has Hidden Compartment, Clever Design via @hackaday https://t.co/lAwaRGMDfR

0

11

abandonrules retweeted

Sundar Pichai

@sundarpichai

about 1 year ago

What a finish! Gemini 2.5 Pro just completed Pokémon Blue! Special thanks to @TheCodeOfJoel for creating and running the livestream, and to everyone who cheered Gem on along the way.

207

6K

833

742

1M

Chris Mayfield @abandonrules

over 1 year ago

A Programming Language For Building NES Games via @hackaday https://t.co/nwEQBlIE4P

0

14

Who to follow

Chris C Games

@CChungGames

Designer/Developer of Lanterns | Spell Smashers | MLP: Adventures in Equestria DBG | MLP: Festival of Lanterns

ZugZug art

@zugzugart

3D Art Production, Online Courses, Mentorship, YouTube Tutorials, and Stylized 3D Art Community

Jason D. Kingsley

@JasonDKingsley

Introvert, graphic designer, amateur game designer. Trying to follow Jesus. Here to retweet art and talk rulebooks. I play with blue.

abandonrules retweeted

Thread Reader App

@threadreaderapp

over 1 year ago

@jiayi_pirate Your thread is creating a buzz! #TopUnroll https://t.co/p0GZkF7Tpk 🙏🏼@vankous for 🥇unroll

0

2

4

3K

abandonrules retweeted

Jiayi Pan @jiayi_pirate

over 1 year ago

We reproduced DeepSeek R1-Zero in the CountDown game, and it just works Through RL, the 3B base LM develops self-verification and search abilities all on its own You can experience the Ahah moment yourself for < $30 Code: https://t.co/UcGKN2SVGj Here's what we learned 🧵

jiayi_pirate's tweet photo. We reproduced DeepSeek R1-Zero in the CountDown game, and it just works

Through RL, the 3B base LM develops self-verification and search abilities all on its own

You can experience the Ahah moment yourself for < $30
Code: https://t.co/UcGKN2SVGj

Here's what we learned 🧵

192

6K

1K

6K

2M

Chris Mayfield @abandonrules

over 1 year ago

This QR Code Leads To Two Websites, But How? via @hackaday https://t.co/QpRtXOcmq4

0

17

Chris Mayfield @abandonrules

over 1 year ago

3D-Printed RC Car Focuses On Performance Fundamentals via @hackaday https://t.co/ti7x0qggHk

0

16

Chris Mayfield @abandonrules

over 1 year ago

USB-C For Hackers: Reusing Cables via @hackaday https://t.co/xHgxmYxVh5

0

6

Chris Mayfield @abandonrules

almost 2 years ago

@stickermule @rickyberwick So where is the story? I need details

0

46

abandonrules retweeted

Sticker Mule

@stickermule

almost 2 years ago

Our last CEO f'd things up. I'm fixing them. Introducing white label hot sauce. Go fucking buy it now. https://t.co/dnBsPvbK8f Ricky Berwick (@rickyberwick) CEO, Sticker Mule P.S. 20 people who repost this win $500. Follow us so we can DM you.

806

2K

4K

163

327K

Chris Mayfield @abandonrules

almost 2 years ago

@AmabelHolland This article brought me here. https://t.co/ER6a1jpNOz

0

9

abandonrules retweeted

Jimmy Fallon

@jimmyfallon

almost 2 years ago

Wedding RSVPs should have options for “Yes,” “No,” and “I feel obligated.”

102

1K

194

9

245K

Chris Mayfield @abandonrules

about 2 years ago

New Part Day: A Hackable Smart Ring via @hackaday https://t.co/y2tjFLKAyp

0

27

abandonrules retweeted

elvis

@omarsar0

about 2 years ago

Nemotron-4 340B is a huge release by NVIDIA! Why? The Nemotron-4 340B instruct model lets you generate high-quality data and then the reward model (also released) can filter out data on several attributes. It's often not enough just to have a good instruct model for generating high-quality synthetic data. You also need to filter that generated data on a set of requirements. It's common to use the same model to grade the outputs (LLM-as-Judge) but the reward model (Reward-Model-as-Judge), which is often not released, is ideal for this. Also, check out in the paper how Nemotron-4-340B compares with GPT-4-1106-preview when evaluated by humans on several tasks. Seems that GPT-4 is a lot better at rewrite and extraction but Nemotron-4-340B compared nicely on other tasks. Multi-turn chat has a high score! Something else that caught my attention is that the majority of preference data used for alignment are synthetic. The results show that Nemotron-4 340B is a strong model. Check out those MMLU, GSM8K, and Arena Hard numbers. GPT-4 is still ahead but the gap keeps closing. What's Meta's move now? Are we going to see a similar type of release? Could be a huge moment for the field and open LLMs. Nice release from NVIDIA. They even released a preference dataset (link in the replies). Well done to the team!