Patrick Fitzgerald @patrickf_ca - Twitter Profile

Patrick Fitzgerald @patrickf_ca

20 days ago

@bdepaepe63 @mzjacobson @mzjacobson already answered this twice. The post is about energy use, not materials.

0

1

0

11

patrickf_ca retweeted

Richard Sutton

@RichardSSutton

22 days ago

The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.

136

7K

976

3K

573K

patrickf_ca retweeted

Sapient Intelligence @Sapient_Int

21 days ago

Introducing HRM-Text. An ultra-lean 1B-parameter reasoning language model designed to deliver strong general performance with a fraction of the data, compute, and infrastructure. Trained on just 40B structured tokens, HRM-Text achieves competitive performance while using ~1/1000 of the training data of comparable models. The kicker? The full model trains in roughly one day on a $1,000 budget. This opens the door to a new generation of AI that is powerful, accessible, and radically easier to adapt. Theories and research concepts once deemed too expensive to test are officially back in the game. Sapient Intelligence invites you to help us shape a new paradigm for general intelligence.

160

3K

268

2K

507K

patrickf_ca retweeted

Andrej Karpathy

@karpathy

29 days ago

This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc. More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage: 1) raw text (hard/effortful to read) 2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default 3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default ...4,5,6,... n) interactive neural videos/simulations Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral https://t.co/z21CP5iQfu There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen. TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.

1K

19K

2K

21K

4M

Who to follow

#Bitcoin and #Crypto Investor since 2017. Follow for crypto news & advice. Mindset is everything! 🦁

Chapmania

@Wiggles_X

Finding balance amid the turmoil!

patrickf_ca retweeted

Andrej Karpathy

@karpathy

about 1 month ago

This is the the quote I've been citing a lot recently.

846

47K

4K

11K

3M

Patrick Fitzgerald @patrickf_ca

about 2 months ago

@allenanalysis It wouldn't end the war unless it passed the House and Senate with a two-thirds supermajority. Otherwise the president can veto this.

0

2

Patrick Fitzgerald @patrickf_ca

2 months ago

@ravihanda #LinkedInLunatics

0

27

Patrick Fitzgerald @patrickf_ca

2 months ago

@nrehiew_ They forked @karpathy's autoresearch repo.

0

6

0

1K

patrickf_ca retweeted

Andrej Karpathy

@karpathy

6 months ago

nanoGPT - the first LLM to train and inference in space 🥹. It begins.

319

11K

854

1K

1M

patrickf_ca retweeted

Nic Cruz Patane

@niccruzpatane

10 months ago

This is the greatest argument I’ve heard for why electric vehicles are better than gasoline-powered cars.

2K

31K

6K

15K

3M

Patrick Fitzgerald @patrickf_ca

over 1 year ago

Unfortunately DeepSeek isn't usable unless it's run locally to avoid censorship.

prayingforexits 🏴‍☠️

@mrexits

over 1 year ago

Ladies and gentlemen, we got em

205

13K

1K

2K

1M

0

2

0

53

patrickf_ca retweeted

Victor Shi

@Victorshi2020

over 1 year ago

Bill Burr NAILED it about the LA fires on Jimmy Kimmel last night: “I think they did a great job, unlike the internet.” “Mismanaged…like some person on the internet knows how to manage the worst fire in LA, sitting there in his underwear.” Watch this.

2K

39K

6K

4K

4M

patrickf_ca retweeted

Don Pettit

@astro_Pettit

over 1 year ago

Flying over aurora; intensely green.

628

28K

5K

2K

3M

patrickf_ca retweeted

NASA Webb Telescope

@NASAWebb

over 1 year ago

Ring around the galaxy… Here’s Webb’s stunning new mid-infrared image of M104. This bright core of the galaxy is dim in this view, revealing a smooth inner disk as well as details of how the clumpy gas in the outer ring is distributed. https://t.co/wQSE9xGTXX

NASAWebb's tweet photo. Ring around the galaxy… Here’s Webb’s stunning new mid-infrared image of M104.

This bright core of the galaxy is dim in this view, revealing a smooth inner disk as well as details of how the clumpy gas in the outer ring is distributed. https://t.co/wQSE9xGTXX https://t.co/p7zNl4YSwD

54

3K

585

211

455K

patrickf_ca retweeted

☈ Chris Jackson ☈

@ChrisJacksonSC

over 1 year ago

Surge Cam 2 is in place in the main traffic circle on St. Armands Key, Florida at 11’ MSL. Video & Weather data now live streaming. Check the link below. https://t.co/1hRB6QXCAi #FLwx #Milton

ChrisJacksonSC's tweet photo. Surge Cam 2 is in place in the main traffic circle on St. Armands Key, Florida at 11’ MSL. Video & Weather data now live streaming. Check the link below.

https://t.co/1hRB6QXCAi

#FLwx
#Milton https://t.co/a1SumXbH7s

112

7K

1K

5K

3M

patrickf_ca retweeted

Philipp Schmid

@_philschmid

over 1 year ago

Can @AnthropicAI Claude 3.5 sonnet outperform @OpenAI o1 in reasoning? Combining Dynamic Chain of Thoughts, reflection, and verbal reinforcement, existing LLMs like Claude 3.5 Sonnet can be prompted to increase test-time compute and match reasoning strong models like OpenAI o1. 👀 TL;DR: 🧠 Combines Dynamic Chain of thoughts + reflection + verbal reinforcement prompting 📊 Benchmarked against tough academic tests (JEE Advanced, UPSC, IMO, Putnam) 🏆 Claude 3.5 Sonnet outperformes GPT-4 and matched O1 models 🔍 LLMs can create internal simulations and take 50+ reasoning steps for complex problems 📚 Works for smaller, open models like Llama 3.1 8B +10% (Llama 3.1 8B 33/48 vs GPT-4o 36/48) ❌ Didn’t benchmark like MMLU, MMLU pro, or GPQA due to computing and budget constraints 📈 High token usage - Claude Sonnet 3.5 used around 1 million tokens for just 7 questions

_philschmid's tweet photo. Can @AnthropicAI Claude 3.5 sonnet outperform @OpenAI o1 in reasoning? Combining Dynamic Chain of Thoughts, reflection, and verbal reinforcement, existing LLMs like Claude 3.5 Sonnet can be prompted to increase test-time compute and match reasoning strong models like OpenAI o1. 👀

TL;DR:
🧠 Combines Dynamic Chain of thoughts + reflection + verbal reinforcement prompting
📊 Benchmarked against tough academic tests (JEE Advanced, UPSC, IMO, Putnam)
🏆 Claude 3.5 Sonnet outperformes GPT-4 and matched O1 models
🔍 LLMs can create internal simulations and take 50+ reasoning steps for complex problems
📚 Works for smaller, open models like Llama 3.1 8B +10% (Llama 3.1 8B 33/48 vs GPT-4o 36/48)
❌ Didn’t benchmark like MMLU, MMLU pro, or GPQA due to computing and budget constraints
📈 High token usage - Claude Sonnet 3.5 used around 1 million tokens for just 7 questions

51

2K

271

2K

1M

patrickf_ca retweeted

Simon Willison

@simonw

over 1 year ago

Let’s see if we can crowdsource a robust definition of “agent” (with respect to AI and LLMs) that fits in a <=280 character tweet Reply to this with your best attempt, then scroll through the replies and fave the ones that makes sense to you

254

575

58

501

185K

patrickf_ca retweeted

martin_casado

@martin_casado

almost 2 years ago

OK, here is my best guess on the state of LLMs: - The scale increase between gpt-3 and gpt-4 was 100x - Doing that for the next model is going to be very hard - We're nearly out of general language tokens. So let's say we can 2x that. And perhaps get more proprietary tokens and get to 3-4x. And do a lot of data cleaning and get to 6-7x. - A 100x training run also requires a Gigawatt datacenter which we don't have yet - Synthetic data is great, but it's not clear how that can be used for general language. I suspect this is why both OAI and Anthropic are focusing on math and code which can be improved via various "synthetic" compute methods (simulated data, or recursive self improvement of some sort) - In the meantime, there is focus on getting more learnings from the same data. Perhaps there is a breakthrough there but I've not heard of it - Planning can be pushed to inference in some domains (e.g. coding) which we're starting to hear about. But again, not clear how much this buys. - Moronic policies like SB 1047 are threatening to slow all this down. So tl;dr I don't see where the 100x jump will come from for general language reasoning. This is why we're seeing a focus on math and code. I'm glad teams are working hard at new algorithmic unlocks. (btw, this is pure speculation, would love to know where I'm wrong!)

129

1K

114

934

538K

patrickf_ca retweeted

Andrej Karpathy

@karpathy

almost 2 years ago

Programming is changing so fast... I'm trying VS Code Cursor + Sonnet 3.5 instead of GitHub Copilot again and I think it's now a net win. Just empirically, over the last few days most of my "programming" is now writing English (prompting and then reviewing and editing the generated diffs), and doing a bit of "half-coding" where you write the first chunk of the code you'd like, maybe comment it a bit so the LLM knows what the plan is, and then tab tab tab through completions. Sometimes you get a 100-line diff to your code that nails it, which could have taken 10+ minutes before. I still don't think I got sufficiently used to all the features. It's a bit like learning to code all over again but I basically can't imagine going back to "unassisted" coding at this point, which was the only possibility just ~3 years ago.

514

18K

2K

11K

3M

Patrick Fitzgerald @patrickf_ca

almost 2 years ago

"What is the ultimate quantification of success? For me, it’s not how much time you spend doing what you love. It’s how little time you spend doing what you hate." - @Casey. Weekly wisdom from @Schwarzenegger's daily newsletter.

0

21

Patrick Fitzgerald

@patrickf_ca

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users