Andrew Bonello

@AndrewBonello

AI & Software. Comedy, Voice, Acting & Improv. Film Analysis & Reviews @FilmTagger_com. @Google @DWAnimation @Cinesite @StHughsCollege alum. Love #Golang.

Joined June 2009

3.4K Following

2.5K Followers

21K Posts

AndrewBonello retweeted

Patrick McKenzie

@patio11

over 7 years ago

This essay is one of the most readable criticisms of US housing finance and policy I’ve ever seen: https://t.co/Ac26PUmE2d

301

Andrew Bonello @AndrewBonello

about 1 year ago

Need help developing your cpp cplusplus backend application ? https://t.co/vTKQGNlsri

AndrewBonello retweeted

Andrew Bonello @AndrewBonello

about 1 year ago

Do you need a comedic or humorous audio voiceover? Then look no further! https://t.co/n7lWlS7HX6

Andrew Bonello @AndrewBonello

about 1 year ago

Do you need a comedic or humorous audio voiceover? Then look no further! https://t.co/n7lWlS7HX6

Who to follow

Solomon Walker

@solomonwalker

Artist, Entrepreneur, Photographer. CEO @ MUSEUM of DIGITAL FINE ARTS. #AI #ComputerArt #DigitalArt #ART #Photography #Crypto #Tech #NFT #AIArt #NFTArt #Design

Chris Saraceno

@ChrisBSaraceno

Vice President/Partner Kelly Automotive Group/Best Selling Author Of TheoryOf5 #TheoryOf5

Fearless Business Boss

@fearlessbizboss

Empowering online service providers to grow a business that fits your LIFESTYLE, not squeeze a life around your business. Let's GROW Your Business!

AndrewBonello retweeted

FilmTagger.com @FilmTagger_com

about 1 year ago

Need to record a hilarious comedy Donald Trump voiceover? https://t.co/xjptcuha0O

AndrewBonello retweeted

Lita 조원경

@litacho

over 4 years ago

@jeanqasaur @hrlomax I never heard of the term developer influencers until now.

AndrewBonello retweeted

✨ Jean Yang ✨ @jeanqasaur

about 3 years ago

@polikarn @thatplguy But! You may (correctly) say: people don't like to write specifications OR types. One of my other favorite papers is by Ras Bodik (again), about using machine learning to automatically mine specifications from program behavior. https://t.co/MHioeEKpHk 6/

AndrewBonello retweeted

François Chollet

@fchollet

over 1 year ago

I'm joining forces with @mikeknoop to start Ndea (@ndeainc), a new AI lab. Our focus: deep learning-guided program synthesis. We're betting on a different path to build AI capable of true invention, adaptation, and innovation.

fchollet's tweet photo. I'm joining forces with @mikeknoop to start Ndea (@ndeainc), a new AI lab.

Our focus: deep learning-guided program synthesis. We're betting on a different path to build AI capable of true invention, adaptation, and innovation. https://t.co/QbjHXVJ9If

126

221

613

193K

Andrew Bonello @AndrewBonello

over 1 year ago

Isaac Asimov asks "How Do People Get New Ideas?" ... https://t.co/ZWaAEHmtYK

AndrewBonello retweeted

Burny - Effective Curiosity

@burny_tech

about 2 years ago

The Kalman filter is a widely used algorithm for estimating the hidden states of a dynamic system from a series of noisy measurements. It works by recursively predicting the system's state using a dynamic model, and then updating this prediction with new measurement data. Some key points about the Kalman filter: - It is an optimal estimator for linear systems with Gaussian noise, minimizing the mean squared error of the estimated state. [2] - It consists of two main steps: prediction and update. In the prediction step, it estimates the current state based on the previous state and the system dynamics. In the update step, it incorporates a new measurement to correct the prediction. [3] - It accounts for both process noise (uncertainty in the system dynamics) and measurement noise (errors in the sensor data). [1] - It requires a mathematical model of the system dynamics (state transition matrix) and the measurement process (measurement matrix). [3] - The filter is recursive, meaning it only needs the current measurement and the previous state estimate to compute the new state estimate, without requiring storage of the entire measurement history. [2] - It has found widespread applications in areas like navigation, object tracking, signal processing, and control systems due to its effectiveness and computational efficiency. [2] The Kalman filter provides an elegant and powerful solution for state estimation problems involving noisy sensor data and uncertain system dynamics, making it a fundamental tool in many engineering and scientific fields. [1][2][3]

burny_tech's tweet photo. The Kalman filter is a widely used algorithm for estimating the hidden states of a dynamic system from a series of noisy measurements. It works by recursively predicting the system's state using a dynamic model, and then updating this prediction with new measurement data. Some key points about the Kalman filter:

- It is an optimal estimator for linear systems with Gaussian noise, minimizing the mean squared error of the estimated state. [2]

- It consists of two main steps: prediction and update. In the prediction step, it estimates the current state based on the previous state and the system dynamics. In the update step, it incorporates a new measurement to correct the prediction. [3]

- It accounts for both process noise (uncertainty in the system dynamics) and measurement noise (errors in the sensor data). [1]

- It requires a mathematical model of the system dynamics (state transition matrix) and the measurement process (measurement matrix). [3]

- The filter is recursive, meaning it only needs the current measurement and the previous state estimate to compute the new state estimate, without requiring storage of the entire measurement history. [2]

- It has found widespread applications in areas like navigation, object tracking, signal processing, and control systems due to its effectiveness and computational efficiency. [2]

The Kalman filter provides an elegant and powerful solution for state estimation problems involving noisy sensor data and uncertain system dynamics, making it a fundamental tool in many engineering and scientific fields. [1][2][3]

698

870

131K

AndrewBonello retweeted

Gary Marcus

@GaryMarcus

over 1 year ago

The problem with this popular tweet is that DeepSeek is NOT actually a capability improvement per se. It’s an efficiency improvement; those two are not the same.

331

36K

AndrewBonello retweeted

kasra

@kasratweets

almost 2 years ago

here are my notes on @fchollet's neat explanation of the differences between deep learning and program synthesis, and the advantages and disadvantages of each, and how they'd fit together to build AGI. in deep learning, your underlying model is a differentiable curve; in program synthesis, your model is a discrete graph of operators – you’re picking from a set of operators and structuring that into a program. this has implications for the amount of compute and data needed for each: - in deep learning your learning engine is gradient descent, which is very compute efficient – you have a very informative feedback signal about where the solution is. but it's very data inefficient — you need a dense sampling of the data distribution. - in program synthesis, your learning engine is combinatorial search. this is extremely data efficient (I believe because the problem space is inherently more constrained?), but it’s extremely compute inefficient (because the search space is massive). how does this apply to AGI? deep learning is great for system 1 thinking; discrete program search is great for system 2 thinking. AGI will likely require a combination of both approaches. Chollet expects that an AGI system would have an outer program that does program synthesis and it will use deep learning to assist it.

kasratweets's tweet photo. here are my notes on @fchollet's neat explanation of the differences between deep learning and program synthesis, and the advantages and disadvantages of each, and how they'd fit together to build AGI.

in deep learning, your underlying model is a differentiable curve; in program synthesis, your model is a discrete graph of operators – you’re picking from a set of operators and structuring that into a program.

this has implications for the amount of compute and data needed for each:
- in deep learning your learning engine is gradient descent, which is very compute efficient – you have a very informative feedback signal about where the solution is. but it's very data inefficient — you need a dense sampling of the data distribution.
- in program synthesis, your learning engine is combinatorial search. this is extremely data efficient (I believe because the problem space is inherently more constrained?), but it’s extremely compute inefficient (because the search space is massive).

how does this apply to AGI? deep learning is great for system 1 thinking; discrete program search is great for system 2 thinking. AGI will likely require a combination of both approaches. Chollet expects that an AGI system would have an outer program that does program synthesis and it will use deep learning to assist it.

809

115

839

113K

AndrewBonello retweeted

Frans Zdyb @FZdyb

almost 4 years ago

Why AI needs to ease up on scaling and learn how to code: https://t.co/Ei2Dv5QyfM @GaryMarcus @fchollet @yudapearl

AndrewBonello retweeted

François Chollet

@fchollet

almost 2 years ago

I believe that program synthesis will solve reasoning. And I believe that deep learning will solve program synthesis (by guiding a discrete program search process). But I don't think you can go all that far with just prompting a LLM to generate end-to-end Python programs (even with a verification step and many samples). That won't scale to very long programs.

828

442

170K

AndrewBonello retweeted

Andrej Karpathy

@karpathy

over 1 year ago

We have to take the LLMs to school. When you open any textbook, you'll see three major types of information: 1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent to pretraining, where the model is reading the internet and accumulating background knowledge. 2. Worked problems with solutions. These are concrete examples of how an expert solves problems. They are demonstrations to be imitated. This is equivalent to supervised finetuning, where the model is finetuning on "ideal responses" for an Assistant, written by humans. 3. Practice problems. These are prompts to the student, usually without the solution, but always with the final answer. There are usually many, many of these at the end of each chapter. They are prompting the student to learn by trial & error - they have to try a bunch of stuff to get to the right answer. This is equivalent to reinforcement learning. We've subjected LLMs to a ton of 1 and 2, but 3 is a nascent, emerging frontier. When we're creating datasets for LLMs, it's no different from writing textbooks for them, with these 3 types of data. They have to read, and they have to practice.

karpathy's tweet photo. We have to take the LLMs to school.

When you open any textbook, you'll see three major types of information:

1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent to pretraining, where the model is reading the internet and accumulating background knowledge.

2. Worked problems with solutions. These are concrete examples of how an expert solves problems. They are demonstrations to be imitated. This is equivalent to supervised finetuning, where the model is finetuning on "ideal responses" for an Assistant, written by humans.

3. Practice problems. These are prompts to the student, usually without the solution, but always with the final answer. There are usually many, many of these at the end of each chapter. They are prompting the student to learn by trial & error - they have to try a bunch of stuff to get to the right answer. This is equivalent to reinforcement learning.

We've subjected LLMs to a ton of 1 and 2, but 3 is a nascent, emerging frontier. When we're creating datasets for LLMs, it's no different from writing textbooks for them, with these 3 types of data. They have to read, and they have to practice.

379

12K

696K

AndrewBonello retweeted

Mark - RIGHT WITH JESUS

@newstem61

over 1 year ago

This might help https://t.co/T9cz0Wjaow Also this https://t.co/U17J9esBhP Researchers don't actually know. I'm feeding a number of research papers on the topic into notebooklm, to see if it can understand how it behaves. I have a feeling a user interaction with say Grok, doesn't have the same weight of trained data. So may not force it into new thought processes. Maybe to avoid brainwashing?

153

AndrewBonello retweeted

@nrehiew_

over 1 year ago

How to train a State-of-the-art reasoner. Let's talk about the DeepSeek-R1 paper and how DeepSeek trained a model that is at frontier Sonnet/o1 level.

nrehiew_'s tweet photo. How to train a State-of-the-art reasoner.

Let's talk about the DeepSeek-R1 paper and how DeepSeek trained a model that is at frontier Sonnet/o1 level. https://t.co/YKtoDXotpi

251

290K

AndrewBonello retweeted

Michael Nielsen @michael_nielsen

almost 5 years ago

This is fascinating: Rich Sutton on the "bitter lesson" of AI research: https://t.co/LW5TOGTIKw

556

211

AndrewBonello retweeted

evanthebouncy @evanthebouncy

over 3 years ago

want to get into program synthesis but don't know how to started? I wrote a minimalist intro to modern program synthesis that can help you -- from problem formulation to generating code by fine-tuning llm on huggingface. https://t.co/tzPHmzoQh3

132

AndrewBonello retweeted

Andrej Karpathy

@karpathy

over 1 year ago

@ID_AA_Carmack The question is will top AIs get better at gui faster than all apps add text. I think I have a guess

146

114K

Andrew Bonello

@AndrewBonello

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users