Error.PDF @errorpdf - Twitter Profile

Pinned Tweet

Error.PDF @ErrorPdf

over 2 years ago

👇🧠

AK

@_akhaliq

over 2 years ago

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent paper page: https://t.co/1hfFhbaCGo Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is non-differentiable. To address these deficiencies, we define a ReAct-style LLM agent with the ability to reason and act upon external knowledge. We further refine the agent through a ReST-like method that iteratively trains on previous trajectories, employing growing-batch reinforcement learning with AI feedback for continuous self-improvement and self-distillation. Starting from a prompted large model and after just two iterations of the algorithm, we can produce a fine-tuned small model that achieves comparable performance on challenging compositional question-answering benchmarks with two orders of magnitude fewer parameters.

_akhaliq's tweet photo. ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

paper page: https://t.co/1hfFhbaCGo

Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is non-differentiable. To address these deficiencies, we define a ReAct-style LLM agent with the ability to reason and act upon external knowledge. We further refine the agent through a ReST-like method that iteratively trains on previous trajectories, employing growing-batch reinforcement learning with AI feedback for continuous self-improvement and self-distillation. Starting from a prompted large model and after just two iterations of the algorithm, we can produce a fine-tuned small model that achieves comparable performance on challenging compositional question-answering benchmarks with two orders of magnitude fewer parameters.

6

513

118

396

174K

0

193

ErrorPdf retweeted

Google DeepMind @GoogleDeepMind

11 months ago

Our new AI model AlphaEarth Foundations is mapping the planet in astonishing detail. 🌏🔍 Scientists will now be able to track the impact of deforestation, monitoring crop health, and more – significantly faster, thanks to our new datasets. 🧵

214

4K

643

1K

853K

ErrorPdf retweeted

Epoch AI

@EpochAIResearch

12 months ago

Introducing FrontierMath Tier 4: a benchmark of extremely challenging research-level math problems, designed to test the limits of AI’s reasoning capabilities.

EpochAIResearch's tweet photo. Introducing FrontierMath Tier 4: a benchmark of extremely challenging research-level math problems, designed to test the limits of AI’s reasoning capabilities. https://t.co/ImY6hFDLGQ

17

563

62

115

86K

Error.PDF @ErrorPdf

about 1 year ago

@GaryMarcus @DotCSV Gary Marcus salúdame soy tu fan.😇

0

1

0

352

Who to follow

Diogo Fernandes

@dioogfernands

machine learning enthusiast

Ezekiel Winger

@EzekielWinger

Stupidity is the currency of youth

foxhu

@foxhu007

AI developer. Love pytorch and golang.A Google lover.

ErrorPdf retweeted

DeepSeek

@deepseek_ai

over 1 year ago

🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at https://t.co/v1TFy7LHNy today! 🐋 1/n

deepseek_ai's tweet photo. 🚀 DeepSeek-R1 is here!

⚡ Performance on par with OpenAI-o1
📖 Fully open-source model & technical report
🏆 MIT licensed: Distill & commercialize freely!

🌐 Website & API are live now! Try DeepThink at https://t.co/v1TFy7LHNy today!

🐋 1/n https://t.co/7BlpWAPu6y

2K

35K

7K

10K

13M

ErrorPdf retweeted

Alex Rives

@alexrives

over 1 year ago

We're thrilled to present ESM3 in @ScienceMagazine. ESM3 is a generative language model that reasons over the three fundamental properties of proteins: sequence, structure, and function. Today we're making ESM3 available free to researchers worldwide via the public beta of an API for biological intelligence. Trained with over a trillion teraflops of compute, this is the first time a model of this scale has been trained for biology, pushing the frontier of AI for biological discovery and engineering. ESM3 learns to represent the immense complexity of protein biology, learning from billions of natural proteins. From this training it developed the capability to design proteins, responding to complex prompts combining atomic level details and high level instructions to generate new proteins. ESM3 can explore protein space far beyond natural evolution. We prompted ESM3 to generate a fluorescent protein at a far distance from any known fluorescent proteins, searching an unknown region of protein space, to discover a new fluorescent protein. We estimate this is equivalent to simulating five hundred million years of evolution.

24

850

239

387

227K

ErrorPdf retweeted

Sakana AI

@SakanaAILabs

over 1 year ago

We’re excited to introduce Transformer², a machine learning system that dynamically adjusts its weights for various tasks! https://t.co/bnOA7PWHsC Adaptation is a remarkable natural phenomenon, like how the octopus can blend in with its environment, or how the brain rewires itself after injury. We believe our new system paves the way for a new generation of adaptive AI models, modifying their own weights and architecture to adapt to the nature of the tasks they encounter, embodying living intelligence capable of continuous change and lifelong learning. Paper: https://t.co/KQb8CsruuF GitHub: https://t.co/1a6q7H5WgJ

93

2K

460

1K

462K

ErrorPdf retweeted

Sakana AI

@SakanaAILabs

over 1 year ago

Introducing ASAL: Automating the Search for Artificial Life with Foundation Models https://t.co/4FMqZ98CSb Artificial Life (ALife) research holds key insights that can transform and accelerate progress in AI. By speeding up ALife discovery with AI, we accelerate our understanding of emergence, evolution, and intelligence–core principles that can inspire the next generation of AI systems! We proudly collaborated with MIT, OpenAI, Swiss AI Lab IDSIA, and Ken Stanley on this exciting project. Full Paper (Website): https://t.co/0cF28Swid6 Full Paper (arxiv): https://t.co/NnOkez0V8r Code: https://t.co/BlZnGJK4g8 In this work, we propose a new algorithm called Automated Search for Artificial Life (“ASAL”) to automate the discovery of artificial life using vision-language foundation models. Instead of tediously hand-designing every tiny rule of an Alife simulation, simply describe the space of simulations to search over, and ASAL will automatically discover the most interesting and open-ended artificial lifeforms! Because of the generality of foundation models, ASAL can discover new lifeforms across a diverse range of seminal ALife simulations, including Boids, Particle Life, Game of Life, Lenia, and Neural Cellular Automata. ASAL even discovered novel cellular automata rules that are more open-ended and expressive than the original Conway’s Game of Life. We believe this new paradigm may reignite ALife research by overcoming the bottleneck of manually designed simulations, thus advancing beyond the limits of human ingenuity.

75

3K

624

2K

750K

ErrorPdf retweeted

BlinkDL

@BlinkDL_AI

over 1 year ago

RWKV-7-World 0.1B (L12-D768) trained w/ ctx4k perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. RWKV is all you need. https://t.co/ANmttaBgYy #RWKV

BlinkDL_AI's tweet photo. RWKV-7-World 0.1B (L12-D768) trained w/ ctx4k perfectly solves NIAH ctx16k 🤯 100% RNN and attention-free. RWKV is all you need. https://t.co/ANmttaBgYy #RWKV https://t.co/6rNpgmncmE

1

79

9

8

9K

ErrorPdf retweeted

Haider.

@haider1

over 1 year ago

🚨 NVIDIA Introduces Jetson Nano Super > compact AI computer capable of 70-T operations per second > designed for robotics, it supports advanced models, including LLMs, and costs $249

663

23K

3K

7K

3M

ErrorPdf retweeted

Alex Rives

@alexrives

over 1 year ago

Introducing ESM Cambrian. Unsupervised learning can invert biology at scale to reveal the hidden structure of the natural world. We’ve scaled up compute and data to train a new generation of protein language models. ESM C defines a new state of the art for protein representation learning.

26

632

159

243

207K

ErrorPdf retweeted

Google DeepMind @GoogleDeepMind

over 1 year ago

Today in @Nature, we’re presenting GenCast: our new AI weather model which gives us the probabilities of different weather conditions up to 15 days ahead with state-of-the-art accuracy. ☁️⚡ Here’s how the technology works. 🧵https://t.co/PWCNWbQnlU

GoogleDeepMind's tweet photo. Today in @Nature, we’re presenting GenCast: our new AI weather model which gives us the probabilities of different weather conditions up to 15 days ahead with state-of-the-art accuracy. ☁️⚡

Here’s how the technology works. 🧵https://t.co/PWCNWbQnlU https://t.co/6DTrmn64Jq

120

5K

898

2K

2M

ErrorPdf retweeted

Google AI

@GoogleAI

over 1 year ago

Training video understanding models on longer contexts is computationally intensive. To address this, we present a novel approach that reduces the computational load while also improving the quality of the learned representations. More at: https://t.co/56Vj3kOzOl

GoogleAI's tweet photo. Training video understanding models on longer contexts is computationally intensive. To address this, we present a novel approach that reduces the computational load while also improving the quality of the learned representations. More at: https://t.co/56Vj3kOzOl https://t.co/f0PtWAVG7f

11

324

84

80

31K

ErrorPdf retweeted

Google DeepMind @GoogleDeepMind

over 1 year ago

Introducing Genie 2: our AI model that can create an endless variety of playable 3D worlds - all from a single image. 🖼️ These types of large-scale foundation world models could enable future agents to be trained and evaluated in an endless number of virtual environments. → https://t.co/1dzB2BUlWo

340

6K

1K

2K

1M

ErrorPdf retweeted

Gabriel Peyré

@gabrielpeyre

over 1 year ago

Oldies but goldies: R. Keys, Cubic convolution interpolation for digital image processing, 1980. Introduces bicubic interpolation, the most frequently used image interpolation method. https://t.co/EINuEBHMxO

0

585

94

269

38K

ErrorPdf retweeted

Gabriel Peyré

@gabrielpeyre

over 1 year ago

Gradient descent on particles’ positions (Lagrangian) is equivalent to an advection linear PDE on the density of particles (Eulerian). https://t.co/lHYMjNg6P1

7

975

154

382

41K

ErrorPdf retweeted

Google DeepMind @GoogleDeepMind

over 1 year ago

Introducing AlphaQubit: our AI-based system that can more accurately identify errors inside quantum computers. 🖥️⚡ This research is a joint venture with @GoogleQuantumAI, published today in @Nature → https://t.co/AtbWuddxxe

GoogleDeepMind's tweet photo. Introducing AlphaQubit: our AI-based system that can more accurately identify errors inside quantum computers. 🖥️⚡

This research is a joint venture with @GoogleQuantumAI, published today in @Nature → https://t.co/AtbWuddxxe

37

2K

382

421

190K

ErrorPdf retweeted

Gabriele Corso @GabriCorso

over 1 year ago

Thrilled to announce Boltz-1, the first open-source and commercially available model to achieve AlphaFold3-level accuracy on biomolecular structure prediction! An exciting collaboration with @jeremyWohlwend, @pas_saro and an amazing team at MIT and Genesis Therapeutics. A thread!

42

2K

409

619

405K

ErrorPdf retweeted

Johannes Brandstetter @jo_brandstetter

over 1 year ago

Super hyped to share NeuralDEM -- the first real-time simulation of industrial particulate flows. NeuralDEM replaces Discrete Element Method (DEM) routines and coupled (CFD-DEM) multiphysics simulations. 🧵 📜: https://t.co/JH4PDpth5g 🖥️: https://t.co/VEsawzd9IV

7

389

91

155

41K

ErrorPdf retweeted

BlinkDL

@BlinkDL_AI

over 1 year ago

New RWKV CoT demo: 4M params to solve 15-puzzle 🔥 https://t.co/OgvZrUfntO #RWKV #RNN

4

112

23

28

8K

ErrorPdf retweeted

Epoch AI

@EpochAIResearch

over 1 year ago

1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.

EpochAIResearch's tweet photo. 1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%. https://t.co/sNVEB6SvyJ

53

2K

383

843

1M

Error.PDF

@ErrorPdf

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users