3ali @alielfilali01 - Twitter Profile

Pinned Tweet

3ali @alielfilali01

about 4 years ago

Every Learning process is a Search process

1

3

0

1

0

3ali @alielfilali01

8 months ago

Long waited space 🤩

Nathan

@nathanhabib1011

8 months ago

🧩 tasks are now modular — each lives in its own file. “suites” are going away → easier contributions, faster iteration. explore all tasks available in lighteval here: https://t.co/3fPGau70MP

1

3

0

162

0

1

0

64

alielfilali01 retweeted

Thinking Machines

@thinkymachines

10 months ago

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly. The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains. https://t.co/lrJioBmpbT

thinkymachines's tweet photo. Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to prompt engineering. Here we share what we are working on and connect with the research community frequently and openly.

The name Connectionism is a throwback to an earlier era of AI; it was the name of the subfield in the 1980s that studied neural networks and their similarity to biological brains.

https://t.co/lrJioBmpbT

230

8K

1K

5K

3M

alielfilali01 retweeted

Mira Murati

@miramurati

10 months ago

A big part of our mission at Thinking Machines is to improve people’s scientific understanding of AI and work with the broader research community. Introducing Connectionism today to share some of our scientific insights.

177

5K

394

1K

607K

Who to follow

Najib Chowdhury

@NajibChy

alielfilali01 retweeted

Rohan Paul

@rohanpaul_ai

10 months ago

Fei-Fei Li (@drfeifei) on limitations of LLMs. "There's no language out there in nature. You don't go out in nature and there's words written in the sky for you.. There is a 3D world that follows laws of physics." Language is purely generated signal.

246

4K

668

2K

2M

alielfilali01 retweeted

François Charton

@f_charton

10 months ago

@francoisfleuret Scaling up old ideas, with 10x the compute and a fancy acronym

1

11

1

0

1K

alielfilali01 retweeted

François Chollet

@fchollet

10 months ago

We were able to reproduce the strong findings of the HRM paper on ARC-AGI-1. Further, we ran a series of ablation experiments to get to the bottom of what's behind it. Key findings: 1. The HRM model architecture itself (the centerpiece of the paper) is not an important factor. 2. The outer refinement loop (barely mentioned in the paper) is the main driver of performance. 3. Cross-task transfer learning is not very helpful. What matters is training on the tasks you will test on. 4. You can use much fewer data augmentations, especially at inference time. Finding 2 & 3 mean that this approach is a case of *zero-pretraining test-time training*, similar to the recently published "ARC-AGI without pretraining" paper by Liao et al.

45

3K

294

2K

368K

alielfilali01 retweeted

Imane Momayiz @imomayiz

11 months ago

One perk of working on @AtlasIA projects: we get to confirm big-lab findings with limited community budget💪 We finetuned Qwen2.5-VL at two scales to find the sweet spot for LR × batch size and saw patterns validating DeepSeek’s scaling laws 📈 (https://t.co/KhsHPzeWs4).

imomayiz's tweet photo. One perk of working on @AtlasIA projects: we get to confirm big-lab findings with limited community budget💪
We finetuned Qwen2.5-VL at two scales to find the sweet spot for LR × batch size and saw patterns validating DeepSeek’s scaling laws 📈
(https://t.co/KhsHPzeWs4). https://t.co/xTIvVL4CRn

1

19

5

1

1K

3ali @alielfilali01

10 months ago

@AnassAb01 @Omar_H_ On a different note, i just don't understand why some people enjoy being as*holes! You could've just shared the blody link man!

0

16

3ali @alielfilali01

10 months ago

@AnassAb01 @Omar_H_ maybe this is the "report" you want: https://t.co/1suFJYMK31 Indeed the information mentioned by detafour is WRONG ! Maybe they misunderstood the 21st slide (which is the exact opposite of what they mentioned) Nevertheless, we are still not at the top of our game yet !!!

2

0

32

3ali @alielfilali01

10 months ago

@AnassAb01 @Omar_H_ Also, i guess it's worth to mention that generally most the funding we have is internal (local VCs), while south africa and egypt lead given the British and GCC VCs respectively. Not justifying falling behind here, but maybe one of the reasons!

0

18

alielfilali01 retweeted

EvalEval Coalition @evaluatingevals

11 months ago

🚨 New blog: The AI Evaluation Chart Crisis 📝 From misleading bar heights to missing error bars, recent model launches have sparked debate on AI evals. In our new blogpost, we dig into what’s broken, why it matters and how they should be presented 👇 https://t.co/5KnVw8a2mf

0

19

8

4

1K

3ali @alielfilali01

11 months ago

We went from cherry picking benchmarks to cherry picking models... Wondering why 4.1 opus *with* thinking is not there 👀

Elon Musk

@elonmusk

11 months ago

Grok wins hands-down at coding. It wasn’t close.

5K

50K

4K

5K

21M

0

1

0

65

alielfilali01 retweeted

Google DeepMind @GoogleDeepMind

11 months ago

We have a long history of using games to measure progress in AI. 🎮 That’s why we’re helping unveil the @Kaggle Game Arena: an open-source platform where models go head-to-head in complex games to help us gauge their capabilities. 🧵

164

2K

182

351

209K

alielfilali01 retweeted

EvalEval Coalition @evaluatingevals

11 months ago

🚨 AI Evals Crisis: Officially kicking off the Eval Science Workstream 🚨 We’re building a shared scientific foundation for evaluating AI systems, one that’s rigorous, open, and grounded in real-world & cross-disciplinary best practices👇 (1/2) https://t.co/AQdEKtJS3l

1

16

7

0

2K

alielfilali01 retweeted

Paul Bertrand

@pbertrand_dev

about 1 year ago

@levelsio @huggingface does this look like the face of someone thats worried about money

9

1K

14

8

35K

alielfilali01 retweeted

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

about 1 year ago

ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models "We present the ChemPile, an open dataset containing over 75 billion tokens of curated chemical data, specifically built for training and evaluating general-purpose models in the chemical sciences."

iScienceLuvr's tweet photo. ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models

"We present the ChemPile, an open dataset containing over 75 billion tokens of curated chemical data, specifically built for training and evaluating general-purpose models in the chemical sciences." https://t.co/39RuYJ2kTV

2

139

24

82

12K

alielfilali01 retweeted

Daniel van Strien

@vanstriendaniel

about 1 year ago

Just released: A Parquet-converted version of the Newspaper Navigator dataset on @huggingface! 📰3M+ visual annotations from historic US newspapers from @ChronAmLOC 🗂️ Bounding boxes, OCR, metadata + IIIF crop URLs 📸 Covers photos, cartoons, comics, maps & more

vanstriendaniel's tweet photo. Just released: A Parquet-converted version of the Newspaper Navigator dataset on @huggingface!

📰3M+ visual annotations from historic US newspapers from @ChronAmLOC
🗂️ Bounding boxes, OCR, metadata + IIIF crop URLs
📸 Covers photos, cartoons, comics, maps & more https://t.co/JctkK3zRvx

1

9

2

436

alielfilali01 retweeted

merve

@mervenoyann

about 1 year ago

NVIDIA released new vision reasoning model for robotics: Cosmos-Reason1-7B 🤖 > first reasoning model for robotics 😱 > based on Qwen 2.5-VL-7B, use with @huggingface transformers or vLLM 🤗 > comes with SFT & alignment dataset and a new benchmark 👏

mervenoyann's tweet photo. NVIDIA released new vision reasoning model for robotics: Cosmos-Reason1-7B 🤖

> first reasoning model for robotics 😱
> based on Qwen 2.5-VL-7B, use with @huggingface transformers or vLLM 🤗
> comes with SFT & alignment dataset and a new benchmark 👏 https://t.co/mG0Kzi0glF

6

383

56

189

29K

alielfilali01 retweeted

Irem Ergün

@irombie

about 1 year ago

I'm excited to share our new pre-print ShiQ: Bringing back Bellman to LLMs! https://t.co/yWMT6M0nuT In this work, we propose a new, Q-learning inspired RL algorithm for finetuning LLMs 🎉 (1/n)

11

220

37

135

26K

alielfilali01 retweeted

Melanie Mitchell @MelMitchell1

about 1 year ago

I reviewed "These Strange New Minds: How AI Learned to Talk and What It Means" by Chris Summerfield. ⬇️

6

119

18

60

16K

3ali

@alielfilali01

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users