Mohamed Kafsi @mou7 - Twitter Profile

mou7 retweeted

10 days ago

Language Models Need Sleep "Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consolidation mechanism in which a model periodically converts recent context into persistent fast weights before clearing its key-value cache." "increasing sleep duration N for our models improves performance, with the largest gains on examples that require deeper reasoning."

iScienceLuvr's tweet photo. Language Models Need Sleep

"Transformer-based large language models are increasingly used for long-horizon tasks; however, their attention mechanism scales poorly with context length. To handle this, we study a sleep-like consolidation mechanism in which a model periodically converts recent context into persistent fast weights before clearing its key-value cache."

"increasing sleep duration N for our models improves performance, with the largest gains on examples that require deeper reasoning."

32

909

146

714

66K

Mohamed Kafsi @mou7

about 1 month ago

Caveat: this works when I already have domain knowledge. For a new territory, I flip the first step, AI maps what is possible, then I generate within it, then critique as usual.

0

16

Mohamed Kafsi @mou7

about 1 month ago

If my goal is brain activation and learning, this is my flow: (1) Think and write first, without any #AI augmentation (2) Use AI as an adversarial critic: prompt it to attack (3) Iterate and (4) Come back days later and reproduce the argument unaided

1

0

26

mou7 retweeted

Organizermemes

@OrganizerMemes

3 months ago

I love this video

153

20K

2K

10K

791K

Who to follow

Julien Herzen

@jlhrzn

Engineering @IsomorphicLabs. AI for science. Creator of Darts for ML on time series.

Dr. Marcin Pietrzyk

@marcinppp

Founder & CEO @Unit8, #Analytics, #AI, INSEAD, specialised in making things happen, digital shaper 2022 by BILANZ

Claudiu Musat

@_Claudiu_Musat

AI Researcher, interested in human LLM interactions. My tweets and opinions are my own.

mou7 retweeted

a16z @a16z

3 months ago

Engineers and salespeople think in completely different ways. - Engineers: "If you ask them a question, a hundred percent of them will try and think of what is the correct answer to that question." - Salespeople: "If you're a salesperson, your first thought isn't: what's the answer? It's: why are you asking me that question?" "And so if you have an engineer talking to a good sales guy, it's going to upset them. Because they're often not gonna answer the question." "The guys who are good at the job get rejected, because you don't like them. And then the people who are terrible at it, those are the ones that ended up getting hired." "These CEOs just wanna take a guy who failed the engineering test, put a clean shirt on him and make him the head of sales." @bhorowitz with @bhalligan

46

1K

99

933

173K

mou7 retweeted

Jeff Dean

@JeffDean

3 months ago

We've been working on the Waxal dataset project since 2021, aiming to enhance the amount of data available for African languages. This public speech dataset initially covers 27 Sub-Saharan African languages spoken by over 100 million speakers across more than 26 countries. 🌍

70

2K

269

415

194K

mou7 retweeted

aditya

@adxtyahq

3 months ago

“16 million exchanges?!” “This is called open source”

125

8K

755

1K

695K

mou7 retweeted

Kylan O'Connor

@kylancodes

4 months ago

Vibe coders be like 👨‍💻

183

10K

975

3K

862K

mou7 retweeted

Laura Ruis @LauraRuis

4 months ago

My PhD thesis is out 🥳🎓 How do LLMs, trained on trillions of tokens, reason? Can they generalise beyond their training data or are they constrained by what they've seen before? My takeaway: they can generalise beyond training in interesting ways, showing genuine reasoning

LauraRuis's tweet photo. My PhD thesis is out 🥳🎓

How do LLMs, trained on trillions of tokens, reason?
Can they generalise beyond their training data or are they constrained by what they've seen before?

My takeaway: they can generalise beyond training in interesting ways, showing genuine reasoning https://t.co/qOEkWegeTM

94

2K

237

1K

106K

mou7 retweeted

Tuana

@tuanacelik

4 months ago

MCP connects agents to live systems: databases, APIs, external services. It's designed for runtime tool access . But the moment you need to teach your agent how to approach a problem domain, you need something else. Skills aren't about (just) accessing data. They're about embedding knowledge into your agent's reasoning. When your agent needs to understand "here's the right sequence for debugging a data pipeline" or "this is how you validate and process complex documents," skills allow you to bake that knowledge into how the agent thinks. There's then also the whole matter of how they work fundamentally: MCP tools rely on an external connection and API calls. Skills are local.. The issue isn't choosing between them. It's understanding that they kiiinda serve different purposes. MCP extends your agent's capabilities at runtime. Skills shape how your agent reasons about problems. I wrote all about this with @itsclelia in our latest blog: https://t.co/ve19aULZwL

tuanacelik's tweet photo. MCP connects agents to live systems: databases, APIs, external services. It's designed for runtime tool access . But the moment you need to teach your agent how to approach a problem domain, you need something else.

Skills aren't about (just) accessing data. They're about embedding knowledge into your agent's reasoning. When your agent needs to understand "here's the right sequence for debugging a data pipeline" or "this is how you validate and process complex documents," skills allow you to bake that knowledge into how the agent thinks.

There's then also the whole matter of how they work fundamentally: MCP tools rely on an external connection and API calls. Skills are local..

The issue isn't choosing between them. It's understanding that they kiiinda serve different purposes. MCP extends your agent's capabilities at runtime. Skills shape how your agent reasons about problems.

I wrote all about this with @itsclelia in our latest blog: https://t.co/ve19aULZwL

11

56

8

63

5K

mou7 retweeted

Brian Chesky

@bchesky

5 months ago

.@Ahmad_Al_Dahle is joining as Airbnb's new CTO. I’m often asked about our AI strategy. We believe pairing great design with frontier technology will help us improve the way people experience travel. Excited to build!

160

1K

68

94

1M

Mohamed Kafsi @mou7

9 months ago

@mortenjust Great! Did you open source your code. Would love to test and extend

1

0

44

Mohamed Kafsi @mou7

11 months ago

@maximelabonne Privacy indeed; for certain use cases, you don't want the data to leave the device. I expect next gen of OS to provide a local LLM that you can use.

0

14

Mohamed Kafsi @mou7

about 1 year ago

Tomorrow #Nexthink is coming (back) to #EPFL! We’ll be sharing how we’re building #AI agents to transform the IT world. Seats are limited — register here https://t.co/Ql9vrqurQD @NexthinkNews @ICepfl @EPFL

mou7's tweet photo. Tomorrow #Nexthink is coming (back) to #EPFL!
We’ll be sharing how we’re building #AI agents to transform the IT world.
Seats are limited — register here
https://t.co/Ql9vrqurQD

@NexthinkNews @ICepfl @EPFL https://t.co/UemeBMjBFS

0

73

mou7 retweeted

@levelsio

over 1 year ago

🇪🇺 EU subsidies are such a massive waste In Portugal for example, I noticed most construction companies from window frames, glass railing, door, balcony doors etc. have a sign in the footer of their website that say they received EU funding You can search their EU funding ID and the reason for funding is always some silly bs like "Project title: Research on how to improve glass railings Project description: Ensure the company’s competitiveness by strengthening its internal competencies to be competitive in the demanding external market, investing in innovation in management processes, distribution, logistics, and work organization practices, as well as in relationships with the external environment" And they always get about €250,000 to €500,000 I even have family who received these subsidies, they got one because they were running a company in a "disadvantaged neighborhood" and had to write a 10 page document and instantly got €250,000, they laughed hysterically cause they definitely didn't need that money, but why not get it if you can, right? Free money! The EU spends hundreds of billions of euros every year from European taxpayer's money on these subsidies that just go nowhere

levelsio's tweet photo. 🇪🇺 EU subsidies are such a massive waste

In Portugal for example, I noticed most construction companies from window frames, glass railing, door, balcony doors etc. have a sign in the footer of their website that say they received EU funding

You can search their EU funding ID and the reason for funding is always some silly bs like

"Project title: Research on how to improve glass railings
Project description: Ensure the company’s competitiveness by strengthening its internal competencies to be competitive in the demanding external market, investing in innovation in management processes, distribution, logistics, and work organization practices, as well as in relationships with the external environment"

And they always get about €250,000 to €500,000

I even have family who received these subsidies, they got one because they were running a company in a "disadvantaged neighborhood" and had to write a 10 page document and instantly got €250,000, they laughed hysterically cause they definitely didn't need that money, but why not get it if you can, right? Free money!

The EU spends hundreds of billions of euros every year from European taxpayer's money on these subsidies that just go nowhere

192

2K

142

390

266K

Mohamed Kafsi @mou7

over 1 year ago

Congrats @anniehartley_ and hope to see you again in the next edition of @AMLDAfrica @EPFL_en

EPFL Computer and Communication Sciences

@ICepfl

over 1 year ago

Congratulations to @anniehartley_ and her promotion to Adjunct Professor! 🥳 📣We also look forward to welcoming Dr Samy Bengio, Director for AI Research at @Apple, joining us as Adjunct Professor in the School of Computer and Communication Sciences. 👉https://t.co/Tc5IiXfJ71

ICepfl's tweet photo. Congratulations to @anniehartley_ and her promotion to Adjunct Professor! 🥳

📣We also look forward to welcoming Dr Samy Bengio, Director for AI Research at @Apple, joining us as Adjunct Professor in the School of Computer and Communication Sciences.

👉https://t.co/Tc5IiXfJ71 https://t.co/c9YHuBN8di

0

27

1

1K

0

1

0

46

Mohamed Kafsi @mou7

over 1 year ago

#OpenAI is opening a new office in Zurich, recruiting 3 engineers from #Google #deepmind — right in the city with Google’s 2nd largest office (~5K employees). The #AI talent war is heating up, but this is great news for the #AI community & #Switzerland’s ecosystem! 🚀

0

1

0

112

mou7 retweeted