Shibl Mourad

@shibl

DeepMind Canada Engineering Lead. Interested in philosophy, cocktails, ramen, machine learning, manga and computer science, not necessarily in this order.

Montreal

Joined August 2008

866 Following

1.2K Followers

3.7K Posts

shibl retweeted

@RichardSSutton

about 1 year ago

The PhD thesis of my _first_ PhD student, Doina Precup, is at-long-last available in digital form. Title: Temporal Abstraction in Reinforcement Learning Url: https://t.co/d1JJNtv407 Abstract: Decision making usually involves choosing among different courses of action over a broad range of time scales. For instance, a person planning a trip to a distant location makes high-level decisions regarding what means of transportation to use, but also chooses low-level actions, such as the movements for getting into a car. The problem of picking an appropriate time scale for reasoning and learning has been explored in artificial intelligence, control theory and robotics. In this dissertation we develop a framework that allows novel solutions to this problem, in the context of Markov Decision Processes (MDPs) and reinforcement learning. In this dissertation, we present a general framework for prediction, control and learning at multiple temporal scales. In this framework, temporally extended actions are represented by a way of behaving (a policy) together with a termination condition. An action represented in this way is called an _option_. Options can be easily incorporated in MDPs, allowing an agent to use existing controllers, heuristics for picking actions, or learned courses of action. The effects of behaving according to an option can be predicted using multi-time models, learned by interacting with the environment. In this dissertation we develop multi-time models, and we illustrate the way in which they can be used to produce plans of behavior very quickly, using classical dynamic programming or reinforcement learning techniques. The most interesting feature of our framework is that it allows an agent to work simultaneously with high-level and low-level temporal representations. The interplay of these levels can be exploited in order to learn and plan more efficiently and more accurately. We develop new algorithms that take advantage of this structure to improve the quality of plans, and to learn in parallel about the effects of many different options. Where now: Doina is a professor of computer science at McGill University and head of the Montreal office of Google DeepMind

5

492

48

206

34K

Shibl Mourad @shibl

21 days ago

@nectarios You are an inspiration for us all @nectarios Congrats on the silver.

1

1

0

0

25

shibl retweeted

David Papineau @davidpapineau

26 days ago

@Philip_Goff WHY is there something rather than nothing? Stop moaning — if there was nothing you’d still be complaining.

9

97

12

6

11K

Shibl Mourad @shibl

about 1 month ago

@iam_elias1 Imagine an economy composed of Alice and Bob. Post AI layoff, 0 employment. They wake and both want to have cereals and milk. Option 1: it is readily available. Who needs a job!!! Option 2: not available. Alice will find a way to cereals and Bob milk. Full employment.

0

0

0

0

26

Who to follow

Verified account

Research Scientist at @GoogleDeepMind working on Gemini and Search.

Marc G. Bellemare

@marcgbellemare

Modelling @ Cohere. Ex RL research lead at Google Brain, DeepMind. Textbook author. Co-founder, Reliant AI.

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

shibl retweeted

5 months ago

Hegel.

46

449

30

35

26K

Shibl Mourad @shibl

6 months ago

@jm_alexia I upgraded my rubber duck to the LLM. But now I feel like the 🦆

0

0

0

0

381

Shibl Mourad @shibl

6 months ago

@fchollet https://t.co/3a45RpeXDf

0

1

0

0

41

Shibl Mourad @shibl

6 months ago

@Tayyarji_return @DrMahmoudHafez3 الاسلام ولد في الجزيرة العربية أما المسلمون فهم أبناء الارض و كما أصبحوا مجوس و مسيحيين فبعضهم اعتنق الاسلام.

0

0

0

0

19

Shibl Mourad @shibl

6 months ago

Note that a lot of the innovations in the previous LLM updates were not scaling: thinking, multimodal, memory, ...

0

0

0

0

103

Shibl Mourad @shibl

6 months ago

The industry is not "obsessed" with making LLM bigger it is obsessed with delivering value to the users so that they choose their product. If you doubt this please try using favorite LLM from 2 generations ago.

@JonhernandezIA

6 months ago

📁 Yann LeCun says that scaling models will not get us to human intelligence. He explains that the industry remains obsessed with making LLMs bigger, but that this path is fundamentally broken. It does not matter how many parameters we add or how many clusters we build, because the models only imitate language patterns. Human intelligence does not emerge from size, it emerges from understanding the world.

209

1K

292

379

156K

2

0

0

0

218

shibl retweeted

6 months ago

AI is not rocket science. It's way harder.

42

285

19

21

12K

Shibl Mourad @shibl

6 months ago

@karpathy Would be great to have practical examples of the difference in response quality.

0

1

0

0

37

Shibl Mourad @shibl

6 months ago

@pmddomingos Or it learns to solve it faster and from different staring point.

0

0

0

0

94

Shibl Mourad @shibl

6 months ago

@pmddomingos Only if the problem is binary.

0

0

0

0

100

Shibl Mourad @shibl

6 months ago

@ViralOrTrying @fchollet Kolmogorov complexity isn't an absolute number but depends on the utm used. The difference between k_utm1 and k_utm2 is bounded to a constant but it could be large.

0

0

0

0

26

Shibl Mourad @shibl

6 months ago

@fchollet If it has merely cached the data then what explains generalization of small datasets with large models? This indicates that large models + régularisera are doing some sort of soft compression happening in large models.

0

0

0

0

46

Shibl Mourad @shibl

6 months ago

https://t.co/fJMOkkG69y Best comment about LLMs from Scott AAronson: "The more I play with things like the O1 model the more grateful I am I have tenure."

0

1

0

0

163

Shibl Mourad @shibl

6 months ago

@BarrAlexandra 💯

0

1

0

0

155

Shibl Mourad @shibl

6 months ago

@ItsKieranDrew The cancer of the cancer

0

0

0

0

10

Shibl Mourad @shibl

6 months ago

@fantfant5 @qaisailan العلم و الفلسفة نشاطان مختلفان جذريا. الفلسفة تتحرى عن معنى و طبيعة الأشياء و العلم يقدم نماذج قابلة للتجربة و الاستخدام عمليا. الفلسفة تقدم راحة النفس و الاحساس بمعنى للحياة و العلم يقدم الكهرباء و الطائرات و الجوال ...

0

0

0

2

133

Shibl Mourad @shibl

6 months ago

@fantfant5 @qaisailan ليست ذات النظرية اينشتاين قدم معادلات تسمح بتحديد موضع الكواكب بدقة تفوق ما كانت الأجهزة في وقته قابلة للقياس. ابن تيمية كرر اراء الإغريق و المعتزلة في خلق الزمان و نسبيته. هذا ليس اجحافا لأن الشيء نفسه يقال في أفلاطون و اوغسطين و ابن رشد و هم من أهم المفكرين في التاريخ.

2

0

0

1

155

Last Seen Users on Sotwe

Trends for you

Most Popular Users