Hareer AL-Namassi @HGL737 - Twitter Profile

HGL737 retweeted

10 days ago

I am a big fan of Jianlin Su's blog because it always starts from first principles in mathematics, rather than "ML tricks", to approach a typical ML problem (eg. training-free MoE load balancing). Here is me trying to "reinvent" one such blog which provides an elegant alternative to compute Muon, by filling in all the derivations that the blog skips for a less math-savvy audience (besides being entirely in Mandarin). The goal of the blog is to find a way to compute a essential component of Muon, ie. the left and right singular value matrices U and V for the gradient G, **individually**. In the standard form, Muon really just needs their product UV^T, hence the standard way to compute it via computing a low-rank polynomial of G many times ("Newton-Schulz"). But there are more variants of Muon to control the properties of model updates if we can get both individually, hence the blog's proposal to revisit some fundamental linear algebra techniques for the computation. The methodological takeaway from the blog's thought process is that there are three components to breaking down a ML problem: (1) how to be able to compute something (power iteration), (2) how to compute it fast (cholesky decomposition), and (3) how to compute it accurately given finite floating points (repeated orthogonalization). The goal of reading inspiring blogs like this is, in Feynman's term, to be able to "reinvent" them at any time to grasp the fundamental approach of doing similar work. Original blog: https://t.co/5ksKPICpMW

HeMuyu0327's tweet photo. I am a big fan of Jianlin Su's blog because it always starts from first principles in mathematics, rather than "ML tricks", to approach a typical ML problem (eg. training-free MoE load balancing).

Here is me trying to "reinvent" one such blog which provides an elegant alternative to compute Muon, by filling in all the derivations that the blog skips for a less math-savvy audience (besides being entirely in Mandarin).

The goal of the blog is to find a way to compute a essential component of Muon, ie. the left and right singular value matrices U and V for the gradient G, **individually**. In the standard form, Muon really just needs their product UV^T, hence the standard way to compute it via computing a low-rank polynomial of G many times ("Newton-Schulz"). But there are more variants of Muon to control the properties of model updates if we can get both individually, hence the blog's proposal to revisit some fundamental linear algebra techniques for the computation.

The methodological takeaway from the blog's thought process is that there are three components to breaking down a ML problem: (1) how to be able to compute something (power iteration), (2) how to compute it fast (cholesky decomposition), and (3) how to compute it accurately given finite floating points (repeated orthogonalization). The goal of reading inspiring blogs like this is, in Feynman's term, to be able to "reinvent" them at any time to grasp the fundamental approach of doing similar work.

Original blog: https://t.co/5ksKPICpMW

10

2K

142

2K

76K

HGL737 retweeted

Jenan Al-Namassi @jenanfn

about 2 months ago

لأول مرة🤩!! يجتمع شغف طب الأسرة مع الخبرة العملية في لقاء تفاعلي مميز يسر مجتمع طب الأسرة ونادي المهتمين بطب الأسرة وVistaMed في تقديم ورشة مميزة مع د. سارة البكري: 🌟The Remarkable Family Physician خطوة أقرب لتفكير سريري أعمق، تواصل أفضل، وعيادة أكثر كفاءة. المقاعد محدودة‼️

jenanfn's tweet photo. لأول مرة🤩!!
يجتمع شغف طب الأسرة مع الخبرة العملية في لقاء تفاعلي مميز يسر مجتمع طب الأسرة ونادي المهتمين بطب الأسرة وVistaMed في تقديم ورشة مميزة مع د. سارة البكري:

🌟The Remarkable Family Physician

خطوة أقرب لتفكير سريري أعمق، تواصل أفضل، وعيادة أكثر كفاءة.

المقاعد محدودة‼️ https://t.co/diXBmsx6AG

1

9

4

11

2K

HGL737 retweeted

SDAIA @SDAIA_SA

5 months ago

المملكة تنشئ أكبر مركز بيانات حكومي في العالم مصنف Tier IV كأعلى تصنيف، تحقيقاً لتطلعات سمو ولي العهد – حفظه الله – في بناء اقتصاد وطني قائم على البيانات والذكاء الاصطناعي ضمن إطار تحقيق مستهدفات #رؤية_السعودية_2030 #مركز_هيكساجون #سدايا

SDAIA_SA's tweet photo. المملكة تنشئ أكبر مركز بيانات حكومي في العالم مصنف Tier IV كأعلى تصنيف، تحقيقاً لتطلعات سمو ولي العهد – حفظه الله – في بناء اقتصاد وطني قائم على البيانات والذكاء الاصطناعي ضمن إطار تحقيق مستهدفات #رؤية_السعودية_2030
#مركز_هيكساجون
#سدايا https://t.co/wkpzs2uppT

35

536

298

45

202K

HGL737 retweeted

Dan Kornas

@DanKornas

6 months ago

UC Berkeley offers two free courses on LLM agents, one at the foundational level and one at the advanced level, taught by leading researchers and practitioners from DeepMind, Meta, and top universities. Together, they cover essentially everything you need to understand and build agents, drawing from some of the best resources available today.

DanKornas's tweet photo. UC Berkeley offers two free courses on LLM agents, one at the foundational level and one at the advanced level, taught by leading researchers and practitioners from DeepMind, Meta, and top universities.

Together, they cover essentially everything you need to understand and build agents, drawing from some of the best resources available today.

19

1K

230

2K

84K

Who to follow

Abeer Almalki

@1AbeerAlmalki

Statistics Lecturer @UOfjeddah | PhD Researcher @UniversityLeeds

“ I don’t get lucky, I make my own luck”

HGL737 retweeted

حصوص @kyuhaiyv

6 months ago

and that was my LAST uni exam EVERRR!!!!!!

0

3

2

4

1K

Hareer AL-Namassi @HGL737

6 months ago

from the Digital Heritage Conference… where technology meets history #التراث_الرقمي

0

3

0

148

Hareer AL-Namassi @HGL737

6 months ago

@Fouz_almu مُستحَقّه لك فعلاً يا فوز 💝

1

0

284

HGL737 retweeted

Tech with Mak

@techNmak

6 months ago

I have created this illustration to help you visualize the Docker Workflow 👇 Let's understand the terms using analogy - 👉 Dockerfile - Think of a Dockerfile as a recipe or a set of instructions. You start by creating a Dockerfile that lists all the ingredients (software and configurations) needed for your application. 👉 Docker Image - Using the Dockerfile as your recipe, you "cook" or "build" a Docker Image. This image is like a frozen snapshot of your application, containing everything it needs to run. 👉 Docker Container - Once you have your Docker Image, you can "serve" it by creating a Docker Container. The container is like a real, running instance of your application, and it can be started, stopped, and even duplicated as needed. You can run any number of containers from an Image. Follow @techNmak for regular insights.

techNmak's tweet photo. I have created this illustration to help you visualize the Docker Workflow 👇

Let's understand the terms using analogy -

👉 Dockerfile
- Think of a Dockerfile as a recipe or a set of instructions.

You start by creating a Dockerfile that lists all the ingredients (software and configurations) needed for your application.

👉 Docker Image
- Using the Dockerfile as your recipe, you "cook" or "build" a Docker Image.

This image is like a frozen snapshot of your application, containing everything it needs to run.

👉 Docker Container
- Once you have your Docker Image, you can "serve" it by creating a Docker Container.

The container is like a real, running instance of your application, and it can be started, stopped, and even duplicated as needed.

You can run any number of containers from an Image.

Follow @techNmak for regular insights.

54

190

37

114

7K

HGL737 retweeted

SDAIA @SDAIA_SA

6 months ago

أهم المشاريع المنجزة في البيانات والذكاء الاصطناعي لعام 2025م في الخدمات الرقمية. #ميزانية_السعودية2026

0

37

21

11

8K

HGL737 retweeted

Unsloth AI

@UnslothAI

6 months ago

You can now train Mistral Ministral 3 with reinforcement learning in our free notebook! You'll GRPO the model to solve sudoku autonomously. Learn about our new reward functions, RL environment & reward hacking. Blog: https://t.co/SLIamT6Dx7 Notebook: https://t.co/oj0lZ0fIhx

UnslothAI's tweet photo. You can now train Mistral Ministral 3 with reinforcement learning in our free notebook!

You'll GRPO the model to solve sudoku autonomously.

Learn about our new reward functions, RL environment & reward hacking.

Blog: https://t.co/SLIamT6Dx7
Notebook: https://t.co/oj0lZ0fIhx

15

873

130

594

41K

Hareer AL-Namassi @HGL737

6 months ago

GP1 done ☑️ GP2 next ➡️

0

1

0

63

HGL737 retweeted

Tech with Mak

@techNmak

6 months ago

RAG Developer Stack -------- Follow @techNmak for more insights.

8

398

83

232

12K

HGL737 retweeted

SDAIA @SDAIA_SA

6 months ago

المملكة الثالثة عالمياً بعد الولايات المتحدة الأمريكية وجمهورية الصين الشعبية في نماذج الذكاء الاصطناعي الرائدة .. وفقاً لمؤشر ستانفورد 2025م.

SDAIA_SA's tweet photo. المملكة الثالثة عالمياً بعد الولايات المتحدة الأمريكية وجمهورية الصين الشعبية في نماذج الذكاء الاصطناعي الرائدة .. وفقاً لمؤشر ستانفورد 2025م. https://t.co/Dy7K8QSgEA

32

827

436

83

103K

Hareer AL-Namassi @HGL737

7 months ago

@alqaabba تستاهلين الفوز أميرة ، أفضل من يمثلنا

0

3K

HGL737 retweeted

Ghaida @Ghaaiiiddaaa

7 months ago

للمهتمين باكتساب مهارات عملية في الذكاء الاصطناعي لا تضيعون الفرصة!

2

10

1

3

4K

HGL737 retweeted

رايـه @Rayah_m7

7 months ago

نسعد بانضمامكم وحضوركم! لنمكن الحاضر بأهم تقنيات الذكاء الاصطناعي حتى نبني مستقبل زاهر

0

8

2

0

2K

HGL737 retweeted

AI Hub @AIHub_X

7 months ago

يسعدنا إطلاق سلسلة ورش AI Peers وهي سلسلة ورش تطبيقية تُقدَّم من الطالبات المتخصصات في الذكاء الاصطناعي، وتهدف إلى تعزيز تبادل المعرفة والخبرة في مجالات الذكاء الاصطناعي. تستهل السلسلة أولى ورشها بعنوان "Advanced AI Concepts and Applications" للتسجيل: https://t.co/V7ocj3Wm03

AIHub_X's tweet photo. يسعدنا إطلاق سلسلة ورش AI Peers وهي سلسلة ورش تطبيقية تُقدَّم من الطالبات المتخصصات في الذكاء الاصطناعي، وتهدف إلى تعزيز تبادل المعرفة والخبرة في مجالات الذكاء الاصطناعي.

تستهل السلسلة أولى ورشها بعنوان "Advanced AI Concepts and Applications"

للتسجيل:
https://t.co/V7ocj3Wm03 https://t.co/dHyTW1rBRb

0

19

5

6

8K

HGL737 retweeted

مغفرة

@1A_610

7 months ago

5

4K

788

568

175K

HGL737 retweeted

ℏεsam

@Hesamation

7 months ago

llm.c by hand is next level. i wonder sometimes how the brain of people who master things in unbelievably deep levels comprehend subjects differently.

31

3K

302

3K

190K

HGL737 retweeted

Python Developer

@PythonDvz

7 months ago

Layers of AI

5

978

228

690

71K

Hareer AL-Namassi

@HGL737

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users