angkyw @angkywilliam - Twitter Profile

4 months ago

Fine-tuning just got a whole lot easier. Serverless SFT is now in public preview on W&B! Managed infrastructure (powered by @CoreWeave) that auto-scales to your training workloads. No cluster setup. No idle GPU costs.

5

173

21

35

251K

angkyw @angkywilliam

9 months ago

@Yuchenj_UW The EO is targeting H1B recipient from outside the States, which mostly is IT consultant. It doesn’t affect and may even boost H1B chance of US college grad.

0

37

angkyw @angkywilliam

10 months ago

@EgeErdil2 Well, it does say without thinking

0

1

0

206

angkywilliam retweeted

Jack Morris

@jxmnop

12 months ago

happy birthday to the USA, the greatest country, and the origin of the following innovations: - Transformers - Pre-training (web-scale next-token prediction) - RLHF - RLVR - RL - GPUs - TPUs - PyTorch - word2vec - reasoning models - GANs - diffusion models - VLMs - self-driving cars 🇺🇸

82

2K

76

310

179K

Who to follow

∿spencer.

@_ontologic

vice president of @conceptcountry // cohost of @ai_rebels // programmer, poet, poster

paola

@braudcroissant

devil works hard but this account owner works harder ✊🏽

over 1 year ago

@jxmnop Interesting take. What's the tradeoff between SGLang and vLLM?

0

72

angkyw @angkywilliam

over 1 year ago

@LBacaj Lol, should start a small bet as a composer.

0

1

0

49

angkyw @angkywilliam

over 1 year ago

@dvassallo Amazon likely build their internal cursor, dogfood internally and release it to public to compete with cursor.

0

32

angkyw @angkywilliam

over 1 year ago

The steam engine moment for intelligence is coming fast.

0

18

angkyw @angkywilliam

over 1 year ago

Starting to see why top AI labs believe ASI is inevitable. Blending imitation learning (SFT) at different checkpoints with exploration learning (RL) can uncovers new solutions to existing problems and also tackle entirely new unsolved problems.

0

18

angkyw @angkywilliam

over 1 year ago

@goldstein_aa @jxmnop @AlexIrpan @sea_snell Training RL from scratch is hard. DeepSeek's approach builds on a strong base model. Similar to how college helps build one knowledge base before applying it to solve real-world problems.

0

73

angkywilliam retweeted

Ross Taylor

@rosstaylor90

over 1 year ago

“Wait that can’t be right” in the wild Thank you internet anons for your service to LLM reasoning. We found you through RL eventually 🫡

rosstaylor90's tweet photo. “Wait that can’t be right” in the wild

Thank you internet anons for your service to LLM reasoning. We found you through RL eventually 🫡 https://t.co/G6MdasmlpL

5

164

16

36

18K

angkyw @angkywilliam

over 1 year ago

Agent for workflow is a deterministic task similar to math and coding with verifiable output.

0

18

angkyw @angkywilliam

over 1 year ago

"Reasoning" model trade compute for data efficiency

0

8

angkyw @angkywilliam

over 1 year ago

A naive approach in training "reasoning" model 1. FineTune instruct model with chain of thought 2. Use best of N to find chain of thought that give the right answer 3. Re fineTune the model with chain of thought that yields the right answer

1

0

28

angkyw @angkywilliam

over 1 year ago

@ironcarbs Seattle 😎

0

2

0

142

angkyw @angkywilliam

over 1 year ago

@volokuleshov @brandondamos The FIM part help me understand how cursor is being trained under the hood.

1

2

0

34

angkyw @angkywilliam

over 1 year ago

@volokuleshov @brandondamos I wish I could attend the lecture! I am working on creating customize chat template serializer for Llama, Mistral and Qwen model.

0

22

angkyw @angkywilliam

almost 2 years ago

@HaramiParindey I know this place, Roosevelt island!

0

165

angkyw @angkywilliam

almost 2 years ago

Have not found class on dataset impact to model performance

0

1

0

21

angkyw @angkywilliam

almost 2 years ago

DNN: Learning Algorithm + Dataset + System/Scale Learning Algorithm: Architecture + Loss function + Optimizer (Gradient Descent) Core classes: - Architecture: https://t.co/pCZpDJNjS9 - Loss function: https://t.co/YbyHjadNWI - System: https://t.co/VPK7hfjznv

1

0

35

angkyw

@angkywilliam

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users