박해선 @hsunpark - Twitter Profile

over 2 years ago

ML & DL Book Roadmap! :)

1

8

4

453

3 days ago

@rasbt Congrats! Can't wait for my copy! :-)

0

19

8 months ago

@aureliengeron 🤣🤣🤣

0

1

0

108

Haag-Streit UK is part of the Haag-Streit Group and is committed to developing innovative solutions for eye care specialists. 01279 881586.

8 months ago

Excited to finally get my copy of @aureliengeron's Hands-On Machine Learning with Scikit-Learn and PyTorch! 😀

1

36

0

13

2K

Who to follow

Haag-Streit UK

@HS_UK

Venn

@VennYour

System Software Engineer at Nvidia; Back to Kaggle since 2025

Tom Joy

@joythw

AI for cancer treatments https://t.co/Gro4Eqbufk GirlsWhoML co-founder @GirlsWhoML. PhD in AI @OxfordTVG. Prev @_FiveAI, @Meta, @SLAMcoreLtd he/him

9 months ago

The Korean translations of @burkov's <The Hundred-Page ML Book> and <The Hundred-Page LM Book> have just been released together! It was such a fun and rewarding experience to translate them. Big thanks to Andriy for these great books and to @insightbook for their careful work!

hsunpark's tweet photo. The Korean translations of @burkov's <The Hundred-Page ML Book> and <The Hundred-Page LM Book> have just been released together! It was such a fun and rewarding experience to translate them. Big thanks to Andriy for these great books and to @insightbook for their careful work! https://t.co/b6sTZTATz6

0

5

2

1

4K

10 months ago

The Korean translation of @rasbt's <Build a Large Language Model (From Scratch)> is now available! 📖✨ I learned so much while translating this book. It offers a clear, hands-on journey into how LLMs are built and how they work. :-) https://t.co/iA8CDtMai1

hsunpark's tweet photo. The Korean translation of @rasbt's <Build a Large Language Model (From Scratch)> is now available! 📖✨
I learned so much while translating this book. It offers a clear, hands-on journey into how LLMs are built and how they work. :-)
https://t.co/iA8CDtMai1 https://t.co/D9VmVVs1Xg

1

16

1

7

8K

11 months ago

@insight_impact_ 더 좋은 책을 만들 수 있도록 열심히 노력하겠습니다! 가즈아~

0

1

359

about 1 year ago

A bit late, but I’d like to share that the Korean translation of <Hands-On LLM> by @JayAlammar and @MaartenGr has been published. Working on the translation was truly enjoyable and filled with surprising discoveries! 😊

hsunpark's tweet photo. A bit late, but I’d like to share that the Korean translation of <Hands-On LLM> by @JayAlammar and @MaartenGr has been published. Working on the translation was truly enjoyable and filled with surprising discoveries! 😊 https://t.co/1a6rkcAxMQ

0

4

1

4

5K

over 1 year ago

@rasbt Thank you. I really enjoyed this book! Also I will start to translate llm from scratch book soon. :-)

1

0

28

over 1 year ago

The Korean edition of Machine Learning Q & AI by @rasbt is now available! This book provides clear and insightful answers to key questions in machine learning and AI. :) Check it out here 👉 https://t.co/WNGE9uJGWa

hsunpark's tweet photo. The Korean edition of Machine Learning Q & AI by @rasbt is now available! This book provides clear and insightful answers to key questions in machine learning and AI. :)
Check it out here 👉 https://t.co/WNGE9uJGWa https://t.co/pA1pA2ivtr

1

4

0

685

hsunpark retweeted

over 2 years ago

This paper https://t.co/zYNgPt1zmu is the complete recipe for pretraining a modern LLM from scratch, with all the details, source code, and source data. The follow-up paper will also provide the details of instruct-finetuning using Open Instruct https://t.co/5TITyUnWrX.

0

90

11

132

8K

hsunpark retweeted

Google AI

@GoogleAI

over 2 years ago

Introducing MobileDiffusion, a novel approach with the potential for rapid (sub-second) text-to-image generation on-device. An efficient latent diffusion model with a comparably small model size, it is well suited for mobile deployment. Learn more →https://t.co/JPmK7iR6T8

GoogleAI's tweet photo. Introducing MobileDiffusion, a novel approach with the potential for rapid (sub-second) text-to-image generation on-device. An efficient latent diffusion model with a comparably small model size, it is well suited for mobile deployment. Learn more →https://t.co/JPmK7iR6T8 https://t.co/MZfErTxR3S

29

752

199

172

134K

hsunpark retweeted

Sebastian Raschka

@rasbt

over 2 years ago

It's been an exciting week: the 'Machine Learning Q and A' book with @nostarch has been shipped to the printer and is now available for preorder! If you've been searching for a resource following an introductory machine learning course, this might be the one. I'm covering 30 concepts that were slightly out of scope for the previous books and courses I've taught, and I've compiled them here in a concise question-and-answer format (including exercises). I believe it will also serve as a useful companion for preparing for machine learning interviews. The topics were selected from the entire breadth of machine learning subfields: Neural networks and deep learning, computer vision, natural language processing, production and deployment, and performance evaluation. Here are just a few examples: - Managing the various sources of randomness in neural network training. - Differentiating between encoder and decoder architectures in large language models. - Reducing overfitting through data and model modifications. - Constructing confidence intervals for classifiers and optimizing models with limited labeled data. - Choosing between different multi-GPU training paradigms and various types of generative AI models. - Understanding performance metrics for natural language processing. - Making sense of the inductive biases in vision transformers. - And many more. Note that this is not a coding book. However, I also have a supplementary GitHub repository with hands-on code examples for those chapters where it makes sense.

rasbt's tweet photo. It's been an exciting week: the 'Machine Learning Q and A' book with @nostarch has been shipped to the printer and is now available for preorder!

If you've been searching for a resource following an introductory machine learning course, this might be the one. I'm covering 30 concepts that were slightly out of scope for the previous books and courses I've taught, and I've compiled them here in a concise question-and-answer format (including exercises).

I believe it will also serve as a useful companion for preparing for machine learning interviews.

The topics were selected from the entire breadth of machine learning subfields: Neural networks and deep learning, computer vision, natural language processing, production and deployment, and performance evaluation.

Here are just a few examples:

- Managing the various sources of randomness in neural network training.
- Differentiating between encoder and decoder architectures in large language models.
- Reducing overfitting through data and model modifications.
- Constructing confidence intervals for classifiers and optimizing models with limited labeled data.
- Choosing between different multi-GPU training paradigms and various types of generative AI models.
- Understanding performance metrics for natural language processing.
- Making sense of the inductive biases in vision transformers.
- And many more.

Note that this is not a coding book. However, I also have a supplementary GitHub repository with hands-on code examples for those chapters where it makes sense.

66

524

97

379

67K

hsunpark retweeted

over 2 years ago

In most jurisdictions, copyright works the following way. Copyright belongs to the author by default. Authors aren't required to claim it explicitly. They just have it. Authors can explicitly provide third parties a certain right. For example, the author can allow the use of their content in certain use cases, including non-commercial and commercial use cases. For this, various licenses exist, such as Creative Commons. Now, imagine you scraped the web to train your LLM. Most of the data you have in your training dataset doesn't come with a license. This means that by default you don't have a right to reproduce the content of your training data. LLMs and diffusion models are known to be able to reproduce their training data verbatim, fully or in part. In the absence of a permissive license, this is a clear violation of copyright. Currently, there's no reliable way of restricting the ability of LLMs to reproduce their training data, and it seems unlikely that it will be invented anytime soon.

3

6

1

2K

hsunpark retweeted

over 2 years ago

A trained LLM (and almost any ML model) is a mathematical formula. In most jurisdictions, a mathematical formula cannot be subject to copyright. As a consequence, it doesn't matter what license was used when the model weights were put online. You can take the formula, modify it for your business needs, and use it, including any commercial context.

62

201

22

105

101K

hsunpark retweeted

over 2 years ago

The Apache 2.0 licensed Mixtral beats proprietary GPT-3.5 Turbo, Gemini Pro, and the newest Claude 2.1. It would take just careful fine-tuning to reach GPT-4 level of performance. 2024 will be awesome!

burkov's tweet photo. The Apache 2.0 licensed Mixtral beats proprietary GPT-3.5 Turbo, Gemini Pro, and the newest Claude 2.1. It would take just careful fine-tuning to reach GPT-4 level of performance. 2024 will be awesome! https://t.co/DKAXnDvaPo

11

342

51

75

35K

hsunpark retweeted

Hugging Face

@huggingface

over 2 years ago

Hugging Face 🫶 @GoogleColab With the latest release of huggingface_hub, you don't need to manually log in anymore. Create a secret once and share it with every notebook you run. 🤗 pip install --upgrade huggingface_hub Check it out!👇

4

552

105

160

48K

over 2 years ago

@ManningBooks @rasbt Congrats @rasbt! :)

0

15

over 2 years ago

@rasbt Here it is! https://t.co/BBqJ6NBpaQ

0

1

0

72