nakosung

5 months ago

@Saemin4655 @ArtificialAnlys Nope. It had been trained from scratch.

1

3

0

2

200

17facet retweeted

5 months ago

Naver, a South Korean internet giant, has just launched HyperCLOVA X SEED Think, a 32B open weights reasoning model that scores 44 on the Artificial Analysis Intelligence Index. This model is one of the strongest South Korean models, and outperforms EXAONE 4.0 32B, a previous Korean model leader Key benchmarking takeaways: ➤ Strength in Agentic Tool Use: HyperCLOVA X SEED Think scores 87% on τ²-Bench Telecom, demonstrating strong performance on agentic tool-use workflows. HyperCLOVA X SEED Think currently ranks among the frontier models in τ²-Bench Telecom, scoring similarly in this category to Gemini 3 Pro Preview ➤ Low token usage: HyperCLOVA X SEED Think demonstrates low token usage relative to other models in the same intelligence tier, using only ~39M reasoning tokens across the Artificial Analysis Intelligence suite. Compared to other Korean models like Motif-2-12.7B (190M reasoning tokens) and Exaone 4.0 32B (96M reasoning tokens), HyperCLOVA X SEED Think sees a clear advantage in token usage which could have latency and cost advantages for at-scale deployment ➤ Korean Language Advantage: HyperCLOVA X SEED Think scores 82% on Global MMLU Lite multilingual index for Korean, roughly in line with leading open-weights models such as gpt-oss-120b in the language category. This highlights the model’s potential usefulness in a primarily Korean language environment ➤ Open weights: HyperCLOVA X SEED Think is open weights and is 32B parameters. This continues the recent trend of newer Korean model labs open sourcing their models in an increasingly competitive AI race See below for further analysis

ArtificialAnlys's tweet photo. Naver, a South Korean internet giant, has just launched HyperCLOVA X SEED Think, a 32B open weights reasoning model that scores 44 on the Artificial Analysis Intelligence Index. This model is one of the strongest South Korean models, and outperforms EXAONE 4.0 32B, a previous Korean model leader

Key benchmarking takeaways:

➤ Strength in Agentic Tool Use: HyperCLOVA X SEED Think scores 87% on τ²-Bench Telecom, demonstrating strong performance on agentic tool-use workflows. HyperCLOVA X SEED Think currently ranks among the frontier models in τ²-Bench Telecom, scoring similarly in this category to Gemini 3 Pro Preview
➤ Low token usage: HyperCLOVA X SEED Think demonstrates low token usage relative to other models in the same intelligence tier, using only ~39M reasoning tokens across the Artificial Analysis Intelligence suite. Compared to other Korean models like Motif-2-12.7B (190M reasoning tokens) and Exaone 4.0 32B (96M reasoning tokens), HyperCLOVA X SEED Think sees a clear advantage in token usage which could have latency and cost advantages for at-scale deployment
➤ Korean Language Advantage: HyperCLOVA X SEED Think scores 82% on Global MMLU Lite multilingual index for Korean, roughly in line with leading open-weights models such as gpt-oss-120b in the language category. This highlights the model’s potential usefulness in a primarily Korean language environment
➤ Open weights: HyperCLOVA X SEED Think is open weights and is 32B parameters. This continues the recent trend of newer Korean model labs open sourcing their models in an increasingly competitive AI race

See below for further analysis

14

324

64

99

194K

17facet retweeted

Research Scientist @nvidia. ex: PhD @UMassCS; Intern @MSFTResearch, @MetaAI, @AdobeResearch. Opinions are my own and not the views of my employer.

5 months ago

HyperCLOVA X SEED Think demonstrates particular strength in agentic tool-use , scoring 95% on τ²-Bench Telecom. This places it among the best models in the agentic tool-use category

ArtificialAnlys's tweet photo. HyperCLOVA X SEED Think demonstrates particular strength in agentic tool-use , scoring 95% on τ²-Bench Telecom. This places it among the best models in the agentic tool-use category https://t.co/o1pVYxSMgb

1

27

7

4

4K

Who to follow

Simeng Sun

@simeng_ssun

hyunji amy lee

@hyunji_amy_lee

postdoc @unc_ai_group w/ @mohitban47. PhD @kaist_ai. Previously: @allen_ai @Adobe.

Jerry Tworek

@MillionInt

CEO and co-founder of Core Automation former VP of RL @ OpenAI : reasoning models, o3, o1, GPT4, ChatGPT, Codex, RL for robots cautious AI optimist

17facet retweeted

5 months ago

HyperCLOVA X SEED Think is one of the least token-intensive models for its intelligence, generating only ~39M reasoning tokens across the Artificial Analysis Intelligence suite - this has latency and cost implications

ArtificialAnlys's tweet photo. HyperCLOVA X SEED Think is one of the least token-intensive models for its intelligence, generating only ~39M reasoning tokens across the Artificial Analysis Intelligence suite - this has latency and cost implications https://t.co/kJW66i3b70

1

22

4

3

4K

17facet retweeted

5 months ago

HyperCLOVA X SEED Think scores -52 on the AA-Omniscience Index, driven primarily by a relatively high hallucination rate. However, it does lead in performance among existing Korean Models in this category

ArtificialAnlys's tweet photo. HyperCLOVA X SEED Think scores -52 on the AA-Omniscience Index, driven primarily by a relatively high hallucination rate. However, it does lead in performance among existing Korean Models in this category https://t.co/urNwmb90QX

1

19

4

3

3K

17facet retweeted

Kyunghyun Cho

@kchonyc

5 months ago

the first phase (only 5 months!) of the Korea's Sovereign AI Foundation Model project is closing out soon, and we are gradually starting to see open-weight models from the participating teams. the first up is Naver's hyperclova X models; HyperCLOVAX-SEED-Omni-8B and HyperCLOVAX-SEED-Think-32B. super excited to see models from the rest of the teams over the next few days! links to the model weights and descriptions below!

kchonyc's tweet photo. the first phase (only 5 months!) of the Korea's Sovereign AI Foundation Model project is closing out soon, and we are gradually starting to see open-weight models from the participating teams. the first up is Naver's hyperclova X models;
HyperCLOVAX-SEED-Omni-8B and HyperCLOVAX-SEED-Think-32B.

super excited to see models from the rest of the teams over the next few days!

links to the model weights and descriptions below!

7

65

22

20

35K

17facet retweeted

Kyunghyun Cho

@kchonyc

5 months ago

HyperCLOVAX-SEED-Think-32B: https://t.co/KuSZYK3WsM HyperCLOVAX-SEED-Omni-8B: https://t.co/4GZE8hFR0N

1

3

2K

11 months ago

@damianplayer AGENTS

0

1

0

14

over 1 year ago

@ai_for_success If you're Korean, you'd know these characters are real Korean actors and this is probably a scene from an aired drama.

1

2

0

72

over 1 year ago

@virattt ChatGPT's knowledge cutoff should be before the backtest to avoid forward-looking issues.

1

5

0

4K

over 2 years ago

@hayas1357 https://t.co/JqRWM3eajD CLOVA X

0

97

17facet retweeted

Nat Friedman

@natfriedman

about 3 years ago

Five months later

45

3K

244

265

696K

17facet retweeted

hardmaru

@hardmaru

over 3 years ago

This is what life is like at a Generative AI startup.

64

3K

376

142

347K

17facet retweeted

Sergey Karayev

@sergeykarayev

over 3 years ago

AI copilots for creative activities (coding, writing, drawing) exist and are awesome. Bing Chat, @perplexity_ai, @YouSearchEngine are copilots for "search" which is more of a consuming activity. Are there any AI copilots for other consuming, e.g. reading, watching, listening?

14

81

6

47

40K

over 3 years ago

@c_valenzuelab K pop artists perform at the Internet Space Station.

0

91

over 3 years ago

@julien_c Not so large LM? NSLM

0

36

over 3 years ago

@nixcraft Me. I loved overlays.

0

37

over 3 years ago

@karpathy The power-of-two rule was very common in the realm of computer graphics. I wonder what old wisdom we don't know about.

0

2

0

722