Shaltiel @sshmidman - Twitter Profile

Shaltiel @SShmidman

about 1 month ago

@maximelabonne Will do! Either way, feel free to DM me and I'll share what we have already :)

0

1

0

11

Shaltiel @SShmidman

about 1 month ago

@maximelabonne Nice! We're going to release our training datasets in the coming weeks, including human curated preference data - would be happy to share more with you ahead of time

1

0

74

Shaltiel @SShmidman

about 2 months ago

@levelsio Missed opportunity to call the GDPR Cookie Consent Dismisser "Cookie Monster"

0

79

Shaltiel @SShmidman

2 months ago

@maximelabonne @neural_avb @liquidai @aiDotEngineer @maximelabonne can you DM me? We're pretraining a new model (~105B) and would love to chat / collab 🙏

0

16

Who to follow

Arie Cattan

@ArieCattan

CS Phd student at @biunlp and intern at @Google, previously @IBMResearch @allen_ai

BIU NLP

@biunlp

The Bar-Ilan University, Natural Language Processing group.

Gabriel Stanovsky

@GabiStanovsky

Associate Professor at @CseHuji

Shaltiel @SShmidman

2 months ago

@maximelabonne @neural_avb @liquidai @aiDotEngineer Would really appreciate a link as well when it's uploaded :)

1

0

33

Shaltiel @SShmidman

2 months ago

@Teknium DM'd!

0

1

0

10

Shaltiel @SShmidman

3 months ago

@digitalix The spark boast 1 Petaflops sparse NVFP4, or 512 teraflops dense. Can you find any configuration with any framework where you can reach that training a model?

0

2

0

71

Shaltiel @SShmidman

3 months ago

@ThePrimeagen I personally found Windsurf's autocomplete much more intuitive - agree with everything else :)

0

47

Shaltiel @SShmidman

3 months ago

@yoavgo We've got our project going on, if you want to hear more :)

0

1

0

60

Shaltiel @SShmidman

4 months ago

@elbeyoglu @elbeyoglu I'm getting some weird results for our website - just getting some metadata with no content. For example, https://t.co/HNkdG62uVn

0

16

Shaltiel @SShmidman

4 months ago

@elder_plinius @viemccoy Curious to hear about that as well

0

1

0

17

SShmidman retweeted

NVIDIA AI Developer

@NVIDIAAIDev

4 months ago

How To Adapt AI for Low-Resource Languages with NVIDIA Nemotron https://t.co/mSEF5llUtE

0

32

3

4

2K

Shaltiel @SShmidman

5 months ago

@yuvalmarton @yoavgo @rtsarfaty @yuvalpi @shmidman אפשר לקבל גישה ב-API לאתר שקישרתי

0

1

0

39

Shaltiel @SShmidman

5 months ago

@yuvalmarton @yoavgo @rtsarfaty @yuvalpi @shmidman יש גם מודל פתוח, אבל הוא נוטה יותר להזיות: https://t.co/wQWgcDQkZK https://t.co/hLMrTZkDsr

1

0

56

Shaltiel @SShmidman

5 months ago

@yuvalmarton @yoavgo @rtsarfaty @yuvalpi @shmidman https://t.co/jMA1AxWW6K

0

1

0

29

Shaltiel @SShmidman

6 months ago

We've come a long way from the initial announcement of the Sovereign LLM project - excited to showcase our results. We've written up a tutorial sharing how you can train your own Sovereign LLM using Nvidia Nemotron, NeMo Framework, and DGX Cloud Lepton: https://t.co/1Y65PdMOdp Great work together with @nvidia

SShmidman's tweet photo. We've come a long way from the initial announcement of the Sovereign LLM project - excited to showcase our results.

We've written up a tutorial sharing how you can train your own Sovereign LLM using Nvidia Nemotron, NeMo Framework, and DGX Cloud Lepton: https://t.co/1Y65PdMOdp

Great work together with @nvidia

Shaltiel @SShmidman

6 months ago

Beyond excited to announce the official release of: 🚀 Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs 🔥 Dicta-LM 3.0 is a powerful open-weight collection of LLMs with full Hebrew support. View the full announcement here: https://t.co/FxPr1j6yBI 🧵

1

3

1

0

479

0

2

0

256

Shaltiel @SShmidman

6 months ago

The models were trained on a cluster of over 100 H200 GPUs, on NVIDIA DGX Cloud Lepton. All training was done using the NVIDIA NeMo Framework and the NVIDIA NeMo-RL library. We are extremely grateful to NVIDIA and their technical teams, who made this all possible!

0

1

0

139

Shaltiel @SShmidman

6 months ago

Beyond excited to announce the official release of: 🚀 Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs 🔥 Dicta-LM 3.0 is a powerful open-weight collection of LLMs with full Hebrew support. View the full announcement here: https://t.co/FxPr1j6yBI 🧵

1

3

1

0

479

Shaltiel @SShmidman

6 months ago

📄 The technical report is available here: https://t.co/ramBGf1Rh4

1

0

87

Shaltiel

@SShmidman

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users