BigScience Research Workshop @bigsciencew - Twitter Profile

Pinned Tweet

BigScience Research Workshop @BigscienceW

almost 4 years ago

BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at https://t.co/mE013I62In https://t.co/KrBRVklXLf

BigscienceW's tweet photo. BLOOM is here. The largest open-access multilingual language model ever. Read more about it or get it at
https://t.co/mE013I62In
https://t.co/KrBRVklXLf https://t.co/onmQu6MxJc

29

3K

758

435

0

BigScience Research Workshop @BigscienceW

7 months ago

🫶🌸

Obvious

@obv_ious

8 months ago

Our design for Jean Zay’s new covering translates the gradient descent, the mathematical heart of how AI learns, into color and form. We actually studied the landscape loss of the BLOOM model from @BigscienceW, which was trained on Jean Zay, to create the artwork.

obv_ious's tweet photo. Our design for Jean Zay’s new covering translates the gradient descent, the mathematical heart of how AI learns, into color and form.

We actually studied the landscape loss of the BLOOM model from @BigscienceW, which was trained on Jean Zay, to create the artwork. https://t.co/OIdPDhkT28

1

11

1

3

3K

1

8

1

0

1K

BigscienceW retweeted

Stas Bekman

@StasBekman

11 months ago

This is the tech that Tunji Ruwase and I first started working on during @BigscienceW to deal with cluster resizes during BLOOM-176B training and then Sam Ade Jacobs, Lev Kurilenko and Masahiro Tanaka brought it to the finish line, improving the code, and publishing a paper and presentation at USENIX ATC 2025. See Minja's post below for links to paper, code, etc.

0

14

2

3

2K

BigscienceW retweeted

Jeff Boudier 🤗

@jeffboudier

12 months ago

4 years ago we were on the brink of AI becoming proprietary and centralized, when OpenAI kept GPT3 closed and VCs started dumping money on researchers. From fully open science, to fully closed, in a matter of months. It was scary, and 1,000+ leading researchers and scientists banded together to show the world that it was possible to do the same work in the open, and build an ecosystem that benefits everyone. That was the @BigscienceW BLOOM project, and it put us back on track to open science, starting with forward-thinking organizations like @Meta releasing OPT. Look at us now. Open models have not only caught up, they're state of the art now. Not just LLMs, but models for document AI, speech to text, text to speech, generating images and more. We're closing in on 2 million open weight models on @huggingface. Thanks for the reminder @Thom_Wolf .

7

94

30

23

22K

Who to follow

Hugging Face

@huggingface

The AI community building the future. https://t.co/TpiXQMQ9rZ

Chip Huyen

@chipro

@aisysbooks @goodailist AI Engineering: https://t.co/94dv4uTU1H Designing MLSys: https://t.co/G81hL2dWmr Reading @chipslib

Philipp Schmid

@_philschmid

Agents & Gemini API, MTS @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻‍💻 https://t.co/7IosdlO6RA

BigScience Research Workshop @BigscienceW

almost 2 years ago

🌸❤️

Matthias Gallé @mgalle

almost 2 years ago

Packing for a weekend I found this. It is hard to believe that @BigScienceLLM really happened. The first time I heard of the idea my take was "this is going to be fun... but not going to work" Kudos to @Thom_Wolf for the vision

mgalle's tweet photo. Packing for a weekend I found this.
It is hard to believe that @BigScienceLLM really happened. The first time I heard of the idea my take was "this is going to be fun... but not going to work"

Kudos to @Thom_Wolf for the vision https://t.co/0HkPFGK7Pz

1

32

5

0

31K

0

7

0

2K

BigscienceW retweeted

clem 🤗

@ClementDelangue

almost 2 years ago

Doesn't get enough credit but IMO paved the way for open-source LLMs!

6

82

7

3

12K

BigscienceW retweeted

Oxford Internet Institute @oiioxford

almost 2 years ago

DPhil candidate @cailean_osborne shares reflections on the @OpenSourceOrg co-design process to define #opensourceAI and recommends next steps, including improving model safety and supporting more grassroots initiatives like @BigscienceW.

0

1

2

0

2K

BigscienceW retweeted

Stas Bekman

@StasBekman

almost 2 years ago

The Universal Checkpointing paper is out! https://t.co/rAZ91sOA7K If you remember the @BigscienceW BLOOM-176B training, Tunji Ruwase and I co-invented this technology for Megatron-Deepspeed in order to enable to quickly scale up and down node topology while continuing training. Since then @MSFTDeepSpeed continued improving on that and it has now been fully integrated into Deepspeed. The blog post is here: https://t.co/39V5lSjOVh

3

169

33

89

19K

BigscienceW retweeted

Omar Sanseviero

@osanseviero

over 2 years ago

The top 15 most-liked organizations on @huggingface 1. @StabilityAI 20k likes 2. @AIatMeta 20k 3. @runwayml 11k 4. CompVis 10k 5. @thukeg 7k 6. @BigscienceW 7k 7. @TIIuae 7k 8. @Microsoft 6.5k 9. @GoogleAI 6k 10. @OpenAI 4k 11. @BigCodeProject 4k 12. @MosaicML 4k 13. @UKPLab 3k 14. @AiEleuther 3k 15. @salesforce 3k https://t.co/TWRABWAP2r

9

437

85

207

156K

BigscienceW retweeted

Yacine Jernite @YJernite

over 2 years ago

I respect the caution, but also need to stress that efforts that pursue transparency as an operational value in service of actual inclusion and accountability do exist - see for example the writing on this very topic by @BigscienceW, including its ethical charter. 1/3

1

18

5

8K

BigscienceW retweeted

Sasha Luccioni, PhD 🦋🌎✨🤗 @SashaMTL

almost 3 years ago

Never thought I'd see the day I'd have a publication in JMLR 🥹 So happy that the BLOOM carbon footprint paper has finally found a home at such an incredible venue! Thank you @shakir_za for being such a great editor, it warms my heart to see your name on this paper 💚

SashaMTL's tweet photo. Never thought I'd see the day I'd have a publication in JMLR 🥹
So happy that the BLOOM carbon footprint paper has finally found a home at such an incredible venue!
Thank you @shakir_za for being such a great editor, it warms my heart to see your name on this paper 💚 https://t.co/QMv75U9F4p

7

180

19

20

38K

BigscienceW retweeted

MMitchell

@mmitchell_ai

almost 3 years ago · Shoreline

If you wanted to see the fun panel/Q&A we did with Londoners on AI, you can check out the recording here! My preso at the start is also on Open Science, representing @huggingface & @BigscienceW.

1

19

4

12K

BigscienceW retweeted

BigCode @BigCodeProject

about 3 years ago

Introducing: 💫StarCoder StarCoder is a 15B LLM for code with 8k context and trained only on permissive data in 80+ programming languages. It can be prompted to reach 40% pass@1 on HumanEval and act as a Tech Assistant. Try it here: https://t.co/4XJ0tn4K1m Release thread🧵

BigCodeProject's tweet photo. Introducing: 💫StarCoder

StarCoder is a 15B LLM for code with 8k context and trained only on permissive data in 80+ programming languages. It can be prompted to reach 40% pass@1 on HumanEval and act as a Tech Assistant.

Try it here: https://t.co/4XJ0tn4K1m

Release thread🧵 https://t.co/wZj6B2KKZE

69

3K

631

2K

882K

BigscienceW retweeted

BigCode @BigCodeProject

about 3 years ago

Join us tomorrow, Wednesday 22nd (6:30 PM - 8:00PM CET) at the @mozillafestival Science Fair to learn more about our work in the open and responsible development of large language models (LLMs) for code. https://t.co/YTpBBzDe8c #Mozfest

0

23

5

1

4K

BigscienceW retweeted

Giada Pistilli @GiadaPistilli

about 3 years ago

As you already know, I am very proud of the collective work that enabled the development of @BigscienceW's ethical charter. Today I am even more proud to announce that it's part of @OECDinnovation's catalog to promote Trustworthy AI: such a milestone! https://t.co/C9A0rhgAO2

1

26

8

4

6K

BigscienceW retweeted

Aran Komatsuzaki

@arankomatsuzaki

about 3 years ago

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Documents the data creation and curation efforts of ROOTS corpus, a 1.6TB dataset used to train BLOOM Releases a large initial subset of the corpus data: https://t.co/8uHADRFuJl abs: https://t.co/KWwt6TaxQx

arankomatsuzaki's tweet photo. The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Documents the data creation and curation efforts of ROOTS corpus, a 1.6TB dataset used to train BLOOM

Releases a large initial subset of the corpus

data: https://t.co/8uHADRFuJl
abs: https://t.co/KWwt6TaxQx https://t.co/y5TZQYbixe

1

122

35

32

17K

BigscienceW retweeted

Anna Rogers @annargrs

over 3 years ago

Worried about benchmark data contamination? Studying LLM memorization or attribution? @BigscienceW BLOOM 🌸 now has exact & fuzzy search over full training data! with @olapiktus🏆 @christopher Paulo Villegas @HugoLaurencon @ggdupont @SashaMTL @YJernite https://t.co/rKE3BmMfKq /1

3

121

28

39

33K

BigscienceW retweeted

Yong Zheng-Xin

@yong_zhengxin

over 3 years ago

(Repost for corrected Arxiv) 🧐What’s the best way to quickly adapt large multilingual language models to new languages? We present our new paper from @BigscienceW 🌸: BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting. 📜 https://t.co/MLUSsdXt29 [1/9]

yong_zhengxin's tweet photo. (Repost for corrected Arxiv)
🧐What’s the best way to quickly adapt large multilingual language models to new languages?

We present our new paper from @BigscienceW 🌸:
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

📜 https://t.co/MLUSsdXt29

[1/9] https://t.co/20blxo9ODL

2

65

27

19

19K

BigscienceW retweeted

Max Ryabinin

@m_ryabinin

over 3 years ago

Petals, a system for easy decentralized inference and adaptation of 100B+ LLMs, is now online! 🌸Generate text with BLOOM-176B using Colab or a desktop GPU 🔌Fine-tune large models for your tasks 👥Help others by contributing your GPUs or host a new swarm https://t.co/5AJ37q7DUX

m_ryabinin's tweet photo. Petals, a system for easy decentralized inference and adaptation of 100B+ LLMs, is now online!

🌸Generate text with BLOOM-176B using Colab or a desktop GPU
🔌Fine-tune large models for your tasks
👥Help others by contributing your GPUs or host a new swarm
https://t.co/5AJ37q7DUX https://t.co/XlL0cCs4xb

5

247

54

87

0

BigscienceW retweeted

clem 🤗

@ClementDelangue

over 3 years ago

The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! https://t.co/NMHIzi1F79

ClementDelangue's tweet photo. The Bloom paper is out. Looks like it's doing worse than current GPT3 API in zero-shot generation tasks in English but better than other open-source LLMs & better than all in zs multi-lingual (which was the main goal). Proud of the work from the community! https://t.co/NMHIzi1F79 https://t.co/oiIjWqJseC

12

590

104

116

0

BigScience Research Workshop @BigscienceW

over 3 years ago

Big day today with two papers out! BLOOM carbon footprint at https://t.co/BcATl2gNFx, new models BLOOMZ and mt0 at https://t.co/WF6Nm7QnOS

2

41

10

5

0

BigScience Research Workshop

@BigscienceW

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users