Clifton Poth

3 months ago

Took Claude up for a spin on the weekend and started a quick open-source self-hosted re-implementation Thinking Machines' Tinker API: https://t.co/AJmLBV2uqx

0

7

3

2

352

6 months ago

been having fun training two new friends @cohere over the last few months: one nimble and quick-witted, one mighty and wise - but both better than me at finding what you're looking for

A central repository for pre-trained adapter modules in transformers! Active maintainers: @clifapt @h_sterz @LeonEnglaender @timo_imhof @PfeiffJo

6 months ago

It’s available in two versions to meet your company’s specific search needs: Fast and Pro.

1

28

5

2

15K

0

11

3

1

3K

Who to follow

AdapterHub

@AdapterHub

Nandan Thakur

@beirmug

PhD @uwaterloo • (prev) intern @DbrxMosaicAI @GoogleAI, RA @UKPLab • IR+NLP research (https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack)

Anne Lauscher (she/her)

@anne_lauscher

Ethical and safe AI in the era of #LLMs Full Professor of Trustworthy AI @unihh leading @TrustAI_lab Previously @MilaNLProc @dwsunima @allen_ai @grammarly

clifapt retweeted

Microsoft Azure @Azure

6 months ago

Introducing Cohere Rerank 4.0 in Microsoft Foundry — a major upgrade to how enterprises search, ground, and reason with their data.

1

85

17

11

27K

clifapt retweeted

Nick Frosst

@nickfrosst

6 months ago

when i say i am excited about boring AI this is what i mean. Rerankers wont make you believe the end of the world is coming, but god damn are they useful. Cohere just released the best reranker in the world. again.

8

74

8

9

9K

clifapt retweeted

6 months ago

Introducing our latest breakthrough in AI search and retrieval: Rerank 4! It’s the most advanced set of reranking models on the market, with best-in-class performance across search relevance, speed, deployment flexibility, multilingual support, and domain-specific understanding.

cohere's tweet photo. Introducing our latest breakthrough in AI search and retrieval: Rerank 4!

It’s the most advanced set of reranking models on the market, with best-in-class performance across search relevance, speed, deployment flexibility, multilingual support, and domain-specific understanding. https://t.co/ABsLQq6wGO

11

168

50

34

42K

clifapt retweeted

AdapterHub @AdapterHub

about 1 year ago

🚀Adapters v1.2 is out!🚀 We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code! We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3! Explore this +2 new adapter methods in this thread👇(1/5)

AdapterHub's tweet photo. 🚀Adapters v1.2 is out!🚀
We've made Adapters incredibly flexible: Add adapter support to ANY Transformer architecture with minimal code!

We used this to add 8 new models out-of-the-box, incl. ModernBERT, Gemma3 & Qwen3!

Explore this +2 new adapter methods in this thread👇(1/5) https://t.co/IbiHihjzKH

1

22

3

6

3K

clifapt retweeted

Nils Reimers

@Nils_Reimers

about 1 year ago

𝐂𝐨𝐡𝐞𝐫𝐞 𝐄𝐦𝐛𝐞𝐝 𝐯𝟒 - 𝐒𝐭𝐚𝐭𝐞-𝐨𝐟-𝐭𝐡𝐞-𝐚𝐫𝐭 𝐭𝐞𝐱𝐭 & 𝐢𝐦𝐚𝐠𝐞 𝐫𝐞𝐭𝐫𝐢𝐞𝐯𝐚𝐥 Today we are releasing Embed v4, unlocking so many cool new features for retrieval. 🇺🇳 100+ languages 🖼️ Text & Image capabilities 📜 128k context length

8

222

30

91

18K

clifapt retweeted

Nick Frosst

@nickfrosst

about 1 year ago

Today we are releasing Embed 4 – the new SOTA foundation for agentic enterprise search and retrieval applications! https://t.co/v9xqs7LSC6 Check out the blog for similarly visually satisfying graphs :)

nickfrosst's tweet photo. Today we are releasing Embed 4 – the new SOTA foundation for agentic enterprise search and retrieval applications!

https://t.co/v9xqs7LSC6

Check out the blog for similarly visually satisfying graphs :) https://t.co/6MqcB9En90

4

149

26

24

10K

clifapt retweeted

about 1 year ago

We’re excited to introduce our newest state-of-the-art model: Command A! Command A provides enterprises maximum performance across agentic tasks with minimal compute requirements.

27

1K

187

471

349K

clifapt retweeted

AdapterHub @AdapterHub

over 1 year ago

🎁 A new update of the Adapters library is out! Check out all the novelties, changes & fixes here: https://t.co/muMqhP0XzA

0

5

4

0

642

over 1 year ago

Check out what we've been building recently! ⬇️

over 1 year ago

Introducing our latest AI search model: Rerank 3.5! Rerank 3.5 delivers state-of-the-art performance with improved reasoning and multilingual capabilities to precisely search complex enterprise data like long documents, emails, tables, and code. https://t.co/UGxclqGIPY

22

1K

170

540

199K

0

17

1

0

678

clifapt retweeted

Aidan Gomez

@aidangomez

over 1 year ago

Your search can see now. We're excited to release fully multimodal embeddings for folks to start building with!

15

432

72

158

83K

over 1 year ago

Modular Transformers def is one of the coolest and most unexpected changes in the Transformers backbone recently! Hoping to see more modularity & inheritance in the future

Lysandre

@LysandreJik

over 1 year ago

Transformers v4.45 was just released, and it introduces a change I would not have expected: Modularity in Modeling Files. Transformers has always been strict about its single-file policy: a model must be defined in a single file rather than through layers of abstraction. So, what changed, and why are we seemingly moving away from the concept that made transformers what it is today, with 250+ model architectures across many modalities? We respond to an issue that affects both contributors and maintainers: contributing a model to transformers is long and tedious. It oftens results in PRs spanning across 20+ files, with thousands of lines of code. We wanted a solution to remove that constraint from contributors, therefore significantly enabling model additions from model authors and community members. Still, the single-file policy is at the core of Transformers: controversial to some due to the constraints it brings with it, we know for a fact that it enabled: - Researchers to experiment and tweak the modeling files - Students to go through the code without jumping from abstraction to abstraction, - Community members to contribute models without first needing to understand the rest of the overwhelmingly large package. Therefore, we've worked on "Modular Transformers," an approach to designing modeling files in a modular way while maintaining the single-file policy. Contributing a model to Transformers can now be done by subclassing other models, inheriting all their attributes, methods, and forward definitions. The tool we contribute enables unraveling that inheritance into a single file. The RoBERTa "Modular" modeling file above defines the base and masked LM models. This is then unraveled in a 1700+ single-file model definition, which can be inspected, debugged, tweaked, and adapted. The model definition spans ~30 lines of code: only the differences are now explicit. This is particularly important in the wake of LLMs, with each released model being only slightly different in terms of architecture; most of the difference lying in the data for the pretrained checkpoints. While the "Modular" and "Single-file" model definitions serve different purposes, they should both result in the exact same code execution. We aim for no magic, no hidden behavior: define a code path, a property, a method in the modular file, and you'll see it reflected in the single file. With this now merged, we can start seeing model contributions coming in at 215 LoC for the modular file; being unraveled to several files, the single-file definition standing at 1300+ LoC. Now, please come and help us break it! It's experimental and brittle, but it should drastically lower the barrier of entry for model contribution. Come and contribute your model to make it accessible to the community at large 🙌

LysandreJik's tweet photo. Transformers v4.45 was just released, and it introduces a change I would not have expected: Modularity in Modeling Files.

Transformers has always been strict about its single-file policy: a model must be defined in a single file rather than through layers of abstraction.

So, what changed, and why are we seemingly moving away from the concept that made transformers what it is today, with 250+ model architectures across many modalities?

We respond to an issue that affects both contributors and maintainers: contributing a model to transformers is long and tedious. It oftens results in PRs spanning across 20+ files, with thousands of lines of code.

We wanted a solution to remove that constraint from contributors, therefore significantly enabling model additions from model authors and community members.

Still, the single-file policy is at the core of Transformers: controversial to some due to the constraints it brings with it, we know for a fact that it enabled:

- Researchers to experiment and tweak the modeling files
- Students to go through the code without jumping from abstraction to abstraction,
- Community members to contribute models without first needing to understand the rest of the overwhelmingly large package.

Therefore, we've worked on "Modular Transformers," an approach to designing modeling files in a modular way while maintaining the single-file policy.

Contributing a model to Transformers can now be done by subclassing other models, inheriting all their attributes, methods, and forward definitions.

The tool we contribute enables unraveling that inheritance into a single file. The RoBERTa "Modular" modeling file above defines the base and masked LM models.

This is then unraveled in a 1700+ single-file model definition, which can be inspected, debugged, tweaked, and adapted.

The model definition spans ~30 lines of code: only the differences are now explicit.

This is particularly important in the wake of LLMs, with each released model being only slightly different in terms of architecture; most of the difference lying in the data for the pretrained checkpoints.

While the "Modular" and "Single-file" model definitions serve different purposes, they should both result in the exact same code execution. We aim for no magic, no hidden behavior: define a code path, a property, a method in the modular file, and you'll see it reflected in the single file.

With this now merged, we can start seeing model contributions coming in at 215 LoC for the modular file; being unraveled to several files, the single-file definition standing at 1300+ LoC.

Now, please come and help us break it! It's experimental and brittle, but it should drastically lower the barrier of entry for model contribution. Come and contribute your model to make it accessible to the community at large 🙌

10

164

40

58

22K

0

4

0

1

287

clifapt retweeted

UKP Lab @UKPLab

over 1 year ago

@emnlpmeeting @IGurevych @TUDarmstadt @ProLOEWE @ATHENECenter @emergen_CITY @CS_TUDarmstadt @FraunhoferSIT @Hessian_AI @jtonglet @Furkansahinuc @thy2512 @yufanghou @IBMResearch @AmazonScience @Koby_Loby @XiaoL558286 @ndaheim_ @dmacjam @mrinmayasachan @trumancfy @xinranz3 @TongChen0 @soshsihao @hongming110 @koeppl_lab @CarnegieMellon @uwcse @AndreasWaldis @anne_lauscher @DS_research_HH @hslu @summetix @HaritzPuerto @mtutek @somakaditya @XiaodanZhu2048 @IITKgp @IngenuityLabs @QueensECE @QianRuan_ @ilokuznetsov One more paper is accepted to the Findings of #EMNLP2024: »M2QA: Multi-domain Multilingual Question Answering« by @LeonEnglaender, @h_sterz, @clifapt, @PfeiffJo, @ilokuznetsov, @IGurevych 📰https://t.co/A23KymqkaD 💻https://t.co/a6KvXFIWh5 (11/🧵)#EMNLP2024

UKPLab's tweet photo. @emnlpmeeting @IGurevych @TUDarmstadt @ProLOEWE @ATHENECenter @emergen_CITY @CS_TUDarmstadt @FraunhoferSIT @Hessian_AI @jtonglet @Furkansahinuc @thy2512 @yufanghou @IBMResearch @AmazonScience @Koby_Loby @XiaoL558286 @ndaheim_ @dmacjam @mrinmayasachan @trumancfy @xinranz3 @TongChen0 @soshsihao @hongming110 @koeppl_lab @CarnegieMellon @uwcse @AndreasWaldis @anne_lauscher @DS_research_HH @hslu @summetix @HaritzPuerto @mtutek @somakaditya @XiaodanZhu2048 @IITKgp @IngenuityLabs @QueensECE @QianRuan_ @ilokuznetsov One more paper is accepted to the Findings of #EMNLP2024:
»M2QA: Multi-domain Multilingual Question Answering«
by
@LeonEnglaender, @h_sterz, @clifapt, @PfeiffJo, @ilokuznetsov, @IGurevych
📰https://t.co/A23KymqkaD
💻https://t.co/a6KvXFIWh5
(11/🧵)#EMNLP2024 https://t.co/D6UUGwgPxj

2

9

5

2

1K