Nir Ratner

over 1 year ago

Today we launched Jamba 1.6, the best open model for private enterprise deployment. AI21’s Jamba outperforms Cohere, Mistral and Llama on key benchmarks, including Arena Hard, and rivals leading closed models while maintaining unmatched speed and quality. Now available on AI21’s Studio and @Hugging Face. Learn more: https://t.co/LZD7IXKqZe

AI21Labs's tweet photo. Today we launched Jamba 1.6, the best open model for private enterprise deployment. AI21’s Jamba outperforms Cohere, Mistral and Llama on key benchmarks, including Arena Hard, and rivals leading closed models while maintaining unmatched speed and quality.

Now available on AI21’s Studio and @Hugging Face.

Learn more: https://t.co/LZD7IXKqZe

6

176

64

31K

NirRatner retweeted

Research Scientist, working on Gemini @GoogleAI PhD from @TelAvivUni

almost 2 years ago

📄Jamba-1.5 whitepaper is out! The whitepaper details the architecture, training schemes, novelties and in-depth evaluations of our new long context hybrid SSM-Transformer models - Jamba-1.5-Large and Jamba-1.5-Mini. Arxiv: https://t.co/bZpWQcbHSa Here are some highlights and insights from the paper 👇1/7

AI21Labs's tweet photo. 📄Jamba-1.5 whitepaper is out!
The whitepaper details the architecture, training schemes, novelties and in-depth evaluations of our new long context hybrid SSM-Transformer models - Jamba-1.5-Large and Jamba-1.5-Mini.

Arxiv: https://t.co/bZpWQcbHSa

Here are some highlights and insights from the paper 👇1/7

6

291

77

114

24K

Who to follow

Yonatan Bitton

@YonatanBitton

Research Scientist @GoogleAI | Multimodal ML & Vision-Language | Account restored after hack (July 2025).

NirRatner retweeted

almost 2 years ago

We released the #Jamba 1.5 open model family: - 256K #contextwindow - Up to 2.5X faster on #longcontext in its size class - Native support for structured JSON output, function calling, digesting doc objects & generating citations https://t.co/tebBJW09c5 #AI #LLM #AI21Jamba

AI21Labs's tweet photo. We released the #Jamba 1.5 open model family:

- 256K #contextwindow
- Up to 2.5X faster on #longcontext in its size class
- Native support for structured JSON output, function calling, digesting doc objects & generating citations

https://t.co/tebBJW09c5

#AI #LLM #AI21Jamba https://t.co/hdl62pRsZq

105

417

96

127

165K

NirRatner retweeted

about 2 years ago

Introducing Jamba, our groundbreaking SSM-Transformer open model! As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU. 🥂Meet Jamba https://t.co/f2XZFOQbxh 🔨Build on @huggingface

AI21Labs's tweet photo. Introducing Jamba, our groundbreaking SSM-Transformer open model!

As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU.

🥂Meet Jamba https://t.co/f2XZFOQbxh

🔨Build on @huggingface https://t.co/WGOjJMiOoE

35

1K

241

473

333K

NirRatner retweeted

Dor Muhlgay @dormuhlg

over 2 years ago

#NLProc I am happy to share I will be presenting our paper “Generating Benchmarks for Factuality Evaluation of Language Models” at #EACL2024! Check out our updated version on arxiv, introducing a new benchmark: Expert-FACTOR (based on ExpertQA) 🚀 Paper, Datasets & Code: ⬇️⬇️

0

6

3

0

1K

NirRatner retweeted

John Spencer

@SpencerGuard

over 2 years ago

Hamas raped and mutilated women on 7 October https://t.co/z2F9SEjT7P

2K

3K

887

105

241K

NirRatner retweeted

Alex Plitsas 🇺🇸

@alexplitsas

over 2 years ago

🧵 I just witnessed ~45 minutes of footage of the October 7th terrorist attack at the @AtlanticCouncil courtesy of @IsraelinUSA along with colleagues from think tanks across the ideological spectrum. What I saw was worse than I’ve ever seen. Pure evil. ***Trigger Warning***

652

22K

6K

4K

5M

over 2 years ago

@zehavoc @broseph_stalin That's a blunt lie: https://t.co/wl4OojkktH

0

19

over 2 years ago

@zehavoc @snarwani Nobody, he is part of channel 14, sort of the "newmax" super right wing channel in Israel.

0

28

over 2 years ago

@jastorj No, it will be quadratic in the number of task token plus the number of tokens in a single window, but not quadratic in the sum of tokens in all windows.

0

5

over 3 years ago

#NLProc Is the context window of your LLM too small for you? Do you want to add in-context examples but can’t? Parallel Context Windows increase any LLM’s context *without further training*! 🚨 Paper from @AI21Labs "Parallel Context Windows Improve In-Context Learning" 🧵

NirRatner's tweet photo. #NLProc
Is the context window of your LLM too small for you?
Do you want to add in-context examples but can’t?

Parallel Context Windows increase any LLM’s context *without further training*!

🚨 Paper from @AI21Labs
"Parallel Context Windows Improve In-Context Learning"

🧵 https://t.co/9X9dQVl9Sb

5

55

18

16

9K

almost 3 years ago

@boknilev @janundnik Ha, I am not aware of anyone doing that. Kudos @janundnik, nice idea.

1

2

0

46

almost 3 years ago

@boknilev @janundnik @boknilev Many reported results for multiple choices of N (https://t.co/IbiHe9NwuQ for example), but I can't recall any paper specifically focusing on those plots. I suspect that Min, S did this one of those plots in one of her papers but I can't recall which one.

1

2

0

100

NirRatner retweeted

AK

@_akhaliq

almost 3 years ago

Generating Benchmarks for Factuality Evaluation of Language Models paper page: https://t.co/WpgdJBL99C Before deploying a language model (LM) within a given domain, it is important to measure its tendency to generate factually incorrect information in that domain. Existing factual generation evaluation methods focus on facts sampled from the LM itself, and thus do not control the set of evaluated facts and might under-represent rare and unlikely facts. We propose FACTOR: Factual Assessment via Corpus TransfORmation, a scalable approach for evaluating LM factuality. FACTOR automatically transforms a factual corpus of interest into a benchmark evaluating an LM's propensity to generate true facts from the corpus vs. similar but incorrect statements. We use our framework to create two benchmarks: Wiki-FACTOR and News-FACTOR. We show that: (i) our benchmark scores increase with model size and improve when the LM is augmented with retrieval; (ii) benchmark score correlates with perplexity, but the two metrics do not always agree on model ranking; and (iii) when perplexity and benchmark score disagree, the latter better reflects factuality in open-ended generation, as measured by human annotators.

_akhaliq's tweet photo. Generating Benchmarks for Factuality Evaluation of Language Models

paper page: https://t.co/WpgdJBL99C

Before deploying a language model (LM) within a given domain, it is important to measure its tendency to generate factually incorrect information in that domain. Existing factual generation evaluation methods focus on facts sampled from the LM itself, and thus do not control the set of evaluated facts and might under-represent rare and unlikely facts. We propose FACTOR: Factual Assessment via Corpus TransfORmation, a scalable approach for evaluating LM factuality. FACTOR automatically transforms a factual corpus of interest into a benchmark evaluating an LM's propensity to generate true facts from the corpus vs. similar but incorrect statements. We use our framework to create two benchmarks: Wiki-FACTOR and News-FACTOR. We show that: (i) our benchmark scores increase with model size and improve when the LM is augmented with retrieval; (ii) benchmark score correlates with perplexity, but the two metrics do not always agree on model ranking; and (iii) when perplexity and benchmark score disagree, the latter better reflects factuality in open-ended generation, as measured by human annotators.

0

74

20

38

22K

NirRatner retweeted

Dor Muhlgay @dormuhlg

almost 3 years ago

#NLProc New paper! “Generating Benchmarks for Factuality Evaluation of Language Models” From @AI21Labs Evaluate an LM’s tendency to generate true facts from your knowledge-intensive corpus! Paper: https://t.co/xDm2f2tRne Code & Data (soon): https://t.co/QyToxP4n2S 🧵⬇️

dormuhlg's tweet photo. #NLProc
New paper!
“Generating Benchmarks for Factuality Evaluation of Language Models”
From @AI21Labs

Evaluate an LM’s tendency to generate true facts from your knowledge-intensive corpus!

Paper: https://t.co/xDm2f2tRne

Code & Data (soon): https://t.co/QyToxP4n2S

🧵⬇️ https://t.co/uPMoEkmKDA

2

139

43

59

18K

almost 3 years ago

Will be presenting a poster today at 1100 in #ACL2023NLP Come and say hello! 👾

0

4

0

97

almost 3 years ago

Do you want to process long texts with LLaMA models, but can't due to its context length? This one is for you! We have implemented PCW for LLaMA, enabling larger contexts!! Link: https://t.co/r0KRAWJYjQ

0

15

6

2

2K

about 3 years ago

A nice TL;DR we did for the original preprint: https://t.co/KLM5T7zIX8

over 3 years ago

#NLProc Is the context window of your LLM too small for you? Do you want to add in-context examples but can’t? Parallel Context Windows increase any LLM’s context *without further training*! 🚨 Paper from @AI21Labs "Parallel Context Windows Improve In-Context Learning" 🧵

5

55

18

16

9K

0

3

0

265

about 3 years ago

LLMs can attend to way more text than their original context window -- Accepted to ACL 2023 main conference 🥳🥳 "Parallel Context Windows for Large Language Models" Paper: https://t.co/yDUFTRAsON Code: https://t.co/r0KRAWJYjQ #ACL2023 #ACL2023NLP #NLProc

2

69

22

18

9K