Nolano.ai @nolanoOrg - Twitter Profile

10 months ago

🚀 New Paper: Scaling Laws and Efficient Inference for Ternary Language Models. Thrilled to share that our work was presented at ACL 2025! We explore ternary LMs (TriLMs), studying their scaling laws and efficiency compared to traditional FloatLMs. 🧵 1/6

imtejas13's tweet photo. 🚀 New Paper: Scaling Laws and Efficient Inference for Ternary Language Models.

Thrilled to share that our work was presented at ACL 2025! We explore ternary LMs (TriLMs), studying their scaling laws and efficiency compared to traditional FloatLMs. 🧵

1/6 https://t.co/gkABlEwMjY

1

16

3

6

4K

Nolano.ai @NolanoOrg

over 1 year ago

Our work has been accepted as a Spotlight (top 5.1%) at ICLR 2025.

Tejas Vaidhya

@imtejas13

over 1 year ago

🎉 Thrilled to share that our paper "Surprising effectiveness of pretraining ternary language models at scale" earned a spotlight at #ICLR2024! We dive into Ternary Language Models (TriLMs), systematically studying their training feasibility and scaling laws against FloatLMs. 1/5

imtejas13's tweet photo. 🎉 Thrilled to share that our paper "Surprising effectiveness of pretraining ternary language models at scale" earned a spotlight at #ICLR2024! We dive into Ternary Language Models (TriLMs), systematically studying their training feasibility and scaling laws against FloatLMs.

1/5

3

62

12

17

12K

0

4

0

544

Nolano.ai @NolanoOrg

almost 2 years ago

Read our in-depth blog post for more insights: https://t.co/hfwSX6SDZ5

0

10

0

1

612

Nolano.ai @NolanoOrg

almost 2 years ago

🚀 SpectraSuite of Ternary and FP16 LLMs 🚀 We’re thrilled to release the Spectra Suite of open ternary (TriLMs) and FP16 (FloatLMs) language models from 99M to 3.9B parameters. At billion+ parameter scale, TriLMs upto 10x smaller can match the performance of FloatLMs. 1/5

NolanoOrg's tweet photo. 🚀 SpectraSuite of Ternary and FP16 LLMs 🚀

We’re thrilled to release the Spectra Suite of open ternary (TriLMs) and FP16 (FloatLMs) language models from 99M to 3.9B parameters. At billion+ parameter scale, TriLMs upto 10x smaller can match the performance of FloatLMs.

1/5 https://t.co/nhtNRLV5Ma

3

40

14

18

30K

Who to follow

LLM Efficiency @NVIDIA - views have always been only my own 🥇🥈 @ Flunkyball Polish Championships

almost 2 years ago

Read our ArXiv Preprint https://t.co/LzK6m6RCQM 5/5

1

10

0

1

703

NolanoOrg retweeted

Irina Rish

@irinarish

over 2 years ago

A little Xmas present 4 you!🎁🎄🎉 Excited for the first release of our open-source Robin vision-language models built by the team at @irinarish’s @cercaai lab @ @UMontreal as part of our INCITE project https://t.co/vJSt6wVEzZ. Blog/models/code: https://t.co/KnaUMFf5RD 🧵

13

307

61

129

53K

Nolano.ai @NolanoOrg

over 2 years ago

Help Guide Us What Open Source Model Should We Train Next! https://t.co/lOh9gqxaUm

0

5

0

953

Nolano.ai @NolanoOrg

over 2 years ago

We are pleased to introduce Hi-NOLIN, the best performing 9B Hindi-English Bilingual LLM. Blog: https://t.co/RdzHBJ4yTb

6

156

22

45

32K

Nolano.ai @NolanoOrg

over 2 years ago

At 60% training completion, it is already outperforming BLOOM and Pythia 9B across most Hindi, English and Coding benchmarks and closes the gap to LLaMa on Coding and English tasks.

NolanoOrg's tweet photo. At 60% training completion, it is already outperforming BLOOM and Pythia 9B across most Hindi, English and Coding benchmarks and closes the gap to LLaMa on Coding and English tasks. https://t.co/CgEFp4HO9U

1

10

0

1K

Nolano.ai @NolanoOrg

over 2 years ago

8/ LoRD provides a novel approach to LLM compression, maintaining full differentiability and trainability of parameters. It is efficient, compatible with existing methods, and holds immense potential for advancements in the field of monolingual code generation.

1

7

0

697

Nolano.ai @NolanoOrg

over 2 years ago

1/ Introducing LoRD: Low-Rank Decomposition of Monolingual Code LLMs for one-shot compression. Paper: https://t.co/spiiwFlm7s

1

45

6

18

11K

Nolano.ai @NolanoOrg

over 2 years ago

7/ Our findings suggest that LoRD is a promising new paradigm for compressing LLMs, offering significant reductions in model parameters without sacrificing model quality or differentiability, and enabling faster inference on modern hardware.

1

5

0

826

Nolano.ai

@NolanoOrg

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users