Reitshare Investment

@reitshare

Joined February 2017

691 Following

44 Followers

578 Posts

Reitshare Investment @reitshare

3 months ago

@TheHumanoidHub Well in fact this is already reality, dont need to wait for the future. Figure 02 is already deployed IN PRODUCTION doing human work. Of course ir not perfect or at human level, but the speed now is exponential. Decades is too pessimist

0

1

0

0

298

Reitshare Investment @reitshare

5 months ago

@RobertoReis @tarcisiogdf Já era

0

0

0

0

114

Reitshare Investment @reitshare

6 months ago

@Teknium @Enscion25 The alternative is to solve real world problems, train to achieve that. Here robotics may contribute a lot to collect data, feedback the results, so they can train again and check if succeed

0

0

0

0

9

Reitshare Investment @reitshare

7 months ago

@gui_zanin_ Mas cresce mais neh. Quanto vale uma empresa que cresce o lu lcro nesta proporção ?

0

0

0

0

103

Who to follow

Mano Brownie.🇧🇷

Só ando de bicicleta mesmo

Psiquiatra Investidor

Médico psiquiatra, interessado em finanças comportamentais e apaixonado por investimento. @psiquiatra_investidor

Reitshare Investment @reitshare

7 months ago

@Civixplorer Where is Nicolas Maduro from Venezuela ?

0

2

0

0

1K

reitshare retweeted

8 months ago

🏗️ Hardware Memory bandwidth is becoming the choke point slowing down GenAI. During 2018–2022, transformer model size grew ~410× every 2 years, while memory per accelerator grew only about 2× every 2 years. And that mismatch shoves us into a “Memory-Wall” The "memory wall" is creating all the challenges in the datacenter and for edge AI applications. In the datacenter, current technologies are primarily trying to solve this problem by applying more GPU compute power. And that's why HBM capacity and bandwidth scaling, KV offload, and prefill-decode disaggregation are central to accelerator roadmaps. Still, at the edge, quite frankly, there are no good solutions. 🚫 Bandwidth is now the bottleneck (not just capacity). Even when you can somehow fit the weights, the chips can’t feed data fast enough from memory to the compute units. Over the last ~20 years, peak compute rose ~60,000×, but DRAM bandwidth only ~100× and interconnect bandwidth ~30×. Result: the processor sits idle waiting for data—the classic “memory wall.” This hits decoder-style LLM inference the hardest. Becasue decoder-style LLMs generate 1 token at a time, so each step reuses the same weights but must stream a growing KV cache from memory. That makes the arithmetic intensity low, since you move a lot of bytes per token relative to FLOPs. As the context grows, the KV cache grows linearly with sequence length and layer count, so every new token has to read more KV tensors, hence the KV cache quickly dominates bytes moved. And thats why so much of recent research focus on reducing or reorganizing KV movement rather than adding FLOPs. Training often needs 3–4X more memory than just the parameters because you must hold parameters, gradients, optimizer states and activations. Hence we have this huge bandwidth gap: Moving weights, activations, and KV-cache around chips/GPUs is slower than the raw compute can consume. Together, these dominate runtime and cost for modern LLMs.

rohanpaul_ai's tweet photo. 🏗️ Hardware Memory bandwidth is becoming the choke point slowing down GenAI.

During 2018–2022, transformer model size grew ~410× every 2 years, while memory per accelerator grew only about 2× every 2 years.

And that mismatch shoves us into a “Memory-Wall”

The "memory wall" is creating all the challenges in the datacenter and for edge AI applications.

In the datacenter, current technologies are primarily trying to solve this problem by applying more GPU compute power. And that's why HBM capacity and bandwidth scaling, KV offload, and prefill-decode disaggregation are central to accelerator roadmaps.

Still, at the edge, quite frankly, there are no good solutions.

🚫 Bandwidth is now the bottleneck (not just capacity).

Even when you can somehow fit the weights, the chips can’t feed data fast enough from memory to the compute units.

Over the last ~20 years, peak compute rose ~60,000×, but DRAM bandwidth only ~100× and interconnect bandwidth ~30×. Result: the processor sits idle waiting for data—the classic “memory wall.”

This hits decoder-style LLM inference the hardest.

Becasue decoder-style LLMs generate 1 token at a time, so each step reuses the same weights but must stream a growing KV cache from memory. That makes the arithmetic intensity low, since you move a lot of bytes per token relative to FLOPs.

As the context grows, the KV cache grows linearly with sequence length and layer count, so every new token has to read more KV tensors, hence the KV cache quickly dominates bytes moved.

And thats why so much of recent research focus on reducing or reorganizing KV movement rather than adding FLOPs.

Training often needs 3–4X more memory than just the parameters because you must hold parameters, gradients, optimizer states and activations.

Hence we have this huge bandwidth gap: Moving weights, activations, and KV-cache around chips/GPUs is slower than the raw compute can consume.

Together, these dominate runtime and cost for modern LLMs.

20

456

65

333

82K

Reitshare Investment @reitshare

10 months ago

@RobertoReis @tarcisiogdf @ratinho_jr @RomeuZema É acho que o sistema já escolheu Ratinho Junior ao invés de Tarcísio:https://t.co/apz1e0bCV3

0

0

0

0

19

Reitshare Investment @reitshare

11 months ago

@ZattarRafael Mais 72 h

0

1

0

0

553

Reitshare Investment @reitshare

about 1 year ago

@apples_jimmy @sama With a lot of garlic hehehe

0

0

0

0

75

Reitshare Investment @reitshare

about 1 year ago

@ai_for_success They are nit buying the code, they are buying the customer base, this is by far the most difficult part

0

0

0

0

8

Reitshare Investment @reitshare

about 1 year ago

@pedroaccorsi_ Pode dar um exemplo real ? Quais outros movimentos de caixa vc usa ? A dívida vc cobsidera a variação total da dívida de curto (circulante) e longo (não circulante) ? Considera debentures tb ?

0

1

0

0

12

Reitshare Investment @reitshare

about 1 year ago

@Samuelsworld Ele não pode se reeleger e será praticamente impossí ele mudar isso

0

0

0

0

158

Reitshare Investment @reitshare

about 1 year ago

@bubaz08 @Hoodville_ @grok I just use "transform this picture and persons in a studio guibli style"

0

0

0

0

98

Reitshare Investment @reitshare

about 1 year ago

@ferdona_jpeg @Hoodville_ @grok I just use "transform this picture and persons in a studio guibli style"

0

0

0

0

173

Reitshare Investment @reitshare

about 1 year ago

@gui_zanin_ Vc está errado. A china considerou a parte dos consumidores. Leia aqui: https://t.co/4YiDC7r6N8

0

0

0

0

7

Reitshare Investment @reitshare

over 1 year ago

@OpenAI Time to release GPT 4.5

0

0

0

0

19

Reitshare Investment @reitshare

over 1 year ago

@Samuelsworld Tudo mudou quando Kassab começou a criticar Lula. PT sem o apoio do PSD em 2026 ? Se sim isso já resolve o fiscal futuro

0

0

0

0

52

Reitshare Investment @reitshare

over 1 year ago

@arankomatsuzaki o3 is o2, they just jumpped because of telecom o2 already registred name

0

0

0

0

22

Reitshare Investment @reitshare

over 1 year ago

@femisapien_z @lf_br_it @elonmusk Ele já construiu, noticia do ano passado

0

0

0

0

48

Reitshare Investment @reitshare

over 1 year ago

@nfalan777 @rafaelgloves Código fonte não está disponível, somente os pesos dos parâmetros são abertos. Voce.pode rodar em qualquer lugar, mas não tem o código fonte

0

0

0

0

6

Last Seen Users on Sotwe

Trends for you

Most Popular Users