Florian Zaruba @be4web - Twitter Profile

be4web retweeted

about 1 year ago

There is an alternate reality where Cray took their vector supercomputers, ditched FP64 calculations, and went with one FP32 pipe and a BF16 tensor core pipe. The same instruction set, memory architecture, and vector registers would have made a sweet deep learning machine, in many ways nicer than SIMT CUDA programming on GPUs. A Y-MP class machine like that could have delivered the AlexNet and DQN moments two decades earlier. Even doing everything in FP64 with no architectural changes, a Cray-1 would have been the best machine in the world for neural networks. If @geoffreyhinton had access to one for early research, the case could have been made for the architectural modifications to 10x the performance.

ID_AA_Carmack's tweet photo. There is an alternate reality where Cray took their vector supercomputers, ditched FP64 calculations, and went with one FP32 pipe and a BF16 tensor core pipe. The same instruction set, memory architecture, and vector registers would have made a sweet deep learning machine, in many ways nicer than SIMT CUDA programming on GPUs. A Y-MP class machine like that could have delivered the AlexNet and DQN moments two decades earlier.

Even doing everything in FP64 with no architectural changes, a Cray-1 would have been the best machine in the world for neural networks. If @geoffreyhinton had access to one for early research, the case could have been made for the architectural modifications to 10x the performance.

192

3K

279

565

275K

be4web retweeted

RISC-V International

@risc_v

about 3 years ago

🌟We have had an amazing time at #RISCVSummitEurope so far! Watch #RISCV Ambassador @FlorianWoh's #recap video, highlighting a few of the incredible Summit announcements, sessions, and activities. We can't wait to see everyone for another packed day tomorrow! #RISCVeverywhere

1

17

6

1

4K

Florian Zaruba @be4web

about 3 years ago

@pulp_platform @suehtamacv Many congratulations @suehtamacv! 🎊

0

1

0

94

Florian Zaruba @be4web

about 3 years ago

@mark_mcgookin @pulp_platform @lucasteske Doom for sure! Crysis is the new benchmark 😅 Occamy should be a new target for the unreal engine

0

2

0

33

Who to follow

OpenHW Foundation

@OpenHWFdn

A non-profit, global organisation where hardware and software designers collaborate in the development of open source cores, related IP, tools and software.

Luca Benini

@LucaBeniniZhFe

Ferrara, Palo Alto, Ferrara, Zurich... Boh

PULP Platform

@pulp_platform

A joint effort of @ETH_en, University of Bologna @Unibo + partners for Parallel Ultra-Low Power computing. Boldly designing open hardware since '13.

be4web retweeted

PULP Platform @pulp_platform

about 3 years ago

In order to put to rest incorrect information that recently appeared on social media and several more prominent websites, we have published a summary of our project Occamy: https://t.co/GMps7YYIcB

pulp_platform's tweet photo. In order to put to rest incorrect information that recently appeared on social media and several more prominent websites, we have published a summary of our project Occamy: https://t.co/GMps7YYIcB https://t.co/Zv4Pms4ARE

1

89

26

6

9K

be4web retweeted

Flo @FlorianWoh

about 3 years ago

Cool @AxeleraAI booth at #ew23 looks great. #AI made in Europe And they use #RISCV in some chips Greetings to the #RISCVambassador @be4web

FlorianWoh's tweet photo. Cool @AxeleraAI booth at #ew23 looks great.
#AI made in Europe
And they use #RISCV in some chips
Greetings to the #RISCVambassador @be4web https://t.co/enZCAz3jYC

0

9

4

0

581

be4web retweeted

Axelera AI

@AxeleraAI

over 3 years ago

We’re thrilled to share that we have closed our $27million Series A and are ready to launch our AI acceleration platform in early 2023! https://t.co/5PGK6KfUjj #deeptech #ai #ml #edgeai

AxeleraAI's tweet photo. We’re thrilled to share that we have closed our $27million Series A and are ready to launch our AI acceleration platform in early 2023! https://t.co/5PGK6KfUjj

#deeptech #ai #ml #edgeai https://t.co/DF2LbzivyY

0

16

10

1

0

Florian Zaruba @be4web

almost 4 years ago

@pulp_platform That’s amazing 😍! Looking at the etron homepage I think you are on the path for the smallest Linux-booting RISC-V system. Looking forward to first alive terminal pictures!

0

13

0

Florian Zaruba @be4web

almost 4 years ago

Wow, congratulations to the entire team, this is such an incredible achievement!

PULP Platform @pulp_platform

almost 4 years ago

Here is Occamy: 216 Snitch cores, an HBM controller in GF12LPP, designed as a chiplet. Enough said 😇🦉. This work wouldn't be possible without the generous support of @GlobalFoundries and @rambusinc. https://t.co/mdsCEhAD8y

pulp_platform's tweet photo. Here is Occamy: 216 Snitch cores, an HBM controller in GF12LPP, designed as a chiplet. Enough said 😇🦉. This work wouldn't be possible without the generous support of @GlobalFoundries and @rambusinc. https://t.co/mdsCEhAD8y https://t.co/QGLGKSqs6L

0

36

4

1

0

8

0

Florian Zaruba @be4web

almost 4 years ago

@pulp_platform Looking forward to the floorplan and die-shot on https://t.co/rCoY7USLGo 😍!

0

3

0

Florian Zaruba @be4web

almost 4 years ago

@pulp_platform @mazzergio @ampereproject The new template also fits the NVIDIA colors very well! Superb job 👌

0

4

0

Florian Zaruba @be4web

almost 4 years ago

Obviously proud!

Andreas Schilling 🇺🇦 @aschilling

almost 4 years ago

Found another chip that is manufactured in Intel 4. This is a 8-core RISC-V (RV64GC) CPU with compute near LLC called Vela. - 64 kB SRAM for each core - 512 kB shared LLC - the silicon is just 1,92 mm² (1,939 x 0,991 mm)

aschilling's tweet photo. Found another chip that is manufactured in Intel 4. This is a 8-core RISC-V (RV64GC) CPU with compute near LLC called Vela.

- 64 kB SRAM for each core
- 512 kB shared LLC
- the silicon is just 1,92 mm² (1,939 x 0,991 mm) https://t.co/nWhkWKV2Nc

5

150

25

13

0

1

6

0

be4web retweeted

Enjoy Digital

@enjoy_digital

about 4 years ago

Hello (ex) @pulp_platform's Ariane/@openhwgroup's CVA6! Thanks to Massimiliano.G's LiteX port, it's possible to regain some freedom: - No longer restricted to Xilinx/Genesys2/MIG. - New peripherals to play with :) - 100% open-source SoC! - ... Try it: litex_sim --cpu-type=cva6

enjoy_digital's tweet photo. Hello (ex) @pulp_platform's Ariane/@openhwgroup's CVA6!

Thanks to Massimiliano.G's LiteX port, it's possible to regain some freedom:
- No longer restricted to Xilinx/Genesys2/MIG.
- New peripherals to play with :)
- 100% open-source SoC!
- ...
Try it: litex_sim --cpu-type=cva6 https://t.co/AhoeTfRU7R

2

33

4

3

0

Florian Zaruba @be4web

about 4 years ago

@bilalzafar @LucaBeniniZhFe @pulp_platform @niwist And I would make sure from the beginning that large state holding elements (e.g., predictors) could be implemented as SRAMs, so can cope with at least one cycle latency

0

3

0

Florian Zaruba @be4web

about 4 years ago

@bilalzafar @LucaBeniniZhFe @pulp_platform @niwist I would group functional units into fixed latency (alu, fp, etc.) and variable latency (lsu, div). For the fixed latency you can be more lean and clever about your implementation because you know exactly when the result will be produced. While in Ariane everything is handshaked

1

2

0

Florian Zaruba @be4web

about 4 years ago

@bilalzafar @LucaBeniniZhFe @pulp_platform @niwist It’s only purpose is to flush the different stages 😉 but that might be the pain point of @LucaBeniniZhFe 😅

2

0

Florian Zaruba @be4web

over 4 years ago · Zurich

@pulp_platform I ❤️ the logo!

0

3

0

be4web retweeted

Luca Benini @LucaBeniniZhFe

over 4 years ago

This is great! That's what I am talking about when I push for open hardware!! Avalanche effect 🚀😎

0

11

4

0

Florian Zaruba @be4web

over 4 years ago · Reichenau

@_O_N_LANG__ Hey! As of now we manage everything via GitHub (issues). We are happy to help out there! We don’t (yet) have any instant messaging for the public 😕

0

Florian Zaruba

@be4web

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users