Tim Sears @csTimSears - Twitter Profile

almost 2 years ago

Happy to share that the new Haskell Meetup will be hosted at @GroqInc in their Mountain View HQ, this Friday 5:30pm https://t.co/R1aNWEBJ2K @csTimSears will share how Groq uses Haskell and @avi_press (founder @scarf_oss) will talk about his experience on running a startup with haskell

0

12

5

1

3K

csTimSears retweeted

Groq Inc

@GroqInc

almost 2 years ago

With over 284K+ developers using GroqCloud™, the Groq Speed Read focuses on developers and highlights new features, great apps, and what’s coming up. Sign up for the Speed Read here so you don't miss out - https://t.co/jxLNpyd2mm -

1

22

5

0

4K

Tim Sears @csTimSears

about 2 years ago

@YankeeWRLD @adaquano Backward cap, nose to nose, kicking dirt on his shoes and home plate.

0

14

csTimSears retweeted

Jay Scambler

@JayScambler

over 2 years ago

Who wants to go all in on an LPU cluster with me?

5

28

2

2K

Who to follow

Tikhon Jelvis

@tikhonjelvis

I like programming languages. A lot. Tools, types and functions.

Chris Smith

@cdsmithus

Software developer and amateur (but published!) ring theorist. Volunteer K-12 math/CS teacher. Haskell enthusiast. He/him.

Remy Goldschmidt

@taktoa1

Chip architect at @MatXComputing. https://t.co/PHcIJReCEj https://t.co/7E7yOtVEZe

csTimSears retweeted

Matt Shumer

@mattshumer_

over 2 years ago

I’m sorry to all the engineers at @GroqInc I should’ve waited till tomorrow to post about you… Hopefully the servers stay online till the morning!

9

168

2

21

44K

csTimSears retweeted

@levelsio

over 2 years ago

Try https://t.co/tfPlxzUDkZ now Hyperfast LLM running on custom built GPUs Answers in miliseconds, not seconds How? 🤯

157

2K

195

2K

602K

csTimSears retweeted

Matt Shumer

@mattshumer_

over 2 years ago

The first public demo using Groq: a lightning-fast AI Answers Engine. It writes factual, cited answers with hundreds of words in less than a second. More than 3/4 of the time is spent searching, not generating! The LLM runs in a fraction of a second. https://t.co/PifDmCRpkY

28

376

66

190

127K

csTimSears retweeted

Groq Inc

@GroqInc

over 2 years ago

@Uncensored_AI @levelsio Good guess, but nope. Completely original hardware / software solution built from the ground up. We say it's "sand to sky" because it's our custom silicon GroqChip, an LPU provided as a system via our cloud solution. We haven't even started with tricks used by others yet. :D

0

3

1

0

147

Tim Sears @csTimSears

over 2 years ago

@TaliaGold @GroqInc We did that. It’s fun. You can talk to mixtral running on Groq in minutes with zero code over at https://t.co/1QDyN63Iuz.

2

39

7

26

10K

csTimSears retweeted

diicell

@0xdiicell

over 2 years ago

oi, m8. @GroqInc fastest resp alive

0

8

2

1

2K

csTimSears retweeted

Dhruv Sahu @DhruvSahu98

over 2 years ago

@chamath @GroqInc I tried this with summarize book text with cross questioning and summary. It's faster and more accurate than big techs llms.

0

6

3

0

2K

csTimSears retweeted

Matt Shumer

@mattshumer_

over 2 years ago

Wild tech you have to try: https://t.co/IddQqtQnvV They are serving Mixtral at nearly 500 tok/s. Answers are pretty much instantaneous. Opens up new use-cases, and completely changes the UX possibilities of existing ones.

66

1K

154

1K

421K

csTimSears retweeted

Jay Scambler

@JayScambler

over 2 years ago

The response time from groq feels like magic!

0

9

1

0

819

csTimSears retweeted

Jay Scambler

@JayScambler

over 2 years ago

Groq is serving the fastest responses I've ever seen. We're talking almost 500 T/s! I did some research on how they're able to do it. Turns out they developed their own hardware that utilize LPUs instead of GPUs. Here's the skinny: Groq created a novel processing unit known as the Tensor Streaming Processor (TSP) which they categorize as a Linear Processor Unit (LPU). Unlike traditional GPUs that are parallel processors with hundreds of cores designed for graphics rendering, LPUs are architected to deliver deterministic performance for AI computations. The LPU's architecture is a departure from the SIMD (Single Instruction, Multiple Data) model used by GPUs and favor a more streamlined approach that eliminate the need for complex scheduling hardware. This design allows every clock cycle to be utilized effectively, ensuring consistent latency and throughput. For developers, this means that performance can be precisely predicted and optimized which is critical in real-time AI applications. Energy efficiency is another area where LPUs shine. By reducing the overhead of managing multiple threads and avoiding the underutilization of cores, LPUs can deliver more computations per watt. Groq's innovative chip design allows multiple TSPs to be linked together without the traditional bottlenecks found in GPU clusters making them extremely scalable. This enables linear scaling of performance as more LPUs are added simplifying the hardware requirements for large-scale AI models and making it easier for developers to scale their applications without rearchitecting their systems. So what does this all mean? LPUs could provide a massive improvement compared to GPUs for serving AI applications in the future! If anything it will be great to have alternative high performing hardware since A100s and H100s are so in demand