Cactus @cactuscompute - Twitter Profile

Cactus

@cactuscompute

2 days ago

@mike_drip @Winterrose @aquavoice @WisprFlow Yes that's us! Run SOTA transcription across Mac & iOS with Cactus 🌵

0

35

cactuscompute retweeted

Better Stack

@BetterStackHQ

17 days ago

Testing Cactus, a low-latency AI inference engine that runs a 1.2B model directly on an iPhone 12 Pro NPU using zero-copy memory mapping and a hybrid cloud router. See the full performance breakdown.

1

6

1

2

564

cactuscompute retweeted

Henry Ndubuaku

@Henry_Ndubuaku

about 1 month ago

Gemma 4 on Cactus can run realtime language, vision and speech on a mobile device...with multiple apps running in the background. Cactus routing algorithms gives Gemma the capacity to forward tasks to frontier models like Gemini and Claude when confused. Typical on-device AI demos are too structured, but this is probably the closest we've come to Jarvis-like AI assistant. @googlegemma @GeminiApp @cactuscompute

1

30

5

19

3K

Cactus

@cactuscompute

about 1 month ago

Proud to power on-device AI for Pebble – the product that pioneered the entire smart watch category – and now innovating on new wearable form factors.

Eric Migicovsky

@ericmigi

about 1 month ago

Haven't posted much about @Pebble Index 01 recently (been...uh a bit in the weeds on PT2 shipping). But lots of progress has been made! Still need to order one? You can do that here 😉 https://t.co/XQ2wdxQj0O We're in PVT2! Yes, 2 :) The first Production Verification Test (PVT) at end of March exposed two main problems 1) possible ESD damage during assembly was blowing out the BLE amp on some units, 2) a change we made to the mic waterproof membrane caused some units to fail audio testing. We fixed both with modifications to the assembly line. Things seem to be on a good track now with PVT2! When will my Index 01 ship? Our goal is to manufacture the first 2k units by mid-May and start shipping them out. It will take us roughly 3 months to manufacture and ship out all pre-orders, meaning we should be finished by July. As always, these are estimates. Delays may happen! Brushed Silver We are switching from offering a polished silver to brushed silver finish! It looks great - photo below, with lots more to come. Head to https://t.co/YDqVttizId to switch colours. Alpha testers Thanks to some brave souls who've been using early versions of Index 01 and reporting bugs, we've crushed a ton of software issues over the last 3 months! Performance and reliability are starting to be really smooth on iOS and Android. Software improvements All open source, of course! https://t.co/9B5NYEoEG9 - We added fully encrypted cloud backup (optional) - Local speech-to-text is working well, thanks to @cactuscompute + parakeet-tdt-0.6b-v3! - You can optionally route audio or transcriptions to a webhook - enabling you to pipe recordings directly from your Index 01 to an agent like OpenClaw - Our lead engineer also added Home Assistant support (because she wanted it herself!!). - Added Beeper (text directly from Index 01!), music control (Android only), Notion, Apple Reminders and Google Tasks integration - Bring your own MCP server Still working on a lot of stuff - dramatically improved UI, more reminder app integrations (Todoist, https://t.co/cAsOmBXAo0), and more. Tell us - what are you excited to do with your Index 01? Do you have a favorite reminder/todo/notes app that you would like to use with it?

ericmigi's tweet photo. Haven't posted much about @Pebble Index 01 recently (been...uh a bit in the weeds on PT2 shipping). But lots of progress has been made!

Still need to order one? You can do that here 😉 https://t.co/XQ2wdxQj0O

We're in PVT2! Yes, 2 :)
The first Production Verification Test (PVT) at end of March exposed two main problems 1) possible ESD damage during assembly was blowing out the BLE amp on some units, 2) a change we made to the mic waterproof membrane caused some units to fail audio testing. We fixed both with modifications to the assembly line. Things seem to be on a good track now with PVT2!

When will my Index 01 ship?
Our goal is to manufacture the first 2k units by mid-May and start shipping them out. It will take us roughly 3 months to manufacture and ship out all pre-orders, meaning we should be finished by July. As always, these are estimates. Delays may happen!

Brushed Silver
We are switching from offering a polished silver to brushed silver finish! It looks great - photo below, with lots more to come. Head to https://t.co/YDqVttizId to switch colours.

Alpha testers
Thanks to some brave souls who've been using early versions of Index 01 and reporting bugs, we've crushed a ton of software issues over the last 3 months! Performance and reliability are starting to be really smooth on iOS and Android.

Software improvements
All open source, of course! https://t.co/9B5NYEoEG9
- We added fully encrypted cloud backup (optional)
- Local speech-to-text is working well, thanks to @cactuscompute + parakeet-tdt-0.6b-v3!
- You can optionally route audio or transcriptions to a webhook - enabling you to pipe recordings directly from your Index 01 to an agent like OpenClaw
- Our lead engineer also added Home Assistant support (because she wanted it herself!!).
- Added Beeper (text directly from Index 01!), music control (Android only), Notion, Apple Reminders and Google Tasks integration
- Bring your own MCP server

Still working on a lot of stuff - dramatically improved UI, more reminder app integrations (Todoist, https://t.co/cAsOmBXAo0), and more.

Tell us - what are you excited to do with your Index 01? Do you have a favorite reminder/todo/notes app that you would like to use with it?

27

164

12

19

23K

0

3

2

773

cactuscompute retweeted

Henry Ndubuaku

@Henry_Ndubuaku

about 1 month ago

@cactuscompute x @GoogleDeepMind x @ycombinator hosts the Gemma 4 Voice Agents Hackathon.

1

16

7

3

4K

cactuscompute retweeted

PowerSync

@powersync_

3 months ago

The PowerSync AI Hackathon starts today. Bring your favorite AI ideas to life and compete for over $8k+ in prizes, including bonus prizes from our partners @supabase @neondatabase @mastra @tan_stack @cactuscompute. Let the hacking being!

powersync_'s tweet photo. The PowerSync AI Hackathon starts today.

Bring your favorite AI ideas to life and compete for over $8k+ in prizes, including bonus prizes from our partners @supabase @neondatabase @mastra @tan_stack @cactuscompute.

Let the hacking being! https://t.co/DknMOc6Fy0

1

7

1

2

660

cactuscompute retweeted

PowerSync

@powersync_

3 months ago

We now also have @cactuscompute on board as a partner in the hackathon! Cactus' on-device AI with cloud fallback pairs well with PowerSync. They are sponsoring a prize of a month of Cactus Hybrid inference for the best submission using Cactus 🌵🎉 https://t.co/hTNT8MK82Z

0

2

0

2K

Cactus

@cactuscompute

6 months ago

@sobedominik @sunglassesface Cactus Chat is a great choice :)

0

12

cactuscompute retweeted

Google Open Source @GoogleOSS

6 months ago

From generalist to expert! See how @cactuscompute used #Tunix for Supervised Fine-Tuning on the Gemma 3 1B model, boosting its tool-calling capabilities from 28% to 35%. All on the free tier of Google Colab. #AI #SFT #Gemma https://t.co/dMTJ6HfMuM

GoogleOSS's tweet photo. From generalist to expert! See how @cactuscompute used #Tunix for Supervised Fine-Tuning on the Gemma 3 1B model, boosting its tool-calling capabilities from 28% to 35%. All on the free tier of Google Colab. #AI #SFT #Gemma
https://t.co/dMTJ6HfMuM https://t.co/gDElb0eAqI

0

30

2

11

6K

cactuscompute retweeted

Samuel Donkor @SAMADON_

6 months ago

@cactuscompute @nothing @huggingface Excited to share that our team placed 2nd at the Cactus (YC S25) x Nothing x Hugging Face Mobile AI Hackathon. We were up against teams from MIT, Stanford, and builders from around the world. Grateful to have had the chance to build and compete alongside so many talented people.

0

3

1

0

421

cactuscompute retweeted

Henry Ndubuaku

@Henry_Ndubuaku

6 months ago

1.6B INT8 VLM by @liquidai on Cactus (YC S25) never exceeds 231MB of peak memory usage at any context size. 1. Cactus is aggressively optimised to run on budget devices with minimal resources, enabling efficiency, negligible pressure on your phone and passes your OS safety mechanisms. 2. Notice how 1.6B INT8 CPU reaches 95 toks/sec on Apple M4 Pro, faster than your eyes could process. Our INT4 will almost 2x the speed when merged. Expect up to 180 toks/sec decode speed. 3. The prefill speed reaches 513 toks/sec. Our NPU kernels will 5-11x that once merged. Expect up to 2500 - 5500 toks/sec. The time to first token of your large context prompt will take less than 1sec. 4. LFM2-1.2B-INT8 in the Cactus compressed format takes only 722mb. This means that with INT4 will shrink to 350mb. Almost half as much as GGUF, ONNX, Executorch, LiteRT etc. 5. Once done, we will start recommending 1B models to our users, cause your Grandma’s phones will run them. Stay tuned! https://t.co/VYqUtezy75

7

154

13

119

37K

cactuscompute retweeted

Jakub Mroz @jakmroo

6 months ago

We just shipped the Cactus React Native SDK🌵- the fastest and most efficient on-device AI inference engine for React Native.⚡️Lightweight, insanely fast, and built for mobile devices from the ground up.🚀

1

3

2

1

574

Cactus

@cactuscompute

6 months ago

@SelimBenayat @tigran3rd @nothing @huggingface @tigran3rd not in SF. Our community at Stanford is away for Thanksgiving. SF will be online-only

1

2

0

56

cactuscompute retweeted

Sélim @SelimBenayat

6 months ago

Hackathon alert! London, SF, Boston. This Friday! 👀 @nothing is teaming up with @cactuscompute and @huggingface to hack on redefining on-device AI experiences! Come build something memorable, meet the teams, and ship in 24 hours! Signups are wild so far 🔥

9

200

20

16

48K

Cactus

@cactuscompute

6 months ago

@_iamEtornam thanks for building with us, Etornam! 🫶🏼🌵

0

1

0

18

Cactus

@cactuscompute

6 months ago

Cactus React Native v1 is live! Deploy AI on-device with text inference, tool calling, embeddings and more – powered by the fastest edge inference engine 🌵 Our React Native bindings run on @margelo_com's Nitro Modules, yielding the fastest mobile inference we've seen so far.

1

3

2

1

377

Cactus

@cactuscompute

Last Seen Users on Sotwe

Trends for you

Most Popular Users