Julian Pscheid

Verified account

@JulianPscheid

Founder @HedyAI_. Previously co-founder at @emergeinteract, tech advisor @gatesfoundation, product @nike.

Portland, OR, USA

Joined March 2011

467 Following

416 Followers

2.6K Posts

Pinned Tweet

22 days ago

Most "private AI" claims are policies. They depend on trusting the vendor. We just shipped something different. Our AI meeting app, @HedyAI_, can now run the entire AI pipeline on your own device. Summaries, notes, chat, live coaching. Nothing leaves your device. The demo was recorded with Wi-Fi turned off the entire time. The transcript, summary, and live chat were all generated locally on an M4 Max. Qwen 3.6, Qwen 3.5, and Gemma 4 in the curated model lineup (quants by @UnslothAI), ranging from 2B for newer iPhones up to 35B for users with serious hardware. Plus: bring your own model from Hugging Face if you don't trust our curation. Cloud is still the default. It's faster and produces higher quality output for many users. But that will change over time. Local AI is opt-in. Built for the meetings that shouldn't happen on cloud tools: privileged client conversations, sensitive interviews, medical appointments, work done offline. No silent cloud fallback. If local fails for any reason, the app errors out. It does not quietly retry against our servers. You opted into local for a reason, and a quiet retry would defeat the point. The next few years of AI will be defined by a quiet shift. From a world where a handful of companies operate the AI on your behalf, to one where you can run your own pipeline, on your own device, with your own data, end to end. Full write up: https://t.co/0e7BoBuClm @stevibe @bnjmn_marie ping me if you want to test this with your workflows.

3

4

1

2

226

about 6 hours ago

@thehypedotnews Well done! It's a throwback to news radio.

1

1

0

0

10

about 6 hours ago

With the 450M model it now might be possible to extract image information on less powerful edge devices, such as older and mid-tier Android devices. Could be a game changer for @HedyAI_ if it allows us to allows users to upload photos of slides, etc from their meetings on their phone and we can analyze it on the edge to preserve privacy like we do with voice.

about 23 hours ago

Introducing LFM2.5-VL-1.6B-Extract and LFM2.5-VL-450M-Extract: Vision-language models that return structured JSON, not free-form text. Pass in an image and a list of fields. Get back a clean JSON object. > Two sizes: 1.6B parameters and 450M > open-weight > run on any device SoC 🧵

liquidai's tweet photo. Introducing LFM2.5-VL-1.6B-Extract and LFM2.5-VL-450M-Extract: Vision-language models that return structured JSON, not free-form text.

Pass in an image and a list of fields. Get back a clean JSON object.

> Two sizes: 1.6B parameters and 450M
> open-weight
> run on any device SoC

🧵

33

1K

127

578

67K

0

0

0

0

13

about 6 hours ago

@guilhermeotina @liquidai Need to see if the 450M version runs reliably on Android devices using Vulkan with llama.cpp. That would be an incredible unlock.

0

0

0

0

5

Who to follow

The Synchronicity Society

handwerker.chat

@handwerkerchat

Ich finde den passenden Handwerker für dich!

about 6 hours ago

@latent_node @HedyAI_ Yeah the speed is really something! I love what the Gemma team is cooking this week.

0

1

0

0

2

1 day ago

Testing Gemma 4 12B in @HedyAI_ now for end-to-end AI conversation processing, and it looks like we might have a new best model recommendation. It took a few overnight updates of llama.cpp to get it to run, but now it's cranking. Just had it accidentally still on during a sales call, and it was giving really solid automatic advice. Running on M4 Max 36B.

JulianPscheid's tweet photo. Testing Gemma 4 12B in @HedyAI_ now for end-to-end AI conversation processing, and it looks like we might have a new best model recommendation. It took a few overnight updates of llama.cpp to get it to run, but now it's cranking. Just had it accidentally still on during a sales call, and it was giving really solid automatic advice. Running on M4 Max 36B.

2 days ago

Celebrating the milestone of a massive 150+ million downloads of Gemma 4 with the release of the new Gemma 4 12B model! It's incredibly powerful for such a small model and it’s tiny enough to run locally on a laptop with just 16GB VRAM. Apache 2.0 license - happy building!

151

3K

295

512

602K

1

2

0

1

450

about 6 hours ago

Damn, the Gemma team is CRUSHING it this week

about 7 hours ago

We just dropped Gemma 4 Quantization-Aware Training (QAT) checkpoints on Hugging Face! All Gemma 4 model sizes and their drafters are now optimized with QAT to cut memory requirements and maximize on-device performance!

61

2K

159

539

195K

0

1

0

0

19

about 6 hours ago

@ivanfioravanti @liquidai Nice demo! I love seeing more utility models that are really small.

0

1

0

0

38

1 day ago

@ivanfioravanti AI inference + low power mode = ☠️ 🤣🤣🤣

1

2

0

0

27

1 day ago

@bnjmn_marie @HedyAI_ Sorry, meant 26B. Can't keep my Qwen and Gemma models straight anymore 😅

JulianPscheid's tweet photo. @bnjmn_marie @HedyAI_ Sorry, meant 26B. Can't keep my Qwen and Gemma models straight anymore 😅 https://t.co/xGsEQA2sC2

0

1

0

0

15

1 day ago

@bnjmn_marie Running the 4bit version of 12B in @HedyAI_ now and it looks quite promising. Definitely faster than 27B and quality vibes check out so far.

1

1

0

0

18

1 day ago

@FerTech @NousResearch I'd consider the Eleven Labs integration more of a PR stunt than an actual technical advancement.

1

1

0

0

30

1 day ago

@FerTech @NousResearch I know people have been using voice convos with Hermes/OC for a while now using other integrations, and I'm sure it's possible to use the new Gemma 4 12B model that have native voice capabilities.

1

1

0

0

45

1 day ago

@drbarnard @AppStore @GritMethodApp Nice! Congrats on finishing and launching the app🤘

0

1

0

0

64

1 day ago

@giacomovenier @fluidinference @NVIDIAAIDev @NVIDIAAI How do you get around that the model is only for evaluation and not production use?

JulianPscheid's tweet photo. @giacomovenier @fluidinference @NVIDIAAIDev @NVIDIAAI How do you get around that the model is only for evaluation and not production use? https://t.co/fO9xBWadtH

0

0

0

0

17

2 days ago

@grok @Google @grok can you point me to that part of the developer guide?

1

0

0

0

142

2 days ago

@Google @grok Can Gemma 4 12B also separate voice recordings by speaker when transcribing (diarization)?

1

0

0

0

774

JulianPscheid retweeted

Joshua S McConkey

2 days ago

@Google @googlegemma Single video card models are only about 16-18 months behind flagship LLMs.

elemental_j's tweet photo. @Google @googlegemma Single video card models are only about 16-18 months behind flagship LLMs. https://t.co/yLLgs62hXe

1

16

1

2

2K

2 days ago

@elemental_j @Google @googlegemma This 👆

0

0

0

0

37

2 days ago

@drbarnard AI design is still brutally hard and the one area it's hard to avoid human labor. Like everyone said at MAU... AI doesn't have taste... yet.

0

0

0

0

7

Last Seen Users on Sotwe

Trends for you

Most Popular Users