Congrats to all #SwiftStudentChallenge winners and applicants! It's such a privilege to have the opportunity to experience dub dub live as a 15 year old. My app playground is an app for deafblind individuals. The app translate text to morse code to haptic feedback.
The fact that memory stocks are crashing because of Googleโs Turboquant is a pretty good indicator of how many clueless people this market is filled with. Itโs like saying Aramco should crash because Toyota came out with a next-generation hybrid engine.
According to benchmarks Qwen3.5 4B is as good as GPT 4o.
GPT 4o came out ~2 years ago (May 2024).
Qwen 3.5 4B runs easily on modern mobile devices.
So the gap between frontier intelligence in a datacenter and running a model of equal quality on your iPhone could be 2-3 years. (Probably closer to 3 assuming Qwen3.5 4B is more benchmaxxed than 4o)
I don't expect the trend of increasing intelligence-per-watt to change. So in 2-3 years it's plausible we will be running GPT 5.x quality models on an iPhone. Pretty wild.
The new Qwen 3.5 by @Alibaba_Qwen running on-device on iPhone 14. Qwen 3.5 performance is on par with gpt-5 on some benchmarks! The 2B 6-bit model we have here is running on MLX, which is specially designed for Apple Silicon.
@pocketllm
@franjohn21@blakeandersonw the problem with the local model is that it is strongly moderated and safety guardrails will be triggered for the most basic prompts