anyone that found external benchmarks on apple's afm 3 (new on-device models)?
great unlock for the local movement to have models out of the box on mac
https://t.co/qHixEzoOKN
not surprised by this trend (esp for coding), cache protects us well, but i wonder if we will see input compression being applied more widely to reduce baseline consumption
for some use cases where context is less formal and precise, think conversations or meeting transcripts this seems interesting https://t.co/f0UvU3g6LG