@Jason______A 10 year old luggage that I can replace the silicone wheels. They offered “lifetime warranty” and when I went to the shop they said it’s not produced anymore and can’t help. Anything equivalent is 3 times the price. Zipper broke and persuaded a tailor to fix it.
@BowTiedAsset @FrenchOG3 And I’m sure women wait too for the one, instead of supplying what they need from the free market of dating apps. I can’t believe the solved the problem for everybody by selecting for men’s scarcity and desperation.
@sudoingX subjectively the tradeoff for me is better than larger context with lower kv cache, keeps doing a good job after the first and second compaction, still not satisfactory quality with q4 kv cache and larger context. Most tasks I give it fit within 128k context.
@sudoingX decided for a similar setup on laptop 5090:60k context q4km, q8 cache and doing the compaction on a laptop 4090 with nemotron 4b at f16. I get slower speeds 20-30tok/s and large contexts take 5-9 min to be processed
@mr_james_c Another case of something brilliant a century ago, but let’s not do it anymore. This is not managed decline it’s managing the place into decline.