ARM often wins on efficiency in many products, but I don't think that comes from the ISA alone.
Curious what folks think about x86 SBCs like the LattePanda when discussing x86 vs ARM efficiency ๐
@hiarun02 Wait you're joking right? I remember at least a year ago when I burned through max conversation length in 2 turns with free tier Claude because I was testing it with a custom parallel mergesort asymptotic analysis task, but I had no idea that it was still this bad? ๐
@gpuwaster@SzymonOzog_ I'm sorry I don't understand; if "inline PTX is all I need" then won't that essentially imply that I write *all* my device code in PTX?
@jimkxa Interesting point. But I was just reminded of Intelโs Larrabee though: is that a case where "simpler" looked appealing architecturally, but in practice the complexity just reappeared elsewhere?
@techbromemes *Mentally flashes back to when I ported a version of a custom CUDA C/C++ mergesort from Linux to Windows and got nearly a 2x "mysterious" performance boost... until I later changed my `long`-declared indices to `long long`* ๐
DLSS 5 isn't just about image quality; it's about control + ergonomics.
- If it integrates cleanly โก๏ธ RTX moment
- If it fights artist intent โก๏ธ NV1 quadratics moment
Curious how this plays out ๐ #DLSS
@tlbflush Heard though that the software stack for these devices weren't very optimized though especially when it came to compiler based auto vectorization ๐