@FelixCLC_@illuhad @ProjectPhysX PVC should "just work" with >4GB allocations, no special steps required. If it doesn't, please file a bug or otherwise let us know - thanks!
@FelixCLC_@illuhad @ProjectPhysX Note, the maximum allocation size is the same for both our OpenCL and Level Zero drivers. The reason why >4GB allocations work through the DPC++ runtime is because it explicitly relaxes the allocation limits.
@illuhad@FelixCLC_ Our current client GPUs report a 4GB limit because it lets us use 32-bit addressing arithmetic more aggressively, which improves performance. It's not something that's required by the spec.
@illuhad@FelixCLC_ This matches my understanding as well. A 16GB card would need to support at least 1GB allocations, and could report up to 16GB (no upper limit). Only less-than-4GB cards could have a less-than-1GB allocation limit.
The call for submissions for the 11th International Workshop on #OpenCL and #SYCL closes on Friday Jan 20, 2023. Papers, tech talks and posters are all welcome. Join us in Cambridge, UK on April 18-20. https://t.co/hQ75ungIjZ @openclapi@thekhronosgroup@SYCLstd
@ProjectPhysX @FluidX3D@IntelGraphics I've filed an internal issue regarding the incorrect queries. Should be an easy fix. Feel free to file an issue on the compute-runtime GitHub also if you'd like. Thanks for raising these issues!
@ProjectPhysX @psychocoderHPC@FelixCLC_@SquashBionic@TheMalcore@AMD Following up: Linux should be much better with the latest drivers. I see 15+GB reported on my 16GB GPU. Windows still needs some work though and I don't have an ETA at this time.
If you need help debugging and tracing your command buffer applications, the OpenCL Intercept Layer has full support for both the base command buffer extension and the mutable dispatch extension. https://t.co/jFxEM4AmoH #OpenCL
Want to experiment with the new command buffer mutable dispatch extension right away? My command buffer emulation layer now supports this extension and should work with most OpenCL implementations. https://t.co/M4m3LVSGyr #OpenCL
I also recently added command buffer emulation layer support for out-of-order command buffers for OpenCL implementations that support out-of-order queues. The same repo has sample applications that demonstrate how to use both new features. #OpenCL
There's room for better GL/VK perf (benchmarks next week), but as a developer card, the A380 at $139 is a bargain... @openclapi 3.0 on open-source stack, @IntelSoftware oneAPI ecosystem for testing and porting work is great, SYCL & rest of software ecosystem around it.
Epic road trip complete! 5000 miles, 34 nights (32 in a tent), 9 national parks, one flat tire 😥, countless memories. Pic from yesterday at Crater Lake.