ICYMI: OCCT 14.2.4 has been released!
#AMD #Intel #Linux #LinuxGaming #NVIDIA #OpenCL #PCGaming #Radeon #Vulkan
https://store.steampowered.com/news/app/3515100/view/499455091736051936
Short write-up on running Hashcat 7 (or older) with OpenCL on CPUs and/or using the Nouveau FOSS driver for NVIDIA cards.
www.infosecworrier.dk/blog/2025/08/opencl/
All the good stuff is from @tychotithonus original post. The rest is just me standing on his shoulders.
Turns out the #OpenCL __builtin_amdgcn_sdot4 intrinsic for dp4a on AMD #GPUs is only supported up to RDNA2. RDNA3+ needs another intrinsic, __builtin_amdgcn_sudot4
My OpenCL-Benchmark now supports both: https://github.com/ProjectPhysX/OpenCL-Benchmark/blob/master/src/kernel.cpp#L6-L20
https://github.com/llvm/llvm-project/blob/c1968fee972859dfd03a7e698422e18a5bc1d478/llvm/include/llvm/IR/IntrinsicsAMDGPU.td#L3213
ICYMI: Update Notes for OCCT 14.2.3
#AMD #Intel #Linux #LinuxGaming #NVIDIA #OpenCL #PCGaming #Radeon #Vulkan
https://store.steampowered.com/news/app/3515100/view/499453445510529075
ICYMI: OCCT - Update notes for 14.2.1
#AMD #Intel #Linux #LinuxGaming #NVIDIA #OpenCL #PCGaming #Radeon #Vulkan
https://store.steampowered.com/news/app/3515100/view/532103833458835534
ICYMI: OCCT - Update notes for version 14.2
#AMD #Intel #Linux #LinuxGaming #NVIDIA #OpenCL #PCGaming #Radeon #Vulkan
https://store.steampowered.com/news/app/3515100/view/499451286721140341
OpenCL v3.0.19 maintenance update released with bug fixes & clarifications and adds two new extensions: cl_khr_spirv_queries to simplify querying the SPIR-V capabilities of a device, and cl_khr_external_memory_android_hardware_buffer to more efficiently interoperate with other APIs on Android devices. In addition, the cl_khr_kernel_clock extension to sample a clock within a kernel has been finalized and is no longer an experimental extension.
Khronos #OpenCL Registry: https://registry.khronos.org/OpenCL/
@GuettisKnippse
Unter Einstellungen/Bearbeitung/
#OpenCL aktiviert?
Finally I can "SLI" AMD+Intel+Nvidia #GPUs at home! I simulated this crow in flight at 680M grid cells in 36GB VRAM, pooled together from
- #AMD Radeon RX 7700 XT 12GB (RDNA3)
- #Intel Arc B580 12GB (Battlemage)
- #Nvidia Titan Xp 12GB (Pascal)
My #FluidX3D #CFD software can pool the VRAM of any combination of any GPUs together via #OpenCL.
#Krähenliebe #birds #crow
https://www.youtube.com/watch?v=1z5-ddsmAag
As usual, getting something like GPU compute that's cross platform working is a message because everyone likes to do their own thing and reinvent the wheel.
I would like something that is [modern] macOS and FreeBSD compatible, but doesn't look like that's possible since Apple deprecated OpenCL.
(Also, could Apple have picked a less searchable term for their new GPU framework?)
It's again looking like the best way to be cross platform is to use JS+browser.
Or am I missing some library?
Is it possible to run AMD+Intel+Nvidia #GPUs in the same PC? Yes!
Got this RDNA3 chonker for free from 11 bit studios contest! It completes my 36GB VRAM RGB SLI abomination setup:
- #AMD Radeon RX 7700 XT 12GB
- #Intel Arc B580 12GB
- #Nvidia Titan Xp 12GB
The drivers all work together in #Linux Ubuntu 24.04.2. Backbone is an ASUS ProArt Z790 with i7-13700K and 64GB, PCIe 4.0 x8/x8 + 3.0 x4 - plenty interconnect bandwidth.
Finally I can develop and test #OpenCL on all major patforms!
My #IWOCL 2025 Keynote presentation is online!
Scaling up #FluidX3D #CFD beyond 100 Billion cells on a single computer - a story about the true cross-compatibility of #OpenCL
https://www.youtube.com/watch?v=Sb3ibfoOi0c&list=PLA-vfTt7YHI2HEFrpzPhhQ8PhiztKhHU8&index=1
Slides: https://www.iwocl.org/wp-content/uploads/iwocl-2025-moritz-lehmann-keynote.pdf
I just uploaded the 5000th #OpenCL hardware report to @sascha's gpuinfo.org database! And guess what #GPU I reserved the spot for: #Intel Arc B580 #Battlemage
https://opencl.gpuinfo.org/displayreport.php?id=5000
I have contributed 4.2% (211) of all entries.
What an honor to start the #IWOCL conference with my keynote talk! Nowhere else you get to talk to so many #OpenCL and #SYCL experts in one room! I shared some updates on my #FluidX3D #CFD solver, how I optimized it at the smallest level of a single grid cell, to scale it up on the largest #Intel #Xeon6 #HPC systems that provide more memory capacity than any #GPU server.
I made this #FluidX3D #CFD simulation run on a frankenstein zoo of AMD +
Nvidia +
Intel #GPUs!
https://www.youtube.com/watch?v=_8Ed8ET9gBU
The ultimate SLI abomination setup:
- 1x Nvidia A100 40GB
- 1x Nvidia Tesla P100 16GB
- 2x Nvidia A2 15GB
- 3x AMD Instinct MI50
- 1x Intel Arc A770 16GB
I split the 2.5B cells in 9 domains of 15GB - A100 takes 2 domains, the other GPUs 1 domain each. The GPUs communicate over PCIe via #OpenCL.
Huge thanks to Tobias Ribizel from TUM for the hardware!