veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

294
active users

#gpu

12 posts12 participants0 posts today

It's #ROCm getting better? Yes

Will you still use #CUDA? Yes.

youtube.com/watch?v=wCBLMXgk3N

What #AMD should focus on is to bring all of their SKU to use ROCm stable on all platforms. Currently that isn't possible, which is frustrating given their cards have more memory than #RTX at the same price.

#AI#LLM#OLlama

#FluidX3D is now 3 years out in the wild! This fun hobby project has turned into the largest #CFD software on #Github, with ⭐4.6k Stars. Community feedback has been overwhelming. 🖖🤯

Since last year I've enabled using full multi-TB memory capacity on CPUs/iGPUs, added #GPU-accelerated force/torque summation, optimized vtk export, and lots of bug fixes, keeping the # of known bugs at 0.

More updates are planned, as well as a roadmap to commercial use. Let's keep growing!

youtube.com/watch?v=6enTZjO9fbQ

📢 Ça vous dirait de découvrir nos travaux de recherche les plus avancés sur les impacts environnementaux de l'IA ?

💥 Cette journée de partage, c'est le Hubblo Day. Cette 1ère édition est dédiée à la question de l’IA générative : des composants de calcul jusqu’à la transformation dans les organisations.

📝 La billetterie est ici (100 places disponibles, prix libre) : my.weezevent.com/hubblo-day

📍Infos pratiques :...

#ia#iagen#genai

🌘 如何在 NVIDIA GPU 上以每秒 500+ 個 token 的速度運行 GPT OSS 120B
➤ 在上市首日,Baseten 如何透過實驗、除錯和基準測試,實現 GPT OSS 120B 的卓越效能。
baseten.co/blog/sota-performan
本文詳細闡述了 Baseten 團隊如何在 NVIDIA GPU 上,針對 GPT OSS 120B 模型達成每秒超過 500 個 token 的尖端延遲和吞吐量。作者分享了從首次推理、修復相容性錯誤到優化模型配置的完整過程,強調了使用 TensorRT-LLM、TPU 協同運作、KV 快取路由和預測解碼等技術,並展示瞭如何在上市首日就為客戶提供最佳的性能體驗。
+ 看到 Baseten 團隊能在短時間內達成如此驚人的效能,真是令人印象深刻。特別是他們在 TensorRT-LLM 和 GPU 協同運作方面的深入研究,為業界樹立了標
#AI 模型效能 #GPU 優化 #大型語言模型 #OpenAI #Baseten

BasetenHow we run GPT OSS 120B at 500+ tokens per second on NVIDIA GPUs | Baseten BlogHow we optimized GPT OSS 120B for state-of-the-art latency and throughput on launch day.

The #Samsung Galaxy Z Fold 7 is powered by the Qualcomm Snapdragon 8 Elite for Galaxy #chipset. This is a special, overclocked version of Qualcomm's latest flagship processor, custom-tuned to deliver enhanced performance specifically for Samsung's devices. It features a faster #CPU, #GPU, and #NPU, providing superior speed for multitasking, gaming, and on-device #Samsung features like those found in Galaxy AI.
Source: faxty.com/samsung-galaxy-z-fol

#NVIDIA #Engineer Now Co-Maintainer Of "NOVA" #OpenSource Rust #GPU Driver
NOVA opensource NVIDIA GPU effort the past year and 1/2 has been led by #RedHat engineer #DaniloKrummrich as the main NOVA-Core driver maintainer for #kernel code that continues being built out piece-by-piece for mainline tree.
Now joining him as a co-maintainer is NVIDIA engineer, #AlexandreCourbot. Name may ring a bell as he started out in 2011 working on Tegra graphics driver support for #Linux.
phoronix.com/news/NOVA-Core-Co

www.phoronix.comNVIDIA Engineer Now Co-Maintainer Of "NOVA" Open-Source Rust GPU DriverThe NOVA-Core driver as the basis for a modern, Rust-written open-source NVIDIA GPU driver for the upstream Linux kernel and eventual successor to the reverse-engineered Nouveau DRM driver has a new co-maintainer.