AMD ROCm™ Blogs

Featured Posts

Bridging the Last Mile: Deploying Hummingbird-XT for Efficient Video Generation on AMD Consumer-Grade Platforms

Learn how to use Hummingbird-XT and Hummingbird-XTX modelS to generate videos. Explore the video diffusion model acceleration solution, including dit distillation method and light VAE model.

January 08, 2026 by Takashi Isobe, He Cui, Mengmeng Ge, Dong Zhou, Dong Li, KuanTing Lin, Chandra Yang, Wickey Wang, Emad Barsoum

Scaling AI Inference Performance with vLLM on AMD Instinct MI355X GPUs

Explore how MI355X performs against B200 in vLLM benchmarks across DeepSeek-R1, GPT-OSS-120B, Qwen3-235B and Llama-3.3-70B.

December 08, 2025 by Jouni Hartikainen, Aarne Talman, Reima Karhila, Teemu Virolainen, Mikko Tukiainen, Bishwo Adhikari, Xavier Aguilar Fruto, Chun Fang, Stig-Arne Gronroos, Markus Hartikainen, Mustafa Khalid Masood, Olga Miroshnichenko, Tres Popp, Tuukka Sarvi, Olha Shkaravska, Jin Tao, Matti Varjokallio, Nico Holmberg, Jaakko Vainio, Faisal Azhar

AMD Enterprise AI Suite: Open Infrastructure for Production AI

Explore an open, GPU-optimized platform to build, deploy, and scale enterprise AI workloads on AMD Instinct with production-ready performance.

November 17, 2025 by Yu Wang, Alexander Finn, Nicola Tan, Sebastian Andersson, Brayden Mahdavi, Mathias Lehtinen

ROCm 7.9 Technology Preview: ROCm Core SDK and TheRock Build System

Introduce ROCm Core SDK, and learn to install and build ROCm components easily using TheRock.

October 20, 2025 by Dominic Widdows, Janet Tseng, Scott Todd, Chris Sosa, Saad Rahim

Applying Compute Partitioning for Workloads on MI300X GPUs

Learn how to boost MI300X performance using GPU Compute partitioning for parallel workloads like GROMACS and REINVENT

January 14, 2026 by David Björelind

Reimagining GPU Allocation in Kubernetes: Introducing the AMD GPU DRA Driver

Explore how the AMD GPU DRA Driver brings declarative, attribute-aware GPU scheduling to Kubernetes — learn how to request and manage GPUs natively

January 13, 2026 by Nitish Bhat, Yan Sun, Shrey Ajmera

Installing AMD HIP-Enabled GROMACS on HPC Systems: A LUMI Supercomputer Case Study

January 12, 2026 by Sebastian Remander, Mittul Singh, Paul Bauer

Athena-PRM: Enhancing Multimodal Reasoning with Data-Efficient Process Reward Models

Learn how to utilize a data-efficient Process Reward Model to enhance the reasoning ability of the Large Language/Multimodal Models.

January 12, 2026 by Zhenhua Liu, Xuanwu Yin, Dong Li, Emad Barsoum

Ecosystems & Partners

Accelerating llama.cpp on AMD Instinct MI300X

Learn more about the superior performance of llama.cpp on Instinct platforms.

December 11, 2025 by Pei Zhang, Deepan Sekar, Eliot Li, Yao Liu, Phani Vaddadi, Vish Vadlamani

Democratizing AI Compute with AMD Using SkyPilot

Learn how SkyPilot integrates with AMD open AI stack to enable seamless multi-cloud deployment and simplifies NVIDIA-to-AMD GPU migration.

November 13, 2025 by Pratik Mishra, Paul Hartke, Romil Bhardwaj, Zongheng Yang, Zhanghao Wu

Continuing the Momentum: Refining ROCm For The Next Wave Of AI and HPC

ROCm 7.1 builds on 7.0’s AI and HPC advances with faster performance, stronger reliability, and streamlined tools for developers and system builders.

November 05, 2025 by Anshul Gupta, Liam Berry, Saad Rahim

ROCm 7.0: An AI-Ready Powerhouse for Performance, Efficiency, and Productivity

Discover how ROCm 7.0 integrates AI across every layer, combining hardware enablement, frameworks, model support, and a suite of optimized tools

September 16, 2025 by Liam Berry, Mohammed Faraaz Mustafa, Danny Guan, Saad Rahim, Aditya Bhattacharji, Marilyn Basanta

Applications & Models

Accelerating IBM Granite 4.0 with FP8 using AMD Quark on MI300/MI355 GPUs

Learn how AMD Instinct MI355 Series GPUs deliver competitive Granite 4.0 inference with faster TTFT, lower latency, and strong throughput.

January 09, 2026 by Xiao Yu, Bowen Bao, Jiaxin Wang, Spandan Tiwari, Ashish Sirasao, Joe Shajrawi

Using Gradient Boosting Libraries on MI300X for Financial Risk Prediction

This blog shows how to run LightGBM and ThunderGBM GPU-accelerated training on AMD Instinct MI300X GPUs with ROCm for finance focused workloads.

January 08, 2026 by Karthik Kashyap Thatipamula, Ish Kool, Yazhini Rajesh, Marco Grond

High-Resolution Weather Forecasting with StormCast on AMD Instinct GPU Accelerators

A showcase for how to run high-resolution weather prediction models such as StormCast on AMD Instinct hardware.

January 07, 2026 by Pauli Pihajoki

Breaking the Accuracy-Speed Barrier: How MXFP4/6 Quantization Revolutionizes Image and Video Generation

Explore how MXFP4/6, supported by AMD Instinct™ MI350 series GPUs, achieves BF16-comparable image and video generation quality.

January 07, 2026 by Hongyi Yao, Zhe Li, Han Wang, Xuanwu Yin, Dong Li, Steve Reinhardt, Ephrem Wu, Emad Barsoum

Software Tools & Optimizations

Stay informed

Subscribe to our RSS feed (Requires an RSS reader available as browser plugins.)
Signup for the ROCm newsletter
View our blog statistics
View the ROCm Developer Hub
Report an issue or request a feature
We are eager to learn from our community! If you would like to contribute to the ROCm Blogs, please submit your technical blog for review at our GitHub. Blog creation can be started through our GitHub user guide.

Contents