← 返回首页
GPUPerfAPI - AMD GPUOpenGPUPerfAPI - AMD GPUOpen

The AMD GPU Performance API (AMD GPUPerfAPI, or GPA) is a powerful library, providing access to GPU Performance Counters. It can help analyze the performance and execution characteristics of applications using an AMD Radeon™ GPU.

AMD GPUPerfAPI is used by the AMD Radeon™ GPU Profiler, as well as several third-party tools including Microsoft PIX on Windows and RenderDoc.

Download the latest version - v4.2

This release adds the following features:

  • Support for additional AMD RDNA™ 4 architecture hardware (AMD Radeon™ RX 9060 GPUs).
  • Support for additional AMD RDNA 3.5 architecture-based APUs (AMD Ryzen™ AI 5 330 Processor with AMD Radeon 820M Graphics).
  • DirectX 11: GPUPerfAPIDXGetAMDDeviceInfo-x64.dll has been removed in favor of Windows Graphics Settings to force select the GPU.
  • GpaGetDeviceMaxWaveSlots() reliability issues have been fixed for DX12.
  • GpaGetDeviceMaxVgprs() entry point has been added.
  • Synchronized access to HW counters using an IPC Mutex when using Windows to prevent multiple apps trying to access the HW counters at the same time.
  • Fixed edge case in parsing SPM data for AMD Radeon RX 6000 and RX 5000 Series graphics hardware.
  • Fixed bug where the DX12 GPA implementation held onto strong references of D3D objects.
  • Removed problematic counters from affecting AMD Radeon RX 6000 Series hardware (VsGsVALUBusy, VsGsVALUBusyCycles, VsGsSALUBusy, VsGsSALUBusyCycles).
  • cacheFlushOnCounterCollection was being inconsistently set by GPA affecting discrete counter output.
  • Fixed GL2C* counters issue affecting AMD Radeon RX 9000 Series graphics hardware.
  • Fixed WriteSize and FetchSize counters on AMD Radeon RX 9000 Series graphics hardware. It was not accounting for all sized reads/writes.
  • Fixed build/test issue preventing users from passing skipvulkan and skpopengl to the build on Windows®.
  • Fixed issue causing release builds taking far longer than they should during the linker step on Windows.

Features

  • Provides a standard API for accessing GPU Performance counters for both graphics and compute workloads across multiple GPU APIs.
  • Supports Vulkan®, DirectX® 12, DirectX® 11, and OpenGL®.
  • Supports all recent AMD RDNA™ architecture-based graphics cards and APUs based on Graphics IP version 10 and newer.
  • Supports both Windows® and Linux®.
  • Provides derived “public” counters based on raw HW counters.
  • Provides access to some raw hardware counters. See Raw Hardware Counters for more information.

Requirements

Supported GPUs

  • AMD Radeon™ AI PRO R9700 Graphics
  • AMD Radeon™ RX 9000 Series Graphics
  • AMD Radeon™ RX 7000 Series Graphics
  • AMD Radeon™ RX 6000 Series Graphics
  • AMD Radeon™ RX 5000 Series Graphics
  • AMD Ryzen™ AI Max 300 Series Processors with Radeon™ 8000S Series Graphics
  • AMD Ryzen™ AI 300 Series Processors with Radeon™ 800M Series Graphics
  • AMD Ryzen™ 8000 Series Processors with Radeon™ 700M Series Graphics
  • AMD Ryzen™ 7000 Series Processors with Radeon™ 700M Series Graphics

Supported graphics APIs

  • DirectX® 12
  • Vulkan®
  • DirectX® 11
  • OpenGL®

Supported OSs

  • Windows® 10
  • Windows® 11
  • Linux® – Ubuntu 24.04 LTS

Version history

Version 4.2 (November 2025)

  • Support for additional AMD RDNA™ 4 architecture hardware (AMD Radeon™ RX 9060 GPUs).

  • Support for additional AMD RDNA 3.5 architecture-based APUs (AMD Ryzen™ AI 5 330 Processor with AMD Radeon 820M Graphics).

  • DirectX 11: GPUPerfAPIDXGetAMDDeviceInfo-x64.dll has been removed in favor of Windows Graphics Settings to force select the GPU.

  • GpaGetDeviceMaxWaveSlots() reliability issues have been fixed for DX12.

  • GpaGetDeviceMaxVgprs() entry point has been added.

  • Synchronized access to HW counters using an IPC Mutex when using Windows to prevent multiple apps trying to access the HW counters at the same time.

  • Fixed edge case in parsing SPM data for AMD Radeon RX 6000 and RX 5000 Series graphics hardware.

  • Fixed bug where the DX12 GPA implementation held onto strong references of D3D objects.

  • Removed problematic counters from affecting AMD Radeon RX 6000 Series hardware (VsGsVALUBusy, VsGsVALUBusyCycles, VsGsSALUBusy, VsGsSALUBusyCycles).

  • cacheFlushOnCounterCollection was being inconsistently set by GPA affecting discrete counter output.

  • Fixed GL2C* counters issue affecting AMD Radeon RX 9000 Series graphics hardware.

  • Fixed WriteSize and FetchSize counters on AMD Radeon RX 9000 Series graphics hardware. It was not accounting for all sized reads/writes.

  • Fixed build/test issue preventing users from passing skipvulkan and skpopengl to the build on Windows®.

  • Fixed issue causing release builds taking far longer than they should during the linker step on Windows.

Version 4.1 (June 2025)

Version 4.0 (March 2025)

Version 3.17 (September 2024)

Version 3.16 (July 2024)

Version 3.15 (December 2023)

Version 3.14 (September 2023)

Version 3.13 (June 2023)

Version 3.12 (December 2022)

Version 3.11.1 (July 2022)

Version 3.11 (April 2022)

Version 3.10 (January 2022)

Version 3.9 (July 2021)

Version 3.8 (April 2021)

Version 3.7 (November 2020)

Version 3.6 (May 2020)

Version 3.5 (December 2019)

Version 3.4 (July 2019)

Version 3.3 (December 2018)

Version 3.2 (August 2018)

Version 3.1 (Jun 2018)

Version 3.0 (March 2018)

Related news and technical articles

AMD DGF: An Open Geometry Compression Standard
AMD is partnering with Samsung on a multivendor Vulkan extension for Dense Geometry Format (DGF) to help enable dramatically smaller geometry, reduced memory/latency for ray-traced real‑time 3D, and easier engine integration.
Introducing AMD DGF SuperCompression
AMD DGF SuperCompression (DGFS) cuts DGF geometry file sizes while preserving exact block reconstruction and enabling fast decode to either DGF blocks or conventional meshlets for cross-device deployment.
Introducing the Latest AMD Radeon Developer Tool Suite with Support for AMD Radeon RX 9060 XT
We've released an updated AMD Radeon™ Developer Tool Suite, including enhanced versions of tools such as the Radeon GPU Detective, Profiler, Raytracing Analyzer, Memory Visualizer, GPU Analyzer, and Developer Panel, now with expanded GPU support.
Major AMD Radeon Developer Tool Suite updates for AMD RDNA 4 architecture
Read the highlights of our updates to the AMD Radeon Developer Tool Suite to support the launch of AMD RDNA 4 architecture!
AMD Radeon Anti-Lag 2 SDK
Learn how to integrate the AMD Radeon Anti-Lag 2 SDK into your game. Unlike the driver-based AL 1, AL 2's point of insertion is at the optimal point inside the game's logic, just before the user controls are sampled.
Introducing Compressonator v4.5 with up to 20% improvement in Brotli-G compression
Compressonator v4.5 introduces improved BC1-BC5 compression that reduces file sizes on average by 10%-15% using a new Brotli-G zip packaging feature, mipmap changes, ETC codec fixes, and more.
Introducing Compressonator v4.4 with AVX-512 support for BC1 Encoding
Compressonator v4.4 adds AVX-512, AVX2, and SSE4 variations of BC1 encoding in the Compressonator Core library, new CLI options, and more.
Render Pipeline Shaders 1.1 is now available with Linux support and other improvements
The latest release of the Render Pipeline Shaders (RPS) SDK adds Linux support, the ability to dynamically load Vulkan functions, and more. Check out this post for the full release notes.