You copied the Doc URL to your clipboard.

Instruction caches

The shader core instruction cache is a performance-impacting area that is often overlooked. Due to the number of threads running concurrently, it is a critically important part to be aware of.


You must understand the following concepts:

  • Instruction caches.
  • Early-zs testing.

How to optimize the use of instruction caches

Try using the following optimizations:

  • Use shorter shaders with many threads over long shaders with few threads. A shorter program is more likely to be hot in the cache.
  • Use shaders that do not have control-flow divergence. Divergence can reduce temporal locality and increase cache pressure.

Things to avoid when optimizing your use of instruction caches

Arm recommends that you:

How to debug instruction cache-related performance issues

Try the following debugging steps: