Hi,
On Ivy Bridge there are the following counters:
- CYCLE_ACTIVITY.CYCLES_L1D_PENDING
- CYCLE_ACTIVITY.CYCLES_L2_PENDING
- CYCLE_ACTIVITY.CYCLES_LDM_PENDING
but no CYCLE_ACTIVITY.CYCLES_LLC_PENDING. I have performed some profiling and my results suggest you cannot just subtract the first two counters from the third counter, to get the LLC value. There are three counters for the number of times there is a cache miss, but I want to know the effect of stalling.
How can I measure the number of CPU cycles stalled due to LLC cache load misses?