Hi, I am trying to measure the number of FLOPS my application achieves on Haswell-EP and Broadwell-EP with VTune 2017 in order to figure out how well I utilize the CPU. I found several resources on Internet according to which I should sample certain events like for example SIMD_FP_256.PACKED_DOUBLE. But it seems that this sort of event is supported only on certain CPU architectures. Is there a resource which lists all events I am supposed to sample on Haswell-EP and Broadwell-EP in order to be able to count all FLOPS that the CPU executes?
Thread Topic:
Question