Hi all,
We are now trying to evaluate this tool for our products.
Our main interest in vtune is wether is can profile apps without any overhead.
The problem with profiling is that generally the profiling code itself ads overhead:
- Extra cycles are performed for accounting. In tight loops, it is not uncommon at all that the profiling code take more time to process than the code you are profiling. This serious messes up measurements and can make results very confusing
- The profiling code may also mess up CPU pipelining/branch prediction, caching, content switches (between threads) and jit. Again, this can skew the profiling result significantly.
We are hoping that maybe vtune can help on some of these issues by using CPU counters more, sometime that hopefully may allow the code to run at full speed without interruption.
Could you give us helpful comments on this concern?
Thanks in advance,
Kim.