Intermittent segfault when finalizing results
I encounter an intermittent segfault when finalizing results from a hotspot analysis. I identified that the segfault only occurs if ~/.flexlmborrow exists as a result of borrowing a floating license...
View ArticleIntel VTune Amplifier XE 2016 error (Cannot get global tsc range)
Hello,I tried to profile a MPI program with Vtune by typing the following command.mpirun -n 8 -machinefile ~/machinefile amplxe-cl -r reportfile -collect-with runsa -start-paused -knob...
View ArticleResults from INST_RETIRED.ANY and INST_RETIRED.ANY_P
Hi,Does anyone know this problem?I have achieved the results, but I do not know how is going on with it, because the results seems like not correct, for example, I have tested the SPEC CIN2006...
View ArticleDoes new Vtune include “LOAD_DISPATCH.ANY” event?
Hi,Does anyone knows the new vtune 2016 have the event “LOAD_DISPATCH.ANY”? I have not found it in the event list while I wanted to add it. I think my cpu support that, and I attached the cpu...
View ArticleAn assertion failure on VTune Amplifier XE 2017
Hi there, Please help. The VTune Amplifier XE 2017(Windows 8.1 64-bit, standalone mode) crashes when finalizing results. I attaсhed files from ReportPath directory to this topic.Best...
View Articlewhy I can not find the event which I used in vtune 2011?
Hi,Does anyone knows the event list in vtune 2017? I did not find a event counter, which I found it in vtune 2011, but in 2017 it do not have, certainly, machines are different. So, can I still use the...
View ArticleHow to detect massive memory copy ?
Hi! I have a problem in my java program: it uses many off-heap operations, and I suspect that in some cases copying is too big by size of blocks or by rate. So program constantly slow or become even...
View ArticleDoes anyone knows why L1D cache miss rate is high?
Hi,I am testing i7-6700 with Vtune 2016 with SPEC 2006. From the definitions, I believe the following should satisfy:MEM_INST_RETIRED.ALL_LOADS = MEM_LOAD_RETIRED.L1_HIT +...
View ArticleVTune Finalizing data and Executing actions 12% and stuck with threaded...
Hi,I am running a serial program which is taking 2min 50sec (with vtune options, its taking some more seconds). The same program with threaded and vectorized, normal execution is taking 2min but adding...
View ArticleMulti-threaded performance issues
I have a multi-threaded programs. We use an own implementation of the thread pool. First, the load of the project is enough. compred to single thread, the program of two threads is more faster.When we...
View ArticleHow to extract sample values frome vtune timeline?
Hellois it possible to extract sample values from the vtune timeline?I know the vtune supports timeline analysis for BW or uncore event counts. This means the vtune internally collects event counts per...
View ArticleFLOPS measurement on Haswell-EP and Broadwell-EP
Hi, I am trying to measure the number of FLOPS my application achieves on Haswell-EP and Broadwell-EP with VTune 2017 in order to figure out how well I utilize the CPU. I found several resources on...
View ArticleProblem using -gtool option on Vtune.
Hi,I need to collect data from MPI 2 applications, with dynamic process creation. The way I found is getting data for each process separately using -gtool option, but for any MPI application I get the...
View Article"Cannot create output file" for Advanced Hotspots
Hi all!I can't run Advanced Hotspots analysis in Intel VTune Amplifier XE 2016:ERROR: Options errorError: Cannot create output file -- The file directory does not exist, or the directory is not...
View ArticleCPU Load and Memory Bound Diagrams
I am trying to speedup my program via parallelization. I am unable to get speedup factor more than 3 on 8-cores Core i7 CPU. I suspect I am facing DRAM access bottleneck. Concurrency Analysis provides...
View ArticleDoes i7-6700 has uops_retired.any?
Hi,Does anyone knows i7-6700 has uops_retired.any? if do not have, does anyone knows how can I get the counts from any other counters instead of uops_retired.any, like using summation of some...
View ArticleHow to get Wait Time and Wait Count from OpenMP applications?
Hi,I'm trying to get wait time and wait count data from some openmp applications, but it return always zero when using locksandwaits or N/A using the concurrency option.The compiler I'm using is gcc...
View ArticleInstallation issues with Parallel Studio XE 2017
Hi All, I couldn't find a proper forum to post issues during installation, so I am posting it here. I am trying to install Intel Parallel Studio XE 2017 on OpenSUSE LEAP 42.1. The installation script...
View ArticleDoes VTune Amp support ChromeOS
Am trying to do some garbage collection analysis for Octane2.0 JS Benchmark on Intel platform.Interested in doing Vtune analysis on Chrome device and see hotspots. Does VTUNE AMp support Chrome OS ?If...
View ArticleVtune and VMware Fusion 8.5
I've just tried vtune 2016 running on Ubuntu 16.04 on VMWare Fusion 8.5.When attempting to do a General Exploration; I get an error saying ERROR: Platform does not support sampling with interrupts -...
View Article