Quantcast
Channel: Intel® VTune™ Profiler (Intel® VTune™ Amplifier)
Viewing all 1347 articles
Browse latest View live

Intel Vtune Ampilfier memory analysis issue

$
0
0

Hi All,

Running code in CRAY XC 40 machine.

When I'm doing memory access getting error at run time

amplxe: Error: This analysis type requires either an access to system-wide monitoring in the Linux perf subsystem or installation of the VTune Amplifier drivers (see the "Sampling Drivers" help topic for further details). Please set the /proc/sys/kernel/perf_event_paranoid value to 0 or less to continue without installing the drivers.

If I change the value in the /proc/sys/kernel/perf_event_paranoid to 0 or less it will effect performance of others who are running jobs in the nodes?

When I'm doing hotsopts analysis it is not giving any error.

Please give me suggestions

 

 

 

TCE Level: 

TCE Open Date: 

Wednesday, March 11, 2020 - 23:08

Assertion failed in tpss_deepbind_notify_on_pthread_loaded

$
0
0

Dear Intel VTune Support Team,

I am learning to use vtune_profiler_2020.0.0.605129 on Arch Linux (kernel 5.3.13) and the CPU based analyses work on my machine.

But I have not managed to run per-program GPU analyses. (System wide GPU profiling seems to work)

E.g. when issuing the following command to profile the program glxspheres64

TPSS_DEBUG=1 /opt/intel/vtune_profiler_2020.0.0.605129/bin64/vtune -collect graphics-rendering -app-working-dir /usr/bin -- /usr/bin/env MESA_GLSL_CACHE_DISABLE=true /usr/bin/glxspheres64

I get the following output:

log4cplus:ERROR Unable to open file: ./tpss-2020.03.14-10h16m40s.405792.log
vtune: Warning: The option to analyze all processes running on the system is enabled for this analysis type by default.
vtune: Warning: Ftrace 'igfx-preempt' events cannot be collected on this platform.
vtune: Warning: To enable hardware event-base sampling, VTune Profiler has disabled the NMI watchdog timer. The watchdog timer will be re-enabled after collection completes.
vtune: Collection started. To stop the collection, either press CTRL-C or enter from another console window: vtune -r /home/christianl/intel/amplxe/projects/test/r006gr -command stop.
strace: Process 405798 attached
strace: Process 405798 detached
strace: Process 405798 attached
Polygons in scene: 62464 (61 spheres * 1024 polys/spheres)
vcs/tpss2/tpss/src/tpss/runtime/linux/exe/tpss_deepbind.c:237 tpss_deepbind_notify_on_pthread_loaded: Assertion '((tpss_pthread_key_create_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_key_create)]))->trampoline)) != ((void *)0) && ((tpss_pthread_setspecific_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_setspecific)]))->trampoline)) != ((void *)0) && ((tpss_pthread_getspecific_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_getspecific)]))->trampoline)) != ((void *)0) && ((tpss_pthread_self_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_self)]))->trampoline)) != ((void *)0) && ((tpss_pthread_getattr_np_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_getattr_np)]))->trampoline)) != ((void *)0) && ((tpss_pthread_attr_getstack_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_attr_getstack)]))->trampoline)) != ((void *)0) && ((tpss_pthread_attr_getstacksize_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_attr_getstacksize)]))->trampoline)) != ((void *)0) && ((tpss_pthread_attr_setstack_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_attr_setstack)]))->trampoline)) != ((void *)0) && ((tpss_pthread_attr_setstacksize_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_attr_setstacksize)]))->trampoline)) != ((void *)0) && ((tpss_pthread_attr_destroy_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi_pthread_attr_destroy)]))->trampoline)) != ((void *)0) && ((tpss__pthread_cleanup_push_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi__pthread_cleanup_push)]))->trampoline)) != ((void *)0) && ((tpss__pthread_cleanup_pop_call_t)(((((tpss_probe_t*)g_tpss_probes_table) + g_tpss_pt_id[(tpss_pi__pthread_cleanup_pop)]))->trampoline)) != ((void *)0)' failed.

strace: Process 405798 detached
vtune: Collection stopped.
... (output continues) ...

I get similar tpss_deepbind_notify_on_pthread_loaded assertions for other rendering applications.

I am using the Mesa graphics driver:

OpenGL vendor string: Intel Open Source Technology Center
OpenGL renderer string: Mesa DRI Intel(R) UHD Graphics 620 (Kabylake GT2) 

Do you have any advice how to resolve this issue?

I have attached the resulting analysis file for you.
The same error results without TPSS_DEBUG=1 and without MESA_GLSL_CACHE_DISABLE=true.

regards,
Christian

AttachmentSize
Downloadapplication/zipr006gr.zip8.48 MB

TCE Level: 

TCE Open Date: 

Saturday, March 14, 2020 - 02:22

VTune Profiler 2020 crash on Mac OS Mojave 10.14.6

$
0
0

VTune is crashing on my Mac OS Mojave laptop. I ran the experiment on a different node, and copied the profiles over to my laptop. It does not crash right away, and I see the profiles are processed in the tabs, however, while switching around tabs, VTune crashes with the attached report. 

TCE Level: 

TCE Open Date: 

Monday, March 16, 2020 - 14:24

Memory Access Collection: VTune 2020 sep driver generates messages about Linux Kernel

$
0
0

I am running a workload on MySQL and collecting memory access data on Oracle Linux 7.7 my kernel version is 4.14.35.1902. I start the VTune 2020 collection by loading the sep driver and after 50 secs i.e at end of the run I am receiving these messages :

Message from syslogd.. 

kernel:Uhhuh. NMI received for unknown reason 20 on CPU 84. 

 Message from syslogd.. 

kernel:Do you have a strange power saving mode enabled? 

 Message from syslogd 

kernel:Dazed and confused, but trying to continue 

I also tried driverless collection ..it generates the .perf files it goes through fine without any messages. So I moved back to VTune 2019 and when I do the data collection with sep driver I do not see any messages generated by  kernel. 

Let me know if I am missing anything in my setup . I don't see these messages affecting my performance but not sure if this is expected !

Thank You 

Ketki Haridas 

Intel Corporation 

 

TCE Level: 

TCE Open Date: 

Monday, March 16, 2020 - 14:32

Human_pose_estimation_demo(2D) not working with vTune

$
0
0

We followed the steps for running VTune as given in the following link on Ubuntu 16.04 :
https://software.intel.com/en-us/get-started-with-vtune-linux-os

We are able to get results for human_pose_estimation_3d but we are getting error while running human_pose_estimation_demo (2D). The screenshot is attached.

For human_pose_estimation_3D,
In vtune gui, for Application field we have given /home/amit/human_pose_estimation_3d_demo/human_pose_estimation_3d_demo.py
In Application parameter field,
-m /path/to/Human_pose_python/human-pose-estimation-3d-0001.xml  -i /path/to/TestVideo.mp4 -d CPU

For human_pose_estimation_demo (2D),
Application: 
/home/amit/omz_demos_build/intel64/Release/human_pose_estimation_demo

Application parameters:

-m /opt/intel/openvino_2020.1.023/deployment_tools/tools/model_downloader/intel/human-pose-estimation-0001/FP32/human-pose-estimation-0001.xml -i /home/amit/TestVideo.mp4 -d CPU

After running human_pose_estimation_demo (2D) , we are getting error as shown in screenshot attached.

 

Specifications:

vTune version : Intel(R) VTune(TM) Profiler 2020 (build 605129)

GCC version : 5.4.0

Openvino: 2020.1.023

AttachmentSize
Downloadimage/pngvtune_error.png22.2 KB

TCE Level: 

TCE Open Date: 

Monday, March 16, 2020 - 23:55

Plans to support Ryzen CPUs

$
0
0

Do you have ANY plans to support AMD CPUs with hardware profiling? 

If not, I'll be evaluating other tools like ARM Forge.

 

TCE Level: 

TCE Open Date: 

Wednesday, March 18, 2020 - 04:28

Failing to build drivers with "Linux kernels prior to 2.6.x are unsupported"

$
0
0

Hi

I'm trying to use and install VTune on two of my Debian servers, and it worked perfectly for one but fails for the others.
The driver install fails  when trying to compile the drivers with the following message:

"************ Building socperf3.ko driver ************

ERROR: Linux kernels prior to 2.6.x are unsupported ... exiting.

ERROR: Failed to build socperf3.ko driver"

Complete log in attachement.

I'm running Debian 10. "uname -r" returns "5.4.0-0.bpo.2-amd64".
It seems to me the error is probably just the script failing to find something in the local sources.

I've run "apt-get install linux-headers-5.4.0-0.bpo.2-amd64" and it's normally installed.

Any idea what more steps can I take?

Thanks!

 

AttachmentSize
Downloadtext/plainoutput.txt17.4 KB

TCE Level: 

TCE Open Date: 

Wednesday, March 18, 2020 - 05:24

VTune not launching application Ubuntu 16.04

$
0
0

Hello, 

When I try to launch my binary with vtune for hotspots analysis, it successfully "collects data" but does not actually launch the application. The summary shows zero calls and zero threads launched. 

I followed all the instructions to set up vtune from here: https://software.intel.com/en-us/vtune-install-guide-linux (I am on Ubuntu 16.04)

I know that my application runs from the terminal. When I pass the exact same application into the "Application" field and the same parameters into the "Application parameters" field. I encounter no errors; the application simply does not run. Nothing appears in the terminal from which I launched vtune-gui.

Any ideas on how to fix this? 

TCE Level: 

TCE Open Date: 

Friday, March 20, 2020 - 01:40

sepdk build fails on Fedora 31. Asm macros not defined

$
0
0

Hi,

I was trying to build the kernel driver on Fedora 31 (I know it's not officially supported yet but I was going try my luck...) with vtune 2020.0.0.605129 and I received an error about invalid characters in mnemonics in the sepdk/src/sys64.S file. Upon closer inspection, it appears to be macros that have not been substituted. I don't know where to dig after that because they are defined in linux/linkage.h and I don't see why it's not being substituted... They're marked deprecated but they're still there.

 

Error messages:

make[1]: Entering directory '/usr/src/kernels/5.5.10-200.fc31.x86_64'
  AS [M]  /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.o

/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S: Assembler messages:
/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S:47: Error: invalid character '(' in mnemonic
/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S:50: Error: invalid character '(' in mnemonic
/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S:82: Error: invalid character '(' in mnemonic
/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S:84: Error: invalid character '(' in mnemonic

Contents of sys64.S:

> sed -n -e 47p -e 50p -e 82p -e 84p sys64.S

ENTRY(SYS_Get_GDT_Base)
ENDPROC(SYS_Get_GDT_Base)
ENTRY(SYS_IO_Delay)
ENDPROC(SYS_IO_Delay)

Offending make command:

make[1]: Entering directory '/usr/src/kernels/5.5.10-200.fc31.x86_64'
...

set -e;  echo '  AS [M]  /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.o'; gcc -Wp,-MD,/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/.sys64.o.d  -nostdinc -isystem /usr/lib/gcc/x86_64-redhat-linux/9/include -I./arch/x86/include -I./arch/x86/include/generated  -I./include -I./arch/x86/include/uapi -I./arch/x86/include/generated/uapi -I./include/uapi -I./include/generated/uapi -include ./include/linux/kconfig.h -D__KERNEL__ -D__ASSEMBLY__ -fno-PIE -m64 -DCONFIG_AS_CFI=1 -DCONFIG_AS_CFI_SIGNAL_FRAME=1 -DCONFIG_AS_CFI_SECTIONS=1 -DCONFIG_AS_SSSE3=1 -DCONFIG_AS_AVX=1 -DCONFIG_AS_AVX2=1 -DCONFIG_AS_AVX512=1 -DCONFIG_AS_SHA1_NI=1 -DCONFIG_AS_SHA256_NI=1 -Wa,-gdwarf-2 -DCC_USING_FENTRY  -DMODULE  -c -o /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.o /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S; scripts/basic/fixdep /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/.sys64.o.d /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.o 'gcc -Wp,-MD,/opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/.sys64.o.d  -nostdinc -isystem /usr/lib/gcc/x86_64-redhat-linux/9/include -I./arch/x86/include -I./arch/x86/include/generated  -I./include -I./arch/x86/include/uapi -I./arch/x86/include/generated/uapi -I./include/uapi -I./include/generated/uapi -include ./include/linux/kconfig.h -D__KERNEL__ -D__ASSEMBLY__ -fno-PIE -m64 -DCONFIG_AS_CFI=1 -DCONFIG_AS_CFI_SIGNAL_FRAME=1 -DCONFIG_AS_CFI_SECTIONS=1 -DCONFIG_AS_SSSE3=1 -DCONFIG_AS_AVX=1 -DCONFIG_AS_AVX2=1 -DCONFIG_AS_AVX512=1 -DCONFIG_AS_SHA1_NI=1 -DCONFIG_AS_SHA256_NI=1 -Wa,-gdwarf-2 -DCC_USING_FENTRY  -DMODULE  -c -o /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.o /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/sys64.S'> /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/.sys64.o.cmd; rm -f /opt/intel/vtune_profiler_2020.0.0.605129/sepdk/src/.sys64.o.d

Thanks in advance for your help!

TCE Level: 

TCE Open Date: 

Wednesday, March 25, 2020 - 00:26

Kernel driver build fails on Fedora 30. Asm macros not substituted.

How to use Intel® VTune™ Profiler in a kubernetes Environment

$
0
0

Hi All,

I have a Kubernetes environment. I want to use Intel® VTune™ Profiler to determine bottlenecks, inefficient code, long execution times, etc or any other useful info about the containers( containers have python applications in it) . 

I need some documentation or wiki on how to get started with setting up Intel® VTune™ in a Kubernetes environment (it can be standalone or container-based installation of VTune) and how to run it

Thanks

Krishna Venkata

Analyse python file from embedded python

$
0
0

Hi, I would like have a python analysis for Python files executed from the C python API, like in this sample code adapted from https://docs.python.org/3/extending/embedding.html :

#define PY_SSIZE_T_CLEAN
#include <Python.h>

int
main(int argc, char *argv[])
{
	printf("before\n");
	wchar_t *program = Py_DecodeLocale(argv[0], NULL);
	if (program == NULL) {
		fprintf(stderr, "Fatal error: cannot decode argv[0]\n");
		exit(1);
	}
	Py_SetProgramName(program); 
	Py_Initialize();
	auto* f = fopen("f.py", "rb");
	auto r = PyRun_SimpleFile(f, "f.py");
	if (Py_FinalizeEx() < 0) {
		exit(120);
	}
	PyMem_RawFree(program);
	return 0;
}

When I use Hostspots analysis with user mode sampling and managed mode set to any of (auto, native, mixed), I only get the C/C++ functions. 

python37_d.dll ! PyEval_EvalFrameDefault - ceval.c
python37_d.dll ! PyEval_EvalCodeWithName + 0xaf3 - ceval.c:3930
python37_d.dll ! PyEval_EvalCodeEx + 0x95 - ceval.c:3959
python37_d.dll ! PyEval_EvalCode + 0x2d - ceval.c:524
python37_d.dll ! run_mod + 0x69 - pythonrun.c:1035
python37_d.dll ! PyRun_FileExFlags + 0x111 - pythonrun.c:988
python37_d.dll ! PyRun_SimpleFileExFlags + 0x4df - pythonrun.c:429
embed1.exe ! [embed1.exe] + 0x11a2f - [unknown source file]

It is possible to have the python file/stack analysis, and how ? Thank you

I use vtune 2020-60519 with Visual Studio 2017 on Windows.

VTune counting cache hit/miss wrong?

$
0
0

Hi!

I am using VTune to measure the different levels of cache hits and misses (Load). I assumed L2_MISS = L3_HIT + L3_MISS (similarly for L1 and L2) but this does not seem to satisfy from the output below?

Config : Intel Core i3-5005u + Windows 10

CPU
    Name:    Intel(R) Core(TM) Processor code named Broadwell
    Frequency:    2.0 GHz
    Logical CPU Count:    4

Elapsed Time:    60.004s
    CPU Time:    25.576s
    CPI Rate:    1.641
    Total Thread Count:    4
    Paused Time:    0s

 

Hardware Events
    Hardware Event Type    Hardware Event Count    Hardware Event Sample Count    Events Per Sample
    BACLEARS.ANY    223,106,693    97    100003
    BR_MISP_RETIRED.ALL_BRANCHES_PS    64,401,449    7    400009
    CPU_CLK_UNHALTED.ONE_THREAD_ACTIVE    1,497,344,919    651    100003
    CPU_CLK_UNHALTED.REF_TSC    51,034,000,000    25,517    2000000
    CPU_CLK_UNHALTED.REF_XCLK    2,645,079,350    1,150    100003
    CPU_CLK_UNHALTED.THREAD    51,314,000,000    25,657    2000000
    CPU_CLK_UNHALTED.THREAD_P    47,242,070,863    1,027    2000003
    CYCLE_ACTIVITY.STALLS_L1D_MISS    13,616,020,424    296    2000003
    CYCLE_ACTIVITY.STALLS_L2_MISS    10,350,015,525    225    2000003
    CYCLE_ACTIVITY.STALLS_MEM_ANY    20,332,030,498    442    2000003
    CYCLE_ACTIVITY.STALLS_TOTAL    29,992,044,988    652    2000003
    INST_RETIRED.ANY    31,262,000,000    15,631    2000000
    INST_RETIRED.PREC_DIST    30,130,045,195    655    2000003
    INST_RETIRED.X87    0    0    2000003
    INT_MISC.RECOVERY_CYCLES    276,000,414    6    2000003
    ITLB_MISSES.STLB_HIT    50,601,518    22    100003
    ITLB_MISSES.WALK_COMPLETED    85,102,553    37    100003
    ITLB_MISSES.WALK_DURATION    2,884,286,526    1,254    100003
    L1D.REPLACEMENT    1,518,002,277    33    2000003
    L1D_PEND_MISS.FB_FULL    46,000,069    1    2000003
    L1D_PEND_MISS.PENDING    33,810,050,715    735    2000003
    L2_RQSTS.RFO_HIT    55,200,828    12    200003
    LD_BLOCKS.NO_SR    0    0    100003
    LD_BLOCKS.STORE_FORWARD    39,101,173    17    100003
    LD_BLOCKS_PARTIAL.ADDRESS_ALIAS    71,302,139    31    100003
    LSD.CYCLES_4_UOPS    138,000,207    3    2000003
    LSD.CYCLES_ACTIVE    92,000,138    2    2000003
    LSD.UOPS    506,000,759    11    2000003
    MACHINE_CLEARS.COUNT    2,300,069    1    100003
    MEM_LOAD_UOPS_L3_HIT_RETIRED.XSNP_HITM_PS    27,154,927    59    20011
    MEM_LOAD_UOPS_L3_HIT_RETIRED.XSNP_HIT_PS    10,585,819    23    20011
    MEM_LOAD_UOPS_L3_HIT_RETIRED.XSNP_MISS_PS    5,523,036    12    20011
    MEM_LOAD_UOPS_RETIRED.HIT_LFB_PS    565,816,974    246    100003
    MEM_LOAD_UOPS_RETIRED.L1_HIT_PS    6,716,010,074    146    2000003
    MEM_LOAD_UOPS_RETIRED.L1_MISS_PS    761,322,839    331    100003
    MEM_LOAD_UOPS_RETIRED.L2_HIT_PS    434,713,041    189    100003
    MEM_LOAD_UOPS_RETIRED.L2_MISS_PS    332,489,587    289    50021
    MEM_LOAD_UOPS_RETIRED.L3_HIT_PS    287,620,750    250    50021
    MEM_LOAD_UOPS_RETIRED.L3_MISS    9,200,644    4    100007
    MEM_LOAD_UOPS_RETIRED.L3_MISS_PS    6,900,483    3    100007
    MEM_UOPS_RETIRED.ALL_STORES_PS    5,888,008,832    128    2000003
    MEM_UOPS_RETIRED.LOCK_LOADS_PS    262,218,354    114    100007
    MEM_UOPS_RETIRED.SPLIT_LOADS_PS    4,600,138    2    100003
    MEM_UOPS_RETIRED.SPLIT_STORES_PS    0    0    100003
    MEM_UOPS_RETIRED.STLB_MISS_LOADS_PS    108,103,243    47    100003
    MEM_UOPS_RETIRED.STLB_MISS_STORES_PS    2,300,069    1    100003
 

Any help regarding this would be appreciated.

Thanks!

VTune backend crashes

$
0
0

VTune 2020 Profile crashes collecting the analysis..

Code compiled with VS2017 - 15.8.9

Do you have a  later version I could try

Problem signature:
  Problem Event Name:    BEX64
  Application Name:    vtune-backend.exe
  Application Version:    0.0.0.0
  Application Timestamp:    5ddce9e2
  Fault Module Name:    amplxe_msdia140.dll
  Fault Module Version:    14.10.25017.0
  Fault Module Timestamp:    58a64084
  Exception Offset:    000000000009e819
  Exception Code:    c0000417
  Exception Data:    0000000000000000
  OS Version:    6.1.7601.2.1.0.18.10
  Locale ID:    1033
  Additional Information 1:    2356
  Additional Information 2:    2356dca811460826fcbf797f7d9cab81
  Additional Information 3:    de98
  Additional Information 4:    de98d53fa8c257268087b149a2719e54

Read our privacy statement online:
  http://go.microsoft.com/fwlink/?linkid=104288&clcid=0x0409

If the online privacy statement is not available, please read our privacy statement offline:
  C:\Windows\system32\en-US\erofflps.txt
 

Issue with vtune drivers being group permissioned

$
0
0

Hi,

To facilitate usage of vtune among a couple of people, I have changed the group ownership of vtune drivers (/dev/{pax,socperf,sep5}) from vtune to some other group (mygroup). This was done via : insmod-sep -g mygroup when loading the driver

On RHEL 7.6 dev host, I'm having issue where if this `mygroup` is not the primary group of the user, then vtune-gui and vtune is unable to run properly and query the HW sampling drivers. 

This does not seem to be an issue on RHEL 6.x. 

straces seem inconclusive yet why vtune driver being group permissioned (perm 660 and ownership of root:mygroup) and user having group membership (primary_group, mygroup) does't work. 

If I make the /dev/{pax,socperf,sep5} be root:primary_group, then it works.But making it root:mygroup fails on RHEL 7.x only

Any reason why? Something to do with sssd settings? Why is vtune not able to work with such a scenario? Any help please?

 

Thanks,


Recursive Filter In by Selection

$
0
0

Hi,

When profiling my code with VTune, I often need to find out when a given function gets executed (start elapsed time -> end elapsed time). To do so, I usually do the following:

  1. I go to the top-down tree view.
  2. I select a range in the elapsed time graph at the bottom, and "Filter In by Selection"
  3. I observe the new tree view, and check if the CPU time for that function is close to 100%. If it is not, I need to move my range around -> go back to step 2 and iterate.

This is a rather tedious process. In the top-down tree, there is an option "Filter In by Selection", which almost does what I want. It shows the function in the elapsed time graph, but it does not show its callees.

Basically, what I am looking for is an option in the top-down tree that would be "Filter In by Selection (Recursive)". Is there anything like that in VTune?

Thank you for your help,

Joachim

./insmod-sep: line 261: socwatch_exists: command not found

$
0
0

Hi,
i have installed vtune 2020 on RHEL 7.6  , and while checking for sep driver load status i get following error - 

[root@node1 ]# source /home/user/I2020u0/parallel_studio_xe_2020.0.088/psxevars.sh intel64
Intel(R) Parallel Studio XE 2020 for Linux*
Copyright (C) 2009-2019 Intel Corporation. All rights reserved.
[root@node1 ]# cd  /home/user/I2020u0/vtune_profiler_2020.0.0.605129/sepdk/src/
[root@node1 ]# ./insmod-sep -q
pax driver is loaded and owned by group "vtune" with file permissions "660".
socperf3 driver is loaded and owned by group "vtune" with file permissions "660".
sep5 driver is loaded and owned by group "vtune" with file permissions "660".
socwatch driver is not correctly loaded.
./insmod-sep: line 261: socwatch_exists: command not found
vtsspp driver is loaded and owned by group "vtune" with file permissions "660".
./insmod-sep: line 268: [: too many arguments
[root@node1 ]#

I also have intel 2019 u5 , and it the same command (insmod-sep) worked fine on this system.

Please let me know if more information is required from my end to fix this.

 

 

Assertion 'Cannot write magic record to trace' failed.

$
0
0

Hi,
I am trying vtune 2020u0 on rhel 7.6 in intel 8280.
For testing the setup i used APS, and it ran fine and generated the results without any issue.

Then i tried out the hpc-performance analysis on using amplxe-cl command as - 

time mpirun -np $SLURM_NPROCS -ppn $SLURM_NTASKS_PER_NODE amplxe-cl -collect hpc-performance -data-limit 0 -result-dir result_hpcperf -- ${INSTALL_ROOT}/wrf.exe

 the run has finished , but is seems that the data gathering command has experienced some issue - 

WRF: SUCCESS COMPLETE wrf
vcs/collectunits1/tmu/src/tmu.c:437 write_trace: Assertion 'Cannot write magic record to trace' failed.

Abort trap signal
Image              PC                Routine            Line        Source
wrf.exe            00000000030C8DDB  for__signal_handl     Unknown  Unknown
libpthread-2.17.s  00002AAAACFB25D0  Unknown               Unknown  Unknown
libc-2.17.so       00002AAAAD8E7207  gsignal               Unknown  Unknown
libc-2.17.so       00002AAAAD8E88F8  abort                 Unknown  Unknown
libittnotify_coll  00002AAAAACE2D88  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACE33AB  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACE34F7  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACE4C9B  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACD7B81  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACD79A3  Unknown               Unknown  Unknown
libittnotify_coll  00002AAAAACD77F6  Unknown               Unknown  Unknown
ld-2.17.so         00002AAAAAABAFCA  Unknown               Unknown  Unknown
libc-2.17.so       00002AAAAD8EAB69  Unknown               Unknown  Unknown
libc-2.17.so       00002AAAAD8EABB7  Unknown               Unknown  Unknown
libc-2.17.so       00002AAAAD8D33DC  __libc_start_main     Unknown  Unknown
wrf.exe            0000000000415169  Unknown               Unknown  Unknown

 

around 11 hours have elapsed and i still see amplxe-cl process running (top command)
 

   PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND
 42453 root      20   0  626072  53584  32180 S   6.2  0.0   0:29.60 amplxe-cl
111683 root      20   0  164668   2616   1556 R   6.2  0.0   0:00.01 top
     1 root      20   0   56060   8328   2620 S   0.0  0.0   1:04.08 systemd
     2 root      20   0       0      0      0 S   0.0  0.0   0:00.12 kthreadd
     3 root      20   0       0      0      0 S   0.0  0.0   0:00.06 ksoftirqd/0

 

and i can see a 17mb directory was also created (result_hpcperf.node1).
I am not sure if i will face issues with other collection/analysis types
Though i tried out the amplxe self checker script, and log seems to indicate that the setup is fine.

Please let me know if i can provide more information from my end to fix this issue. 

VTune Profiler 2020 : Profiling Remote Target Inside Docker With Host and Remote system as Windows

$
0
0

Objective : To profile remote running process using Hardware Event Based Sampling Hotspot analysis type inside a docker with windows image (windowsserver:ltsc2019) from Windows Host.

Version of vtune tool used : Vtune 2020 update1

Version of Docker Engine :  19.03.8 (Docker Desktop)

Host OS : Version 1909 Windows Pro

I have done the following successfully : 

1) Passwordless SSH connection to the remote system (docker) using empty password config (not through key based)

2) Installed VTune standalone profiler with command line support in the remote target system. This includes the installation of the driver also.

Problem : I get the following error 

C:\Users\hariv\.ssh>vtune --target-system=ssh:User03@localhost[:2222] -collect hotspots -knob collection-type:hw-events -- /matrix/matrix.exe
vtune: Using target: ssh:User03@localhost[:2222]
vtune: Error: Please, check that the command '/tmp/vtune_profiler_2020.1.0.607630/bin32/amplxe-runss -V' is run successfully on the target.
vtune: Error: VTune cannot detect remote machine configuration.
vtune: Error: Please, check that the command '/tmp/vtune_profiler_2020.1.0.607630/bin32/amplxe-runss -V' is run successfully on the target.
vtune: Error: VTune cannot detect remote machine configuration.

 

The amplxe-runss works fine when i try to run in the target system but it is present in a different directory inside /bin64

Attached files : 1) Included the screenshot of the error

2) Dockerfile used to build the windows image which includes SSH configuration and VTune installation.

3) The script to install the SSH in the remote target system

I am unable to change the default directory even by using the flag -target-install-dir . I am unable to find a solution after this to make my remote profiling work here. Any sort of Help is appreciated. Thank you.

AttachmentSize
Downloadimage/pngCapture.PNG19.35 KB
Downloadapplication/rarDockerfile.rar426 bytes

Mac VTune downloads lead to a 404 page

$
0
0

Hi,

Is there a known problem with Mac downloads at the moment?  I've been trying the download the Mac interface for Vtune and all of the versions I've tried lead to a 404 page rather than a download.  I was able to download the Linux version successfully.

Thanks

Viewing all 1347 articles
Browse latest View live