Quantcast
Channel: Intel® VTune™ Profiler (Intel® VTune™ Amplifier)
Viewing all articles
Browse latest Browse all 1347

VTune GPU Hotspot (processor graphics HW events) and OpenCL case studies? sample code?

$
0
0

 

Hi, 

I am teaching a Master's class this quarter at Univ. of Washington EE dept on Applied GPU computing.

We are using Intel OpenCL SDK, VTune, running on Skylake (i5-6500T CPU w/ HD Graphics 530).

I have been receiving excellent help from the Intel OpenCL group (Jeffrey Mcallister, Robert Ioffe, Ben Ashaugh, Michael Chu, and others) on lots of OpenCL details.  

I am planning to use VTune to provide additional in-depth analysis of OpenCL kernel performance and optimization, but I was hoping to find some tutorials and/or samples specifically showing GPU Hotspots and OpenCL capabilities (We're using the latest VTune Amplifier XE 2017).  I've read Julia Fedorova's VTune article from 2013 on "Getting Started with OpenCL...", but that is now several years old (used Intel Iris Graphics), and doesn't really provide a tutorial, just a good discussion.

Specifically, I'd like to use VTune to conduct STREAM-like [McCalpin, 1995] studies of memory performance/bandwidth and cache utilization, in order to create Roofline-type performance model analysis plots of compute-/memory-bound kernel behaviors for various arithmetic intensities.

So two initial questions:

1) Are there any other tutorials/code samples/reference specifications for GPU Hotspot analysis types, with Processor Graphics Hardware Events (Compute Basic, Compute Extended, Full Compute (?)), and OpenCL tracing? Perhaps a real world case-study?

2) I need to understand better the metrics available for the various blocks in the Graphics/Architecture plot (Sampler (L1,L2), L3 (SLM), GTI, LLC, DRAM), and how these map to OpenCL memory-model objects (device local memory, device global memory, device constant memory).

Thank you, Colin

 

 

 


Viewing all articles
Browse latest Browse all 1347

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>