20 Mar
2011
20 Mar
'11
13:34
On 03/19/2011 02:21 PM, Per Jessen wrote:
I'm trying to optimize some assembler code for which I'm measuring runtime over a <slice of data>. Is there any easy (in C or assembler ) access to counters for pipeline stalls and cache misses available? At some point I'd also like to look at context switches, although I'm not sure how I might affect those.
Valgrind can give you some of what you want. I've also used Intel's
Vtune. The Linux Trace Toolkit may also be useful. But, I think for
things like pipeline stalls you would need one of the profiling tools.
--
Jerry Feldman