20 Mar
2011
20 Mar
'11
13:34
On 03/19/2011 02:21 PM, Per Jessen wrote:
I'm trying to optimize some assembler code for which I'm measuring runtime over a <slice of data>. Is there any easy (in C or assembler ) access to counters for pipeline stalls and cache misses available? At some point I'd also like to look at context switches, although I'm not sure how I might affect those.
Valgrind can give you some of what you want. I've also used Intel's Vtune. The Linux Trace Toolkit may also be useful. But, I think for things like pipeline stalls you would need one of the profiling tools. -- Jerry Feldman <gaf@blu.org> Boston Linux and Unix PGP key id: 537C5846 PGP Key fingerprint: 3D1B 8377 A3C0 A5F2 ECBB CA3B 4607 4319 537C 5846