I'm running sysbench on a Power7 server - running the latest RHEL 6.3 release (with the CPU utilization fix installed).
When I first ran the test
sysbench --test=cpu --cpu-max-prime=200000 run
it took quite a while to finish, but then I realized it's only running a single thread. 790 seconds.
So I specified 16 threads (I'm running on a 16-core Power7 server), and it obviously went much faster - 50 seconds..
sysbench --num-threads=16 --test=cpu --cpu-max-prime=200000 run
I would like to see if I can optimize and tune the code to run even faster on Power7. Is there a standard approach for understanding what this code is doing? Is the code even optimized for Power systems? Any ideas?