RDTSCP - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: 0f 01 f9 rdtscp
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 32.25
Reference cycles: 28.55
UOPS_EXECUTED.THREAD: 20.0
RETIRE_SLOTS: 20.0
UOPS_MITE: 0.0
UOPS_MS: 20.0
UOPS_PORT_0: 4.0
UOPS_PORT_1: 6.0
UOPS_PORT_2: 0.0
UOPS_PORT_3: 0.0
UOPS_PORT_4: 0.0
UOPS_PORT_5: 2.0
UOPS_PORT_6: 8.0
UOPS_PORT_7: 0.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.0
With unroll_count=500, no inner loop, and 1 NOP
Code:
0: 0f 01 f9 rdtscp 3: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 32.06
Reference cycles: 28.46
UOPS_EXECUTED.THREAD: 20.0
RETIRE_SLOTS: 21.0
UOPS_MITE: 1.0
UOPS_MS: 20.0
UOPS_PORT_0: 4.0
UOPS_PORT_1: 6.0
UOPS_PORT_2: 0.0
UOPS_PORT_3: 0.0
UOPS_PORT_4: 0.0
UOPS_PORT_5: 2.75
UOPS_PORT_6: 7.25
UOPS_PORT_7: 0.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0
With loop_count=1000 and unroll_count=10
Code:
0: 0f 01 f9 rdtscp
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 32.06
Reference cycles: 28.49
UOPS_EXECUTED.THREAD: 20.1
RETIRE_SLOTS: 20.1
UOPS_MITE: 0.1
UOPS_MS: 20.0
UOPS_PORT_0: 4.0
UOPS_PORT_1: 6.0
UOPS_PORT_2: 0.0
UOPS_PORT_3: 0.0
UOPS_PORT_4: 0.0
UOPS_PORT_5: 2.75
UOPS_PORT_6: 7.35
UOPS_PORT_7: 0.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.1
With loop_count=100 and unroll_count=100
Code:
0: 0f 01 f9 rdtscp
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 32.06
Reference cycles: 28.49
UOPS_EXECUTED.THREAD: 20.01
RETIRE_SLOTS: 20.01
UOPS_MITE: 0.01
UOPS_MS: 20.0
UOPS_PORT_0: 4.0
UOPS_PORT_1: 6.0
UOPS_PORT_2: 0.0
UOPS_PORT_3: 0.0
UOPS_PORT_4: 0.0
UOPS_PORT_5: 2.75
UOPS_PORT_6: 7.26
UOPS_PORT_7: 0.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.01