RDRAND (R32) - Throughput and Uops
With 1 independent instruction
With unroll_count=10 and no inner loop
Code:
0: 41 0f c7 f0 rdrand r8d
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 1548.8
Reference cycles: 513.05
UOPS_EXECUTED.THREAD: 56.35
RETIRE_SLOTS: 54.12
UOPS_MITE: -0.16
UOPS_MS: 62.2
UOPS_PORT_0: 9.7
UOPS_PORT_1: 18.42
UOPS_PORT_5: 9.9
UOPS_PORT_6: 17.4
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.0
With unroll_count=10, no inner loop, and 1 NOP
Code:
0: 41 0f c7 f0 rdrand r8d 4: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 1548.38
Reference cycles: 513.05
UOPS_EXECUTED.THREAD: 56.45
RETIRE_SLOTS: 54.8
UOPS_MITE: 1.0
UOPS_MS: 62.2
UOPS_PORT_0: 10.1
UOPS_PORT_1: 18.0
UOPS_PORT_5: 10.0
UOPS_PORT_6: 17.28
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0
With loop_count=10 and unroll_count=1
Code:
0: 41 0f c7 f0 rdrand r8d
Show nanoBench command
Results:
Instructions retired: 3.0
Core cycles: 1548.83
Reference cycles: 513.57
UOPS_EXECUTED.THREAD: 58.4
RETIRE_SLOTS: 55.75
UOPS_MITE: 1.87
UOPS_MS: 62.2
UOPS_PORT_0: 9.72
UOPS_PORT_1: 18.47
UOPS_PORT_5: 10.7
UOPS_PORT_6: 18.28
UOPS_PORT_23: 0.0
UOPS_PORT_49: 0.0
UOPS_PORT_78: 1.0
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0