RDRAND (R32) - Throughput and Uops
With 1 independent instruction
With unroll_count=10 and no inner loop
Code:
0: 41 0f c7 f0 rdrand r8d
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 13825.83
Reference cycles: 8263.6
UOPS_EXECUTED.THREAD: 79.9
RETIRE_SLOTS: 64.6
UOPS_MITE: 0.0
UOPS_MS: 91.2
UOPS_PORT_0: 8.0
UOPS_PORT_1: 13.9
UOPS_PORT_5: 12.0
UOPS_PORT_6: 23.33
UOPS_PORT_23: 1.0
UOPS_PORT_49: 9.18
UOPS_PORT_78: 12.18
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 0.0
With unroll_count=10, no inner loop, and 1 NOP
Code:
0: 41 0f c7 f0 rdrand r8d 4: 90 nop
Show nanoBench command
Results:
Instructions retired: 2.0
Core cycles: 13838.95
Reference cycles: 8270.9
UOPS_EXECUTED.THREAD: 79.85
RETIRE_SLOTS: 65.6
UOPS_MITE: 1.8
UOPS_MS: 91.2
UOPS_PORT_0: 8.0
UOPS_PORT_1: 13.9
UOPS_PORT_5: 12.0
UOPS_PORT_6: 23.28
UOPS_PORT_23: 1.0
UOPS_PORT_49: 9.3
UOPS_PORT_78: 12.3
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0
With loop_count=2 and unroll_count=1
Code:
0: 41 0f c7 f0 rdrand r8d
Show nanoBench command
Results:
Instructions retired: 3.0
Core cycles: 13599.08
Reference cycles: 8127.33
UOPS_EXECUTED.THREAD: 80.5
RETIRE_SLOTS: 64.0
UOPS_MITE: 0.09
UOPS_MS: 92.0
UOPS_PORT_0: 8.0
UOPS_PORT_1: 14.0
UOPS_PORT_5: 12.0
UOPS_PORT_6: 23.75
UOPS_PORT_23: 1.0
UOPS_PORT_49: 9.25
UOPS_PORT_78: 12.25
DIV_CYCLES: 0.0
ILD_STALL.LCP: 0.0
UOPS_MITE>=1: 1.0