MOVSB - Throughput and Uops
With unroll_count=500 and no inner loop
Code:
0: a4 movs BYTE PTR es:[rdi],BYTE PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.0
Core cycles: 6.0
Reference cycles: 6.01
UOPS_RETIRED.ANY: 4.0
UOPS_MS: 4.0
With loop_count=1000 and unroll_count=10
Code:
0: a4 movs BYTE PTR es:[rdi],BYTE PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.2
Core cycles: 6.12
Reference cycles: 6.12
UOPS_RETIRED.ANY: 4.2
UOPS_MS: 4.0
With loop_count=100 and unroll_count=100
Code:
0: a4 movs BYTE PTR es:[rdi],BYTE PTR ds:[rsi]
Show nanoBench command
Results:
Instructions retired: 1.02
Core cycles: 6.03
Reference cycles: 6.03
UOPS_RETIRED.ANY: 4.02
UOPS_MS: 4.0