[cpu] remove branch prediction logic #678

stnolting · 2023-08-30T17:25:31Z

This PR removes the CPU front end 's "branch prediction" logic that was used to halt instruction fetch while a branch/jump instruction is in progress until the destination address is available (resulting in less bus traffic / congestion).

However, benchmarks show that this prediction actually lowers performance:

Exemplary CoreMark run with branch prediction:

NEORV32: Hardware Performance Monitors (low words only)
 > Active clock cycles:          2199671829
 > Retired instructions:         596184106
 > Retired compr. instructions:  349937868
 > Instr.-fetch wait cycles:     34008506
 > Instr.-issue wait cycles:     286243045
 > Multi-cycle ALU wait cycles:  99331050
 > Load operations:              108277848
 > Store operations:             28390960
 > Load/store wait cycles:       5984501
 > Unconditional jumps:          16292334
 > Conditional branches (all):   115064467
 > Conditional branches (taken): 58094389
 > Entered traps:                0
 > Illegal operations:           0

Exemplary CoreMark run without branch prediction:

NEORV32: Hardware Performance Monitors (low words only)
 > Active clock cycles:          2188425528 (faster!)
 > Retired instructions:         596184106
 > Retired compr. instructions:  349937868
 > Instr.-fetch wait cycles:     0
 > Instr.-issue wait cycles:     263816491
 > Multi-cycle ALU wait cycles:  99331050
 > Load operations:              108277848
 > Store operations:             28390960
 > Load/store wait cycles:       17164754
 > Unconditional jumps:          16292334
 > Conditional branches (all):   115064467
 > Conditional branches (taken): 58094389
 > Entered traps:                0
 > Illegal operations:           0

Adding caches results in the same speed up factor when the prediction logic is removed. Additionally, removing the prediction logic reduces core size and relaxes the critical path (= the branch taken / not taken logic).

[cpu] remove branch prediction logic

8f40d51

stnolting added HW hardware-related optimization Make things faster, smaller and more efficient labels Aug 30, 2023

stnolting self-assigned this Aug 30, 2023

[CHANGELOG] add v1.8.8.5

111c884

stnolting marked this pull request as ready for review August 30, 2023 19:04

stnolting merged commit 1df13f5 into main Aug 31, 2023
8 checks passed

stnolting deleted the front_end branch August 31, 2023 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cpu] remove branch prediction logic #678

[cpu] remove branch prediction logic #678

stnolting commented Aug 30, 2023 •

edited

Loading

[cpu] remove branch prediction logic #678

[cpu] remove branch prediction logic #678

Conversation

stnolting commented Aug 30, 2023 • edited Loading

stnolting commented Aug 30, 2023 •

edited

Loading