On 1/29/2024 12:56 AM, Bonita Montero wrote:
Am 28.01.2024 um 20:18 schrieb Chris M. Thomasson:
Try padding and aligning the blocks. iirc, std::vector works with
alignas. Actually, it's pretty nice.
I'm testing all 64 offsets. If offset zero becomes physically offset
one in the cacheline doesn't matter since physical offset zero would
then be occupied by logical offset 63.
You don't want to straddle any cache lines. ...
On my Windows 7050X Zen4 computer scanning memory in both directions
has the same speed. On my Linux 3990X Zen2 computer scanning forward
is 22% faster. On my small Linux PC, a HP EliteDesk Mini PC with a
Skylake Pentium G4400 scanning memory forward is about 38% faster.
I'd first have guessed that the prefetchers between the memory-levels
are as effective for both directions. So I'd like to see some results
from you.
Sysop: | Tetrazocine |
---|---|
Location: | Melbourne, VIC, Australia |
Users: | 6 |
Nodes: | 8 (0 / 8) |
Uptime: | 07:36:57 |
Calls: | 45 |
Files: | 21,492 |
Messages: | 62,775 |