Skip to content

Instantly share code, notes, and snippets.

@Prototyped
Created January 4, 2022 22:23
Show Gist options
  • Select an option

  • Save Prototyped/c4acf157fb1067a99fb42aba52198eed to your computer and use it in GitHub Desktop.

Select an option

Save Prototyped/c4acf157fb1067a99fb42aba52198eed to your computer and use it in GitHub Desktop.
$ time ./stream_haswell.bin
-------------------------------------------------------------
STREAM version $Revision: 5.10 $
-------------------------------------------------------------
This system uses 8 bytes per array element.
-------------------------------------------------------------
Array size = 269000000 (elements), Offset = 0 (elements)
Memory per array = 2052.3 MiB (= 2.0 GiB).
Total memory required = 6156.9 MiB (= 6.0 GiB).
Each kernel will be executed 100 times.
The *best* time for each kernel (excluding the first iteration)
will be used to compute the reported bandwidth.
-------------------------------------------------------------
Number of Threads requested = 8
Number of Threads counted = 8
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 169420 microseconds.
(= 169420 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Best Rate MB/s Avg time Min time Max time
Copy: 18343.9 0.235524 0.234628 0.248754
Scale: 18313.0 0.236159 0.235025 0.246510
Add: 20642.9 0.313888 0.312747 0.333817
Triad: 20619.1 0.314565 0.313107 0.333907
-------------------------------------------------------------
Solution Validates: avg error less than 1.000000e-13 on all three arrays
-------------------------------------------------------------
./stream_haswell.bin 872.29s user 3.48s system 786% cpu 1:51.28 total
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment