Created
January 4, 2022 22:23
-
-
Save Prototyped/c4acf157fb1067a99fb42aba52198eed to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| $ time ./stream_haswell.bin | |
| ------------------------------------------------------------- | |
| STREAM version $Revision: 5.10 $ | |
| ------------------------------------------------------------- | |
| This system uses 8 bytes per array element. | |
| ------------------------------------------------------------- | |
| Array size = 269000000 (elements), Offset = 0 (elements) | |
| Memory per array = 2052.3 MiB (= 2.0 GiB). | |
| Total memory required = 6156.9 MiB (= 6.0 GiB). | |
| Each kernel will be executed 100 times. | |
| The *best* time for each kernel (excluding the first iteration) | |
| will be used to compute the reported bandwidth. | |
| ------------------------------------------------------------- | |
| Number of Threads requested = 8 | |
| Number of Threads counted = 8 | |
| ------------------------------------------------------------- | |
| Your clock granularity/precision appears to be 1 microseconds. | |
| Each test below will take on the order of 169420 microseconds. | |
| (= 169420 clock ticks) | |
| Increase the size of the arrays if this shows that | |
| you are not getting at least 20 clock ticks per test. | |
| ------------------------------------------------------------- | |
| WARNING -- The above is only a rough guideline. | |
| For best results, please be sure you know the | |
| precision of your system timer. | |
| ------------------------------------------------------------- | |
| Function Best Rate MB/s Avg time Min time Max time | |
| Copy: 18343.9 0.235524 0.234628 0.248754 | |
| Scale: 18313.0 0.236159 0.235025 0.246510 | |
| Add: 20642.9 0.313888 0.312747 0.333817 | |
| Triad: 20619.1 0.314565 0.313107 0.333907 | |
| ------------------------------------------------------------- | |
| Solution Validates: avg error less than 1.000000e-13 on all three arrays | |
| ------------------------------------------------------------- | |
| ./stream_haswell.bin 872.29s user 3.48s system 786% cpu 1:51.28 total | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment