https://www.youtube.com/watch?v=pHqcHzxx6I8
- A 2048 x 2048 matrix multiplication takes about 28 ms on 1 CPU core.
- The same operation on 1 GPU takes roughly 209 microseconds (
$\mu s$ ).- Question: This is a massive speedup! what specific kind of CPU and GPU were they using for this comparison?