Intel seems stuck at 2P and HT still has massively lower latency but 80 GByte/s ...

erichocean · on Jan 22, 2014

Cool project! I wonder if you'd get similar perf from CPUs if you could used Intel's ISPC compiler[0] with the same GPU algorithms. I've found that GPU algorithms also perform substantially better on plain old CPUs, IMO because they use memory bandwidth more effectively.

I too would like to see how far those PCI Express busses can be pushed. :)

BTW We're adopting Intel's DPDK[1] approach to get massive packet processing performance on a single machine. So far we're liking it, but we'll see as it's not in production yet.

[0] http://ispc.github.io/ [1] http://www.intel.com/content/www/us/en/intelligent-systems/i...