xenonite
I don't see any vectorizing or parallelizing hints to the compiler in the source code
... /Qipo
parallelization is a cheating
we're already in heavy multi-threaded environment and we're not interested in single-threaded performance
Have you ever run a performance profiler on the complete SVP program
very long time ago, I don't remember actual numbers