This seems to lack any sort of unrolling, so the speed will be much worse than it could be. If we just want lame optimizations, I would argue for intrinsics rather than ASM. No hard objections though. -- Rémi Denis-Courmont http://www.remlab.net/