[x265] [PATCH] asm: Optimized sad_64xN for better cache performance. Reduced lea instruction by half. Performance gain is average +5x w.r.t. previous asm code

chen chenm003 at 163.com
Thu Oct 31 14:19:23 CET 2013


right

except pixel_sad_64x32, it is loop 2 times only, I am not sure which is better between loop 4 times and all unroll

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131031/9a52392d/attachment.html>


More information about the x265-devel mailing list