[x265] Fwd: [PATCH] blockcopy_sp_8x2, optimized asm code

chen chenm003 at 163.com
Fri Nov 8 12:43:16 CET 2013


>+movh       [r0],       m0
>+movhps     [r0 + r1],  m0

>>change movh to movlps is better, movh+movhps is mixed float and integer path
Will movh+movhps cause any problem ? I thought movh will be faster.
In old CPU, the data across float and integer path need extra latency
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131108/65e37f9e/attachment.html>


More information about the x265-devel mailing list