[x265] [PATCH] asm: 10bpp code of sse_ss for 12x16, 24x32, 48x64 and 64xN blocks

chen chenm003 at 163.com
Mon Dec 2 14:45:04 CET 2013


>+%macro SSD_TWO 2
>+cglobal pixel_ssd_ss_%1x%2, 4,7,6
>+    FIX_STRIDES r1, r3
>+    pxor    m0,  m0
>+    mov     r4d, %2/2
>+    lea     r5,  [r1 * 2]
>+    lea     r6,  [r3 * 2]
>+.loop
>+    mova    m1,  [r0]
>+    mova    m2,  [r0 + 16]
>+    mova    m3,  [r0 + 32]
>+    mova    m4,  [r0 + 48]
>+    psubw   m1,  [r2]
>+    psubw   m2,  [r2 + 16]
>+    psubw   m3,  [r2 + 32]
>+    psubw   m4,  [r2 + 48]
This style need memory is alignment, please check and confirm it in our encoder.
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131202/c59f472a/attachment.html>


More information about the x265-devel mailing list