[x265] [PATCH Review Only] asm: assembly code for pixel_sad_8x32
chen
chenm003 at 163.com
Wed Oct 30 16:03:33 CET 2013
>+ PROCESS_SAD_8x4
>+ PROCESS_SAD_8x4
>+
>+ movq m1, [r2]
>+ movq m2, [r2 + r3]
>+ lea r2, [r2 + 2 * r3]
>+ movq m3, [r0]
>+ movq m4, [r0 + r1]
>+ lea r0, [r0 + 2 * r1]
>+ punpcklqdq m1, m2
>+ punpcklqdq m3, m4
>+ psadbw m1, m3
>+ paddd m0, m1
>+ movq m1, [r2]
>+ movq m2, [r2 + r3]
>+ movq m3, [r0]
>+ movq m4, [r0 + r1]
>+ punpcklqdq m1, m2
>+ punpcklqdq m3, m4
>+ psadbw m1, m3
>+ paddd m0, m1
You only want to reduce two of LEA instruction, don't extend code, you can pass a param to PROCESS_SAD_8x4 and control by %if, like below:
PROCESS_SAD_8x4 1
PROCESS_SAD_8x4 1
PROCESS_SAD_8x4 0
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131030/5e7076a6/attachment-0001.html>
More information about the x265-devel
mailing list