[x265] [PATCH] asm: 16bpp assembly code for intra_pred_ang16 - all modes

chen chenm003 at 163.com
Tue Feb 25 16:57:34 CET 2014


>+cglobal intra_pred_ang16_33, 4,7,8
>+    xor         r6d,       r6d
>+    inc         r6d
>+    xchg        r2,        r3
>+    lea         r3,        [ang_table + 16 * 16]
you didn't need r3 anymore, so 'mov r2, r3' is faster
 
>+    add         r1,        r1
>+    lea         r4,        [r1 * 3]
>+
>+    call        ang16_mode_3_33
move ang16_mode_3_33 near this function is more cache performance


>+    lea         r2,        [r2 + 16]
>+    lea         r0,        [r0 + 16]
>+
>+    call        ang16_mode_3_33
>+
>+    RET
>+
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20140225/cc8305e3/attachment.html>


More information about the x265-devel mailing list