[x265] [PATCH] asm: 16bpp assembly code for intra_pred_ang16 - all modes
chen
chenm003 at 163.com
Tue Feb 25 16:57:34 CET 2014
>+cglobal intra_pred_ang16_33, 4,7,8
>+ xor r6d, r6d
>+ inc r6d
>+ xchg r2, r3
>+ lea r3, [ang_table + 16 * 16]
you didn't need r3 anymore, so 'mov r2, r3' is faster
>+ add r1, r1
>+ lea r4, [r1 * 3]
>+
>+ call ang16_mode_3_33
move ang16_mode_3_33 near this function is more cache performance
>+ lea r2, [r2 + 16]
>+ lea r0, [r0 + 16]
>+
>+ call ang16_mode_3_33
>+
>+ RET
>+
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20140225/cc8305e3/attachment.html>
More information about the x265-devel
mailing list