[x265] [PATCH] asm code for blockcopy_ps_16x4
chen
chenm003 at 163.com
Mon Nov 11 11:36:49 CET 2013
>+movu m1, [r2]
>+punpcklbw m2, m1, m0
Here have a hide register copy, try to avoid it by SSE4.1 "pmovzxbw m2, m1"
>+movu [r0], m2
>+punpckhbw m1, m0
>+movu [r0 + 16], m1
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20131111/751bc458/attachment-0001.html>
More information about the x265-devel
mailing list