[x265] [PATCH] Luma_hpp[16x16] avx2 asm code : improved 2307c->1610c

chen chenm003 at 163.com
Thu Nov 13 22:32:52 CET 2014


 

At 2014-11-13 13:39:58,aasaipriya at multicorewareinc.com wrote:
># HG changeset patch
># User Aasaipriya
># Date 1415856424 -19800
>#      Thu Nov 13 10:57:04 2014 +0530
># Node ID c6994599bf61ba8c5a0ede52e6062196b340dce4
># Parent  f0a17f2b4c22ff8aa05afb40e3360e5ac03590a6
>Luma_hpp[16x16] avx2 asm code : improved 2307c->1610c
>
>diff -r f0a17f2b4c22 -r c6994599bf61 source/common/x86/ipfilter8.asm
>--- a/source/common/x86/ipfilter8.asm	Wed Nov 12 15:57:27 2014 +0530
>+++ b/source/common/x86/ipfilter8.asm	Thu Nov 13 10:57:04 2014 +0530
>@@ -897,8 +897,7 @@
>     packuswb        m4, m4
>     vpermq          m4, m4, 11011000b
>     pshufd          xm4, xm4, 11011000b
>-    movq            [r2], xm4
>-    movhps          [r2 + 8], xm4
>+    movdqu          [r2],xm4
>     lea             r0, [r0 + r1]
>     lea             r2, [r2 + r3]
>     dec             r4d
movu is better, and please send full patch
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20141114/b115c9b3/attachment.html>


More information about the x265-devel mailing list