<div style="line-height:1.7;color:#000000;font-size:14px;font-family:arial"><div> </div><pre><br>At 2015-03-06 08:19:55,dtyx265@gmail.com wrote:
># HG changeset patch
># User David T Yuen <dtyx265@gmail.com>
># Date 1425588355 28800
># Node ID 3b2512cba57e1af6ba21c445122a2b5b2efd87dc
># Parent  39111ae2fd72824c60f827023bd6b55b92dc1c41
>asm: intrapred dc8 sse2 high bit
>
>This replaces c code for systems using ssse3 to sse2 processors
>The code is backported from intrapred dc8 sse4 high bit
>
>./test/TestBench --testbench intrapred | grep intra_dc_8x8
>intra_dc_8x8[f=0]   1.64x       437.56      719.58
>intra_dc_8x8[f=1]   1.55x       747.51      1157.56
>
>diff -r 39111ae2fd72 -r 3b2512cba57e source/common/x86/asm-primitives.cpp
>--- a/source/common/x86/asm-primitives.cpp     Thu Mar 05 12:21:23 2015 -0800
>+++ b/source/common/x86/asm-primitives.cpp     Thu Mar 05 12:45:55 2015 -0800
>@@ -869,6 +869,7 @@
>         ALL_LUMA_TU_S(transpose, transpose, sse2);

>         p.cu[BLOCK_4x4].intra_pred[DC_IDX] = x265_intra_pred_dc4_sse2;
>+        p.cu[BLOCK_8x8].intra_pred[DC_IDX] = x265_intra_pred_dc8_sse2;

>         p.cu[BLOCK_4x4].intra_pred[PLANAR_IDX] = x265_intra_pred_planar4_sse2;

>diff -r 39111ae2fd72 -r 3b2512cba57e source/common/x86/intrapred16.asm
>--- a/source/common/x86/intrapred16.asm        Thu Mar 05 12:21:23 2015 -0800
>+++ b/source/common/x86/intrapred16.asm        Thu Mar 05 12:45:55 2015 -0800
>@@ -161,6 +161,89 @@
> ;-----------------------------------------------------------------------------------
> ; void intra_pred_dc(pixel* dst, intptr_t dstStride, pixel* above, int, int filter)
> ;-----------------------------------------------------------------------------------
>+INIT_XMM sse2
>+cglobal intra_pred_dc8, 5, 8, 2
x86 compatible issue</pre><pre><br>>+    movu            m0,            [r2 + 34]
>+    movu            m1,            [r2 + 2]
>+
>+    paddw           m0,            m1
>+    movhlps         m1,            m0
</pre></div>