<div dir="ltr"><div class="gmail_default" style="font-family:arial,helvetica,sans-serif;font-size:small">the real function name is sa8d_8x8_avx2 whereas the common function name is sa8d_8x8_avx2_avx2, that's why we got proper call. both are different.<br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Dec 1, 2015 at 9:08 PM, chen <span dir="ltr"><<a href="mailto:chenm003@163.com" target="_blank">chenm003@163.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="line-height:1.7;color:#000000;font-size:14px;font-family:arial"><div><br></div><pre><div><div class="h5"><br>At 2015-12-01 18:58:43,<a href="mailto:dnyaneshwar@multicorewareinc.com" target="_blank">dnyaneshwar@multicorewareinc.com</a> wrote:
># HG changeset patch
># User Dnyaneshwar G <<a href="mailto:dnyaneshwar@multicorewareinc.com" target="_blank">dnyaneshwar@multicorewareinc.com</a>>
># Date 1448962785 -19800
># Tue Dec 01 15:09:45 2015 +0530
># Node ID f8b0ce4e9f4092a38d8095961825e734a34f112e
># Parent e2e507ffe752d6c193a219b242c433bdc55f39f7
>asm: SA8D avx2 asm code for main12
>
>sa8d[ 8x8] 4.70x 564.58 2652.82
>sa8d[ 8x16] 4.00x 1358.06 5429.52
>sa8d[16x16] 5.57x 2013.70 11212.47
>sa8d[16x32] 3.90x 5610.47 21883.35
>sa8d[32x32] 5.36x 8274.18 44361.61
>sa8d[32x64] 3.86x 23024.04 88901.80
>sa8d[64x64] 4.35x 45509.79 198165.11
>
>diff -r e2e507ffe752 -r f8b0ce4e9f40 source/common/x86/asm-primitives.cpp
>--- a/source/common/x86/asm-primitives.cpp Mon Nov 30 11:23:38 2015 +0530
>+++ b/source/common/x86/asm-primitives.cpp Tue Dec 01 15:09:45 2015 +0530
>@@ -1313,6 +1313,9 @@
> }
> if (cpuMask & X265_CPU_AVX2)
> {
>+#if X265_DEPTH == 12
>+ ASSIGN_SA8D(avx2);
>+#endif
> <a href="http://p.cu" target="_blank">p.cu</a>[BLOCK_4x4].intra_filter = PFX(intra_filter_4x4_avx2);
>
> // TODO: the planecopy_sp is really planecopy_SC now, must be fix it
>diff -r e2e507ffe752 -r f8b0ce4e9f40 source/common/x86/pixel-a.asm
>--- a/source/common/x86/pixel-a.asm Mon Nov 30 11:23:38 2015 +0530
>+++ b/source/common/x86/pixel-a.asm Tue Dec 01 15:09:45 2015 +0530
>@@ -6499,6 +6499,1357 @@
> %endif ; !ARCH_X86_64
> %endmacro ; SA8D
>
>+
>+%if ARCH_X86_64 == 1 && BIT_DEPTH == 12
>+INIT_YMM avx2
>+cglobal sa8d_8x8_avx2
<br></div></div>the really function name is sa8d_8x8_avx2_avx2, we are lucky, below call use correct name</pre></div><br>_______________________________________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" rel="noreferrer" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a><br>
<br></blockquote></div><br></div>