[x265] [PATCH] asm: avx2 code for sse_ss[16x16] for 10 bpp (1935 -> 1030)
Sumalatha Polureddy
sumalatha at multicorewareinc.com
Thu May 21 08:32:16 CEST 2015
please ignore this patch, will send new patch with changed commit message
Regards
Sumalatha
On Thu, May 21, 2015 at 11:58 AM, <sumalatha at multicorewareinc.com> wrote:
> # HG changeset patch
> # User Sumalatha Polureddy
> # Date 1432189730 -19800
> # Thu May 21 11:58:50 2015 +0530
> # Node ID f33e89178120b981ae3ec0e717a74bc608741722
> # Parent dc4fcfc574ade14ecc841797ad08be9753fad58e
> asm: avx2 code for sse_ss[16x16] for 10 bpp (1935 -> 1030)
>
> sse2:
> sse_ss[32x32] 4.71x 1935.46 9113.11
>
> avx2:
> sse_ss[32x32] 8.85x 1030.08 9112.05
>
> diff -r dc4fcfc574ad -r f33e89178120 source/common/x86/asm-primitives.cpp
> --- a/source/common/x86/asm-primitives.cpp Wed May 20 12:17:44 2015
> -0500
> +++ b/source/common/x86/asm-primitives.cpp Thu May 21 11:58:50 2015
> +0530
> @@ -1259,6 +1259,7 @@
>
> p.cu[BLOCK_32x32].ssd_s = x265_pixel_ssd_s_32_avx2;
> p.cu[BLOCK_16x16].sse_ss = x265_pixel_ssd_ss_16x16_avx2;
> + p.cu[BLOCK_32x32].sse_ss = x265_pixel_ssd_ss_32x32_avx2;
>
> p.quant = x265_quant_avx2;
> p.nquant = x265_nquant_avx2;
> diff -r dc4fcfc574ad -r f33e89178120 source/common/x86/ssd-a.asm
> --- a/source/common/x86/ssd-a.asm Wed May 20 12:17:44 2015 -0500
> +++ b/source/common/x86/ssd-a.asm Thu May 21 11:58:50 2015 +0530
> @@ -347,6 +347,7 @@
> INIT_YMM avx2
> SSD_ONE 16, 8
> SSD_ONE 16, 16
> +SSD_ONE 32, 32
> %endif ; HIGH_BIT_DEPTH
>
>
> ;-----------------------------------------------------------------------------
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150521/ebee7c4f/attachment.html>
More information about the x265-devel
mailing list