[x265] [PATCH 0 of 6 ] SAO SSE4 asm code for HIGH_BIT_DEPTH
Dnyaneshwar Gorade
dnyaneshwar at multicorewareinc.com
Mon Jun 22 17:26:02 CEST 2015
Okay. Will check IACA report and try pxor for m0 and buffer 1023.
On Mon, Jun 22, 2015 at 8:24 PM, chen <chenm003 at 163.com> wrote:
> right
>
> some comment:
> 'psignb X, [pb_128]' equal to 'psubb X, 0, X', in AVX2, second type
> faster, in SSE4, choice depends on IACA report
>
> in PMINSW, you buffer ZERO into M0, and use pw_1023 directly, could you
> try buffer pw_1023 and use PXOR to get ZERO?
>
>
> At 2015-06-22 20:50:32,dnyaneshwar at multicorewareinc.com wrote:
> >SAO_EO_0 8.97x 974.03 8740.81
> >SAO_EO_1 10.18x 492.67 5017.42
> >SAO_EO_1_2Rows 11.21x 900.82 10095.86
> >SAO_EO_2[0] 6.27x 207.22 1298.92
> >SAO_EO_2[1] 8.92x 555.20 4949.69
> >SAO_EO_3[0] 4.97x 236.72 1177.29
> >SAO_EO_3[1] 8.67x 551.14 4778.67
> >SAO_BO_0 7.50x
> 23127.89 173346.66
> >_______________________________________________
> >x265-devel mailing list
> >x265-devel at videolan.org
> >https://mailman.videolan.org/listinfo/x265-devel
>
>
> _______________________________________________
> x265-devel mailing list
> x265-devel at videolan.org
> https://mailman.videolan.org/listinfo/x265-devel
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150622/19533fc9/attachment-0001.html>
More information about the x265-devel
mailing list