[x265] [PATCH] asm: idct[8x8] sse2 12232.53 -> 3480 over c code, 3555 -> over intrinsic

dave dtyx265 at gmail.com
Tue Dec 9 01:42:51 CET 2014


On 12/08/2014 03:35 PM, chen wrote:
> This is x64 only version, but you didn't check enviorment.
I will add a check.
> And you use rsp without alloc space.
Sorry.  For amd there is 128 bytes of space beyond the end of the stack 
that is free to use if there are no calls, at least in linux, of which 
not all 128 bytes are used.  It looks like some of the other primitives 
also use this space.  Is there another primitive that allocates space 
that I can use as an example?
> At 2014-12-09 07:09:04,dtyx265 at gmail.com wrote:
> ># HG changeset patch
> ># User David T Yuen <dtyx265 at gmail.com>
> ># Date 1418080004 28800
> ># Node ID c16530ca89e4f55c2c1fd17bd2cc4640589b096b
> ># Parent  53f7efef5ebda6d5ff03e868f2b702c385d72ddd
> >asm: idct[8x8] sse2 12232.53 -> 3480 over c code, 3555 -> over intrinsic
> >
>
>
> _______________________________________________
> x265-devel mailing list
> x265-devel at videolan.org
> https://mailman.videolan.org/listinfo/x265-devel

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20141208/0bb685fe/attachment.html>


More information about the x265-devel mailing list