[x265] Size vs. speed

chen chenm003 at 163.com
Wed Jun 3 23:34:32 CEST 2015


My goal is cover most of function, after that, we can evalerate size
and speed to find a balance point in every asm function.
 
Thanks,
Min

At 2015-06-04 05:23:22,dave <dtyx265 at gmail.com> wrote:
>Steve and Min or anyone else with an opinion,
>
>For many of the primitives that I have submitted I unrolled loops with 
>%rep to improve performance but for nested loops I only unrolled the 
>inner loop leaving the outer loop intact.  For my last submission, 
>interp_4tap_horiz_pp/s, it is entirely unrolled with %rep, as is the 
>sse4 version, though in a different way.  This probably generates 
>considerably larger executables, especially for the larger sizes.  Is 
>there any preference on this?  Are x265's goals purely performance 
>related over memory usage?
>
>thanks,
>Dave
>_______________________________________________
>x265-devel mailing list
>x265-devel at videolan.org
>https://mailman.videolan.org/listinfo/x265-devel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150604/1653bd04/attachment.html>


More information about the x265-devel mailing list