[x265] Size vs. speed
chen
chenm003 at 163.com
Wed Jun 3 23:34:32 CEST 2015
My goal is cover most of function, after that, we can evalerate size
and speed to find a balance point in every asm function.
Thanks,
Min
At 2015-06-04 05:23:22,dave <dtyx265 at gmail.com> wrote:
>Steve and Min or anyone else with an opinion,
>
>For many of the primitives that I have submitted I unrolled loops with
>%rep to improve performance but for nested loops I only unrolled the
>inner loop leaving the outer loop intact. For my last submission,
>interp_4tap_horiz_pp/s, it is entirely unrolled with %rep, as is the
>sse4 version, though in a different way. This probably generates
>considerably larger executables, especially for the larger sizes. Is
>there any preference on this? Are x265's goals purely performance
>related over memory usage?
>
>thanks,
>Dave
>_______________________________________________
>x265-devel mailing list
>x265-devel at videolan.org
>https://mailman.videolan.org/listinfo/x265-devel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150604/1653bd04/attachment.html>
More information about the x265-devel
mailing list