[x265] Size vs. speed
dave
dtyx265 at gmail.com
Wed Jun 3 23:23:22 CEST 2015
Steve and Min or anyone else with an opinion,
For many of the primitives that I have submitted I unrolled loops with
%rep to improve performance but for nested loops I only unrolled the
inner loop leaving the outer loop intact. For my last submission,
interp_4tap_horiz_pp/s, it is entirely unrolled with %rep, as is the
sse4 version, though in a different way. This probably generates
considerably larger executables, especially for the larger sizes. Is
there any preference on this? Are x265's goals purely performance
related over memory usage?
thanks,
Dave
More information about the x265-devel
mailing list