[x265] Size vs. speed

dave dtyx265 at gmail.com
Wed Jun 3 23:23:22 CEST 2015


Steve and Min or anyone else with an opinion,

For many of the primitives that I have submitted I unrolled loops with 
%rep to improve performance but for nested loops I only unrolled the 
inner loop leaving the outer loop intact.  For my last submission, 
interp_4tap_horiz_pp/s, it is entirely unrolled with %rep, as is the 
sse4 version, though in a different way.  This probably generates 
considerably larger executables, especially for the larger sizes.  Is 
there any preference on this?  Are x265's goals purely performance 
related over memory usage?

thanks,
Dave


More information about the x265-devel mailing list