[x265] Performance hit with revision 341049cdcc5c

chen chenm003 at 163.com
Thu Aug 29 14:38:26 CEST 2013


Seems interpolate part, we want to remove 16 of pre-calculate interpolate buffer and replace with just in time, the primitives works have a delay, we will improvement it soon.

At 2013-08-29 20:15:19,"Nikos Barkas" <nikbar2004 at yahoo.com> wrote:

I note a small but measurable performance hit with revision 341049cdcc5c on Windows, compared to earlier revisions. This is on a MinGW build.


My standard encode test run normally takes approximately 57 seconds to complete and hits 1.74-1.75 fps. This revision takes approximately 64.4 seconds, with performance at 1.55 fps.


Looking at the commit, it seems like a blockcpy instruction was replaced with a quite a bit more complicated logic. It is probably expected that performance might be affected, especially if the method where this logic is used runs continuously, but I thought I'd bring this up because it seems like we have a bottleneck and it could be a good idea considering options for optimizing this part.


Best regards
Nikos
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.videolan.org/private/x265-devel/attachments/20130829/0bdae570/attachment.html>


More information about the x265-devel mailing list