[x264-devel] [PATCH] Speedups of various sorts involving macroblock_encode and RDO.
Jason Garrett-Glaser
darkshikari at gmail.com
Fri Mar 14 16:10:58 CET 2008
The 4th speedup was simply changing CABAC copy (which is done on every
single RDO call, even the single-partition i4x4 RDO calls) from
memcpy() to a custom ASM function.
On 32-bit Windows, this cuts the clock cost from ~200 clocks to ~53
clocks. Compare this to a total cost of ~9000 clocks for
full-macroblock RDO, and I would assume 10-20 times less for
single-partition RDO.
Dark Shikari
On Fri, Mar 14, 2008 at 2:53 AM, Guillaume POIRIER <poirierg at gmail.com> wrote:
> Hello,
>
>
> On 3/13/08, Jason Garrett-Glaser <darkshikari at gmail.com> wrote:
> > I apologize, Gmail appears to have destroyed the formatting of my patch.
> >
> > The correct version can be found at http://pastebin.com/f55b55c85 .
>
> What speed-up did you measure regarding C cabac VS ASM cabac, and
> overall speed-up? With what encoding parameters?
>
> Guillaume
> --
> A soldier will fight long and hard for a bit of colored ribbon.
> -- Napoleon Bonaparte
> _______________________________________________
> x264-devel mailing list
> x264-devel at videolan.org
> http://mailman.videolan.org/listinfo/x264-devel
>
More information about the x264-devel
mailing list