[x264-devel] [PATCH] zigzag SSE2

Axel Zeuner axel.zeuner at gmx.de
Sun May 4 08:50:29 CEST 2008


Hello,

On Friday 02 May 2008 14:30:02 Guillaume Poirier wrote:
> Hello,
>
> Axel Zeuner wrote:
> > Hello,
> > two patches against git HEAD are attached:
> > - x264-zigzag-sse2.diff contains SSE2 implementations of the zigzag
> > functions. - x264-timeasm.diff contains timeasm, a timing code to check
> > the effects of the changes made. The program is a hack, it does no checks
> > and was tested only on linux x86/x86-64 using gcc.
> >
> > I would like to see results on other processors in 32-bit and 64-bit mode
> > before one may start discuss about inclusion of these functions into git.
> >
> > Two results as printed by timeasm follow:
>
> I tested your patch on 2 Core2 machines:
Thank you very much. I am glad to see that the latencies of all functions are 
less than or equal as the latencies of their MMX/MMXEXT/C counterparts also 
on Core2.

Regards,
Axel


More information about the x264-devel mailing list