[x264-devel] [PATCH] zigzag SSE2
Axel Zeuner
axel.zeuner at gmx.de
Sun May 4 08:57:29 CEST 2008
Hello,
On Friday 02 May 2008 15:33:45 Holger Lubitz wrote:
> hi,
>
> > two patches against git HEAD are attached:
>
> hmm. it seems my patches haven't made it there yet.
Sorry, I did not know about it.
>
> i only did the scan_4x4_frame and 8x8_frame so far, but i think something
> similar can be done to the subs. 4x4 was somewhere below 10 cycles
> (dark_shikari measured 8 on core2, i had 6.something on amd64, pengvado
> saw 10 on opteron). the 8x8 was 40 cycles on my amd64, core2 differed
> with alignment due to cache split, but i think the range was 38-57.
> (4x4 was pure mmx, 8x8 used pshufw)
How these measurements were done? oprofile?
Regards,
Axel
More information about the x264-devel
mailing list