[x264-devel] [PATCH] zigzag SSE2
Axel Zeuner
axel.zeuner at gmx.de
Sun May 4 09:13:43 CEST 2008
Hello,
On Friday 02 May 2008 15:41:50 Holger Lubitz wrote:
> > - x264-timeasm.diff contains timeasm, a timing code to check the effects
> > of the changes made. The program is a hack, it does no checks and was
> > tested only on linux x86/x86-64 using gcc.
>
> ah, one more comment. you do the rdtsc after synchronizing with cpuid.
> you'll see more interesting results without, because the latency of the
> final writes is likely to be hidden. and if you want it synchronized
> you can have that without cpuid if you use rdtscp.
I want to see the latencies - it shows me when the results are ready,
otherwise only the decoding speed is measured if the code generates less
uOps/mOps than entries in the reorder buffers/schedulers exist.
But you are right, one should measure both times.
rdtscp does not exist on all athlons (before F-stepping?, I know it is not
available on E6 stepping).
Regards
Axel
More information about the x264-devel
mailing list