[x264-devel] commit: x86 asm for high-bit-depth pixel metrics (Oskar Arvidsson )
git at videolan.org
git at videolan.org
Fri Nov 19 23:50:12 CET 2010
x264 | branch: master | Oskar Arvidsson <oskar at irock.se> | Sat Oct 30 20:16:33 2010 +0200| [80174bfdd30da508576d318323af57b2cd821c5b] | committer: Jason Garrett-Glaser
x86 asm for high-bit-depth pixel metrics
Overall speed change from these 6 asm patches: ~4.4x.
But there's still tons more asm to do -- patches welcome!
Breakdown from this patch:
~13x faster SAD than C.
~11.5x faster SATD than C (only MMX done).
~18.5x faster SA8D than C.
~19.2x faster hadamard_ac than C.
~8.3x faster SSD than C.
~12.4x faster VAR than C.
~3-4.2x faster intra SAD than C.
~7.9x faster intra SATD than C.
> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=80174bfdd30da508576d318323af57b2cd821c5b
---
Makefile | 7 +-
common/pixel.c | 150 +++++++++--
common/x86/pixel-a.asm | 698 ++++++++++++++++++++++++++++++++++++++----------
common/x86/pixel.h | 59 +++--
common/x86/sad16-a.asm | 432 ++++++++++++++++++++++++++++++
common/x86/x86util.asm | 26 ++
6 files changed, 1175 insertions(+), 197 deletions(-)
Diff: http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=80174bfdd30da508576d318323af57b2cd821c5b
More information about the x264-devel
mailing list