[x264-devel] commit: x86 asm for high-bit-depth pixel metrics (Oskar Arvidsson )

git at videolan.org git at videolan.org
Fri Nov 19 23:50:12 CET 2010


x264 | branch: master | Oskar Arvidsson <oskar at irock.se> | Sat Oct 30 20:16:33 2010 +0200| [80174bfdd30da508576d318323af57b2cd821c5b] | committer: Jason Garrett-Glaser 

x86 asm for high-bit-depth pixel metrics
Overall speed change from these 6 asm patches: ~4.4x.
But there's still tons more asm to do -- patches welcome!

Breakdown from this patch:
~13x faster SAD than C.
~11.5x faster SATD than C (only MMX done).
~18.5x faster SA8D than C.
~19.2x faster hadamard_ac than C.
~8.3x faster SSD than C.
~12.4x faster VAR than C.
~3-4.2x faster intra SAD than C.
~7.9x faster intra SATD than C.

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=80174bfdd30da508576d318323af57b2cd821c5b
---

 Makefile               |    7 +-
 common/pixel.c         |  150 +++++++++--
 common/x86/pixel-a.asm |  698 ++++++++++++++++++++++++++++++++++++++----------
 common/x86/pixel.h     |   59 +++--
 common/x86/sad16-a.asm |  432 ++++++++++++++++++++++++++++++
 common/x86/x86util.asm |   26 ++
 6 files changed, 1175 insertions(+), 197 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=80174bfdd30da508576d318323af57b2cd821c5b


More information about the x264-devel mailing list