[x264-devel] SSSE3/SSE4/AVX 9-way fully merged i8x8 analysis (sa8d_x9)

Loren Merritt git at videolan.org
Sat Oct 22 02:30:27 CEST 2011


x264 | branch: master | Loren Merritt <pengvado at akuvian.org> | Mon Oct 10 05:42:36 2011 +0000| [2f0384dcd68bb85f98fb566b70b863b40082c83e] | committer: Jason Garrett-Glaser

SSSE3/SSE4/AVX 9-way fully merged i8x8 analysis (sa8d_x9)
x86_64 only for now, due to register requirements (like sa8d_x3).

i8x8 analysis cycles (per partition):
 penryn sandybridge bulldozer
616->600  482->374  418->356  preset=faster
892->632  725->387  598->373  preset=medium
948->650  789->409  673->383  preset=slower

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=2f0384dcd68bb85f98fb566b70b863b40082c83e
---

 common/pixel.c         |    9 ++
 common/x86/pixel-a.asm |  320 +++++++++++++++++++++++++++++++++++++++++++++---
 common/x86/pixel.h     |    3 +
 common/x86/x86util.asm |   16 +++
 4 files changed, 332 insertions(+), 16 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=2f0384dcd68bb85f98fb566b70b863b40082c83e


More information about the x264-devel mailing list