[x264-devel] Optimize x86 intra_predict_4x4 and 8x8

Loren Merritt git at videolan.org
Wed Sep 21 21:34:40 CEST 2011


x264 | branch: master | Loren Merritt <pengvado at akuvian.org> | Sat Aug 13 19:01:22 2011 +0000| [265cfae4441ffe76cf6885e5a2448b945deb9b0c] | committer: Jason Garrett-Glaser

Optimize x86 intra_predict_4x4 and 8x8

High bit depth Penryn, Sandybridge cycles:
4x4_ddl: 11->10,  9-> 8
4x4_ddr: 15->13, 12->11
4x4_hd:        , 15->12
4x4_hu:        , 14->13
4x4_vr:  15->14, 14->12
8x8_ddl: 32->19, 19->14
8x8_ddr: 42->19, 21->14
8x8_hd:        , 15->13
8x8_hu:  21->17, 16->12
8x8_vr:  33->19,

8-bit Penryn, Sandybridge cycles:
4x4_ddr: 24->15,
4x4_hd:  24->16,
4x4_hu:  23->15,
4x4_vr:  23->16,
4x4_vl:  10-> 9,
8x8_ddl: 23->15,
8x8_hd:        , 17->14
8x8_hu:        , 15->14
8x8_vr:  20->16, 17->13

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=265cfae4441ffe76cf6885e5a2448b945deb9b0c
---

 common/osdep.h           |   15 +-
 common/x86/const-a.asm   |    1 -
 common/x86/predict-a.asm | 1216 ++++++++++++++++++++++++----------------------
 common/x86/predict-c.c   |   29 +-
 common/x86/x86util.asm   |   16 +
 encoder/analyse.c        |    4 +-
 encoder/macroblock.c     |    2 +-
 encoder/rdo.c            |    2 +-
 tools/checkasm.c         |    8 +-
 9 files changed, 686 insertions(+), 607 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=265cfae4441ffe76cf6885e5a2448b945deb9b0c


More information about the x264-devel mailing list