[x264-devel] x86: AVX2 plane_copy_deinterleave

Henrik Gramner git at videolan.org
Tue Jan 24 21:14:12 CET 2017


x264 | branch: master | Henrik Gramner <henrik at gramner.com> | Tue Jan 17 21:59:47 2017 +0100| [c22c10ddb21e9f5af1da83d37122e6f7388e1342] | committer: Henrik Gramner

x86: AVX2 plane_copy_deinterleave

50% faster than SSSE3 in 8-bit.
25% faster than AVX in high bit-depth.

Also drop the MMX versions of deinterleave functions in favor of SSE2.

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=c22c10ddb21e9f5af1da83d37122e6f7388e1342
---

 common/x86/mc-a2.asm | 109 ++++++++++++++++++++++++++-------------------------
 common/x86/mc-c.c    |  20 ++++------
 encoder/encoder.c    |   4 +-
 tools/checkasm.c     |   2 +-
 4 files changed, 65 insertions(+), 70 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=c22c10ddb21e9f5af1da83d37122e6f7388e1342


More information about the x264-devel mailing list