[x264-devel] x86: Add asm for mbtree fixed point conversion
Henrik Gramner
git at videolan.org
Tue Apr 12 20:36:17 CEST 2016
x264 | branch: master | Henrik Gramner <henrik at gramner.com> | Fri Mar 4 17:53:08 2016 +0100| [c82c7374938f4342971adf8b2495c3a1bbe621c4] | committer: Henrik Gramner
x86: Add asm for mbtree fixed point conversion
The QP offsets of each macroblock are stored as floats internally and
converted to big-endian Q8.8 fixed point numbers when written to the 2-pass
stats file, and converted back to floats when read from the stats file.
Add SSSE3 and AVX2 implementations for conversions in both directions.
About 8x faster than C on Haswell.
> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=c82c7374938f4342971adf8b2495c3a1bbe621c4
---
common/mc.c | 15 +++++++
common/mc.h | 3 +-
common/x86/mc-a2.asm | 113 +++++++++++++++++++++++++++++++++++++++++++++++++
common/x86/mc-c.c | 10 +++++
encoder/ratecontrol.c | 10 +----
tools/checkasm.c | 54 +++++++++++++++++++++++
6 files changed, 196 insertions(+), 9 deletions(-)
Diff: http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=c82c7374938f4342971adf8b2495c3a1bbe621c4
More information about the x264-devel
mailing list