[x264-devel] x86: Use SSE instead of SSE2 for copying data
Henrik Gramner
git at videolan.org
Wed Feb 27 00:18:03 CET 2013
x264 | branch: master | Henrik Gramner <henrik at gramner.com> | Sun Jan 27 23:01:59 2013 +0100| [c63a518d43bb3822342513eb4af109551e86fbd2] | committer: Jason Garrett-Glaser
x86: Use SSE instead of SSE2 for copying data
Reduces code size because movaps/movups is one byte shorter than movdqa/movdqu.
Also merge MMX and SSE versions of memcpy_aligned into a single macro.
> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=c63a518d43bb3822342513eb4af109551e86fbd2
---
common/cpu.c | 2 +-
common/pixel.c | 4 ++
common/x86/mc-a.asm | 4 +-
common/x86/mc-a2.asm | 92 +++++++++++++++++++---------------------------
common/x86/mc-c.c | 44 +++++++++++-----------
common/x86/predict-a.asm | 18 ++++-----
common/x86/predict-c.c | 20 +++++++---
common/x86/predict.h | 8 ++--
common/x86/x86inc.asm | 4 +-
tools/checkasm.c | 5 ++-
10 files changed, 102 insertions(+), 99 deletions(-)
Diff: http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=c63a518d43bb3822342513eb4af109551e86fbd2
More information about the x264-devel
mailing list