[x264-devel] x86: optimize and clean up predictor checking
Jason Garrett-Glaser
git at videolan.org
Wed Feb 27 00:18:04 CET 2013
x264 | branch: master | Jason Garrett-Glaser <jason at x264.com> | Fri Jan 18 22:55:46 2013 -0800| [91049858f8a051e87efcbe97285657fa3ef9a639] | committer: Jason Garrett-Glaser
x86: optimize and clean up predictor checking
Branchlessly handle elimination of candidates in MMX roundclip asm.
Add a new asm function, similar to roundclip, except without the round part.
Optimize and organize the C code, and make both subme>=3 and subme<3 consistent.
Add lots of explanatory comments and try to make things a little more understandable.
~5-10% faster with subme>=3, ~15-20% faster with subme<3.
> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=91049858f8a051e87efcbe97285657fa3ef9a639
---
common/common.h | 47 ++++++++++----
common/x86/util.h | 152 ++++++++++++++++++++++++++++++++++++----------
encoder/analyse.c | 12 ++--
encoder/me.c | 168 ++++++++++++++++++++++++++++++++-------------------
encoder/slicetype.c | 16 ++---
5 files changed, 275 insertions(+), 120 deletions(-)
Diff: http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=91049858f8a051e87efcbe97285657fa3ef9a639
More information about the x264-devel
mailing list