[x264-devel] x86: optimize and clean up predictor checking

Jason Garrett-Glaser git at videolan.org
Wed Feb 27 00:18:04 CET 2013


x264 | branch: master | Jason Garrett-Glaser <jason at x264.com> | Fri Jan 18 22:55:46 2013 -0800| [91049858f8a051e87efcbe97285657fa3ef9a639] | committer: Jason Garrett-Glaser

x86: optimize and clean up predictor checking
Branchlessly handle elimination of candidates in MMX roundclip asm.
Add a new asm function, similar to roundclip, except without the round part.
Optimize and organize the C code, and make both subme>=3 and subme<3 consistent.
Add lots of explanatory comments and try to make things a little more understandable.
~5-10% faster with subme>=3, ~15-20% faster with subme<3.

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=91049858f8a051e87efcbe97285657fa3ef9a639
---

 common/common.h     |   47 ++++++++++----
 common/x86/util.h   |  152 ++++++++++++++++++++++++++++++++++++----------
 encoder/analyse.c   |   12 ++--
 encoder/me.c        |  168 ++++++++++++++++++++++++++++++++-------------------
 encoder/slicetype.c |   16 ++---
 5 files changed, 275 insertions(+), 120 deletions(-)

Diff:   http://git.videolan.org/gitweb.cgi/x264.git/?a=commitdiff;h=91049858f8a051e87efcbe97285657fa3ef9a639


More information about the x264-devel mailing list