[x264-devel] ppc: Use the vec_xst_len for partial stores

Luca Barbato git at videolan.org
Tue Mar 12 19:31:57 CET 2019


x264 | branch: master | Luca Barbato <lu_zero at gentoo.org> | Thu Aug 23 08:30:37 2018 +0000| [40688108dd13fc0bf1847a6dfc1cf86a728654fb] | committer: Anton Mitrofanov

ppc: Use the vec_xst_len for partial stores

Seems to give about a 1-2% overall speedup on --slow.

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=40688108dd13fc0bf1847a6dfc1cf86a728654fb
---

 common/ppc/ppccommon.h | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/common/ppc/ppccommon.h b/common/ppc/ppccommon.h
index 51936e0a..8df69380 100644
--- a/common/ppc/ppccommon.h
+++ b/common/ppc/ppccommon.h
@@ -148,8 +148,12 @@ typedef union {
 /***********************************************************************
  * VEC_STORE##n:  stores n bytes from vector v to address p
  **********************************************************************/
+#ifndef __POWER9_VECTOR__
 #define VEC_STORE8( v, p ) \
     vec_vsx_st( vec_xxpermdi( v, vec_vsx_ld( 0, p ), 1 ), 0, p )
+#else
+#define VEC_STORE8( v, p ) vec_xst_len( v, p, 8 )
+#endif
 
 /***********************************************************************
  * VEC_TRANSPOSE_8



More information about the x264-devel mailing list