[x264-devel] aarch64: Clear the upper half of int parameters in x264_plane_copy_core_neon
Janne Grunau
git at videolan.org
Thu Dec 1 21:01:43 CET 2016
x264 | branch: master | Janne Grunau <janne-x264 at jannau.net> | Wed Nov 16 10:49:14 2016 +0200| [a91e95fca2222ac0731e987a07f4b11c670f4556] | committer: Henrik Gramner
aarch64: Clear the upper half of int parameters in x264_plane_copy_core_neon
> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=a91e95fca2222ac0731e987a07f4b11c670f4556
---
common/aarch64/mc-a.S | 5 +++--
1 file changed, 3 insertions(+), 2 deletions(-)
diff --git a/common/aarch64/mc-a.S b/common/aarch64/mc-a.S
index 3a99fbe..9dcb4db 100644
--- a/common/aarch64/mc-a.S
+++ b/common/aarch64/mc-a.S
@@ -1254,8 +1254,9 @@ load_deinterleave_chroma:
endfunc
function x264_plane_copy_core_neon, export=1
- add x8, x4, #15
- and x4, x8, #~15
+ add w8, w4, #15 // 32-bit write clears the upper 32-bit the register
+ and w4, w8, #~15
+ // safe use of the full reg since negative width makes no sense
sub x1, x1, x4
sub x3, x3, x4
1:
More information about the x264-devel
mailing list