[x264-devel] Fix rare overflow in 10-bit intra_satd_x3_16x16 asm

Anton Mitrofanov git at videolan.org
Wed Mar 7 03:20:12 CET 2012


x264 | branch: master | Anton Mitrofanov <Bugmaster at narod.ru> | Tue Feb 21 12:58:22 2012 -0800| [b17c247178a24c218843639c3f46bcfde0edab0a] | committer: Jason Garrett-Glaser

Fix rare overflow in 10-bit intra_satd_x3_16x16 asm

> http://git.videolan.org/gitweb.cgi/x264.git/?a=commit;h=b17c247178a24c218843639c3f46bcfde0edab0a
---

 common/x86/pixel-a.asm |   11 ++++++++---
 1 files changed, 8 insertions(+), 3 deletions(-)

diff --git a/common/x86/pixel-a.asm b/common/x86/pixel-a.asm
index 8d935ef..adeeff0 100644
--- a/common/x86/pixel-a.asm
+++ b/common/x86/pixel-a.asm
@@ -2007,9 +2007,14 @@ cglobal intra_satd_x3_16x16, 0,5
     inc         r4
     jl  .loop_x
 %if HIGH_BIT_DEPTH
-    mova        m7, [pw_1]
-    pmaddwd     m4, m7
-    pmaddwd     m0, m7
+    psrld       m7, m4, 16
+    pslld       m4, 16
+    psrld       m4, 16
+    paddd       m4, m7
+    psrld       m7, m0, 16
+    pslld       m0, 16
+    psrld       m0, 16
+    paddd       m0, m7
     paddd       m4, [sums+32]
     paddd       m0, [sums+24]
     mova [sums+32], m4



More information about the x264-devel mailing list