[x265] [PATCH] asm : saturation bug fix for luma_vss asm routine

chen chenm003 at 163.com
Wed Jan 29 09:26:41 CET 2014


>@@ -5105,8 +5108,9 @@
>     pmaddwd    m5, [r6 + 3 * 16]
>     paddd      m1, m5                          ;m1=[1+2+3+4+5+6+7+8]  Row2 end
>     psrad      m1, 6
>-
>-    packssdw   m0, m1
>+    pand       m1, m7
>+
>+    packusdw   m0, m1
> 
>     movlps     [r2], m0
>     movhps     [r2 + r3], m0

PAND + PACKUSDW may avoid overflow problem, but it is wrong way here
as you said, you got a result value 0x8D84, it is overflow on 16bits, so we need to find really reason,
I check the HM code, it use Short, so I suggest you catch input data and put into HM to check HM's output.
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20140129/ef4c03f9/attachment.html>


More information about the x265-devel mailing list