[x265] [PATCH] asm : saturation bug fix for luma_vss asm routine

Deepthi Nandakumar deepthi at multicorewareinc.com
Thu Jan 30 15:29:14 CET 2014


This patch is pending, right Nabajit? I havent pushed the luma_vss
/chroma_vss assembly patches or the testbench edits to luma_vss.




On Wed, Jan 29, 2014 at 1:56 PM, chen <chenm003 at 163.com> wrote:

> >@@ -5105,8 +5108,9 @@
> >     pmaddwd    m5, [r6 + 3 * 16]
>
> >     paddd      m1, m5                          ;m1=[1+2+3+4+5+6+7+8]  Row2 end
> >     psrad      m1, 6
> >-
> >-    packssdw   m0, m1
> >+    pand       m1, m7
> >+
> >+    packusdw   m0, m1
> >
> >     movlps     [r2], m0
> >     movhps     [r2 + r3], m0
> PAND + PACKUSDW may avoid overflow problem, but it is wrong way here
> as you said, you got a result value 0x8D84, it is overflow on 16bits, so
> we need to find really reason,
> I check the HM code, it use Short, so I suggest you catch input data and
> put into HM to check HM's output.
>
>
> _______________________________________________
> x265-devel mailing list
> x265-devel at videolan.org
> https://mailman.videolan.org/listinfo/x265-devel
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20140130/7c872db7/attachment.html>


More information about the x265-devel mailing list