[x265] [PATCH 2 of 2] fix Main12 satd overflow bug up to SSE4, (fixes #180)

chen chenm003 at 163.com
Wed Sep 16 20:19:50 CEST 2015


Please ignore this patch, I forgot cleanup debug code,
I send it again.


At 2015-09-17 02:14:30,"Min Chen" <chenm003 at 163.com> wrote:
># HG changeset patch
># User Min Chen <chenm003 at 163.com>
># Date 1442423261 18000
># Node ID 61769ca2358186b36c2498b3eaa8aeded95af6fd
># Parent  e35d3260b5138abf09bf79808bf55104606c4626
>fix Main12 satd overflow bug up to SSE4, (fixes #180)
>---
> source/common/x86/asm-primitives.cpp |  106 ++++++-
> source/common/x86/pixel-a.asm        |  635 +++++++++-------------------------
> 2 files changed, 264 insertions(+), 477 deletions(-)
>
>diff -r e35d3260b513 -r 61769ca23581 source/common/x86/asm-primitives.cpp
>--- a/source/common/x86/asm-primitives.cpp	Wed Sep 16 12:07:40 2015 -0500
>+++ b/source/common/x86/asm-primitives.cpp	Wed Sep 16 12:07:41 2015 -0500
>@@ -871,6 +871,84 @@
> 
> #if HIGH_BIT_DEPTH
> 
>+#define BITS_PER_SUM (8 * sizeof(sum_t))
>+
>+#define HADAMARD4(d0, d1, d2, d3, s0, s1, s2, s3) { \
>+        sum2_t t0 = s0 + s1; \
>+        sum2_t t1 = s0 - s1; \
>+        sum2_t t2 = s2 + s3; \
>+        sum2_t t3 = s2 - s3; \
>+        d0 = t0 + t2; \
>+        d2 = t0 - t2; \
>+        d1 = t1 + t3; \
>+        d3 = t1 - t3; \
>+}
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150917/557712f8/attachment.html>


More information about the x265-devel mailing list