<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">FYI, If what I submitted performs
better than the sse4 code then I suggest either improving the sse4
code with ssse3 and sse4 instructions or removing it.<br>
<br>
On 05/29/2015 10:12 AM, chen wrote:<br>
</div>
<blockquote
cite="mid:7ec097f1.db8.14da0a958e8.Coremail.chenm003@163.com"
type="cite">
<div
style="line-height:1.7;color:#000000;font-size:14px;font-family:arial">right,thanks
<div><br>
.</div>
<pre>
At 2015-05-30 01:01:15,<a class="moz-txt-link-abbreviated" href="mailto:dtyx265@gmail.com">dtyx265@gmail.com</a> wrote:
># HG changeset patch
># User David T Yuen <a class="moz-txt-link-rfc2396E" href="mailto:dtyx265@gmail.com"><dtyx265@gmail.com></a>
># Date 1432917446 25200
># Node ID 2d5efe979f6b9c8db275ecb53767e4bcff1da659
># Parent 12f0ed28ba0eb29f2df0bb8adbc5f3cfb40a6361
>asm: interp_8tap_vert_pX sse2
>
>This code replaces c code for sse2. It is the combination of the sse4 macros into
>one for smaller code size with no sacrifice in function and a few tweeks for performance.
>The original sse4 macros only use up to sse2 code so this code may perform better with the
>tweeks which include unrolling the inner loop which eliminated the need to use the stack
>to hold the counter for one of the loops and replaced incrementing the source register
>with address offsets.
>
</pre>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
x265-devel mailing list
<a class="moz-txt-link-abbreviated" href="mailto:x265-devel@videolan.org">x265-devel@videolan.org</a>
<a class="moz-txt-link-freetext" href="https://mailman.videolan.org/listinfo/x265-devel">https://mailman.videolan.org/listinfo/x265-devel</a>
</pre>
</blockquote>
<br>
</body>
</html>