<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Aptos;
panose-1:2 11 0 4 2 2 2 2 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:12.0pt;
font-family:"Aptos",sans-serif;}
pre
{mso-style-priority:99;
mso-style-link:"HTML Preformatted Char";
margin:0cm;
font-size:10.0pt;
font-family:"Courier New";}
span.HTMLPreformattedChar
{mso-style-name:"HTML Preformatted Char";
mso-style-priority:99;
mso-style-link:"HTML Preformatted";
font-family:Consolas;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;
mso-ligatures:none;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-GB" link="#467886" vlink="#96607D" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Hi Chen,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Thank you for the approval.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Yes, EOR and ADD have the same instruction cost (latency / throughput / pipeline) in AArch64 across Neoverse (Nx, Vx), so using either should be optimal.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Regards,<br>
Micro<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div id="mail-editor-reference-message-container">
<div>
<div>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal" style="margin-bottom:12.0pt"><b><span style="color:black">From:
</span></b><span style="color:black">chen <chenm003@163.com><br>
<b>Date: </b>Wednesday, 9 April 2025 at 07:10<br>
<b>To: </b>Development for x265 <x265-devel@videolan.org><br>
<b>Cc: </b>nd <nd@arm.com>, Micro Daryl Robles <MicroDaryl.Robles@arm.com><br>
<b>Subject: </b>Re:[x265] [PATCH 0/2] Add Neon impl of findPosFirstLast<o:p></o:p></span></p>
</div>
<div>
<div id="spnEditorContent">
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black">Hi Micro,<o:p></o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black"><o:p> </o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black">The code looks good to me, I have no more comment, thank you.<o:p></o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black"><o:p> </o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black">btw: for absSumSign, does EOR and ADD same instruction cost on Neoverse?<o:p></o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black"><o:p> </o:p></span></p>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black">Regards,<br>
Chen<o:p></o:p></span></p>
</div>
<p style="margin:0cm"><span style="font-size:10.5pt;font-family:"Arial",sans-serif;color:black"><o:p> </o:p></span></p>
<pre><span style="color:black"><br>At 2025-04-08 23:13:29, "Micro Daryl Robles" <microdaryl.robles@arm.com> wrote:<o:p></o:p></span></pre>
<pre><span style="color:black">>Hi,<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>This patch series adds a Neon intrinsic implementation of<o:p></o:p></span></pre>
<pre><span style="color:black">>findPosFirstLast.<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>Also, we are submitting a proposal to rename CLZ/CTZ to BSR/BSF, as the<o:p></o:p></span></pre>
<pre><span style="color:black">>current CLZ macro does not actually count leading zeros. Instead, it<o:p></o:p></span></pre>
<pre><span style="color:black">>returns the index of the highest set bit, which aligns with the behavior<o:p></o:p></span></pre>
<pre><span style="color:black">>of BSR.<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>This series is based on the master branch.<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>Many thanks,<o:p></o:p></span></pre>
<pre><span style="color:black">>Micro<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>Micro Daryl Robles (2):<o:p></o:p></span></pre>
<pre><span style="color:black">> AArch64: Add Neon implementation of findPosFirstLast<o:p></o:p></span></pre>
<pre><span style="color:black">> Rename CLZ/CTZ to BSR/BSF<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">> source/common/aarch64/dct-prim.cpp | 55 ++++++++++++++++++++++++++++-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/aarch64/dct-prim.h | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/bitstream.cpp | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/dct.cpp | 4 +--<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/ppc/dct_altivec.cpp | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/quant.cpp | 8 ++---<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/threading.h | 18 ++++++----<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/threadpool.cpp | 10 +++---<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/wavefront.cpp | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/common/x86/pixel-util8.asm | 4 +--<o:p></o:p></span></pre>
<pre><span style="color:black">> source/encoder/entropy.cpp | 10 +++---<o:p></o:p></span></pre>
<pre><span style="color:black">> source/encoder/frameencoder.cpp | 4 +--<o:p></o:p></span></pre>
<pre><span style="color:black">> source/encoder/slicetype.cpp | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/encoder/weightPrediction.cpp | 2 +-<o:p></o:p></span></pre>
<pre><span style="color:black">> source/test/pixelharness.cpp | 12 +++++--<o:p></o:p></span></pre>
<pre><span style="color:black">> 15 files changed, 102 insertions(+), 35 deletions(-)<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>-- <o:p></o:p></span></pre>
<pre><span style="color:black">>2.34.1<o:p></o:p></span></pre>
<pre><span style="color:black">><o:p> </o:p></span></pre>
<pre><span style="color:black">>_______________________________________________<o:p></o:p></span></pre>
<pre><span style="color:black">>x265-devel mailing list<o:p></o:p></span></pre>
<pre><span style="color:black">>x265-devel@videolan.org<o:p></o:p></span></pre>
<pre><span style="color:black">>https://mailman.videolan.org/listinfo/x265-devel<o:p></o:p></span></pre>
</div>
</div>
</div>
</div>
</div>
IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose,
or store or copy the information in any medium. Thank you.
</body>
</html>