<div style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><p style="margin: 0;">Hello,</p><div style="margin: 0;"><br></div><div style="margin: 0;">Thank your patch, I make some comments.</div><div style="margin: 0;"><br></div><p style="margin: 0;">+#ifdef __MACH__</p><p style="margin: 0;">+# define MACH</p><p style="margin: 0;">+#else</p><p style="margin: 0;">+# define MACH #</p><div style="margin: 0;">This is not good idea to bypass .const_data</div><div style="margin: 0;"><br></div><div style="margin: 0;"><div style="margin: 0;">+ ld1 {v0.s}[0], [x0], x1</div><div style="margin: 0;">+ ld1 {v0.s}[1], [x0], x1</div><div style="margin: 0;">+ ushll v0.8h, v0.8b, #0</div><div>...</div><div><div>+ // row[0-1]</div><div>+ mul v16.8h, v0.8h, v24.8h</div></div><div>Why not MULL?</div></div><p style="margin: 0;">+ ext v21.16b, v0.16b, v1.16b, #8</p><p style="margin: 0;">+ mul v17.8h, v21.8h, v24.8h</p><p style="margin: 0;">+ orr v0.16b, v1.16b, v1.16b</p><div>This is equal to MOV, I guess compiler will replace to right instruction on ARM64</div><div><br></div><p style="margin: 0;">+ // sum row[0-7]</p><p style="margin: 0;">+ dup v18.2d, v16.d[1]</p><p style="margin: 0;">+ dup v19.2d, v17.d[1]</p><p style="margin: 0;">+ add v16.4h, v16.4h, v18.4h</p><p style="margin: 0;">+ add v17.4h, v17.4h, v19.4h</p><div>How about ADDP?</div><div style="position:relative;zoom:1"></div><div id="divNeteaseMailCard"></div><p style="margin: 0;"><br></p><p> 2021-07-02 01:18:42£¬"Pop, Sebastian" <spop@amazon.com> </p><blockquote id="isReplyContent" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">
<style><!--
_font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
_font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
_page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt">Hi,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">the attached patch ports to arm64 the following kernels:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">luma_vpp[ 4x4] 18.77x 27.66 519.22<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">luma_vpp[ 4x8] 22.73x 45.35 1030.72<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">luma_vpp[ 4x16] 25.10x 82.32 2066.41<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Ok to commit?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Thanks,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt">Sebastian<o:p></o:p></span></p>
</div>
</blockquote></div>