<div style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><div style="margin: 0;">Hi,</div><div style="margin: 0;"><br></div><div style="margin: 0;">I just a little comments.</div><div style="margin: 0;"><br></div><div style="margin: 0;">+.macro addAvg_start</div><p style="margin: 0;">+ lsl x3, x3, #1</p><p style="margin: 0;">+ lsl x4, x4, #1</p><p style="margin: 0;">+ movrel x11, addAvg_offset</p><div style="margin: 0;">+ ld1 {v30.8h}, [x11]</div><div style="margin: 0px;">All of value in the addAvg_offset is 0x40, why not DUP?</div><div style="margin: 0px;"><br></div><p style="margin: 0;">+ add v0.8h, v0.8h, v1.8h</p><div style="margin: 0;">+ saddl v16.4s, v0.4h, v30.4h</div><div style="margin: 0;">immediate use v0 may make pipeline stall</div><div style="margin: 0;"><br></div><p style="margin: 0;">+ saddl2 v17.4s, v0.8h, v30.8h</p><p style="margin: 0;">+ add v2.8h, v2.8h, v3.8h</p><p style="margin: 0;">+ saddl v18.4s, v2.4h, v30.4h</p><p style="margin: 0;">+ saddl2 v19.4s, v2.8h, v30.8h</p><div><br></div><p style="margin: 0;"> 2021-07-27 09:01:32£¬"Pop, Sebastian" <spop@amazon.com> </p><blockquote id="isReplyContent" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">
<style><!--
_font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
_font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
_page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Hi,</span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">the attached patch ports to arm64 the following kernels:</span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 4x4] 22.03x 9.87 217.35<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 8x8] 41.06x 21.01 862.77<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 4x4] 21.07x 10.31 217.20<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 4x8] 23.19x 17.87 414.44<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 8x4] 35.10x 12.46 437.40<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 4x2] 13.23x 8.01 105.94<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 4x8] 23.17x 17.89 414.54<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[16x16] 50.38x 63.28 3187.50<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x8] 38.47x 21.93 843.59<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x16] 44.45x 38.55 1713.69<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 16x8] 47.63x 33.70 1605.09<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x4] 34.13x 12.86 439.01<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x8] 39.22x 21.87 857.94<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 8x16] 42.08x 40.88 1720.30<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 4x8] 23.03x 17.93 413.10<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 4x16] 24.58x 32.44 797.45<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 16x4] 44.62x 18.13 809.08<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x2] 28.08x 8.17 229.29<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x4] 34.00x 12.82 435.82<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[16x12] 50.69x 48.05 2435.74<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x6] 38.48x 17.07 656.91<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x12] 42.95x 30.00 1288.53<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 4x16] 25.31x 31.73 802.95<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[12x16] 35.76x 67.70 2421.01<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 6x8] 19.93x 30.26 603.15<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 6x16] 20.47x 57.31 1172.97<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[32x32] 48.23x 254.84 12291.57<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[16x16] 49.59x 63.82 3164.65<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[16x32] 51.79x 123.15 6377.69<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[32x16] 49.46x 128.27 6343.50<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 16x8] 48.03x 33.75 1620.91<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[16x16] 50.35x 62.86 3164.73<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[16x32] 51.75x 122.50 6339.62<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x16] 43.78x 38.62 1690.74<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x32] 45.53x 72.44 3298.22<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 32x8] 47.93x 65.87 3156.92<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 16x4] 43.43x 18.64 809.56<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 16x8] 47.47x 33.64 1596.84<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[32x24] 49.16x 191.04 9392.00<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[16x12] 49.27x 48.68 2398.20<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[16x24] 50.96x 93.21 4750.37<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[ 8x32] 45.61x 72.32 3298.91<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 4x16] 24.65x 32.30 796.37<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 4x32] 25.97x 60.57 1572.78<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[24x32] 46.28x 204.88 9481.85<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[12x16] 35.58x 68.07 2422.33<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[12x32] 37.35x 130.66 4879.55<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[64x64] 45.30x 1066.50 48309.83<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[32x32] 48.17x 255.22 12293.77<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[32x64] 48.67x 505.28 24591.01<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[64x32] 45.22x 535.51 24215.25<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[32x16] 48.63x 130.26 6334.18<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[32x32] 48.33x 255.33 12341.31<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[32x64] 48.88x 504.10 24641.61<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[16x32] 51.87x 123.09 6384.44<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[16x64] 53.21x 242.70 12914.20<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[64x16] 44.87x 270.22 12125.58<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 32x8] 46.57x 66.57 3100.05<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[32x16] 48.76x 129.97 6336.97<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[64x48] 46.57x 800.90 37301.68<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[32x24] 49.21x 192.49 9473.39<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[32x48] 49.02x 379.97 18627.41<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[16x64] 53.24x 242.72 12922.55<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[ 8x32] 44.63x 74.53 3326.18<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[ 8x64] 48.12x 138.94 6686.57<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> addAvg[48x64] 47.97x 754.41 36187.82<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i420] addAvg[24x32] 45.60x 205.26 9360.26<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> [i422] addAvg[24x64] 45.69x 408.96 18684.47<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Ok to commit?</span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Thanks,</span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Sebastian</span><span style="color:black"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
</div>
</blockquote></div>