<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle19
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt">With the patch attached.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="color:black">From: </span></b><span style="color:black">"Pop, Sebastian" <spop@amazon.com><br>
<b>Date: </b>Thursday, July 22, 2021 at 3:13 PM<br>
<b>To: </b>"x265-devel@videolan.org" <x265-devel@videolan.org><br>
<b>Subject: </b>[arm64] port count_nonzero, blkfill, and copy_{ss,sp,ps}<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
</div>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Hi,</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">the attached patch ports to arm64 the following kernels:</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">count_nonzero[4x4] 19.23x 2.95 56.77</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">count_nonzero[8x8] 32.07x 7.11 228.15</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">count_nonzero[16x16] 35.16x 26.09 917.23</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">count_nonzero[32x32] 37.31x 98.07 3658.49</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> blkfill[4x4] 31.39x 3.72 116.84</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> blkfill[8x8] 85.97x 5.78 497.26</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> blkfill[16x16] 102.63x 16.28 1670.56</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> blkfill[32x32] 100.07x 62.89 6293.62</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ss[ 4x4] 16.87x 6.21 104.78</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_sp[ 4x4] 16.21x 6.34 102.69</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ps[ 4x4] 18.06x 5.91 106.69</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ss[ 8x8] 51.50x 8.30 427.52</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_sp[ 8x8] 43.34x 9.32 403.79</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ps[ 8x8] 49.00x 8.50 416.36</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ss[ 4x4] 15.40x 6.62 101.98</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ps[ 4x4] 16.50x 6.26 103.28</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_sp[ 4x4] 14.14x 6.82 96.48</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ss[ 4x8] 25.79x 8.28 213.57</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ps[ 4x8] 24.74x 8.62 213.35</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_sp[ 4x8] 22.01x 9.27 204.03</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ss[16x16] 82.20x 19.79 1626.69</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_sp[16x16] 85.13x 18.78 1599.19</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ps[16x16] 72.51x 22.28 1615.58</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ss[ 8x8] 49.16x 8.49 417.24</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ps[ 8x8] 46.52x 8.71 405.34</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_sp[ 8x8] 42.68x 9.47 404.13</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ss[ 8x16] 56.55x 14.98 847.42</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ps[ 8x16] 57.71x 15.12 872.39</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_sp[ 8x16] 49.76x 16.83 837.44</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ss[32x32] 98.60x 67.47 6652.77</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_sp[32x32] 96.31x 65.07 6266.88</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ps[32x32] 77.71x 81.02 6295.59</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ss[16x16] 83.93x 20.52 1722.55</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ps[16x16] 72.66x 22.13 1608.30</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_sp[16x16] 85.67x 18.73 1604.77</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ss[16x32] 91.45x 36.56 3343.09</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ps[16x32] 75.73x 42.40 3211.16</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_sp[16x32] 91.93x 34.32 3154.89</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ss[64x64] 104.11x 254.52 26498.82</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_sp[64x64] 98.81x 252.38 24937.40</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_ps[64x64] 80.97x 308.55 24983.04</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ss[32x32] 99.49x 67.40 6706.31</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_ps[32x32] 76.50x 81.51 6235.63</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i420] copy_sp[32x32] 97.43x 65.84 6414.64</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ss[32x64] 102.57x 129.82 13315.36</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_ps[32x64] 78.95x 159.47 12590.31</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt">[i422] copy_sp[32x64] 99.54x 128.29 12769.10</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_cnt[4x4] 13.91x 7.48 104.10</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_cnt[8x8] 31.01x 12.69 393.40</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_cnt[16x16] 42.88x 36.23 1553.66</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> copy_cnt[32x32] 47.43x 129.19 6127.58</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Ok to commit?</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black"> </span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Thanks,</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt;color:black">Sebastian</span><o:p></o:p></p>
<p class="MsoNormal"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
</div>
</body>
</html>