<div data-ntes="ntes_mail_body_root" style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><div id="spnEditorContent"><p style="margin: 0;">Hi George,</p><p style="margin: 0;"><br></p><p style="margin: 0;">Thank for the improve patch.</p><p style="margin: 0;">I just a little comments below,</p><p style="margin: 0;"><br></p></div><pre>At 2025-03-08 00:41:05, "George Steed" <george.steed@arm.com> wrote:
> source/common/aarch64/pixel-util.S | 94 +++++++++++++-----------------
> 1 file changed, 42 insertions(+), 52 deletions(-)
>
>diff --git a/source/common/aarch64/pixel-util.S b/source/common/aarch64/pixel-util.S
>index d8b3f4365..6635e52b1 100644
>--- a/source/common/aarch64/pixel-util.S
>+++ b/source/common/aarch64/pixel-util.S
>@@ -2213,27 +2213,25 @@ endfunc
> //     const uint16_t* scanCG4x4, // x6
> //     const int trSize)          // x7
> function PFX(scanPosLast_neon)
>-.Loop_spl:
>-    // position of current CG
>+    ldr             q28, [x10]              // v28 = mask for pmovmskb
>+    add             x10, x7, x7             // 2*x7
>+    add             x11, x7, x7, lsl #1     // 3*x7
>+    add             x9, x4, #1              // CG count
>+
<div>>+1:</div><div>This is GCC style label, please keep generic style of local label</div><div><br></div><div><br></div>>     // coeffFlag = reverse_bit(w15) in 16-bit
>-    rbit            w12, w15
>-    lsr             w12, w12, #16
>-    fmov            s30, w12
>+    rbit            w12, w13
<div>>+    and             w12, w12, #0xffff</div><div>Is this necessary?</div><div><br></div>>     strh            w12, [x3], #2


>-    // compute coeffNum = popcount(coeffFlag)
>-    cnt             v30.8b, v30.8b
>-    addp            v30.8b, v30.8b, v30.8b
>-    fmov            w6, s30
<div>>-    sub             x5, x5, x6</div><div>We are not need 64bits x5</div><div><br></div>>-    strb            w6, [x4], #1
>-
>-    cbnz            x5, .Loop_spl
<div>>+    cbnz            x5, 1b</div><div>Same x5 here</div><div><br></div></pre></div>