<div data-ntes="ntes_mail_body_root" style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><div id="spnEditorContent"><p style="margin: 0;">Hi George,</p><p style="margin: 0;"><br></p><p style="margin: 0;">Thank for the improve patch.</p><p style="margin: 0;">I just a little comments below,</p><p style="margin: 0;"><br></p></div><pre>At 2025-03-08 00:41:05, "George Steed" <george.steed@arm.com> wrote:
> source/common/aarch64/pixel-util.S | 94 +++++++++++++-----------------
> 1 file changed, 42 insertions(+), 52 deletions(-)
>
>diff --git a/source/common/aarch64/pixel-util.S b/source/common/aarch64/pixel-util.S
>index d8b3f4365..6635e52b1 100644
>--- a/source/common/aarch64/pixel-util.S
>+++ b/source/common/aarch64/pixel-util.S
>@@ -2213,27 +2213,25 @@ endfunc
> // const uint16_t* scanCG4x4, // x6
> // const int trSize) // x7
> function PFX(scanPosLast_neon)
>-.Loop_spl:
>- // position of current CG
>+ ldr q28, [x10] // v28 = mask for pmovmskb
>+ add x10, x7, x7 // 2*x7
>+ add x11, x7, x7, lsl #1 // 3*x7
>+ add x9, x4, #1 // CG count
>+
<div>>+1:</div><div>This is GCC style label, please keep generic style of local label</div><div><br></div><div><br></div>> // coeffFlag = reverse_bit(w15) in 16-bit
>- rbit w12, w15
>- lsr w12, w12, #16
>- fmov s30, w12
>+ rbit w12, w13
<div>>+ and w12, w12, #0xffff</div><div>Is this necessary?</div><div><br></div>> strh w12, [x3], #2
>
>- // compute coeffNum = popcount(coeffFlag)
>- cnt v30.8b, v30.8b
>- addp v30.8b, v30.8b, v30.8b
>- fmov w6, s30
<div>>- sub x5, x5, x6</div><div>We are not need 64bits x5</div><div><br></div>>- strb w6, [x4], #1
>-
>- cbnz x5, .Loop_spl
<div>>+ cbnz x5, 1b</div><div>Same x5 here</div><div><br></div></pre></div>