+    ld2             {v0.16b,v1.16b}, [x1], #32
+    ld2             {v2.16b,v3.16b}, [x1], x2
+    ld2             {v4.16b,v5.16b}, [x1], #32
+    ld2             {v6.16b,v7.16b}, [x1], x2
+    uaddl           v16.8h, v0.8b, v1.8b
+    uaddl2          v17.8h, v0.16b, v1.16b
LD2+UADDL equal to LD1+ADDLP
btw: excuse me, other patches need more time, probability review on weekend.
Regards,
Min Chen
 2021-07-30 06:13:34,"Pop, Sebastian" <spop at amazon.com> 
Hi,
the attached patch ports to arm64 the following kernels:
 
       scale1D_128to64  68.89x   12.06           830.58
        scale2D_64to32  62.21x   220.95          13744.77
 
Ok to commit?
 
Thanks,
Sebastian
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210730/6f9d7ccb/attachment.html>