[x265] [arm64] port scale1D_128to64 and scale2D_64to32

chen chenm003 at 163.com
Fri Jul 30 00:14:44 UTC 2021


+    ld2             {v0.16b,v1.16b}, [x1], #32

+    ld2             {v2.16b,v3.16b}, [x1], x2

+    ld2             {v4.16b,v5.16b}, [x1], #32

+    ld2             {v6.16b,v7.16b}, [x1], x2

+    uaddl           v16.8h, v0.8b, v1.8b

+    uaddl2          v17.8h, v0.16b, v1.16b

LD2+UADDL equal to LD1+ADDLP




btw: excuse me, other patches need more time, probability review on weekend.


Regards,
Min Chen


 2021-07-30 06:13:34,"Pop, Sebastian" <spop at amazon.com> 

Hi,

the attached patch ports to arm64 the following kernels:

 

       scale1D_128to64  68.89x   12.06           830.58

        scale2D_64to32  62.21x   220.95          13744.77

 

Ok to commit?

 

Thanks,

Sebastian

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210730/6f9d7ccb/attachment.html>


More information about the x265-devel mailing list