[x265] [arm64] port cpy2Dto1D_{shl, shr} and cpy1Dto2D_{shl, shr}

chen chenm003 at 163.com
Tue Jul 27 14:52:20 UTC 2021


Looks good, thanks.




2021-07-27 02:53:10,"Pop, Sebastian" <spop at amazon.com> 

Hi,

the attached patch ports to arm64 the following kernels:

 

    cpy2Dto1D_shl[4x4]  15.69x   6.73            105.60

    cpy2Dto1D_shr[4x4]  12.97x   6.65            86.28

    cpy2Dto1D_shl[8x8]  43.32x   8.85            383.16

    cpy2Dto1D_shr[8x8]  34.56x   9.75            336.91

  cpy2Dto1D_shl[16x16]  52.93x   21.95           1161.97

  cpy2Dto1D_shr[16x16]  52.10x   27.88           1452.72

  cpy2Dto1D_shl[32x32]  68.29x   89.12           6085.54

  cpy2Dto1D_shr[32x32]  38.55x   105.73          4076.24

 

    cpy1Dto2D_shl[4x4]  19.04x   5.63            107.16

cpy1Dto2D_shl_aligned[4x4]      19.22x   5.60            107.68

    cpy1Dto2D_shr[4x4]  15.32x   6.52            99.89

    cpy1Dto2D_shl[8x8]  47.59x   8.27            393.34

cpy1Dto2D_shl_aligned[8x8]      47.22x   8.28            390.90

    cpy1Dto2D_shr[8x8]  36.68x   9.74            357.15

  cpy1Dto2D_shl[16x16]  71.02x   21.51           1527.64

cpy1Dto2D_shl_aligned[16x16]    69.37x   21.71           1506.23

  cpy1Dto2D_shr[16x16]  39.06x   28.23           1102.52

  cpy1Dto2D_shl[32x32]  68.19x   89.34           6092.00

cpy1Dto2D_shl_aligned[32x32]    70.01x   89.26           6248.95

  cpy1Dto2D_shr[32x32]  56.47x   105.90          5979.45

 

Ok to commit?

 

Thanks,

Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210727/c50653c1/attachment.html>


More information about the x265-devel mailing list