[x265] [arm64] port cpy2Dto1D_{shl,shr} and cpy1Dto2D_{shl,shr}

Pop, Sebastian spop at amazon.com
Mon Jul 26 18:53:10 UTC 2021


Hi,
the attached patch ports to arm64 the following kernels:

    cpy2Dto1D_shl[4x4]  15.69x   6.73            105.60
    cpy2Dto1D_shr[4x4]  12.97x   6.65            86.28
    cpy2Dto1D_shl[8x8]  43.32x   8.85            383.16
    cpy2Dto1D_shr[8x8]  34.56x   9.75            336.91
  cpy2Dto1D_shl[16x16]  52.93x   21.95           1161.97
  cpy2Dto1D_shr[16x16]  52.10x   27.88           1452.72
  cpy2Dto1D_shl[32x32]  68.29x   89.12           6085.54
  cpy2Dto1D_shr[32x32]  38.55x   105.73          4076.24

    cpy1Dto2D_shl[4x4]  19.04x   5.63            107.16
cpy1Dto2D_shl_aligned[4x4]      19.22x   5.60            107.68
    cpy1Dto2D_shr[4x4]  15.32x   6.52            99.89
    cpy1Dto2D_shl[8x8]  47.59x   8.27            393.34
cpy1Dto2D_shl_aligned[8x8]      47.22x   8.28            390.90
    cpy1Dto2D_shr[8x8]  36.68x   9.74            357.15
  cpy1Dto2D_shl[16x16]  71.02x   21.51           1527.64
cpy1Dto2D_shl_aligned[16x16]    69.37x   21.71           1506.23
  cpy1Dto2D_shr[16x16]  39.06x   28.23           1102.52
  cpy1Dto2D_shl[32x32]  68.19x   89.34           6092.00
cpy1Dto2D_shl_aligned[32x32]    70.01x   89.26           6248.95
  cpy1Dto2D_shr[32x32]  56.47x   105.90          5979.45

Ok to commit?

Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210726/9ee16747/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-arm64-port-cpy2Dto1D_-shl-shr-and-cpy1Dto2D_-shl-shr.patch
Type: application/octet-stream
Size: 14745 bytes
Desc: 0001-arm64-port-cpy2Dto1D_-shl-shr-and-cpy1Dto2D_-shl-shr.patch
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210726/9ee16747/attachment-0001.obj>


More information about the x265-devel mailing list