[x265] [arm64] port cpy2Dto1D_{shl,shr} and cpy1Dto2D_{shl,shr}
Pop, Sebastian
spop at amazon.com
Mon Jul 26 18:53:10 UTC 2021
Hi,
the attached patch ports to arm64 the following kernels:
cpy2Dto1D_shl[4x4] 15.69x 6.73 105.60
cpy2Dto1D_shr[4x4] 12.97x 6.65 86.28
cpy2Dto1D_shl[8x8] 43.32x 8.85 383.16
cpy2Dto1D_shr[8x8] 34.56x 9.75 336.91
cpy2Dto1D_shl[16x16] 52.93x 21.95 1161.97
cpy2Dto1D_shr[16x16] 52.10x 27.88 1452.72
cpy2Dto1D_shl[32x32] 68.29x 89.12 6085.54
cpy2Dto1D_shr[32x32] 38.55x 105.73 4076.24
cpy1Dto2D_shl[4x4] 19.04x 5.63 107.16
cpy1Dto2D_shl_aligned[4x4] 19.22x 5.60 107.68
cpy1Dto2D_shr[4x4] 15.32x 6.52 99.89
cpy1Dto2D_shl[8x8] 47.59x 8.27 393.34
cpy1Dto2D_shl_aligned[8x8] 47.22x 8.28 390.90
cpy1Dto2D_shr[8x8] 36.68x 9.74 357.15
cpy1Dto2D_shl[16x16] 71.02x 21.51 1527.64
cpy1Dto2D_shl_aligned[16x16] 69.37x 21.71 1506.23
cpy1Dto2D_shr[16x16] 39.06x 28.23 1102.52
cpy1Dto2D_shl[32x32] 68.19x 89.34 6092.00
cpy1Dto2D_shl_aligned[32x32] 70.01x 89.26 6248.95
cpy1Dto2D_shr[32x32] 56.47x 105.90 5979.45
Ok to commit?
Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210726/9ee16747/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-arm64-port-cpy2Dto1D_-shl-shr-and-cpy1Dto2D_-shl-shr.patch
Type: application/octet-stream
Size: 14745 bytes
Desc: 0001-arm64-port-cpy2Dto1D_-shl-shr-and-cpy1Dto2D_-shl-shr.patch
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210726/9ee16747/attachment-0001.obj>
More information about the x265-devel
mailing list