[x265] [arm64] port cpy2Dto1D_{shl, shr} and cpy1Dto2D_{shl, shr}
chen
chenm003 at 163.com
Tue Jul 27 14:52:20 UTC 2021
Looks good, thanks.
2021-07-27 02:53:10,"Pop, Sebastian" <spop at amazon.com>
Hi,
the attached patch ports to arm64 the following kernels:
cpy2Dto1D_shl[4x4] 15.69x 6.73 105.60
cpy2Dto1D_shr[4x4] 12.97x 6.65 86.28
cpy2Dto1D_shl[8x8] 43.32x 8.85 383.16
cpy2Dto1D_shr[8x8] 34.56x 9.75 336.91
cpy2Dto1D_shl[16x16] 52.93x 21.95 1161.97
cpy2Dto1D_shr[16x16] 52.10x 27.88 1452.72
cpy2Dto1D_shl[32x32] 68.29x 89.12 6085.54
cpy2Dto1D_shr[32x32] 38.55x 105.73 4076.24
cpy1Dto2D_shl[4x4] 19.04x 5.63 107.16
cpy1Dto2D_shl_aligned[4x4] 19.22x 5.60 107.68
cpy1Dto2D_shr[4x4] 15.32x 6.52 99.89
cpy1Dto2D_shl[8x8] 47.59x 8.27 393.34
cpy1Dto2D_shl_aligned[8x8] 47.22x 8.28 390.90
cpy1Dto2D_shr[8x8] 36.68x 9.74 357.15
cpy1Dto2D_shl[16x16] 71.02x 21.51 1527.64
cpy1Dto2D_shl_aligned[16x16] 69.37x 21.71 1506.23
cpy1Dto2D_shr[16x16] 39.06x 28.23 1102.52
cpy1Dto2D_shl[32x32] 68.19x 89.34 6092.00
cpy1Dto2D_shl_aligned[32x32] 70.01x 89.26 6248.95
cpy1Dto2D_shr[32x32] 56.47x 105.90 5979.45
Ok to commit?
Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210727/c50653c1/attachment.html>
More information about the x265-devel
mailing list