[x265] [arm64] port copy_pp
Pop, Sebastian
spop at amazon.com
Thu Jul 22 21:52:09 UTC 2021
Hi,
the attached patch ports to arm64 the following kernels:
copy_pp[ 4x4] 16.61x 5.83 96.85
copy_pp[ 8x8] 50.88x 7.41 377.22
[i420] copy_pp[ 4x4] 14.85x 6.30 93.47
[i422] copy_pp[ 4x8] 22.97x 8.22 188.87
copy_pp[ 8x4] 34.93x 5.30 185.06
[i420] copy_pp[ 4x2] 8.28x 5.04 41.73
[i422] copy_pp[ 4x4] 15.72x 6.29 98.81
copy_pp[ 4x8] 23.64x 8.14 192.46
[i420] copy_pp[ 2x4] 6.75x 6.33 42.71
copy_pp[16x16] 66.32x 15.32 1015.63
[i420] copy_pp[ 8x8] 49.04x 7.65 375.08
[i422] copy_pp[ 8x16] 50.99x 14.65 746.82
copy_pp[ 16x8] 97.64x 7.68 750.23
[i420] copy_pp[ 8x4] 31.06x 5.67 176.22
[i422] copy_pp[ 8x8] 48.80x 7.64 372.59
copy_pp[ 8x16] 51.36x 14.72 755.96
[i420] copy_pp[ 4x8] 23.39x 8.45 197.71
[i422] copy_pp[ 4x16] 15.13x 14.94 226.00
copy_pp[ 16x4] 64.05x 5.41 346.35
[i420] copy_pp[ 8x2] 18.09x 4.68 84.73
[i422] copy_pp[ 8x4] 32.38x 5.68 183.86
copy_pp[16x12] 100.95x 11.20 1130.36
[i420] copy_pp[ 8x6] 41.51x 6.66 276.47
[i422] copy_pp[ 8x12] 51.20x 11.00 562.99
copy_pp[ 4x16] 15.67x 14.85 232.74
[i420] copy_pp[ 2x8] 9.44x 8.94 84.34
[i422] copy_pp[ 2x16] 7.96x 22.26 177.21
copy_pp[12x16] 47.22x 23.99 1133.07
[i420] copy_pp[ 6x8] 21.64x 12.94 280.07
[i422] copy_pp[ 6x16] 26.33x 24.38 641.85
copy_pp[32x32] 143.86x 38.77 5577.15
[i420] copy_pp[16x16] 100.09x 14.85 1486.50
[i422] copy_pp[16x32] 96.01x 30.22 2901.08
copy_pp[32x16] 114.02x 21.69 2472.80
[i420] copy_pp[ 16x8] 85.78x 7.86 674.12
[i422] copy_pp[16x16] 91.62x 14.84 1359.77
copy_pp[16x32] 73.04x 30.64 2238.22
[i420] copy_pp[ 8x16] 49.44x 14.62 722.68
[i422] copy_pp[ 8x32] 53.63x 30.08 1613.46
copy_pp[ 32x8] 116.42x 11.64 1355.32
[i420] copy_pp[ 16x4] 58.93x 5.84 344.42
[i422] copy_pp[ 16x8] 88.45x 7.89 697.98
copy_pp[32x24] 135.31x 29.74 4024.18
[i420] copy_pp[16x12] 92.30x 11.27 1039.98
[i422] copy_pp[16x24] 90.90x 22.38 2034.18
copy_pp[ 8x32] 48.70x 30.02 1462.12
[i420] copy_pp[ 4x16] 24.31x 14.94 363.09
[i422] copy_pp[ 4x32] 23.94x 30.43 728.37
copy_pp[24x32] 91.74x 46.34 4251.20
[i420] copy_pp[12x16] 47.35x 24.36 1153.65
[i422] copy_pp[12x32] 43.63x 46.28 2019.02
copy_pp[64x64] 176.98x 129.35 22893.08
[i420] copy_pp[32x32] 144.42x 39.10 5646.94
[i422] copy_pp[32x64] 153.73x 74.20 11406.21
copy_pp[64x32] 171.82x 66.15 11365.43
[i420] copy_pp[32x16] 141.05x 21.61 3048.16
[i422] copy_pp[32x32] 150.24x 38.93 5848.20
copy_pp[32x64] 153.59x 73.28 11254.45
[i420] copy_pp[16x32] 92.93x 30.68 2850.89
[i422] copy_pp[16x64] 95.23x 61.54 5860.84
copy_pp[64x16] 173.45x 35.12 6091.41
[i420] copy_pp[ 32x8] 126.73x 11.92 1510.15
[i422] copy_pp[32x16] 129.62x 21.63 2803.73
copy_pp[64x48] 167.00x 97.67 16311.01
[i420] copy_pp[32x24] 132.80x 30.21 4011.50
[i422] copy_pp[32x48] 154.76x 55.84 8642.34
copy_pp[16x64] 90.05x 61.47 5534.98
[i420] copy_pp[ 8x32] 50.51x 30.05 1517.87
[i422] copy_pp[ 8x64] 48.10x 61.61 2963.15
copy_pp[48x64] 174.26x 98.45 17156.06
[i420] copy_pp[24x32] 90.07x 46.54 4192.25
[i422] copy_pp[24x64] 92.13x 93.10 8577.37
Ok to commit?
Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210722/5381d82a/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-arm64-port-copy_pp.patch
Type: application/octet-stream
Size: 16737 bytes
Desc: 0001-arm64-port-copy_pp.patch
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210722/5381d82a/attachment-0001.obj>
More information about the x265-devel
mailing list