[x265] [arm64] add missing p2s functions
Pop, Sebastian
spop at amazon.com
Wed Aug 11 23:46:08 UTC 2021
Hi,
Part of using the x86 macros to register the functions makes it possible to automatically spot those functions that were not yet implemented.
The attached patch adds the following missing kernels:
chroma_p2s[ 2x4] 7.23x 7.77 56.14
chroma_p2s_aligned[ 2x4] 7.36x 7.82 57.53
chroma_p2s[ 2x8] 9.33x 11.42 106.57
chroma_p2s_aligned[ 2x8] 11.46x 11.40 130.65
chroma_p2s[ 2x16] 12.81x 20.87 267.46
chroma_p2s_aligned[ 2x16] 13.18x 20.45 269.52
chroma_p2s[ 4x2] 9.27x 5.94 55.11
chroma_p2s_aligned[ 4x2] 9.26x 6.05 56.01
chroma_p2s[ 4x32] 28.73x 30.52 876.99
chroma_p2s_aligned[ 4x32] 33.27x 30.85 1026.31
chroma_p2s[ 6x8] 15.33x 19.92 305.40
chroma_p2s_aligned[ 6x8] 15.41x 19.92 306.91
chroma_p2s[ 6x16] 16.28x 37.47 609.82
chroma_p2s_aligned[ 6x16] 16.53x 37.42 618.63
chroma_p2s[ 8x2] 17.47x 5.87 102.58
chroma_p2s_aligned[ 8x2] 16.97x 5.95 100.98
chroma_p2s[ 8x6] 36.08x 8.35 301.40
chroma_p2s_aligned[ 8x6] 35.69x 8.39 299.25
chroma_p2s[ 8x12] 45.62x 13.12 598.47
chroma_p2s_aligned[ 8x12] 46.13x 13.09 603.87
chroma_p2s[ 8x64] 51.21x 62.76 3214.13
chroma_p2s_aligned[ 8x64] 52.85x 62.51 3303.60
chroma_p2s[12x32] 45.45x 52.36 2379.91
chroma_p2s_aligned[12x32] 45.66x 51.82 2365.99
chroma_p2s[16x24] 59.90x 39.38 2358.73
chroma_p2s_aligned[16x24] 59.66x 39.39 2350.24
chroma_p2s[24x64] 59.29x 159.27 9444.02
chroma_p2s_aligned[24x64] 59.44x 159.39 9473.78
chroma_p2s[32x48] 59.84x 155.86 9326.65
chroma_p2s_aligned[32x48] 59.81x 155.81 9318.74
The patch also makes use of the macros to declare the functions.
That makes it much easier to see through the include file.
Ok to commit?
Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210811/5f7ed32c/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-arm64-add-missing-p2s-functions.patch
Type: application/octet-stream
Size: 59458 bytes
Desc: 0001-arm64-add-missing-p2s-functions.patch
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210811/5f7ed32c/attachment-0001.obj>
More information about the x265-devel
mailing list