<div data-ntes="ntes_mail_body_root" style="line-height:1.7;color:#000000;font-size:14px;font-family:Arial"><div id="spnEditorContent"><p style="margin: 0;">Thank for the patches, I have some comments</p><p style="margin: 0;"><br></p><p style="margin: 0;">* In current version, we support pixel up to 12 bits, so sse_pp equal to sse_ss, of course, separate <span style="font-family: arial; white-space: pre-wrap;">16-bits version is not bad idea.</span></p><pre style="width: 1298.64px; word-break: break-word !important;">* In below code, LD1 vs LDR, which one better?</pre><pre style="width: 1298.64px; word-break: break-word !important;">+ ld1 {v16.8h-v17.8h}, [x0], x1
+ ld1 {v18.8h-v19.8h}, [x2], x3
</pre></div><div style="position:relative;zoom:1"></div><div id="divNeteaseMailCard"></div><p style="margin: 0;"><br></p><pre>At 2024-12-10 23:59:15, "Gerda Zsejke More" <gerdazsejke.more@arm.com> wrote:
>Hi,
>
>This patch series adds Neon and SVE asm implementation of HBD SSE_PP, SSE_SS and SSD_S functions.
>The added HBD SSE_SS and SSD_S SVE implementation is suitable for SBD as well, so enable it for that.
>Delete unused Neon intrinsics functions for SSE and SSD_S.
>
>This series is based on the master branch.
>
>Many thanks,
>Gerda
>
>Gerda Zsejke More (11):
> Avoid aliasing HBD SSE_PP functions for AArch64 platforms
> AArch64: Add Neon asm implementation of HBD SSE_PP
> AArch64: Add SVE asm implementation of HBD SSE_PP
> AArch64: Add Neon asm implementation of HBD SSE_SS
> AArch64: Add SVE asm implementation of HBD SSE_SS
> AArch64: Enable existing SSE_SS SVE impl for SBD
> AArch64: Delete sse_neon implementation
> AArch64: Add Neon asm implementation of HBD SSD_S
> AArch64: Add SVE asm implementation of HBD SSD_S
> AArch64: Enable existing SSD_S SVE impl for SBD
> AArch64: Delete pixel_ssd_s_neon implementation
>
> source/common/CMakeLists.txt | 4 +-
> source/common/aarch64/asm-primitives.cpp | 84 +--
> source/common/aarch64/pixel-prim.cpp | 89 ----
> source/common/aarch64/ssd-a-sve.S | 483 +++++++++++++++++
> source/common/aarch64/ssd-a-sve2.S | 626 -----------------------
> source/common/aarch64/ssd-a.S | 525 +++++++++++++++++++
> source/common/primitives.cpp | 2 +
> 7 files changed, 1063 insertions(+), 750 deletions(-)
> create mode 100644 source/common/aarch64/ssd-a-sve.S
> delete mode 100644 source/common/aarch64/ssd-a-sve2.S
>
>--
>2.39.5 (Apple Git-154)
>
>_______________________________________________
>x265-devel mailing list
>x265-devel@videolan.org
>https://mailman.videolan.org/listinfo/x265-devel
</pre></div>