<div dir="ltr">All the patches of this series have been pushed to the master branch. <br clear="all"><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><b>__________________________</b></div><div><b>Karam Singh</b></div><div><b>Ph.D. IIT Guwahati</b></div><div><font size="1">Senior Software (Video Coding) Engineer </font></div><div><font size="1">Mobile: +91 8011279030</font></div><div><font size="1">Block 9A, 6th floor, DLF Cyber City</font></div><div><font size="1">Manapakkam, Chennai 600 089</font></div></div></div></div><br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Mon, Sep 2, 2024 at 9:20 AM chen <<a href="mailto:chenm003@163.com">chenm003@163.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div style="line-height:1.7;color:rgb(0,0,0);font-size:14px;font-family:Arial"><div id="m_-1356655377911013488spnEditorContent"><p style="margin:0px">Hi Hari,</p><p style="margin:0px"><br></p><p style="margin:0px">Thank for the patches, I have no comment on this initialize version.</p><p style="margin:0px">In future version, we may improve by algorithm.</p><p style="margin:0px">For example, width=12 may split into 2 of witdh=8, it got cache benefits.</p><p style="margin:0px"><br></p></div><div style="zoom:1"></div><div id="m_-1356655377911013488divNeteaseMailCard"></div><div style="margin:0px">Regards,</div><div style="margin:0px">Chen</div><pre><br>At 2024-08-31 03:18:48, "Hari Limaye" <<a href="mailto:hari.limaye@arm.com" target="_blank">hari.limaye@arm.com</a>> wrote:
>This patch series optimises the existing Neon intrinsics implementations of the ipfilter primitives, and removes the assembly implementations in favour of these new implementations.
>
>Relative performance observed for the new Neon intrinsics implementations, compared to the existing assembly implementations, is in the respective commit messages.
>
>Many thanks,
>Hari
>
>Hari Limaye (12):
> Test: Remove check for unused coeffIdx in ipfilter tests
> Move ipfilter primitives into X265_NS
> AArch64: Move ipfilter primitives into X265_NS
> AArch64: Support all block sizes in p2s Neon
> AArch64: Optimise low bitdepth interp_horiz_pp_neon
> AArch64: Optimise low bitdepth interp_horiz_ps_neon
> AArch64: Optimise low bitdepth interp_vert_ss_neon
> AArch64: Optimise low bitdepth interp_vert_pp_neon
> AArch64: Optimise low bitdepth interp_vert_ps_neon
> AArch64: Optimise low bitdepth interp_vert_sp_neon
> AArch64: Define all low bitdepth Neon ipfilter primitives
> AArch64: Remove Assembly ipfilter primitives
>
> source/common/CMakeLists.txt | 4 +-
> source/common/aarch64/asm-primitives.cpp | 186 --
> source/common/aarch64/filter-prim.cpp | 2877 ++++++++++++++++++----
> source/common/aarch64/fun-decls.h | 15 -
> source/common/aarch64/ipfilter-common.S | 1436 -----------
> source/common/aarch64/ipfilter-sve2.S | 1282 ----------
> source/common/aarch64/ipfilter.S | 1054 --------
> source/common/aarch64/mem-neon.h | 193 ++
> source/common/ipfilter.cpp | 8 +-
> source/test/ipfilterharness.cpp | 24 +-
> 10 files changed, 2580 insertions(+), 4499 deletions(-)
> delete mode 100644 source/common/aarch64/ipfilter-common.S
> delete mode 100644 source/common/aarch64/ipfilter-sve2.S
> delete mode 100644 source/common/aarch64/ipfilter.S
>
>--
>2.42.1
>
>_______________________________________________
>x265-devel mailing list
><a href="mailto:x265-devel@videolan.org" target="_blank">x265-devel@videolan.org</a>
><a href="https://mailman.videolan.org/listinfo/x265-devel" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a>
</pre></div>_______________________________________________<br>
x265-devel mailing list<br>
<a href="mailto:x265-devel@videolan.org" target="_blank">x265-devel@videolan.org</a><br>
<a href="https://mailman.videolan.org/listinfo/x265-devel" rel="noreferrer" target="_blank">https://mailman.videolan.org/listinfo/x265-devel</a><br>
</blockquote></div>