[x265] [arm64] port filterPixelToShort

Pop, Sebastian spop at amazon.com
Thu Jun 24 01:09:09 UTC 2021


> +.macro filterPixelToShort_64xN h
> +function x265_filterPixelToShort_64x\h\()_neon
> +    add             x3, x3, x3
> +    sub             x3, x3, #0x40
> +    movi            v4.8h, #0xe0, lsl #8
> +.rept \h
> I guess unroll N is not good idea, because the code section too large, it most probability to make cache flush and missing.

Please see attached the amended patch to include the loop.
Ok to commit?

Thanks,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210624/809b8118/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-arm64-port-x265_filterPixelToShort_-_neon.patch
Type: application/octet-stream
Size: 13360 bytes
Desc: 0001-arm64-port-x265_filterPixelToShort_-_neon.patch
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20210624/809b8118/attachment.obj>


More information about the x265-devel mailing list