[x264-devel] [PATCH 2/2] arm: Implement x264_mbtree_propagate_{cost, list}_neon
Henrik Gramner
henrik at gramner.com
Thu Sep 3 16:36:45 CEST 2015
On Thu, Sep 3, 2015 at 8:30 AM, Martin Storsjö <martin at martin.st> wrote:
> The cost function could be simplified to avoid having to clobber
> q4/q5, but this requires reordering instructions which increase
> the total runtime.
>
> checkasm timing Cortex-A7 A8 A9
> mbtree_propagate_cost_c 63702 155835 62829
> mbtree_propagate_cost_neon 17199 10454 11106
>
> mbtree_propagate_list_c 104203 108949 84532
> mbtree_propagate_list_neon 82035 78348 60410
Applied.
More information about the x264-devel
mailing list