[x265] [PATCH 2 of 2] asm:intra pred planar32 sse2 high bit
Steve Borho
steve at borho.org
Tue Mar 10 16:56:19 CET 2015
On 03/10, dave wrote:
> On 03/09/2015 11:40 PM, Steve Borho wrote:
<snip>
> >No, but the command line option --cu-stats does show how much it is
> >called (but not how long it took)
> >
> This produces some interesting numbers.
>
> Without using registers for constants
>
> x265 [info]: I32: Intra 100%(DC 0% P 40% Ang 58%)
>
> encoded 2000 frames in 95.98s (20.84 fps), 1020.04 kb/s
>
> With using registers for constants
>
> x265 [info]: I32: Intra 99%(DC 39% P 16% Ang 43%)
>
> encoded 2000 frames in 93.10s (21.48 fps), 1008.63 kb/s
>
> I just added --cu-stats to the same command options that I used
> previously and I ran it several times and got exactly the same
> percentages. Times varied by less than a second for each build. So
> how can simple register usage in one primitive affect intra pred
> decisions?
it shouldn't, the behavior must be wrong in one of the cases. no change
in performance should be able to impact the encoder output (or any
coding decisions)
--
Steve Borho
More information about the x265-devel
mailing list