[x265] [PATCH 2 of 2] asm:intra pred planar32 sse2 high bit

Steve Borho steve at borho.org
Tue Mar 10 04:25:01 CET 2015


On 03/09, dave wrote:
> Interesting.  Performance is almost identical
> 
> original code
> 
> /x265 -I 1 --input ~/Videos/bridge-close-cif/bridge-close.y4m -o
> bridge-close.y4m
> y4m  [info]: 352x288 fps 30/1 i420p8 frames 0 - 1999 of 2000
> x265 [info]: HEVC encoder version 1.5+162-4d1d54d28cb1
> x265 [info]: build info [Linux][GCC 4.7.2][64 bit] 16bpp
> x265 [info]: using cpu capabilities: MMX2 SSE2Slow SlowCTZ
> x265 [info]: Main 10 profile, Level-2 (Main tier)
> x265 [info]: Thread pool created using 2 threads
> x265 [info]: frame threads / pool features       : 1 / wpp(5 rows)
> x265 [info]: Internal bit depth                  : 10
> x265 [info]: Coding QT: max CU size, min CU size : 64 / 8
> x265 [info]: Residual QT: max TU size, max depth : 32 / 1 inter / 1 intra
> x265 [info]: ME / range / subpel / merge         : hex / 57 / 2 / 2
> x265 [info]: Keyframe min / max / scenecut       : 1 / 1 / 40
> x265 [info]: Lookahead / bframes / badapt        : 20 / 0 / 0
> x265 [info]: b-pyramid / weightp / weightb / refs: 0 / 1 / 0 / 3
> x265 [info]: Rate Control / AQ-Strength / CUTree : CRF-28.0 / 1.0 / 1
> x265 [info]: tools: rd=3 psy-rd=0.30 deblock sao signhide tmvp
> x265 [info]: frame I:   2000, Avg QP:33.08  kb/s: 973.45
> x265 [info]: global :   2000, Avg QP:33.08  kb/s: 973.45
> x265 [info]: consecutive B-frames: 100.0%
> 
> encoded 2000 frames in 414.26s (4.83 fps), 973.45 kb/s
> 
> and using registers to hold constants
> 
> ./x265 -I 1 --input ~/Videos/bridge-close-cif/bridge-close.y4m -o
> bridge-close.y4m
> y4m  [info]: 352x288 fps 30/1 i420p8 frames 0 - 1999 of 2000
> x265 [info]: HEVC encoder version 1.5+162-4d1d54d28cb1
> x265 [info]: build info [Linux][GCC 4.7.2][64 bit] 16bpp
> x265 [info]: using cpu capabilities: MMX2 SSE2Slow SlowCTZ
> x265 [info]: Main 10 profile, Level-2 (Main tier)
> x265 [info]: Thread pool created using 2 threads
> x265 [info]: frame threads / pool features       : 1 / wpp(5 rows)
> x265 [info]: Internal bit depth                  : 10
> x265 [info]: Coding QT: max CU size, min CU size : 64 / 8
> x265 [info]: Residual QT: max TU size, max depth : 32 / 1 inter / 1 intra
> x265 [info]: ME / range / subpel / merge         : hex / 57 / 2 / 2
> x265 [info]: Keyframe min / max / scenecut       : 1 / 1 / 40
> x265 [info]: Lookahead / bframes / badapt        : 20 / 0 / 0
> x265 [info]: b-pyramid / weightp / weightb / refs: 0 / 1 / 0 / 3
> x265 [info]: Rate Control / AQ-Strength / CUTree : CRF-28.0 / 1.0 / 1
> x265 [info]: tools: rd=3 psy-rd=0.30 deblock sao signhide tmvp
> x265 [info]: frame I:   2000, Avg QP:33.08  kb/s: 973.45
> x265 [info]: global :   2000, Avg QP:33.08  kb/s: 973.45
> x265 [info]: consecutive B-frames: 100.0%
> 
> encoded 2000 frames in 414.28s (4.83 fps), 973.45 kb/s
> 
> The closest I could find to forcing intra planar32 to be used is -I 1

If you enable DETAILED_CU_STATS it will report the amount of time spent
in intra analysis.

-- 
Steve Borho


More information about the x265-devel mailing list