[x265] x265 CPU utilization very low on a multi-numa sockets server
Pradeep Ramachandran
pradeep at multicorewareinc.com
Tue Jul 28 07:55:22 CEST 2015
My bad in reading the command line - 20 is the fps of the video, and not
the # frames to encode. Ignore that part of the comment above.
Pradeep.
On Tue, Jul 28, 2015 at 11:16 AM, Pradeep Ramachandran <
pradeep at multicorewareinc.com> wrote:
> Ximing Cheng,
> To increase CPU utilization, you will have to run a higher resolution
> video (targeting a higher bit rate) in a more advanced setting for a longer
> duration - a 20 frames of 1080p video encoded with a target bit rate of
> 1200 in 'fast' preset may not be sufficient to fill out 72 contexts. With
> higher resolution videos we do see that heavily threaded machines get
> decent utilization; doesn't hit a 100% all the time but it is better than
> 35%.
>
> Also, I can try reproducing your result if you can you give your complete
> command line - the console output shows that some of the default settings
> have been altered (like frame threads) but your command line doesn't list
> them.
>
> Thanks,
> Pradeep.
>
> On Tue, Jul 28, 2015 at 7:28 AM, Ximing Cheng <chengximing1989 at gmail.com>
> wrote:
>
>> Hi, I am testing x265 with a two numa nodes server, each node has 36
>> cores. The x265 version is 1.7 release with command line
>>
>> ./x265 --input-res 1920x1080 --input input.yuv --bitrate 1200
>> --vbv-maxrate 1380 --fps 20 --early-skip --preset fast -o test1.hevc
>>
>> but when ruuning on the server, CPU utilization ranges from 27% ~ 35% (<
>> 40%) which means most of the CPU cores are not busy.
>>
>> x265 [info]: HEVC encoder version 1.7x265 [info]: build info [Linux][GCC 4.4.6][64 bit] 8bppx265 [info]: using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX AVX2 FMA3 LZCNT BMI2x265 [warning]: --psnr used with AQ on: results will be invalid!x265 [warning]: --tune psnr should be used if attempting to benchmark psnr!x265 [info]: Main profile, Level-4 (Main tier)x265 [info]: Thread pool 0 using 36 threads on NUMA node 0x265 [info]: Thread pool 1 using 36 threads on NUMA node 1x265 [info]: frame threads / pool features : 16 / wpp(34 rows)+pmodex265 [warning]: VBV maxrate specified, but no bufsize, ignoredx265 [info]: Coding QT: max CU size, min CU size : 32 / 8x265 [info]: Residual QT: max TU size, max depth : 32 / 2 inter / 2 intrax265 [info]: ME / range / subpel / merge : star / 57 / 1 / 2x265 [info]: Keyframe min / max / scenecut : 20 / 250 / 40x265 [info]: Lookahead / bframes / badapt : 60 / 4 / 2x265 [info]: b-pyramid / weightp / weightb / refs: 1 / 1 / 1 / 1x265 [info]: AQ: mode / str / qg-size / cu-tree : 1 / 0.3 / 32 / 1x265 [info]: Rate Control / qCompress : ABR-1200 kbps / 0.60x265 [info]: tools: rect amp rd=4 rdoq=2 early-skip signhide tmvp b-intra
>>
>>
>> _______________________________________________
>> x265-devel mailing list
>> x265-devel at videolan.org
>> https://mailman.videolan.org/listinfo/x265-devel
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.videolan.org/pipermail/x265-devel/attachments/20150728/9799cf71/attachment.html>
More information about the x265-devel
mailing list