[x264-devel] x264 thread scalability (was unaligned accesses in IA64)
Mauricio Alvarez
lokifo at gmail.com
Thu Feb 28 18:06:13 CET 2008
On Tue, Feb 26, 2008 at 1:06 PM, Mauricio Alvarez <lokifo at gmail.com> wrote:
> I'm testing X264 on an Itanium machine . I want to test the parallel
> scalability of the threading model implemented in X264.
BTW, Just some numbers of my experiments with the multiprocessor machine:
- icc compiler (-O2)
- input sequence: 1088p25_blue_sky, 100 frames
- x264 command: x264 --qp=20 --bframes 2 --keyint 100 --no-b-adapt
--scenecut=-1 --b-bias=0 --ref 1 --fps 25 --frames 100
--threads=${PTHREAD_NUM_THREADS} -o 1088p25_blue_sky.h264
1088p25_blue_sky.yuv 1920x1088
- time results are average of 10 executions
processors - time [s] - speed-up
1 100.84 1
2 65.08 1.55
4 31.13 3.24
8 18.33 5.5
12 13.22 7.63
16 9.56 10.54
20 9.15 11.02
24 8.15 12.37
28 8.01 12.58
32 6.14 16.43
36 5.93 17.01
40 6.2 16.28
44 5.98 16.87
48 6.22 16.21
52 6.38 15.8
56 6.37 15.83
results in terms of parallel scalability are similar to those reported
in http://www.via.ecp.fr/via/ml/x264-devel/2007-02/msg00064-en.html
but in absolute time I obtaining less performance I think due to the
exceptions that the program is getting due to the unaligned accesses.
regards,
Mauricio A.
More information about the x264-devel
mailing list