[x264-devel] unaligned accesses in IA64

Mauricio Alvarez lokifo at gmail.com
Tue Feb 26 13:06:56 CET 2008


Hi,

I'm testing X264 on an Itanium machine . I want to test the parallel
scalability of the threading model implemented in X264.

The machine is like this:
- 64 cpus Dual Core Montecito(IA-64). Each one of the 128 cores works
at 1,6 GHz, with a 8MB L3 cache and 533 MHz Bus.
- 512 GB RAM (8 GB per blade).
- OS: SuSe Linux 2.6.16.27-0.9

I have compiled the code using both gcc (4.1.0) and icc (v 10.0), and
I execute it using the following command line:

./x264 --qp=20 --fps 25 --frames 100 --progress  --threads=${NTHREADS}
-o 1088p25_blue_sky.h264 1088p25_blue_sky.yuv 1920x1088

And, I'm getting these results, with some messages about unaligned accesses:
(these results are for the version compiled with icc)
x264 [info]: using cpu capabilities: none!
x264(8689): unaligned access to 0x607fffffff32ee5c, ip=0x40000000000afb71
x264(8689): unaligned access to 0x607fffffff32ee64, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee6c, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee74, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee7c, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee5c, ip=0x40000000000afb71
x264(8689): unaligned access to 0x607fffffff32ee64, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee6c, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee74, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee7c, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee5c, ip=0x40000000000afb71
x264(8689): unaligned access to 0x607fffffff32ee64, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee6c, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee74, ip=0x40000000000aff31
x264(8689): unaligned access to 0x607fffffff32ee7c, ip=0x40000000000aff31
x264 [info]: slice I:1     Avg QP:17.00  size:293253  PSNR Mean
Y:46.51 U:48.54 V:49.18 Avg:47.16 Global:47.16
x264 [info]: slice P:9     Avg QP:20.00  size: 53870  PSNR Mean
Y:44.51 U:45.50 V:46.23 Avg:44.91 Global:44.91
x264 [info]: mb I  I16..4: 56.6%  0.0% 43.4%
x264 [info]: mb P  I16..4: 25.7%  0.0%  0.4%  P16..4: 24.1% 14.9%
5.1%  0.0%  0.0%    skip:29.7%
x264 [info]: SSIM Mean Y:0.9813082
x264 [info]: PSNR Mean Y:44.709 U:45.806 V:46.523 Avg:45.136
Global:45.090 kb/s:15561.58

encoded 10 frames, 0.81 fps, 15570.36 kb/s

I found a previous thread related to this problem, but seems that
there was no solution to it:
http://www.via.ecp.fr/via/ml/x264-devel/2006-04/msg00069.html

Do you have any suggestion on how to solve this issue? At the end the
video is coded correctly but I suppose that having those unaligned
accesses have a big performance penalty.

Thanks in advance,

Mauricio Alvarez


More information about the x264-devel mailing list