[x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2

dave dtyx265 at gmail.com
Thu Apr 30 22:38:05 CEST 2015


On 04/30/2015 12:43 PM, Steve Borho wrote:
> On 04/30, dave wrote:
>> On 04/30/2015 09:28 AM, Steve Borho wrote:
>>> On 04/30, dave wrote:
>>>> I submitted a patch fixing this but I was only able to test it for 64-bit.
>>>> When trying to build for 32-bit I ran into the following build error.
>>>>
>>>> Linking CXX executable x265
>>>> libx265.so.56: undefined reference to `dlsym'
>>>> libx265.so.56: undefined reference to `dlopen'
>>>> collect2: error: ld returned 1 exit status
>>>> make[2]: *** [x265] Error 1
>>>> make[1]: *** [CMakeFiles/cli.dir/all] Error 2
>>>> make: *** [all] Error 2
>>> do you have a 32bit version of libdl installed?
>>>
>> Yes.
>>
>> ls -l lib32/libdl*
>> -rw-r--r-- 1 root root 9768 Feb 22 03:45 lib32/libdl-2.13.so
>> lrwxrwxrwx 1 root root   13 Feb 22 03:45 lib32/libdl.so.2 -> libdl-2.13.so
> Not sure what's causing this. cmake should be checking for libdl and
> adding that to the library list if it is present. Can you tell if -ldl
> is being passed to the linker?
>
When I toggle cmake to show all options,

LIBDL    LIBDL-NOTFOUND

I see the same for LIBRT and all the NUMA stuff but I am guessing those 
aren't required.

I turned on CMAKE_VERBOSE_MAKEFILE and logged the output, no -ldl or any 
mention of libdl.

I changed LIBDL-NOTFOUND to /lib32/libdl.so.2 and was able to build and 
test.

32-bit..
./test/TestBench --testbench interp | grep luma_h
luma_hpp[  4x4]        2.01x      1867.65       3762.72
luma_hps[  4x4]        1.90x      4550.19       8629.97
luma_hpp[  8x8]        2.38x      6087.71       14482.83
luma_hps[  8x8]        1.95x      11010.22      21510.71
luma_hpp[16x16]        2.71x      23922.61      64745.14
luma_hps[16x16]        2.33x      32951.08      76901.44
luma_hpp[32x32]        2.81x      94440.19      265020.06
luma_hps[32x32]        2.24x      111068.30      248300.09
luma_hpp[64x64]        2.61x      376556.16      983163.88
luma_hps[64x64]        2.19x      402714.62      882300.50
luma_hpp[  8x4]        2.34x      3110.44       7265.88
luma_hps[  8x4]        1.92x      8213.40       15790.23
luma_hpp[  4x8]        2.11x      3627.76       7652.79
luma_hps[  4x8]        1.91x      6084.59       11629.97
luma_hpp[ 16x8]        2.72x      11995.22      32633.96
luma_hps[ 16x8]        2.32x      21585.11      50085.57
luma_hpp[ 8x16]        2.37x      12222.63      29025.33
luma_hps[ 8x16]        1.99x      16649.99      33118.82
luma_hpp[32x16]        2.63x      47409.25      124461.49
luma_hps[32x16]        2.23x      65630.83      146272.73
luma_hpp[16x32]        2.71x      47632.11      128912.72
luma_hps[16x32]        2.34x      55819.18      130416.08
luma_hpp[64x32]        2.60x      189231.47      492174.06
luma_hps[64x32]        2.19x      221716.19      485533.44
luma_hpp[32x64]        2.65x      188249.28      498872.19
luma_hps[32x64]        2.24x      201548.78      451323.59
luma_hpp[16x12]        2.70x      18042.62      48710.91
luma_hps[16x12]        2.35x      27225.15      63847.45
luma_hpp[12x16]        2.31x      18982.63      43757.87
luma_hps[12x16]        1.92x      25584.95      49156.58
luma_hpp[ 16x4]        2.87x      6058.06       17385.17
luma_hps[ 16x4]        2.31x      15944.99      36844.97
luma_hpp[ 4x16]        2.14x      7312.49       15642.68
luma_hps[ 4x16]        1.96x      8990.00       17631.11
luma_hpp[32x24]        2.63x      70876.62      186707.23
luma_hps[32x24]        2.23x      88340.04      196993.19
luma_hpp[24x32]        2.66x      71127.98      189195.44
luma_hps[24x32]        2.26x      83403.52      188429.67
luma_hpp[ 32x8]        2.63x      23754.62      62378.40
luma_hps[ 32x8]        2.23x      42879.46      95483.74
luma_hpp[ 8x32]        2.44x      23913.38      58450.92
luma_hps[ 8x32]        1.98x      28051.65      55436.18
luma_hpp[64x48]        2.61x      282386.12      736332.88
luma_hps[64x48]        2.19x      312061.47      683854.44
luma_hpp[48x64]        2.62x      282449.75      741238.62
luma_hps[48x64]        2.21x      302046.09      667564.06
luma_hpp[64x16]        2.60x      94443.56      245561.27
luma_hps[64x16]        2.19x      130718.59      285776.12
luma_hpp[16x64]        2.73x      94895.83      258934.14
luma_hps[16x64]        2.34x      101004.30      236416.59


thanks on the libdl tip.



More information about the x265-devel mailing list