[x265] [PATCH] asm: dct8 sse2 1.88x improvement over c code

Steve Borho steve at borho.org
Fri Feb 20 17:49:29 CET 2015


On Fri, Feb 20, 2015 at 10:06 AM, dave <dtyx265 at gmail.com> wrote:
> On 02/19/2015 04:58 PM, Steve Borho wrote:
>>
>> On 02/19, dtyx265 at gmail.com wrote:
>>>
>>> # HG changeset patch
>>> # User David T Yuen <dtyx265 at gmail.com>
>>> # Date 1424385856 28800
>>> # Node ID 28287b57013e9c43488bfba1570ded5cfb4af16d
>>> # Parent  039ea966d5ebccab1de2c3766fb7b4f125d2020a
>>> asm: dct8 sse2 1.88x improvement over c code
>>>
>>> This is backported from dct8 sse4
>>
>> it would be helpful for reviewers if you could tell us which CPUs this
>> will help and by how much.
>
> This is intended for cpus that support up to sse3 and until an ssse3 dct8
> primitive is developed, also ssse3.  The following is from testing on my
> sse3 system
>
> dct8x8            1.88x      11202.50      21044.87
>
>> If your CPU does not have SSE4 would the
>> encoder use the C reference if this primitive were not present?
>
> Yes, this replaces the C reference on systems with sse2 to ssse3.

thanks, queued with this info added to the commit message

-- 
Steve Borho


More information about the x265-devel mailing list