[vlc-devel] [PATCH 3/6] copy: remove need for cache memory in SSE routines
Jean-Baptiste Kempf
jb at videolan.org
Sat Jun 14 15:33:00 CEST 2014
On 13 Jun, Jean-Yves Avenard wrote :
> SSE code used a 16-bytes aligned memory buffer to perform memory operations.
> Rewrite code to directory work on source and destination frame, regardless of memory alignment.
>
> This speeds up NV12->YV12 conversion by over 100% by halving the number of memory writes and reads.
So, you remove the CopyFromUswc part, right?
Is that not too slow when using DxVA?
https://software.intel.com/en-us/articles/copying-accelerated-video-decode-frame-buffers
--
Jean-Baptiste Kempf
http://www.jbkempf.com/ - +33 672 704 734
Sent from my Electronic Device
More information about the vlc-devel
mailing list