Alan Kelly
4aeedf4c2a
libswscale/x86/yuv2yuvX: Removes unrolling for mmx and mmxext
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
(cherry picked from commit 3ce8d09244
)
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
4 years ago
James Almer
1a555d3c60
swscale/x86/yuv2yuvX: use the movsxdifnidn helper macro
Simplifies code
Signed-off-by: James Almer <jamrial@gmail.com>
4 years ago
James Almer
ebb48d85a0
swscale/x86/yuv2yuvX: use movq to load 8 bytes in all non-AVX2 functions
mova expands to movq on non-XMM functions
Signed-off-by: James Almer <jamrial@gmail.com>
4 years ago
James Almer
d512ebbaed
swscale/x86/yuv2yuvX: use the SPLATW helper macro
Simplifies code
Signed-off-by: James Almer <jamrial@gmail.com>
4 years ago
James Almer
c2bf1dcace
swscale/x86/swscale: fix compilation with old yasm
Where AVX2 may not be supported.
Signed-off-by: James Almer <jamrial@gmail.com>
4 years ago
Alan Kelly
554c2bc708
swscale: move yuv2yuvX_sse3 to yasm, unrolls main loop
And other small optimizations for ~20% speedup.
4 years ago