April 2015 Archives by date
Starting: Wed Apr 1 00:23:50 CEST 2015
Ending: Thu Apr 30 22:38:05 CEST 2015
Messages: 612
- [x265] [PATCH] cmake: support PGO and NATIVE_BUILD for Intel C++ (icpc)
Steve Borho
- [x265] [PATCH] cli: move raw bitstream output to separate file
Xinyue Lu
- [x265] [PATCH] cli: move raw bitstream output to separate file
Steve Borho
- [x265] [PATCH] asm:intra_pred_ang4_2 improved by ~4% 134.99 -> 129.95
chen
- [x265] [PATCH 1 of 3] asm: reduce binary size
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 3] asm: intra_pred_ang32_21 improved by ~27% over SSE4, 3439.25c -> 2504.30c
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 3] asm: intra_pred_ang32_22 improved by ~5% over AVX2, 2308.11c -> 2207.80c
praveen at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for dst4x4
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for idst4x4
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] frameencoder: increase refLagRows by one when subme has multiple hpel iteration steps
santhoshini at multicorewareinc.com
- [x265] [PATCH] replace for loops with memcpy
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] x86inc: alignment all of const to 32-bytes
Min Chen
- [x265] Basic understanding
Apurw Potuwar
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 8bpp code for convert_p2s[4xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[8xN], convert_p2s[16xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[32xN],[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 code for chroma_p2s for i420, i422, i444, reuse the luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Divya Manivannan
- [x265] [PATCH] asm: sse4 chroma_p2s[4x2](2.29x), ssse3 chroma_p2s[8x2](3.60x) for i420
rajesh at multicorewareinc.com
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
Rajesh Paulraj
- [x265] [PATCH] frameencoder: increase refLagRows by one when subme has multiple hpel iteration steps
Steve Borho
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
rajesh at multicorewareinc.com
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
Steve Borho
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
Steve Borho
- [x265] [PATCH] asm:intra_pred_ang4_2 improved by ~4% 134.99 -> 129.95
Steve Borho
- [x265] [PATCH 0 of 7 ] asm: nits and tweaks for intra_pred_ang4_2-9 sse2
dtyx265 at gmail.com
- [x265] [PATCH 1 of 7] asm:intra_pred_ang4_3_sse2 improved ~4.5% 684.95 -> 654.99 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 2 of 7] asm:intra_pred_ang4_4_sse2 improved ~3% 642.49 -> 624.99 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 3 of 7] asm: intra_pred_ang4_5_sse2 improved ~2.5% 642.50 -> 627.50 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 4 of 7] asm: intra_pred_ang4_6_sse2 improved ~3% 612.49 -> 592.50 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 5 of 7] asm: intra_pred_ang4_7_sse2 improved ~6.5% 634.99 -> 592.50 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 6 of 7] asm: intra_pred_ang4_8_sse2 improved ~5% 609.99 -> 577.50 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 7 of 7] asm: intra_pred_ang4_9_sse2 improved ~5% 605.00 -> 572.56 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 0 of 7 ] asm: nits and tweaks for intra_pred_ang4_2-9 sse2
Steve Borho
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Steve Borho
- [x265] [PATCH] asm: intra_pred_ang4_5_sse2 improved ~2.5% 642.50 -> 627.50 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH 1 of 3] api: comment nit
Steve Borho
- [x265] [PATCH 2 of 3] api: remove public funcdef and export for deprecated x265_setup_primitives()
Steve Borho
- [x265] [PATCH 3 of 3] api: introduce x265_api_get()
Steve Borho
- [x265] [PATCH 00 of 11 ] asm:intra pred 4x4 modes 10-18, 26 and transposed modes
dtyx265 at gmail.com
- [x265] [PATCH 01 of 11] asm: intra_pred_ang4_10_sse2
dtyx265 at gmail.com
- [x265] [PATCH 02 of 11] asm: intra_pred_ang4_11_sse2
dtyx265 at gmail.com
- [x265] [PATCH 03 of 11] asm: intra_pred_ang4_12_sse2
dtyx265 at gmail.com
- [x265] [PATCH 04 of 11] asm: intra_pred_ang4_13_sse2
dtyx265 at gmail.com
- [x265] [PATCH 05 of 11] asm: intra_pred_ang4_14_sse2
dtyx265 at gmail.com
- [x265] [PATCH 06 of 11] asm: intra_pred_ang4_15_sse2
dtyx265 at gmail.com
- [x265] [PATCH 07 of 11] asm: intra_pred_ang4_16_sse2
dtyx265 at gmail.com
- [x265] [PATCH 08 of 11] asm: intra_pred_ang4_17_sse2
dtyx265 at gmail.com
- [x265] [PATCH 09 of 11] asm: intra_pred_ang4_18_sse2
dtyx265 at gmail.com
- [x265] [PATCH 10 of 11] asm: intra_pred_ang4_26_sse2
dtyx265 at gmail.com
- [x265] [PATCH 11 of 11] asm: intra pred 4x4 modes 19-25 and 27-33
dtyx265 at gmail.com
- [x265] [PATCH 3 of 7] asm: intra_pred_ang4_5_sse2 improved ~2.5% 642.50 -> 627.50 with nits and tweaks
dave
- [x265] [PATCH 3 of 7] asm: intra_pred_ang4_5_sse2 improved ~2.5% 642.50 -> 627.50 with nits and tweaks
Steve Borho
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
chen
- [x265] [PATCH 1 of 7] asm:intra_pred_ang4_3_sse2 improved ~4.5% 684.95 -> 654.99 with nits and tweaks
chen
- [x265] [PATCH 4 of 7] asm: intra_pred_ang4_6_sse2 improved ~3% 612.49 -> 592.50 with nits and tweaks
chen
- [x265] [PATCH 1 of 7] asm: intra_pred_ang32_23 improved by ~10% over AVX2, 1925.55c -> 1738.47c
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 7] asm: intra_pred_ang32_24 improved by ~5% over AVX2
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 7] asm: cleanup unused constant table
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 7] asm: intra_pred_ang4_27 improved by ~35% over SSE4, 146.71c -> 95.03c
praveen at multicorewareinc.com
- [x265] [PATCH 5 of 7] asm: intra_pred_ang4_28 improved by ~38% over SSE4, 152.87c -> 94.98c
praveen at multicorewareinc.com
- [x265] [PATCH 6 of 7] asm: intra_pred_ang4_29 improved by ~45% over SSE4, 157.78c -> 88.14c
praveen at multicorewareinc.com
- [x265] [PATCH 7 of 7] asm: intra_pred_ang4_30 improve by ~38% over SSE4, 160.00c -> 99.99c
praveen at multicorewareinc.com
- [x265] [PATCH 00 of 11 ] asm:intra pred 4x4 modes 10-18, 26 and transposed modes
chen
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Divya Manivannan
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Divya Manivannan
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
Steve Borho
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
chen
- [x265] [PATCH] cli: fill pts if source does not provide one
Xinyue Lu
- [x265] [PATCH] asm: avx2 code for weight_sp() for 8bpp
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for weight_sp() for 8bpp
chen
- [x265] [PATCH] asm: avx2 code for weight_sp() for 8bpp
Sumalatha Polureddy
- [x265] cli: unify x265 log function
Xinyue Lu
- [x265] [PATCH 01 of 11] asm: intra_pred_ang4_31, improved by ~43% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 02 of 11] asm: intra_pred_ang4_32 improved by ~47% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 03 of 11] asm: intra_pred_ang4_33 improved by ~44% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 04 of 11] asm: intra_pred_ang4_24 improved by ~38% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 05 of 11] asm: intra_pred_ang4_25 improved ~37% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 06 of 11] asm: intra_pred_ang4_23 improved by ~48 over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 07 of 11] asm: intra_pred_ang4_22 improved by ~46% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 08 of 11] asm: intra_pred_ang4_21 improved by ~50% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 09 of 11] asm: intra_pred_ang4_20 improved by ~52% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 10 of 11] asm: intra_pred_ang4_19 improved by ~60% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 11 of 11] asm: reduce code size with macro 'INTRA_PRED_STORE_4x4'
praveen at multicorewareinc.com
- [x265] [PATCH] asm: saoCuOrgE0 avx2 code: 756c->629c
Divya Manivannan
- [x265] [PATCH] asm: reduce count of register in calSign, 213c -> 202c
Min Chen
- [x265] [PATCH 1 of 7] asm:intra_pred_ang4_3_sse2 improved ~4.5% 684.95 -> 654.99 with nits and tweaks
dave
- [x265] [PATCH 00 of 11 ] asm:intra pred 4x4 modes 10-18, 26 and transposed modes
dave
- [x265] [PATCH Review only] primivites: rename luma_p2s to convert_p2s and move into PU
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: intra_pred_ang4_6_sse2 improved ~2% 622.50 -> 609.99 with nits and tweaks
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra_pred_ang4_6_sse2 improved ~2% 622.50 -> 609.99 with nits and tweaks
Steve Borho
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
Deepthi Nandakumar
- [x265] [PATCH 1 of 2] api: introduce x265_api_get()
Steve Borho
- [x265] [PATCH 2 of 2] cli: use multi-lib APIs as a (weak) demonstration
Steve Borho
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Steve Borho
- [x265] cli: unify x265 log function
Steve Borho
- [x265] [PATCH Review only] primivites: rename luma_p2s to convert_p2s and move into PU
Steve Borho
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Steve Borho
- [x265] [ANN] x265 1.6 is released
Steve Borho
- [x265] [PATCH] asm: remove duplicate constant pw_256 and alignment nits
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH for review] cli: Timebase, PTS and output module
Xinyue Lu
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Divya Manivannan
- [x265] [PATCH] asm: avx2 code for intrapred_planar16x16
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for intra_planar_32x32
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for intra_dc_32x32
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH 1 of 9] asm: intra_pred_ang4_17 improved by ~57% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 9] asm: intra_pred_ang4_16 improved by ~49% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 9] asm: intra_pred_ang4_15 improved by ~53% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 9] asm: intra_pred_ang4_14 improved by ~43% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 5 of 9] asm: intra_pred_ang4_13 improved by ~43% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 6 of 9] asm: intra_pred_ang4_12 improved by ~35% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 7 of 9] asm: intra_pred_ang4_11 improved by ~31% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 8 of 9] asm: intra_pred_ang4_9 improved by ~35% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 9 of 9] asm: reduce code size with macro 'INTRA_PRED_TRANS_STORE_4x4'
praveen at multicorewareinc.com
- [x265] [PATCH 1 of 3] asm: general calSign to accelerate sao
Min Chen
- [x265] [PATCH 2 of 3] asm: reduce 1 register in quant_avx2
Min Chen
- [x265] [PATCH 3 of 3] improve fillReferenceSamples by merge pixel fill
Min Chen
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 8bpp code for convert_p2s[4xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[8xN], convert_p2s[16xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[32xN],[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 code for chroma_p2s for i420, i422, i444, reuse the luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 chroma_p2s[4x2](2.29x), ssse3 chroma_p2s[8x2](3.60x) for i420
rajesh at multicorewareinc.com
- [x265] [PATCH for review] cli: Timebase, PTS and output module
Steve Borho
- [x265] [PATCH 1 of 3] asm: general calSign to accelerate sao
Steve Borho
- [x265] [PATCH 1 of 9] asm: intra_pred_ang4_17 improved by ~57% over SSE4
Steve Borho
- [x265] [PATCH] asm: avx2 code for intra_dc_32x32
Steve Borho
- [x265] [PATCH] primivites: rename luma_p2s to convert_p2s and move into PU
Steve Borho
- [x265] [PATCH 00 of 18 ] asm:intra_pred_ang4 16 bit all modes
dtyx265 at gmail.com
- [x265] [PATCH 01 of 18] asm: intra_pred_ang4_2_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 02 of 18] asm: intra_pred_ang4_3_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 03 of 18] asm: intra_pred_ang4_4_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 04 of 18] asm: intra_pred_ang4_5_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 05 of 18] asm: intra_pred_ang4_6_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 06 of 18] asm: intra_pred_ang4_7_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 07 of 18] asm: intra_pred_ang4_8_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 08 of 18] asm: intra_pred_ang4_9_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 09 of 18] asm: intra_pred_ang4_10_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 10 of 18] asm: intra_pred_ang4_26_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 11 of 18] asm: intra_pred_ang4_11_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 12 of 18] asm: intra_pred_ang4_12_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 13 of 18] asm: intra_pred_ang4_13_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 14 of 18] asm: intra_pred_ang4_14_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 15 of 18] asm: intra_pred_ang4_15_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 16 of 18] asm: intra_pred_ang4_16_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 17 of 18] asm: intra_pred_ang4_17_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 18 of 18] asm: intra_pred_ang4_18_sse2 16-bit
dtyx265 at gmail.com
- [x265] [PATCH 00 of 18 ] asm:intra_pred_ang4 16 bit all modes
Steve Borho
- [x265] [PATCH 00 of 18 ] asm:intra_pred_ang4 16 bit all modes
Steve Borho
- [x265] [PATCH for review] cli: Timebase, PTS and output module
Xinyue Lu
- [x265] [PATCH 00 of 18 ] asm:intra_pred_ang4 16 bit all modes
dave
- [x265] [PATCH for review] cli: Timebase, PTS and output module
Steve Borho
- [x265] [PATCH] encoder: do not disable the thread pool if lookahead-slices is enabled
Steve Borho
- [x265] [PATCH 02 of 18] asm: intra_pred_ang4_3_sse2 16-bit
chen
- [x265] [PATCH 10 of 18] asm: intra_pred_ang4_26_sse2 16-bit
chen
- [x265] [PATCH 18 of 18] asm: intra_pred_ang4_18_sse2 16-bit
chen
- [x265] [PATCH 00 of 18 ] asm:intra_pred_ang4 16 bit all modes
chen
- [x265] [PATCH 10 of 18] asm: intra_pred_ang4_26_sse2 16-bit
Steve Borho
- [x265] [PATCH] asm:intra_pred4_x filtering
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra_pred_ang4_26_sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra_pred_ang4_18
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra_pred_ang4_18
Steve Borho
- [x265] [PATCH] asm: intra_pred_ang4_26_sse2
chen
- [x265] [PATCH] asm: intra_pred_ang4_26_sse2
dtyx265 at gmail.com
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
deepthi at multicorewareinc.com
- [x265] [PATCH] AQ/VBV: separate VBV cost accumulation from QP averaging for AQ
deepthi at multicorewareinc.com
- [x265] [PATCH] AQ/VBV: separate VBV cost accumulation from QP averaging for AQ
Deepthi Nandakumar
- [x265] [PATCH] api: add --allow-non-conformance param, default to False
Steve Borho
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
Steve Borho
- [x265] [PATCH] asm: luma_hps[12x16] avx2 - improved 3779c->2482c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: luma_hps[24x32] avx2 - improved 11545c->6843c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hps[24x32] avx2 - improved 4458c->3583c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: luma_hvpp[16x16] - 11.39x 5226c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
Deepthi Nandakumar
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[12x16](9.82x), convert_p2s[24x32](13.61x)
rajesh at multicorewareinc.com
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
Deepthi Nandakumar
- [x265] [PATCH] asm: improve the old avx2 code for sad[32x24]
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 6] asm: intra_pred_ang4_8 improved by ~24% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 6] asm: intra_pred_ang4_7 improved by ~42% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 6] asm: intra_pred_ang4_6 improved by ~36% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 6] asm: intra_pred_ang4_5 improved by ~41% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 5 of 6] asm: intra_pred_ang4_4 improved by ~44% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 6 of 6] asm: intra_pred_ang4_3 improved by ~41% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH] sao: modify C and SSE4 code for saoCuOrgE0 to process 2 rows
Divya Manivannan
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[12x16](9.82x), convert_p2s[24x32](13.61x)
Rajesh Paulraj
- [x265] [PATCH] asm: saoCuOrgE0 avx2 code: 756c->629c
Divya Manivannan
- [x265] [PATCH 1 of 2] aq: implementation of fine-grained adaptive quantization
deepthi at multicorewareinc.com
- [x265] [PATCH 2 of 2] aq: add cost of sub-LCU level QP to RD costs
deepthi at multicorewareinc.com
- [x265] [PATCH] asm: improve the old avx2 code for sad[64x64]
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: improve old avx2 code for sad[64x48]
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 3] fix count of shift overflow bug in Quant::getSigCoeffGroupCtxInc
Min Chen
- [x265] [PATCH 2 of 3] improve rdoQuant by more parameters on getSigCoeffGroupCtxInc and calcPatternSigCtx
Min Chen
- [x265] [PATCH] improve rdoQuant by reduce type convert and condition check
Min Chen
- [x265] [PATCH] asm: ssse3 8bpp code for convert_p2s[12xN], [24xN], [48x64]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 8bpp code for chroma_p2s[6xN] for i420, i422
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for chroma_p2s[8x6](4.74x) for i420
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 8bpp code for chroma_p2s i422, reuse luma code
rajesh at multicorewareinc.com
- [x265] [PATCH for review] cli: annex_b format switch
Steve Borho
- [x265] [PATCH 1 of 2] aq: implementation of fine-grained adaptive quantization
Steve Borho
- [x265] [PATCH 1 of 2] aq: implementation of fine-grained adaptive quantization
Steve Borho
- [x265] [PATCH for review] cli: annex_b format switch
Steve Borho
- [x265] [PATCH] cli: rewrite pts_queue to use new/delete
Xinyue Lu
- [x265] [PATCH] cli: rewrite pts_queue to use new/delete
Steve Borho
- [x265] [PATCH] level: allow unbounded level 8.5 to be used for lossless encodes
Steve Borho
- [x265] [PATCH] asm: improve avx2 code for add_ps[32x32] (1428 -> 1312)
sumalatha at multicorewareinc.com
- [x265] Parameter --qgsize not functional?
Mario *LigH* Rohkrämer
- [x265] Parameter --qgsize not functional?
Deepthi Nandakumar
- [x265] Parameter --qgsize not functional?
Mario *LigH* Rohkrämer
- [x265] [PATCH algorithm modify] simplify coeff group clear when CG decide to not encode
Min Chen
- [x265] [PATCH 1 of 7] asm: intra_pred_ang16_11 improved by ~27% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 7] asm: fix 32-bit build issue
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 7] asm: intra_pred_ang16_9 improved by ~28% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 7] asm: intra_pred_ang16_8 improved by ~28% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 5 of 7] asm: intra_pred_ang16_7 improved by ~22% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 6 of 7] asm: optimize code size with macro 'INTRA_PRED_ANG16_CAL_ROW'
praveen at multicorewareinc.com
- [x265] [PATCH 7 of 7] asm: optimize buffer address using registers
praveen at multicorewareinc.com
- [x265] [PATCH] asm: avx2 8bpp code for convert_p2s[32xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 8bpp code for convert_p2s[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] [PATCH for review] cli: annex_b format switch
Steve Borho
- [x265] [PATCH 2 of 7] asm: fix 32-bit build issue
Steve Borho
- [x265] [PATCH] asm: avx2 8bpp code for convert_p2s[64xN]
Steve Borho
- [x265] [PATCH algorithm modify] simplify coeff group clear when CG decide to not encode
Steve Borho
- [x265] [PATCH] rc-test: revise rc test cases - use more test clips, additional rc features
aarthi at multicorewareinc.com
- [x265] [PATCH] rc-test: revise rc test cases - use more test clips, additional rc features
Steve Borho
- [x265] [PATCH] aq: implementation of fine-grained adaptive quantization
Steve Borho
- [x265] [PATCH] sao: add C and sse4 code of saoCuOrgE1 to process 2 rows
Divya Manivannan
- [x265] [PATCH 2 of 2] aq: add cost of sub-LCU level QP to RD costs
Deepthi Nandakumar
- [x265] [PATCH] search: add RDcost measurement of DeltaQP to lower rdLevels
deepthi at multicorewareinc.com
- [x265] [PATCH] asm: saoCuOrgE1 avx2 code: 403c->331c
Divya Manivannan
- [x265] [PATCH] asm: avx2 version convert_p2s[48x64], 4069c -> 3043c
Min Chen
- [x265] [PATCH] asm: saoCuOrgE1_2Rows avx2 code: 657c->525c
Divya Manivannan
- [x265] [PATCH] asm: improve avx2 code sub_ps[32x32] 1402 -> 1360
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: improve avx2 code sub_ps[32x32] 1402 -> 1360
chen
- [x265] [PATCH 1 of 4] asm: intra_pred_ang16_6 improved by ~19% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 4] asm: intra_pred_ang16_5 improved by ~16% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 4] asm: intra_pred_ang16_4 improved by ~23% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 4] asm: intra_pred_ang16_3 improved by ~25% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH] optimize maxAbsLevel cost compute logic in rdoQuant()
Min Chen
- [x265] [PATCH] asm: improve avx2 8bpp code for convert_p2s[32xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: improve avx2 8bpp code for convert_p2s[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH 1 of 2] rc: separate frame bits predictor objects for BRef and B frames
aarthi at multicorewareinc.com
- [x265] [PATCH 2 of 2] rc: tune initial predictor values for better frame size predictions in vbv lookahead
aarthi at multicorewareinc.com
- [x265] [PATCH 1 of 2] rc: separate frame bits predictor objects for BRef and B frames
Steve Borho
- [x265] [PATCH 2 of 2] rc: tune initial predictor values for better frame size predictions in vbv lookahead
Steve Borho
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] Next steps on my road map
Xinyue Lu
- [x265] [PATCH for review] cli: annex_b format switch
Steve Borho
- [x265] Next steps on my road map
Steve Borho
- [x265] Next steps on my road map
Tom Vaughan
- [x265] Next steps on my road map
Xinyue Lu
- [x265] [PATCH 2 of 2] rc: tune initial predictor values for better frame size predictions in vbv lookahead
Aarthi Priya Thirumalai
- [x265] [PATCH 1 of 2] rc: separate frame bits predictor objects for BRef and B frames
Aarthi Priya Thirumalai
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] [PATCH] optimize c1c2 context set update logic in rdoQuant
Min Chen
- [x265] [PATCH 1 of 9] asm: intra_pred_ang16_12 improved by ~20% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 9] asm: intra_pred_ang16_13 improved by ~9% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 3 of 9] asm: correct register count
praveen at multicorewareinc.com
- [x265] [PATCH 4 of 9] asm: intra_pred_ang8_13 improved by ~16% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 5 of 9] asm: intra_pred_ang8_14 improved by ~15% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 6 of 9] asm: intra_pred_ang8_15 improved by ~5% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 7 of 9] asm: intra_pred_ang8_23 improved by ~18% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 8 of 9] asm: intra_pred_ang8_22 improved by ~14% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 9 of 9] asm: intra_pred_ang8_21 improved by ~5% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH] asm: sse4 8bpp code for chroma_p2s[2xN]
rajesh at multicorewareinc.com
- [x265] [PATCH for review] cli: annex_b format switch
Steve Borho
- [x265] [PATCH] cmake: do not allow full path of libnuma to be used in x265.pc
Steve Borho
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] intrapred8_allangs.asm
dave
- [x265] [PATCH] asm: avx2 code for planecopy_sp
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 8bpp code for convert_p2s[24xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 8bpp code for chroma_p2s[32xN], [24xN], reuse the luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx code for chroma sa8d, reused luma code
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 2] change costUncoded[] coordinate system from Raster to Zigzag
Min Chen
- [x265] [PATCH 2 of 2] nits: CRLF convert
Min Chen
- [x265] [PATCH] avoid calculate rateIncUp and rateIncDown when sigHide disabled
Min Chen
- [x265] [PATCH] asm: saoCuOrgB0 avx2 code: 23780c->18441c
Divya Manivannan
- [x265] [PATCH 1 of 2] asm: intra_pred_ang8_20 improved by ~4% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 2 of 2] asm: intra_pred_ang8_16 improved by ~3% over SSE4
praveen at multicorewareinc.com
- [x265] intrapred8_allangs.asm
Steve Borho
- [x265] [PATCH] avoid calculate rateIncUp and rateIncDown when sigHide disabled
Steve Borho
- [x265] [PATCH] avoid calculate rateIncUp and rateIncDown when sigHide disabled
chen
- [x265] [PATCH] avoid calculate rateIncUp and rateIncDown when sigHide disabled
chen
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
chen
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
dave
- [x265] [PATCH 1 of 2] docs: mention that Windows NUMA APIs require Windows 7 build target
Steve Borho
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Steve Borho
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
chen
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
chen
- [x265] [PATCH] asm: improve avx2 code sub_ps[32x32] 1402 -> 1360
Sumalatha Polureddy
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Mario *LigH* Rohkrämer
- [x265] [PATCH for review] cli: annex_b format switch
Mario *LigH* Rohkrämer
- [x265] [PATCH for review] cli: annex_b format switch
Mario *LigH* Rohkrämer
- [x265] [PATCH for review] cli: annex_b format switch
Xinyue Lu
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Xinyue Lu
- [x265] [PATCH] simplify rdoQuant() logic on ctxSet
Min Chen
- [x265] [PATCH] analysis: re-order RD 0/4 analysis to do splits before ME or intra
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: at RD 0/4 avoid motion references if not used by split blocks
ashok at multicorewareinc.com
- [x265] [PATCH] stats: profile effectiveness of reference limit masks
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: skip intra in RD 0/4 if split was analyzed and no split CUs used intra
ashok at multicorewareinc.com
- [x265] [PATCH] stats: RD 0/4 profile effectiveness of avoiding intra if split CUs did not select it
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: model the effectiveness of --limit-ref with RD 0/4
ashok at multicorewareinc.com
- [x265] [PATCH] analysis: respect X265_REF_LIMIT_DEPTH with RD 0/4
ashok at multicorewareinc.com
- [x265] [PATCH] cli: connect --limit-refs to param.limitReferences
ashok at multicorewareinc.com
- [x265] [PATCH] stats: with the CU reference limit, even 8x8 can have skipped motion searches
ashok at multicorewareinc.com
- [x265] [PATCH] asm: intra_pred_ang32_18 improved by ~44% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH] asm: intra_pred_ang32_18 improved by ~44% over SSE4
chen
- [x265] [PATCH 1 of 2] api: add SMPTE ST 2086 mastering display color metadata
Steve Borho
- [x265] [PATCH 2 of 2] doc: clarify that --pools strings might need shell escaping (closes #121)
Steve Borho
- [x265] [PATCH] cli: add an output preview feature, activated by --recon-y4m-exec
Steve Borho
- [x265] [PATCH] cmake: specify default Windows target O/S as Win7, to enable NUMA APIs
Steve Borho
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Steve Borho
- [x265] [PATCH] simplify rdoQuant() logic on ctxSet
Steve Borho
- [x265] [PATCH] asm: improve avx2 code sub_ps[32x32] 1402 -> 1360
Steve Borho
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Tim W.
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: intra pred all_angs_pred_4x4 sse2
chen
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Steve Borho
- [x265] [PATCH] sao: add saoCuOrgE3_2Rows function to process 2 rows
Divya Manivannan
- [x265] [PATCH] disable SIGPIPE on Windows platform
Min Chen
- [x265] [PATCH] disable SIGPIPE on Windows platform
Xinyue Lu
- [x265] [PATCH] asm: intra_pred_ang32_18 improved by ~45% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH] asm: improve sub_ps[16x16] (477 -> 461) and reduce code size
sumalatha at multicorewareinc.com
- [x265] [PATCH] disable SIGPIPE on Windows platform
Deepthi Nandakumar
- [x265] [PATCH] disable SIGPIPE on Windows platform
chen
- [x265] [PATCH] asm: avx2 code for satd_32xN
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx code for chroma copy_ss 32x64, reused luma code (2616 -> 1313)
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 3] improve rdoQuant() by reduce count of code group scan
Min Chen
- [x265] [PATCH 2 of 3] improve rdoQuant() by use non-zero coeff group mask to reduce count of coeff scan
Min Chen
- [x265] [PATCH 3 of 3] improve rdoQuant() by block fill on non-zero coeff group
Min Chen
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[4xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[8xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[16xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[32xN],[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[24xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: improve algorithm logic on saoCuOrgE3
Min Chen
- [x265] [PATCH] stats: with the CU reference limit, even 8x8 can have skipped motion searches
Steve Borho
- [x265] [PATCH 0 of 3 ] asm: chroma_hpp for i422
aasaipriya at multicorewareinc.com
- [x265] [PATCH 1 of 3] asm: chroma_hpp[12x32] for i422 - improved 2997c->2295c
aasaipriya at multicorewareinc.com
- [x265] [PATCH 2 of 3] asm: chroma_hpp[24x64] for i422 - improved 9272c->8212c
aasaipriya at multicorewareinc.com
- [x265] [PATCH 3 of 3] asm: chroma_hpp[2x16] for i422 - improved 595c->500c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: saoCuOrgE3 avx2 code: 502c->473c
Divya Manivannan
- [x265] [PATCH] asm: improve algorithm logic on saoCuOrgE3
Deepthi Nandakumar
- [x265] [PATCH 0 of 3 ] asm: chroma_hpp for i422
Deepthi Nandakumar
- [x265] [PATCH] asm: improve algorithm logic on saoCuOrgE3
Deepthi Nandakumar
- [x265] [PATCH 0 of 3 ] asm: chroma_hpp for i422
Aasaipriya Chandran
- [x265] [PATCH] asm: avx2 code for chroma add_ps, reused luma code
sumalatha at multicorewareinc.com
- [x265] [PATCH 2 of 2] cmake: add build option for Windows to target Win7 to enable NUMA APIs
Tim W.
- [x265] [PATCH] asm: add macro to sub_ps module to reduce code size
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for chroma sub_ps module, reused luma code
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for convert_p2s[12xN], [48x64]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: ssse3 10bit code for chroma_p2s[4x2], [8x2], [8x6]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 10bit code for chroma_p2s[6xN] for i420, i422
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse4 10bit code for chroma_p2s[2xN] for i420, i422
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: sse version 10bit code for chroma_p2s, reuse luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: new optimized algorithm for satd, improved ~30% over previous algorithm
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for satd_16xN, improved over ~70% than SSE code
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hps[64x64, 64x48, 64x32, 64x16] for i444 - improved 21540c->14767c, 18551c->14129c, 17096c->12742c, 6216c->3923c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hpp i422[4xN, 8xN, 16xN, 32xN]
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for satd_48x64 and 64xN, improved over ~100% than SSE
dnyaneshwar at multicorewareinc.com
- [x265] Fwd: [PATCH] asm: avx2 code for satd_48x64 and 64xN, improved over ~100% than SSE
Praveen Tiwari
- [x265] Fwd: [PATCH] asm: avx2 code for satd_48x64 and 64xN, improved over ~100% than SSE
Dnyaneshwar Gorade
- [x265] [PATCH] asm: avx2 10bit code for convert_p2s[16xN]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 10bit code for convert_p2s[32xN],[64xN]
rajesh at multicorewareinc.com
- [x265] [PATCH 0 of 8 ] asm: interp_4tap_horiz_pp
dtyx265 at gmail.com
- [x265] [PATCH 1 of 8] asm: interp_4tap_horiz_pp_2x4_sse3
dtyx265 at gmail.com
- [x265] [PATCH 2 of 8] asm: interp_4tap_horiz_pp_2x8_sse3
dtyx265 at gmail.com
- [x265] [PATCH 3 of 8] asm: interp_4tap_horiz_pp_2x16_sse3
dtyx265 at gmail.com
- [x265] [PATCH 4 of 8] asm: interp_4tap_horiz_pp_4x2_sse3
dtyx265 at gmail.com
- [x265] [PATCH 5 of 8] asm: interp_4tap_horiz_pp_4x4_sse3
dtyx265 at gmail.com
- [x265] [PATCH 6 of 8] asm: interp_4tap_horiz_pp_4x8_sse3
dtyx265 at gmail.com
- [x265] [PATCH 7 of 8] asm: interp_4tap_horiz_pp_4x16_sse3
dtyx265 at gmail.com
- [x265] [PATCH 8 of 8] asm: interp_4tap_horiz_pp_4x32_sse3
dtyx265 at gmail.com
- [x265] [PATCH 0 of 8 ] asm: interp_4tap_horiz_pp
chen
- [x265] [PATCH] asm: chroma_hpp[64x64, 64x48, 64x32, 64x16] for i444 - improved 22990c->14176c, 17897c->10791c, 12050c->7186c, 5655c->3266c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: avx2 10bit code for convert_p2s[24xN],[48x64]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for chroma addAvg for all partitions
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 10bit code for chroma_p2s[16xN], [24xN], [32xN], reuse luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hpp[4xN, 8xN, 16xN, 32xN, 12x16, 24x32] for i444
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hps[4xN, 8xN, 16xN, 32xN, 2x8]
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: chroma_hps[4xN, 8xN, 16xN, 32xN, 24x32] for i444
aasaipriya at multicorewareinc.com
- [x265] [PATCH] asm: intra_allangs4x4 improved by ~61% over SSE4
praveen at multicorewareinc.com
- [x265] [PATCH 1 of 5] rdoQuant: improve encoder ~3.5% by modify all zero-coeff group scan and cost compute logic
Min Chen
- [x265] [PATCH 2 of 5] rdoQuant: fast zero cost compute path
Min Chen
- [x265] [PATCH 3 of 5] rdoQuant: improve coeff group block clean code
Min Chen
- [x265] [PATCH 4 of 5] rdoQuant: fast zero-coeff path
Min Chen
- [x265] [PATCH 5 of 5] rdoQuant: move cgRdStats.sigCost0 outside from loop
Min Chen
- [x265] [PATCH 0 of 8 ] asm: interp_4tap_horiz_pp
dtyx265 at gmail.com
- [x265] [PATCH 1 of 8] asm: interp_4tap_horiz_pp_2x4_sse3
dtyx265 at gmail.com
- [x265] [PATCH 2 of 8] asm: interp_4tap_horiz_pp_2x8_sse3
dtyx265 at gmail.com
- [x265] [PATCH 3 of 8] asm: interp_4tap_horiz_pp_2x16_sse3
dtyx265 at gmail.com
- [x265] [PATCH 4 of 8] asm: interp_4tap_horiz_pp_4x2_sse3
dtyx265 at gmail.com
- [x265] [PATCH 5 of 8] asm: interp_4tap_horiz_pp_4x4_sse3
dtyx265 at gmail.com
- [x265] [PATCH 6 of 8] asm: interp_4tap_horiz_pp_4x8_sse3
dtyx265 at gmail.com
- [x265] [PATCH 7 of 8] asm: interp_4tap_horiz_pp_4x16_sse3
dtyx265 at gmail.com
- [x265] [PATCH 8 of 8] asm: interp_4tap_horiz_pp_4x32_sse3
dtyx265 at gmail.com
- [x265] [PATCH 1 of 5] rdoQuant: improve encoder ~3.5% by modify all zero-coeff group scan and cost compute logic
Steve Borho
- [x265] [PATCH] asm: avx2 code for chroma addAvg for all partitions
Steve Borho
- [x265] [PATCH] asm: new optimized algorithm for satd, improved ~30% over previous algorithm
Steve Borho
- [x265] [PATCH] cli: fix incorrect timebase source
Xinyue Lu
- [x265] [PATCH] cli: fix incorrect timebase source
Steve Borho
- [x265] [PATCH 1 of 5] rdoQuant: improve encoder ~3.5% by modify all zero-coeff group scan and cost compute logic
chen
- [x265] [PATCH 0 of 8 ] asm: interp_4tap_horiz_pp
chen
- [x265] [PATCH 0 of 7 ] Fine grained AQ
deepthi at multicorewareinc.com
- [x265] [PATCH 1 of 7] AQ: Re-enable fine grained adaptive quantization
deepthi at multicorewareinc.com
- [x265] [PATCH 2 of 7] search: add RDcost measurement of DeltaQP to lower rdLevels
deepthi at multicorewareinc.com
- [x265] [PATCH 3 of 7] encoder: ignore param->rc.qgSize when delta-qp coding is disabled
deepthi at multicorewareinc.com
- [x265] [PATCH 4 of 7] param: show quant-group size in logs, move AQ config into its own line
deepthi at multicorewareinc.com
- [x265] [PATCH 5 of 7] encoder: give param->rc.qgSize a sane default when dqp is not used
deepthi at multicorewareinc.com
- [x265] [PATCH 6 of 7] tests: add coverage for --qg-size
deepthi at multicorewareinc.com
- [x265] [PATCH 7 of 7] entropy: after encodeCU, the CU structures need to be reset with the right QP
deepthi at multicorewareinc.com
- [x265] [PATCH] asm: leading space nit
dtyx265 at gmail.com
- [x265] [PATCH 0 of 8 ] asm: interp_4tap_horiz_pp
dave
- [x265] [PATCH 1 of 5] rdoQuant: improve encoder ~3.5% by modify all zero-coeff group scan and cost compute logic
Steve Borho
- [x265] [PATCH 7 of 7] entropy: after encodeCU, the CU structures need to be reset with the right QP
Steve Borho
- [x265] [PATCH] fix build warning in quant.cpp
Min Chen
- [x265] [PATCH] fix build warning in quant.cpp
Steve Borho
- [x265] [PATCH] fix build warning in quant.cpp
Steve Borho
- [x265] [PATCH] clean asm findPosLast() output buffer to avoid debug check failure
Min Chen
- [x265] [PATCH] fix build warning in quant.cpp
chen
- [x265] [PATCH] asm: new optimized algorithm for satd, improved ~30% over previous algorithm
Dnyaneshwar Gorade
- [x265] GCC 4.8.2: Warning in threadpool.cpp (X265_MIN), [non-]enum. in cond.expr.
Mario *LigH* Rohkrämer
- [x265] [PATCH] asm: ssse3 version of findPosFirstLast, 365c -> 75c
Min Chen
- [x265] [PATCH] analysis: removed gcc warnings from RD 0/4 performance improvement series patches
ashok at multicorewareinc.com
- [x265] [PATCH] stats: with the CU reference limit, even 8x8 can have skipped motion searches
Ashok Kumar Mishra
- [x265] [PATCH] sao: modify saoCuOrgE3_2Rows C code and add sse4 code
Divya Manivannan
- [x265] [PATCH] clean asm findPosLast() output buffer to avoid debug check failure
Steve Borho
- [x265] [PATCH] sao: modify saoCuOrgE3_2Rows C code and add sse4 code
Steve Borho
- [x265] [PATCH] asm: generic x64 version of findPosLast
Min Chen
- [x265] [PATCH] slicetype: select best mvp using neighbor mv's satd cost for Lowres ME
gopu at multicorewareinc.com
- [x265] [PATCH] asm: avx code for chroma satd functions for all partitions of 422
sumalatha at multicorewareinc.com
- [x265] [PATCH] slicetype: select best mvp using neighbor mvs satd cost for Lowres ME
gopu at multicorewareinc.com
- [x265] [PATCH] slicetype: select best mvp using neighbor mvs satd cost for Lowres ME
Steve Borho
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
chen
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
dave
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
chen
- [x265] [PATCH] asm: interp_4tap_horiz_pp sse3
chen
- [x265] [PATCH] motion: lowres mvc[] are measured in slicetype no need to measure again in ME
gopu at multicorewareinc.com
- [x265] [PATCH] sao: remove saoCuOrgE3_2Rows function and modify saoCuOrgE3 primitive to handle width=16 seperately
Divya Manivannan
- [x265] [PATCH] asm: saoCuOrgE3 avx2 code for width>16: improve 508c->427c
Divya Manivannan
- [x265] [PATCH] asm: avx2 code for satd_16xN, improved over ~50% than SSE code
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for satd_48x64 and 64xN, improved over ~50% than SSE
dnyaneshwar at multicorewareinc.com
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
gopu at multicorewareinc.com
- [x265] [PATCH 1 of 6] asm: fix bug in generic version findPosLast_x64 and improve testbench on it
Min Chen
- [x265] [PATCH 2 of 6] asm: rename findPosLast to scanPosLast and modify its API
Min Chen
- [x265] [PATCH 3 of 6] testbench: support BMI2
Min Chen
- [x265] [PATCH 4 of 6] testbench: fix testbench crash when no coeff in block
Min Chen
- [x265] [PATCH 5 of 6] asm: avx2+bmi2 version of scanPosLast, 27.6k -> 6.8k cycles
Min Chen
- [x265] [PATCH 6 of 6] testbench: fix table fault when trSize more than 8
Min Chen
- [x265] [PATCH] asm: avx2 10bit code for scale2D_64to32
rajesh at multicorewareinc.com
- [x265] [PATCH] motion: lowres mvc[] are measured in slicetype no need to measure again in ME
Steve Borho
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
Steve Borho
- [x265] [PATCH] sao: remove saoCuOrgE3_2Rows function and modify saoCuOrgE3 primitive to handle width=16 seperately
Steve Borho
- [x265] [PATCH] Invoke macro to setup sse2 intrapred dc and planar primitives
dtyx265 at gmail.com
- [x265] [PATCH] Invoke macro to setup sse2 intrapred dc and planar primitives
Steve Borho
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
Gopu Govindaswamy
- [x265] [PATCH] asm: avx2 code chroma vss filter for i422
sumalatha at multicorewareinc.com
- [x265] [PATCH] sao: remove saoCuOrgE3_2Rows function and modify saoCuOrgE3 primitive to handle width=16 seperately
Divya Manivannan
- [x265] [PATCH] asm: avx2 code for chroma vss filter for i444
sumalatha at multicorewareinc.com
- [x265] [PATCH] sao: modify saoCuOrgE2 primitive to handle width=16 separately
Divya Manivannan
- [x265] [PATCH] asm: saoCuOrgE2[0] avx2 code: improve 154c->128c
Divya Manivannan
- [x265] [PATCH] asm: saoCuOrgE2[1] avx2 code: improve 449c->292c
Divya Manivannan
- [x265] [PATCH] asm: avx2 10bit code for sub_ps[16x16], [32x32], [64x64]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 10bit code for sub_ps for chroma sizes 16xN, 32xN, reuse luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for chroma vsp filter for i422
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for chroma vsp filter for i444
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for sign primitive: improve 204c->114c
Divya Manivannan
- [x265] [PATCH] asm: add pixel restoration part in saoCuOrgE2 primitive
Divya Manivannan
- [x265] [PATCH] doc: add document that contains the reason for two versions of sao primitives
Divya Manivannan
- [x265] [PATCH] asm: avx2 code for chroma vps filter for i422
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: avx2 code for chroma vps filter for i444
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 3] fix build fault on WinXP
Min Chen
- [x265] [PATCH 2 of 3] rdoQuant: split coeff cost into psy and non-psy path
Min Chen
- [x265] [PATCH 3 of 3] rdoQuant: optimize getSigCtxInc()
Min Chen
- [x265] [PATCH] asm: avx2 10bit code for add_ps[16x16], [32x32], [64x64]
rajesh at multicorewareinc.com
- [x265] [PATCH] asm: avx2 10bit code for add_ps for chroma sizes 16xN, 32xN, reuse luma code
rajesh at multicorewareinc.com
- [x265] [PATCH] doc: add document that contains the reason for two versions of sao primitives
Steve Borho
- [x265] [PATCH 1 of 3] fix build fault on WinXP
Steve Borho
- [x265] [PATCH 0 of 4 ] refactor commits for fine-grained AQ
Steve Borho
- [x265] [PATCH 1 of 4] analysis: rename m_qp to m_aqQP for clarity
Steve Borho
- [x265] [PATCH 2 of 4] analysis: simplify CTU QP init loops
Steve Borho
- [x265] [PATCH 3 of 4] analysis: keep per-CU AQ QPs in cuGeom index order, simplify arguments
Steve Borho
- [x265] [PATCH 4 of 4] analysis: hoist all adaptive-quant work to recursive callers
Steve Borho
- [x265] [PATCH] analysis: removed gcc warnings from RD 0/4 performance improvement series patches
Steve Borho
- [x265] [PATCH 1 of 2] analysis: configure slave quant QP prior to pmode intra RDO for RD 0..4
Steve Borho
- [x265] [PATCH 2 of 2] search: drop bMergeOnly argument to predInterSearch()
Steve Borho
- [x265] [PATCH] analysis: always configure quant QP directly after setting RD lambda
Steve Borho
- [x265] [PATCH] analysis: always configure quant QP directly after setting RD lambda
Steve Borho
- [x265] [PATCH 0 of 6 REV2] refactor commits for fine-grained AQ,
Steve Borho
- [x265] [PATCH 1 of 6 REV2] analysis: rename m_qp to m_aqQP for clarity
Steve Borho
- [x265] [PATCH 2 of 6 REV2] analysis: simplify CTU QP init loops
Steve Borho
- [x265] [PATCH 3 of 6 REV2] analysis: keep per-CU AQ QPs in cuGeom index order, simplify arguments
Steve Borho
- [x265] [PATCH 4 of 6 REV2] analysis: hoist all adaptive-quant work to recursive callers
Steve Borho
- [x265] [PATCH 5 of 6 REV2] analysis: configure slave quant QP prior to pmode intra RDO for RD 0..4
Steve Borho
- [x265] [PATCH 6 of 6 REV2] analysis: always configure quant QP directly after setting RD lambda
Steve Borho
- [x265] [PATCH 6 of 6 REV2] analysis: always configure quant QP directly after setting RD lambda
Steve Borho
- [x265] [PATCH] rc: fix cost issues in predicting row size during mid frame vbv encodes
aarthi at multicorewareinc.com
- [x265] [PATCH] doc: add document that contains the reason for two versions of sao primitives
Divya Manivannan
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
Gopu Govindaswamy
- [x265] [PATCH 3 of 6 REV2] analysis: keep per-CU AQ QPs in cuGeom index order, simplify arguments
Deepthi Nandakumar
- [x265] [PATCH 1 of 2] asm: avx2 code for chroma vpp filter for i422
sumalatha at multicorewareinc.com
- [x265] [PATCH 2 of 2] asm: avx2 code for chroma vpp filter for i444
sumalatha at multicorewareinc.com
- [x265] [PATCH] asm: filter_vsp and filter_vss for Nx64, 32x48 in I422
Divya Manivannan
- [x265] [PATCH] asm: filter_vsp[8x12], filter_vss[8x12] for I422 in avx2
Divya Manivannan
- [x265] [PATCH 6 of 6 REV2] analysis: always configure quant QP directly after setting RD lambda
Aarthi Priya Thirumalai
- [x265] [PATCH 6 of 6 REV2] analysis: always configure quant QP directly after setting RD lambda
Deepthi Nandakumar
- [x265] [PATCH 1 of 4] modify m_psyRdoqScale from int64 to int32 because dynamic range is [0, 50]*256
Min Chen
- [x265] [PATCH 2 of 4] modify lambda from int64 to int32 because dynamic range less than 21 bits
Min Chen
- [x265] [PATCH 3 of 4] force type convert since multiplication result up to 34-bits
Min Chen
- [x265] [PATCH 4 of 4] rdoQuant: reduce address operators by swap order on array significantBits[][]
Min Chen
- [x265] [PATCH] asm: filter_vsp[6x16], filter_vss[6x16] in avx2
Divya Manivannan
- [x265] [PATCH] sao: add comment for the reason of two versions of sao primitives
Divya Manivannan
- [x265] [PATCH 3 of 6 REV2] analysis: keep per-CU AQ QPs in cuGeom index order, simplify arguments
Steve Borho
- [x265] [PATCH] sao: add comment for the reason of two versions of sao primitives
chen
- [x265] [PATCH] level: do not try to configure color space in x265_param_apply_profile()
Steve Borho
- [x265] [PATCH] rc: extract final average QP from the coded CTU structure
Steve Borho
- [x265] [PATCH 1 of 2] analysis: remove m_aqQP[], determine AQ QPs on demand
Steve Borho
- [x265] [PATCH 2 of 2] cudata: cu index is no longer necessary again
Steve Borho
- [x265] Next steps on my road map
Xinyue Lu
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: filter_vsp, filter_vss for 2x16 in avx2
Divya Manivannan
- [x265] [PATCH] asm: filter_vsp, filter_vss for 16x24 in avx2
Divya Manivannan
- [x265] [PATCH] asm: filter_vsp, filter_vss for 12x32 in avx2
Divya Manivannan
- [x265] [PATCH] asm: remove tab_c_526336, it is duplicate to pd_526336
Min Chen
- [x265] [PATCH] asm: filter_vsp, filter_vss for 4x32 in avx2
Divya Manivannan
- [x265] [PATCH] asm: avx2 code for sad[16x8] for 10 bpp (398 -> 254)
sumalatha at multicorewareinc.com
- [x265] [PATCH 1 of 3] asm: use prefix const to avoid unaligned crash
Min Chen
- [x265] [PATCH 2 of 3] asm: remove interp4_hps_shuf, it is duplicate to interp4_hpp_shuf
Min Chen
- [x265] [PATCH 3 of 3] simplify logic on posOffset in codeCoeffNxN()
Min Chen
- [x265] [PATCH] log: make qTreeCnt as stack arrays to avoid non determinism in 2 pass
aarthi at multicorewareinc.com
- [x265] [PATCH 1 of 3] asm: use prefix const to avoid unaligned crash
Steve Borho
- [x265] [PATCH] log: make qTreeCnt as stack arrays to avoid non determinism in 2 pass
Steve Borho
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
Steve Borho
- [x265] [PATCH 1 of 3] asm: use prefix const to avoid unaligned crash
Steve Borho
- [x265] [PATCH 1 of 4] api: allow libx265 to forward x265_api_get() calls
Steve Borho
- [x265] [PATCH 2 of 4] cli: add -P short option for --profile
Steve Borho
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Steve Borho
- [x265] [PATCH 4 of 4] doc: describe new multi-lib behavior
Steve Borho
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Xinyue Lu
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Steve Borho
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Xinyue Lu
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Deepthi Nandakumar
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Xinyue Lu
- [x265] [PATCH] asm: filter_vsp, filter_vss for 2x4 in avx2
Divya Manivannan
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: filter_vsp, filter_vss for 64xN, 48x64 in avx2
Divya Manivannan
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Deepthi Nandakumar
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Xinyue Lu
- [x265] Proposed solution to mixed 8/10 bit libraries
Deepthi Nandakumar
- [x265] [PATCH 3 of 4] cli: allow -P/--profile to influence requested API bit-depth
Deepthi Nandakumar
- [x265] [PATCH] search: move MVP Index selection to motion estimate to avoid code duplication
gopu at multicorewareinc.com
- [x265] [PATCH] asm: filter_vpp, filter_vps for 16x64 in avx2
Divya Manivannan
- [x265] [PATCH] asm: filter_vpp, filter_vps for 8x64 in avx2
Divya Manivannan
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
gopu at multicorewareinc.com
- [x265] [PATCH] asm: filter_vpp, filter_vps for 32x64, 32x48 in avx2
Divya Manivannan
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
Deepthi Nandakumar
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH] search: remove the merge candidates from the motion candidate list
Steve Borho
- [x265] [PATCH] search: move MVP Index selection to motion estimate to avoid code duplication
Steve Borho
- [x265] [PATCH 0 of 4 ] ME cleanups and minor improvements
Steve Borho
- [x265] [PATCH 1 of 4] search: cleanup checkBestMVP(), no behavior change
Steve Borho
- [x265] [PATCH 2 of 4] search: introduce selectMVP helper method
Steve Borho
- [x265] [PATCH 3 of 4] search: do not clip MVP in setSearchRange()
Steve Borho
- [x265] [PATCH 4 of 4] search: allow AMP to use motion estimation for 64x64 CUs
Steve Borho
- [x265] [PATCH] asm: interp_8tap_hv_pp_8x8 sse3
dtyx265 at gmail.com
- [x265] [PATCH] asm: chroma_hpp[48x64] for i444 - improved 17498c->13381c
aasaipriya at multicorewareinc.com
- [x265] [PATCH] api: clarify docs and use of x265_api_get()
deepthi at multicorewareinc.com
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
Deepthi Nandakumar
- [x265] [PATCH] api: add an error message for an api_get() failure
deepthi at multicorewareinc.com
- [x265] [PATCH] asm: filter_vpp, filter_vps for 12x32 in avx2
Divya Manivannan
- [x265] [PATCH] asm: filter_vpp, filter_vps for 8x12 in avx2
Divya Manivannan
- [x265] [PATCH 1 of 8] convert sigCtx table from [4][4] to [16]
Min Chen
- [x265] [PATCH 2 of 8] pre-compute abs coeff and simplify scan table
Min Chen
- [x265] [PATCH 3 of 8] remove reduce check on firstC2FlagIdx
Min Chen
- [x265] [PATCH 4 of 8] fast RD path on encode coeff remain code in codeCoeffNxN()
Min Chen
- [x265] [PATCH 5 of 8] improve compute on baseLevel by 2-bits encode code
Min Chen
- [x265] [PATCH 6 of 8] simplify compute on get codeNumber length
Min Chen
- [x265] [PATCH 7 of 8] faster clip operator on goRiceParam
Min Chen
- [x265] [PATCH 8 of 8] simplify logic on get coeff remain cost in codeCoeffNxN()
Min Chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
chen
- [x265] [PATCH] asm: interp_8tap_hv_pp_8x8 sse3
chen
- [x265] [PATCH 1 of 2] fix check failure in Entropy::writeCoefRemainExGolomb()
Min Chen
- [x265] [PATCH 2 of 2] asm: downgrade x265_interp_8tap_hv_pp_8x8 from SSE4 to SSSE3
Min Chen
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dtyx265 at gmail.com
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH] asm: filter_vpp, filter_vps for 2x4 in avx2
Divya Manivannan
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
Steve Borho
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
Steve Borho
- [x265] [PATCH 4 of 8] fast RD path on encode coeff remain code in codeCoeffNxN()
Steve Borho
- [x265] [PATCH] asm: interp_8tap_horiz pp and ps sse2
dave
Last message date:
Thu Apr 30 22:38:05 CEST 2015
Archived on: Thu Apr 30 22:39:43 CEST 2015
This archive was generated by
Pipermail 0.09 (Mailman edition).