Commits · WIP_HEVC_UAPI_V6 · Benjamin Gaignard / for-upstream

May 25, 2022

media: hantro: Allows luma and chroma depth to be different · b23ffb5a


Luma and chroma depth are set on different hardware registers.
Even if they aren't identical the bitstream can be compliant
to HEVC specifications and decoded by the hardware.

With this patch TSUNEQBD_A_MAIN10_Technicolor_2 conformance test
is successfully decoded.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

b23ffb5a

May 24, 2022

media: Hantro: Correct G2 init qp field · 876e2dd7

Benjamin Gaignard authored 2 years ago


Documentation said that g2 init_qp field use bits 24 to 30 of
the 8th register.
Change the field mask to be able to set 7 bits and not only 6 of them.

Conformance test INITQP_B_Main10_Sony_1 decoding is OK with this
patch.

Fixes: cb5dd5a0 ("media: hantro: Introduce G2/HEVC decoder")
Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

876e2dd7

media: Hantro: HEVC: Allows 10-bit bitstream · 1e95e044
Benjamin Gaignard authored 2 years ago
```
Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>
```
1e95e044

media: hantro: imx8m: Enable 10bit decoding · e05a5688

Benjamin Gaignard authored 2 years ago


Expose 10bit pixel formats to enable 10bit decoding in IMX8M SoCs.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

e05a5688

media: hantro: postproc: Configure output regs to support 10bit · e24842f8

Benjamin Gaignard authored 2 years ago


Move output format in postproc and make sure that 10bit configuration
is correctly set.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

e24842f8

May 23, 2022

media: hantro: Store HEVC bit depth in context · 3f18da29

Benjamin Gaignard authored 2 years ago


Store HEVC bit depth in context.
Bit depth is equal to hevc sps bit_depth_luma_minus8 + 8.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

3f18da29

media: hantro: HEVC: Fix auxillary buffer size calculation · 5f0cee5f

Benjamin Gaignard authored 2 years ago


SAO and FILTER buffers size depend of the bit depth.
Make sure we have enough space for 10bit bitstreams.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

5f0cee5f

media: hantro: Store VP9 bit depth in context · 41ef8196

Benjamin Gaignard authored 2 years ago


Now that we have proper infrastructure for postprocessing 10-bit
formats, store VP9 bit depth in context.

Signed-off-by: Jernej Skrabec <jernej.skrabec@gmail.com>

41ef8196

media: hantro: postproc: Fix legacy regs configuration · b3e79d4d

Benjamin Gaignard authored 2 years ago


Some postproc legacy registers were set in VP9 code. Move them to
postproc and fix their value.

Signed-off-by: Jernej Skrabec <jernej.skrabec@gmail.com>
Reviewed-by: Ezequiel Garcia <ezequiel@vanguardiasur.com.ar>

b3e79d4d

media: hantro: postproc: Fix buffer size calculation · 335409d5

Jernej Skrabec authored 3 years ago and

Benjamin Gaignard committed 2 years ago

When allocating aux buffers for postprocessing, it's assumed that base
buffer size is the same as that of output. Coincidentally, that's true
most of the time, but not always. 10-bit source also needs aux buffer
size which is appropriate for 10-bit native format, even if the output
format is 8-bit. Similarly, mv sizes and other extra buffer size also
depends on source width/height, not destination.

Signed-off-by: Jernej Skrabec <jernej.skrabec@gmail.com>
Reviewed-by: Ezequiel Garcia <ezequiel@vanguardiasur.com.ar>

335409d5

media: hantro: Support format filtering by depth · 179657ea

Jernej Skrabec authored 3 years ago and

Benjamin Gaignard committed 2 years ago


In preparation for supporting 10-bit formats, add mechanism which will
filter formats based on pixel depth.

Hantro G2 supports only one decoding format natively and that is based
on bit depth of current video frame. Additionally, it makes no sense to
upconvert bitness, so filter those out too.

Signed-off-by: Jernej Skrabec <jernej.skrabec@gmail.com>

179657ea

media: Add P010 tiled format · e0476ffb

Ezequiel Garcia authored 3 years ago and

Benjamin Gaignard committed 2 years ago


Add P010 tiled format

Signed-off-by: Ezequiel Garcia <ezequiel@vanguardiasur.com.ar>
[rebased and updated pixel format name]
Signed-off-by: Jernej Skrabec <jernej.skrabec@gmail.com>

e0476ffb

media: hantro: Be more accurate on pixel formats step_width constraints · a2013926

Benjamin Gaignard authored 2 years ago


On Hantro G2 decoder on IMX8MQ strides requirements aren't the same
for NV12_4L4 and NV12 pixel formats. The first one use a 4 bytes padding
while the last one needs 8 bytes.
To be sure to provide the correct stride in all cases we need:
- to relax the constraints on codec formats so set step_width to 4
- use capture queue format and not the output queue format when applying
  the pixel format constraints.
- put the correct step_width constraints on each pixel format.

Move HEVC SPS validation in hantro_hevc.c to be able to perform it
when setting sps control and when starting to decode the bitstream.
Add a new test in HEVC SPS validation function to check if resolution
is still matching the hardware constraints.

With this SAODBLK_A_MainConcept_4 and SAODBLK_B_MainConcept_4 conformance
tests files are correctly decoded with both NV12 and NV12_4L4 pixel formats.
These two files have a resolution of 1016x760.
If, for the both pixel formats, step_width equal 16 than the selected
capture resolution is 1024x768 which is wrong for NV12_4L4 (which expect
1016x760) on Hantro G2 on IMX8MQ.

Add defines for various resolutions.
For other variants than Hantro G2 on IMX8M keep the same step_width to avoid
regressions.

Fluster HEVC test score is now 128/147 vs 126/147 with the both pixel
formats as decoder output.
Fluster VP9 test score stay at 147/303.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

a2013926

media: hantro: HEVC: Fix reference frames management · a6d643f3

Benjamin Gaignard authored 2 years ago


PoC shall be int the range of -2^31 to 2^31 -1
(HEVC spec section 8.3.1 Decoding process for picture order count).
The current way to know if an entry in reference picture array is free
is to test if PoC = UNUSED_REF. Since UNUSED_REF is defined as '-1' that
could lead to decode issue if one PoC also equal '-1'.
PoC with value = '-1' exists in conformance test SLIST_B_Sony_9.

Change the way unused entries are managed in reference pictures array to
avoid using PoC to detect then.

This patch doesn't change fluster HEVC score.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

a6d643f3

media: uapi: move HEVC stateless controls out of staging · 7d5fa367

Benjamin Gaignard authored 2 years ago


HEVC uAPI is used by 2 mainline drivers (Hantro, Cedrus)
and at least 2 out-of-tree drivers (rkvdec, RPi).
The uAPI has been reviewed so it is time to make it 'public' by
un-staging it.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

7d5fa367

media: uapi: Change data_bit_offset definition · 962d28ce

Benjamin Gaignard authored 3 years ago


'F.7.3.6.1 General slice segment header syntax' section of HEVC
specification describes that a slice header always end aligned on
byte boundary, therefore we only need to provide the data offset in bytes.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

962d28ce

media: uapi: HEVC: fix padding in v4l2 control structures · 6ec99a81

Benjamin Gaignard authored 3 years ago


Fix padding where needed to remove holes and stay align on cache boundaries

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

6ec99a81

media: hantro: Stop using Hantro dedicated control · b157b431

Benjamin Gaignard authored 2 years ago


The number of bits to skip in the slice header can be computed
in the driver by using sps, pps and decode_params information.
This allow to remove Hantro dedicated control.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

b157b431

media: controls: Log HEVC stateless control in .std_log · 010e17e0
Benjamin Gaignard authored 3 years ago
```
Simply print the type of the control.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>
```
010e17e0

media: uapi: Move the HEVC stateless control type out of staging · 2186f1ee

Benjamin Gaignard authored 3 years ago


Move the HEVC stateless controls types out of staging,
and re-number them.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

2186f1ee

media: uapi: Add V4L2_CID_STATELESS_HEVC_ENTRY_POINT_OFFSETS control · 1ff2efa0

Benjamin Gaignard authored 3 years ago


The number of 'entry point offset' can be very variable.
Instead of using a large static array define a v4l2 dynamic array
of U32 (V4L2_CTRL_TYPE_U32).
The number of entry point offsets is reported by the elems field
and in struct v4l2_ctrl_hevc_slice_params.num_entry_point_offsets
field.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

1ff2efa0

media: uapi: Move parsed HEVC pixel format out of staging · 557872ca

Benjamin Gaignard authored 3 years ago


Move HEVC pixel format since we are ready to stabilize the uAPI

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

557872ca

media: uapi: HEVC: Define V4L2_CID_STATELESS_HEVC_SLICE_PARAMS as a dynamic array · 08c4adad

Benjamin Gaignard authored 3 years ago


Make explicit that V4L2_CID_STATELESS_HEVC_SLICE_PARAMS control is
a dynamic array control type.
Some drivers may be able to receive multiple slices in one control
to improve decoding performance.

Define the max size of the dynamic that can driver can set in .dims = {}.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

08c4adad

media: uapi: HEVC: Add documentation to uAPI structure · 88bcd5f0

Benjamin Gaignard authored 3 years ago


Add kernel-doc documentation for all the HEVC structures.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

88bcd5f0

media: uapi: HEVC: Add SEI pic struct flags · 40dc2723

Benjamin Gaignard authored 2 years ago

The possible values for the field_pic field in the v4l2_hevc_dpb_entry
structure are defined in the table D.2 in HEVC specification section D.3.3.
Add flags and documentation for each of them.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

40dc2723

media: uapi: HEVC: Change pic_order_cnt definition in v4l2_hevc_dpb_entry · e93043b5

Benjamin Gaignard authored 2 years ago


The HEVC specification describes the following:
"PicOrderCntVal is derived as follows:
PicOrderCntVal = PicOrderCntMsb + slice_pic_order_cnt_lsb
The value of PicOrderCntVal shall be in the range of −2^31 to 2^31 − 1, inclusive."

To match with these definitions change __u16	pic_order_cnt[2]
into __s32	pic_order_cnt_val.
Change v4l2_ctrl_hevc_slice_params->slice_pic_order_cnt to __s32 too.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

e93043b5

media: uapi: HEVC: Rename HEVC stateless controls with STATELESS prefix · 54279148

Benjamin Gaignard authored 3 years ago

Change HEVC stateless controls names to V4L2_CID_STATELESS_HEVC instead
of V4L2_CID_MPEG_VIDEO_HEVC be coherent with v4l2 naming convention.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>
Reviewed-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>

54279148

media: uapi: HEVC: Add missing fields in HEVC controls · a86487c2

Benjamin Gaignard authored 3 years ago

Complete the HEVC controls with missing fields from H.265 specifications.
Even if these fields aren't used by the current mainlined drivers
they will be required for (at least) the rkvdec driver.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

a86487c2

vivid: add dynamic array test control · 42457ab3

Hans Verkuil authored 3 years ago and

Benjamin Gaignard committed 2 years ago


Add a dynamic array test control to help test support for this
feature.

Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>

42457ab3

v4l2-ctrls: add support for dynamically allocated arrays. · 0ddc91bc

Hans Verkuil authored 3 years ago and

Benjamin Gaignard committed 2 years ago


Implement support for dynamically allocated arrays.

Most of the changes concern keeping track of the number of elements
of the array and the number of elements allocated for the array and
reallocating memory if needed.

Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>

0ddc91bc

videodev2.h: add V4L2_CTRL_FLAG_DYNAMIC_ARRAY · 7cae9f26

Hans Verkuil authored 3 years ago and

Benjamin Gaignard committed 2 years ago

Add a new flag that indicates that this control is a dynamically sized
array. Also document this flag.

Currently dynamically sized arrays are limited to one dimensional arrays,
but that might change in the future if there is a need for it.

The initial use-case of dynamic arrays are stateless codecs. A frame
can be divided in many slices, so you want to provide an array containing
slice information for each slice. Typically the number of slices is small,
but the standard allow for hundreds or thousands of slices. Dynamic arrays
are a good solution since sizing the array for the worst case would waste
substantial amounts of memory.

Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>

7cae9f26

ARM64: dts: freescale: IMX8MQ: Set VPU G2 frequency to 300MHz · 73c1c08f

Benjamin Gaignard authored 2 years ago


Hardware documentation said that G2 max frequency is 300MHz.
Fix dts to be aligned with this value.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>

73c1c08f

media: Add P010 video format · 16a527e2

Benjamin Gaignard authored 3 years ago

P010 is a YUV format with 10-bits per component with interleaved UV.

Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com>
Acked-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>

16a527e2

May 17, 2022

media: hantro: Enable HOLD_CAPTURE_BUF for H.264 · 340ce50f

Nicolas Dufresne authored 2 years ago


This is needed to optimize field decoding. Each field will be
decoded into the same capture buffer. To be able to queue multiple
buffers, we need to be able to ask the driver to hold the capture
buffer.

Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Reviewed-by: Sebastian Fricke <sebastian.fricke@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

340ce50f

media: hantro: Add H.264 field decoding support · 11442b7c

Nicolas Dufresne authored 2 years ago


This adds the required code to support field decoding. While most of
the code is derived from Rockchip and VSI reference code, the
reduction of the reference list to 16 entries was found by
trial and errors. The list consists of all the references with the
opposite field parity.

The strategy is to deduplicate the reference picture that points
to the same storage (same index). The choice of opposite parity has
been made to keep the other field of the current field pair in the
list. This method may not be robust if a field was lost.

[hverkuil: fix typos in the comment before deduplicate_reflist()]
[hverkuil: document new cur_poc field]

Signed-off-by: Jonas Karlman <jonas@kwiboo.se>
Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

11442b7c

media: hantro: h264: Make dpb entry management more robust · 3630e493

Jonas Karlman authored 2 years ago


The driver maintains stable slot locations for reference pictures. This
change makes the code more robust by using the reference_ts as key and
by marking all entries invalid right from the start.

Signed-off-by: Jonas Karlman <jonas@kwiboo.se>
Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

3630e493

media: hantro: Stop using H.264 parameter pic_num · 83141070

Nicolas Dufresne authored 2 years ago


The hardware expects FrameNumWrap or long_term_frame_idx. Picture
numbers are per field, and are mostly used during the memory
management process, which is done in userland. This fixes two
ITU conformance tests:

  - MR6_BT_B
  - MR8_BT_B

Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Reviewed-by: Sebastian Fricke <sebastian.fricke@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

83141070

media: rkvdec: Enable capture buffer holding for H264 · ed7bb87d

Nicolas Dufresne authored 2 years ago


In order to support interlaced video decoding, the driver must
allow holding the capture buffer so that the second field can
be decoded into it.

Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Reviewed-by: Sebastian Fricke <sebastian.fricke@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

ed7bb87d

media: rkvdec-h264: Add field decoding support · 6f32ea37

Nicolas Dufresne authored 2 years ago


This makes use of the new feature in the reference builder to program
up to 32 references when doing field decoding. It also signals the
parity (top or bottom) of the field to the hardware.

Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Reviewed-by: Sebastian Fricke <sebastian.fricke@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

6f32ea37

media: rkvdec: Ensure decoded resolution fit coded resolution · 5e57a860

Jonas Karlman authored 2 years ago


Ensure decoded CAPTURE buffer resolution is larger or equal to the coded
OUTPUT buffer resolution.

Signed-off-by: Jonas Karlman <jonas@kwiboo.se>
Signed-off-by: Nicolas Dufresne <nicolas.dufresne@collabora.com>
Reviewed-by: Sebastian Fricke <sebastian.fricke@collabora.com>
Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>

5e57a860