kernel_optimize_test/drivers/gpu
Sean Keely 0e9f449221 drm/amdkfd: Account for SH/SE count when setting up cu masks.
[ Upstream commit 1ec06c2dee679e9f089e78ed20cb74ee90155f61 ]

On systems with multiple SH per SE compute_static_thread_mgmt_se#
is split into independent masks, one for each SH, in the upper and
lower 16 bits.  We need to detect this and apply cu masking to each
SH.  The cu mask bits are assigned first to each SE, then to
alternate SHs, then finally to higher CU id.  This ensures that
the maximum number of SPIs are engaged as early as possible while
balancing CU assignment to each SH.

v2: Use max SH/SE rather than max SH in cu_per_sh.

v3: Fix comment blocks, ensure se_mask is initially zero filled,
    and correctly assign se.sh.cu positions to unset bits in cu_mask.

Signed-off-by: Sean Keely <Sean.Keely@amd.com>
Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Sasha Levin <sashal@kernel.org>
2021-09-18 13:40:33 +02:00
..
drm drm/amdkfd: Account for SH/SE count when setting up cu masks. 2021-09-18 13:40:33 +02:00
host1x gpu: host1x: Split up client initalization and registration 2021-06-18 10:00:04 +02:00
ipu-v3 gpu: ipu-v3: Fix i.MX IPU-v3 offset calculations for (semi)planar U/V formats 2021-09-08 08:49:00 +02:00
trace
vga
Makefile