Mesa 24.2.0 Release Notes / 2024-08-14¶
Mesa 24.2.0 is a new development release. People who are concerned with stability and reliability should stick with a previous release or wait for Mesa 24.2.1.
Mesa 24.2.0 implements the OpenGL 4.6 API, but the version reported by glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. Some drivers don’t support all the features required in OpenGL 4.6. OpenGL 4.6 is only available if requested at context creation. Compatibility contexts may report a lower version depending on each driver.
Mesa 24.2.0 implements the Vulkan 1.3 API, but the version reported by the apiVersion property of the VkPhysicalDeviceProperties struct depends on the particular driver being used.
SHA256 checksum¶
c02bb72cea290f78b11895a0c95c7c92394f180d7ff66d4a762ec6950a58addf mesa-24.2.0.tar.xz
New features¶
VK_KHR_dynamic_rendering_local_read on RADV
VK_EXT_legacy_vertex_attributes on lavapipe, ANV, Turnip and RADV
VK_MESA_image_alignment_control on RADV
VK_EXT_shader_replicated_composites on ANV, dozen, hasvk, lavapipe, nvk, RADV, and Turnip
VK_KHR_maintenance5 on v3dv
VK_KHR_maintenance7 on RADV
VK_EXT_depth_clamp_zero_one on v3dv
GL_ARB_depth_clamp on v3d
Defaulting to a new shader cache implementation, reducing filesystem overhead.
Bug fixes¶
u_debug_stack_test.capture_not_overwritten fails on i386
turnip: vulkaninfo crashed
turnip-freereno Build error (/usr/local/include/xf86drm.h:40:10: fatal error: drm.h: No such file or directory)
tu: compilation failing when compiling turnip with only kgsl and X11 enabled
samplerCube constructor in fragment shader no longer converting uvec2 bindless texture handles ( segmentation fault , core dumped )
anv: gen9.5 flakiness in dEQP-VK.multiview.dynamic_rendering.depth.*
vaapi decoding corruption with green blocks
interpolateAt precision lowering unhandled by glsl_to_nir()
[anv] CS2 crashes on LNL
[anv] Dota2 does not start on LNL
[radeonsi][bisected][regression] glClientWaitSync() quickly times out with INT64_MAX timeout
d3d10umd: Build regression in 24.2.0-devel
zink/tu: glcts flake on a750
nouveau: advertises GL_EXT_memory_object without implementing `*UID` callbacks
LIBGL_DRIVERS_PATH gone
[Regression][Vulkan][TGL][Bisected]vkCmdCopyQueryPoolResults failed to write buffer with compute pipeline on Mesa 24.1
Worms Revolution: not rendering explosion effects
crash on pushbuf_validate nvc0_blit do_blit_framebuffer
piglit: cl-api-build-program crashes
i915g: glGenerateMipmap() fails with 2048×2048 textures
[radeonsi] Asterix & Obelix XXLRomastered: river misrendered (completely black)
Build fails without Vulkan
No dependency check for PyYAML in meson.build
GPU Hang in Metal Gear Rising Revengeance
VK_ERROR_DEVICE_LOST A770 DXVK Fallout 3
[Bisected] Recent compile issue in libnak
anv: Wrong push constant values for bytes?
anv: dEQP-VK.protected_memory tests GPU hang on MTL
RustiCL (or maybe not…): radeonsi freezes after 2 hours of simulation, zink works just fine
ci_run_n_monitor.py doesn’t monitor manual jobs
Crash in util_idalloc_resize due to glBindTexture with a way-too-large ID
mesa-24.1.2 fails to compile: ast_to_hir.cpp:5371:39: error: ‘%s’ directive argument is null
[regression][bisected] VMware Xv video displays as black rectangle
Blender 4.2,4.3 crashes when rendering with motion blur on RDNA3 cards (OpenGL/radeonsi)
nvk: regression with multiple games crashing
Transparent background in Blender 3D view with nouveau
turnip: latest git does not build anymore
ACO Unimplemented intrinsic instr
RADV/ACO: assert on per-sample interpolation
radv: large descriptor layout creation is slow
Gnome shell (wayland) crashes when opening any window
DRI Intel drivers fix a problem in Redhat 7 (Mesa 18), but are not included for Redhat 8 (Mesa versions v23, v24)
Vulkan: ../src/nouveau/vulkan/nvk_physical_device.c:1109: VK_ERROR_INCOMPATIBLE_DRIVER
RADV: Smooth lines affect triangle rendering
[armhf build error][regression] error: ‘StringMapIterator’ was not declared in this scope; did you mean ‘llvm::StringMapIterator’?
Build fails with latest llvm 19: error: no matching function for call to unwrap(LLVMOpaqueModule*&)
tu: support KHR_8bit_storage
Incorrect colours on desktop and apps
nir: Incorrect nir_opt_algebraic semantics for signed integer constants causing end-to-end miscompiles
src/gallium/frontends/clover/meson.build:93:40: ERROR: Unknown variable “idep_mesaclc”.
panfrost: mpv is broken on T604
Nightly CI is broken
[radv] GPU hang in Starfield on RX 5700 XT
anv, isl, iris: Clarify and improve CCS + FCV on gfx12
isl: CPCB horizontal and vertical alignment requirements unknown
Indika: flickering black artifacting on the snow
intel/isl: Split Xe2 changes into new files of Xe2.
rusticl: Generated rusticl_mesa_bindings.c fails to find include
isl: Remove 512B pitch requirement for non-displayable CCS
MESA 24.1 - broken zink OpenGL under Windows
Blue flickering rectangles on AMD RX 7600
GPU hangs on AMD Radeon RX 6400 on a fragment shader
v3dv: vkcube-wayland crashes
intel/brw: scoreboarding regression
regression in !29436 for radv+angle on stoney
[radv][regression] Starfield invisible terrain on a 7900 XTX
free_zombie_shaders() leave context in a bad state (access violation occurs)
r300: X11 fails to start with the modesetting driver (glamor is broken with R300/R400 gpus).
[NINE]Far Cry 1 trees flicker regression[bisected][traces]
Vulkan: Most sync2 implementations are missing new access flags
Incorrect buffer_list advance when writing disjoint image descriptors
ANV: Block shadows in Cyberpunk on Intel A770
ACO ERROR: Temporary never defined or are defined after use
[ANV] Graphics memory allocation in Total War: Warhammer 3
DG2: God of War trace fails to play
Borderlands trace fails to play on dg2
NVK: Vulkan apps simply terminated with segfault under wayland and Xwayland
NVK: VK_ERROR_OUT_OF_DEVICE_MEMORY on swapchain creation
anv/zink regression: piglit.spec.arb_fragment_layer_viewport.layer-no-gs
[anv] failures when upgrading vulkancts 1.3.6 -> 1.3.7 on intel mesa ci
RustiCL: deadlock when calling clGetProfilingInfo() on callbacks
[Intel][Vulkan][Gen12] Vulkan compute shader is 3x slower than the same OpenCL kernel
turnip: Broken AHB support
zink: nir validation failures in Sparse code
nir: nir_opt_varyings uses more stack than musl libc has
dEQP-VK.pipeline.pipeline_library.shader_module_identifier.pipeline_from_id.graphics regression
freedreno + perfetto missing dependency on adreno_common.xml.h
anv: unbounded shader cache
radv: Crash due to nir validation fail in Enshrouded
vulkan/wsi/wayland: valgrid reports invalid read in `vk_free` call in `wsi_wl_surface_analytics_fini`
android: sRGB configs no longer exist after !27709
bisected: turnip: deqp regressions
aco: Radeonsi unable to use rusticl
anv: clean up default_pipeline_cache in anv_device
[24.1-rc4] fatal error: intel/dev/intel_wa.h: No such file or directory
Turnip driver is crashing since turnip: ANB/AHB support got merged
vcn: rewinding attached video in Totem cause [mmhub] page fault
When using amd gpu deinterlace, tv bt709 properties mapping to 2 chroma
a530: ir3_context_error assertion (unknown vertex shader output name: VARYING_SLOT_EDGE)
VCN decoding freezes the whole system
[RDNA2 [AV1] [VAAPI] hw decoding glitches in Thorium 123.0.6312.133 after https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28960
radv regression bisected: Flickering textures (vega)
[Regression][Bisected] EGL/Wayland: QT applications terminated by SIGSEGV (Address boundary error) when using dGPU
WSI: Support VK_IMAGE_ASPECT_MEMORY_PLANE_i_BIT_EXT for DRM Modifiers in Vulkan
nvk: Tracker issue for gamescope support
nvk: Implement VK_EXT_image_drm_format_modifier
nvk: NVK_DEBUG=zero_memory is hitting an assert
nvk: Implement VK_EXT_conservative_rasterization
zink sparse: Improper semaphore handling
zink sparse: Reference to mip tails should be refcounted
radv: Enshrouded GPU hang on RX 6800
NVK Zink: Wrong color in Unigine Valley benchmark
intel vulkan incremental build takes forever
intel vulkan incremental build takes forever
24.0.6: build fails
shader with multidimensional array in shader storage buffer takes too long to compile
panforst: T604 issue with using u32 for flat varyings
lp_screen.c:601:4: error: ‘snprintf’ will always be truncated; specified size is 16, but format string expands to at least 17
[anv] FINISHME: support YUV colorspace with DRM format modifiers
gen9/11 test became flaky: piglit.spec.!opengl 1_4.blendminmax
mesa 24 intel A770 KOTOR black shadow smoke scenes
nvk: Implement VK_EXT_pipeline_robustness
[bisected][regression] kitty fails to start due to `glfwWindowHint(GLFW_SRGB_CAPABLE,true)`
r600: bisected 5eb0136a3c561 breaks a number of piglits
[bdw][bisected][regression] assertion failure in nir_validate.c
Graphical glitches in RPCS3 after updating Vulkan Intel drivers
[R600] OpenGL and VDPAU regression in Mesa 23.3.0 - some bitmaps get distorted.
VAAPI radeonsi: VBAQ broken with HEVC
tu: weird fail in packing tests
radv/video: 10-bit support
radv: vkCmdWaitEvents2 is broken
anv: add support for EDS3::extendedDynamicState3AlphaToCoverageEnable
ci: switch from CI_JOB_JWT to id_tokens
Zink: enabled extensions and features may not match
anv: share embedded samplers
Changes¶
Adam Jackson (8):
mesa: Enable EXT_shadow_samplers for GLES2
gallium: Rename ${target}/target.c to ${target}/{$target}_target.c
treewide: Include mesa_interface.h not dri_interface.h
mesa_interface: Set ourselves free
mesa_interface: Move out of GL/internal/
gallium/dril: Compatibility stub for the legacy DRI loader interface
dri: Let dril handle the DRI driver link farm
gallium/meson: Deconflate swrast/softpipe/llvmpipe
Adrian Perez de Castro (1):
Revert “egl/wayland: Remove EGL_WL_create_wayland_buffer_from_image”
Alejandro Piñeiro (29):
v3dv/cmd_buffer: always bind pipeline static state
v3dv/ci: dEQP-VK.dynamic_state.*.double_static_bind are fixed now
v3dv: port dynamic state tracking to use Mesa Vulkan
v3dv: provide implementation for vkCmdBindVertexBuffers2
v3dv: provide implementation for CmdSetViewportWithCount
v3dv: CullMode and FrontFace are dynamic now
v3dv: DepthBoundsTestEnable is dynamic now
v3dv: move depth CFG bits setting to cmd buffer emission
v3dv: ez_state/incompatible_ez_test could be recomputed at cmd_buffer
v3dv: PrimitiveTopology is now dynamic
v3dv: StencilOp and StencilTestEnable are now dynamic
v3dv/ci: update expected list due VK_EXT_extended_dynamic_state
v3dv: enable VK_EXT_extended_dynamic_state
v3dv/cmd_buffer: missing updates due PrimitiveTopology being dynamic now
v3dv: fixes StencilTestEnable handling
v3dv: PrimitiveRestartEnable is now dynamic.
v3dv: DepthBiasEnable is dynamic now
v3dv: SetRasterizerDiscardEnable is dynamic now
v3dv: enable VK_EXT_extended_dynamic_state2
v3dv: add debug option to disable custom pipeline caches for meta operations
v3dv/meta_clear: take into account multiview for the custom clear pipeline caches
v3dv/meta_clear: use v3dv_renderpass used as parameter
v3dv/device: compute maxDescriptorSet*Limits multiplying per-stage by 4
v3dv/device: set DescriptorUpdateAfterBind limits
v3d/devinfo: unify comment style
broadcom: move HW-dependant constants to v3d_device_info
v3d,v3dv: document cl_emit_with_prepacked
v3dv/pipeline: ensure vk_graphics_pipeline_all_state alive when still needed
drm-shim: stub synobj_timeline_wait and query ioctl
Aleksi Sapon (5):
lavapipe: fixes for macOS support
lavapipe: build “Windows” check should use the host machine, not the `platforms` option.
util: fix memory related OS calls on macOS
wsi: fix compilation on macOS
util: macOS support for cnd_monotonic
Alessandro Astone (1):
egl/gbm: Walk device list to initialize DRM platform
Alex Deucher (1):
ac/surface: fix version check for gfx12 DCC
Alexandre Marquet (2):
pan/mdg: quirk to disable auto32
panfrost: implement SFBD raw format support on v4
Alexandros Frantzis (2):
egl/wayland: Pass dri2_wl_formats to create_dri_image
egl/wayland: Fail EGL surface creation if opaque format is unsupported
Ali Homafar (1):
lavapipe: Set ICD api_version to 1.3
Alyssa Rosenzweig (222):
vulkan: optimize vk_dynamic_graphics_state_any_dirty
vulkan: add helper to fill out spirv caps automatically
nir/lower_subgroups: add filter
nir/lower_subgroups: add generic scan/reduce lower
nir/lower_subgroups: relax ballot_type_to_uint
nir/lower_robust_access: also handle image derefs
docs: add header-stub for vk_enum_to_str
vulkan: add vk_debug_ignored_stype helper
nvk: use common stype debug
broadcom: use common stype debug
pvr: use common stype debug
anv,hasvk: use common stype debug
dzn: use common stype debug
nir: add is_first_fan_agx sysval
nir: add texops for AGX border colour emulation
nir: add quad_ballot_agx intrinsic
nir,agx: add depth=never workaround
nir,agx: fix load_active_subgroup_index
compiler: add ACCESS_IN_BOUNDS_AGX
agx: optimize and/or with booleans
agx: enable more lowering
agx: fix query LOD of array
agx: fix some ms texture packing
agx: handle cross-workgroup memory barriers
agx: allow 8-bit bcsel
agx: fix phi translation corruption
agx: fix load_helper_invocation with sample shading
agx: fix frag sidefx with sample shading
agx: handle subgroup barriers
agx: fix spilling inside sample loop
agx: switch to demote internally
agx: lower nir_intrinsic_load_num_subgroups
agx: delete unreachable blocks
agx: model more subgroup ops
agx: lower shuffle
agx: handle non-immediate shuffles in divergent CF
agx: handle quad_broadcast
agx: handle quad swaps
agx: add missing b2b16 implementation
agx: forbid uniforms on ballots
agx: lower 8-bit subgroups
agx: flesh out subgroup lowering
agx: report uses_txf
agx: expose agx_link_libagx
agx: document another sample_mask restriction
agx: reserve scratch registers for mem<–>mem swaps
agx: optimize txf with lod 0
agx: fix bogus unit test
agx: stash early_fragment_test info
agx: handle quad reduce
agx: implement quad_ballot
agx: lower more quad ops
agx: optimize elect()
agx: fix UB in cursor comparison
ail: constify everything
asahi: mark eMRT loads as in-bounds
asahi: calculate validity when unpacking
asahi: agx_translate_sample_count
asahi: assert bo size > 0
asahi: unwrap pointless null check
asahi: implement PIPE_CAP_QUERY_MEMORY_INFO
asahi: rm unused #include
asahi: resize key
asahi: cleanup fs epilog link info
asahi: move agx_link_varyings_vs_fs
asahi: fix prolog emit
asahi: pack UVS key properly
asahi: plumb shader stage into info
asahi: get debug in common
asahi: rm deadcode
asahi: drop rgb10a2_sint rendering
asahi: add missing rgba4 format
asahi: fix 1D array atomics
asahi: fix txf/image_load robustness with arrays
asahi: rework VBO lower for divisor=0
asahi: mv AGX_MAX_OCCLUSION_QUERIES define
asahi: handle agx_ppp_fragment_face_2 with no info
asahi: clarify format code in image lowering
asahi: fix rgb565 blending
asahi: fix store_output component/offset
asahi: fix sample ID with multiblock epilogs
asahi: lower texture instructions with epilogs
asahi: fix cull unknown bits
asahi: simplify image atomic lowering
asahi: move primitive MSAA field
asahi: free libagx if we don’t use a ralloc memctx
asahi: eliminate troublesome empty uniforms
asahi: rearrange VS uniforms
asahi: set src_type for store_output
asahi: rm dead code
asahi: add agx_index_size_to_B helper
asahi: move some GS lowering into lower_gs
asahi: don’t use load_num_vertices in geometry shaders
asahi: mv vertex_id_for_topology_class into GS lowering
asahi: rm another num_vertices use
asahi: rm dated comment
asahi: rm unused lower
asahi: rm num_vertices uses
asahi: rm redundant input_vertices
asahi: mv initialization of grid z for indirect GS
asahi: rm more dead lowering
asahi: rm always true param
asahi: update comment
asahi: update comment for maint5
asahi: eliminate num_workgroups for VS->GS + VS->TCS
asahi: drop bogus assertion
asahi: pack tilebuffer usc word ahead-of-time
asahi: add agx_ppp_push_merged helper
asahi: use ppp_merge
asahi: don’t allocate varyings ourselves
asahi: don’t allocate for ppp updates
asahi: extend varying linking for tri fan weirdness
asahi: plumb tri fan flatshading through common
asahi: don’t ralloc in agx_fast_link
asahi: extend epilog key for force early frag handling
asahi: don’t reserve extra UVS space for layer
libagx: use sub_group_scan_inclusive_add
libagx: add query copy kernel
libagx: don’t use get_group_id()
asahi/decode: QoL improvements
asahi: track imports for decode
asahi: clean up bg/eot counts
asahi: rename meta -> bg/eot
asahi: don’t allocate for USC words
asahi: split frag shader words
asahi: split CDM Launch words
asahi: unify naming for COUNTS structs
nir/tex_instr_result_size: handle subpass_ms
nir/lower_point_size: support lowered i/o
asahi/decode: drop Apple-specific decode check
libagx: rm unused field
libagx: fix static assert
libagx: fix triangle fan + prim restart + GS/XFB
libagx: drop unused !indexed path
libagx: add libagx_copy_xfb_counters helper
asahi: be robust against out of sync shader info
agx: fix draw param gather for sw vs
asahi: split param structs for GS internal kernel
agx: rework libagx I/O lowering
asahi: add missing lowerings
asahi: force bindless for eMRT
asahi: bounds check eMRT stores
asahi: support bigger buffer textures
asahi: add AGX_TEXTURE_FLAG_CLAMP_TO_0 flag
agx: handle discard with force early tests
asahi: pack blend key
agx: switch to combined clip/cull
asahi: add flag controlling sample mask without MSAA
asahi: use scalar outputs for rast shaders
asahi: move null descriptor routines to common
asahi: implement rba2 for uniform texel buffers
asahi: implement rba2 for storage texel buffers
agx: prepare for lower_wpos_center
asahi: extract agx_calculate_vbo_clamp
agx: fix indirect CF accounting
mesa: fix duplicate initializer
nir/lower_wpos_center: clean up
nir/builtin_builder: factor out nir_build_texture_query
asahi: use nir_build_texture_query
gallium: remove ability to probe asahi
asahi: add broken bits of unstable Linux UAPI
agx: fix 64-bit bcsel ingestion
agx: fix fmin/fmax with (-0, 0) pair
libagx: fix uint8_t definition
libagx: make index buffer fetch robust
libagx: generalize query copies
asahi: implement rba2 semantics for vbo
asahi: fix vbo clamp with stride=0
asahi: implement robustness2 for msaa image stores
asahi: be more clever about GS side effects
nir/lower_robust_access: handle MSAA images
nir: add nir_metadata_control_flow
treewide: use nir_metadata_control_flow
nir: document restriction on load_smem_amd constantness
vulkan: reference count vk_descriptor_update_template
vulkan: handle push DUT with emulated secondaries
vulkan: fix potential UAF with vk_cmd_enqueue_CmdPushDescriptorSetKHR
vulkan: handle enqueueing CmdPushConstants2KHR
vulkan: handle enqueueing CmdPushDescriptorSet2KHR
lvp: use common push constant enqueue
lvp: use common push descriptor set enqueue
lvp: fix silly casting for sampler desc updates
lvp: use common descriptor update templates
nir/format_convert: remove unorm bit size assert
nir: add nir_def_replace helper
treewide: use nir_def_replace sometimes
agx: fix insidious ballot optimizer bug
agx: add unit test for ballot bug
agx: set discard_is_demote
nir: add nir_break_if helper
treewide: use nir_break_if
nir: fix miscompiles with rules with INT32_MIN
nir/algebraic: explicitly suffix constants
nir/opt_constant_folding: fix array size define
zink: move print_pipeline_stats
zink: print pipeline stats for compute shader-db
util: add dui/uid helpers
nir: add nir_alu_instr float controls queries
nir/search: use ALU float control helpers
nir: use MIN2/MAX2 opcodes for imin/umax folding
nir: strengthen fmin/fmax definitions with signed zero
glsl/float64: handle signed zero with min/max
nir/lower_double_ops: handle signed zero with min/max
nir/lower_alu: remove dead #define
nir: add lower_fminmax_signed_zero
agx: set lower_fminmax_signed_zero
agx: do not flush denorms for fp16 fmin/fmax
asahi: drop old comment
asahi: drop stale comment
asahi: make agx_pack opencl compatible
asahi: tuck in null query check
libagx: specify heap size explicitly
asahi,libagx: tessellate on device
libagx: add kernel for incrementing CS counter
asahi: handle CS pipeline stat with indirect dispatch
libagx: handle VS/IA pipeline stats on GPU
asahi: eliminate load_num_workgroups from TCS unrolled ID
nouveau/drm-shim: set ram_user
nvk: add instruction count exec property
nir/opt_constant_folding: fix array size define, pt 2
zink: remove extraneous \n with shaderdb
zink: match shader-db report.py format
Amber (1):
tu: Disable depth and stencil tests when attachment state requires it
Amit Pundir (1):
android: Fix zink build failure
Amol Surati (1):
nine: avoid using post-compacted indices with state expecting pre-compacted ones
Antoine Coutant (1):
drisw: fix build without dri3
Antonio Ospite (2):
meson: fix deprecation warning in create-android-cross-file.sh
android: simplify building libgallium_dri on Android
Arthur Huillet (4):
nvk: update 3d classes for conservative raster
nvk: implement VK_EXT_conservative_rasterization
nvk: import SPH headers files from open-gpu-doc
nvk: generate Rust bindings from SPH header files
Asahi Lina (1):
asahi: Make asahi_clc build work on x86_64->x86 builds
Bas Nieuwenhuizen (5):
radv: Use zerovram for Enshrouded.
util/disk_cache: Fix cache marker refresh.
util/disk_cache: Delete the old multifile cache if using the default.
util/cache_test: Add tests for old cache deletion.
relnotes: Add an entry about the new cache default.
Benjamin Lee (1):
vk/graphics_state: Add last bits for extraPrimitiveOverestimationSize
Boris Brezillon (81):
panvk: Prepare things so we can push sysvals to our push uniform buffer
panvk: Put dynamic uniform buffers at the end of the UBO array
panvk: Move the dynamic SSBO descriptors to their own UBO
panvk: Prepare dynamic buffer descriptors at bind time
panvk: Lower sysvals to push uniforms
panvk: Kill panvk_sysval_vec4
panfrost: do not write outside num_wg_sysval
panfrost: Add the BO containing fragment program descriptor to the batch
pan/kmod: Fix a syncobj leak in the panthor backend
pan/kmod: Make default allocator thread-safe
panvk: clang-format fixups
panvk: Kill panvk_queue_get_device()
panvk: Get rid of panvk_descriptor_state::dirty
panvk: Move panvk_cmd_state::batch to panvk_cmd_buffer::cur_batch
panvk: Kill unused dynamic state bits
panvk: Get rid of special attribute support
panvk: Split the graphics and compute state at the cmd_buffer level
panvk: Split compute/graphics pipeline objects
panvk: Use memory pools to store pipeline shaders/descriptors
panvk: Kill the panvk_pipeline_builder object
panvk: Transition the graphics pipeline logic to vk_graphics_pipeline_state
panvk: Fully transition to vk_viewport_state
panvk: Fully transition to vk_rasterization_state
panvk: Fully transition to vk_input_assembly_state
panvk: Use vk_color_blend_state to fill our blend constant
panvk: Fully transition to vk_depth_stencil_state
panvk: Fully transition to vk_multisample_state
panvk: Set unused attribute buffers descriptors to zero
panvk: Rename non_vs_attribs into img_attribs
panvk: Prevent re-emission of image attributes used in vertex shaders
panvk: Move VS attribute/buffer state to panvk_cmd_graphics_state
panvk: Emit VS-accessible image attributes at a fixed offset
panvk: Leave holes in the attribute locations used by a shader
panvk: Fully transition to vk_vertex_binding_state
pan/blend: Move constant inlining out of pan_blend_create_shader()
pan/blend: Expose pan_blend_create_shader()
panvk: Add a blend library to deal blend shaders/descriptors
panvk: Don’t pass the stage to shader_create()
panvk: Simplify shader initialization in the pipeline logic
panvk: Fix/simplify the shader linking logic
panvk: Replace the stages array in panvk_draw_info by vs/fs fields
panvk: Move fs_rsd fields to an fs sub-struct
panvk: Emit the fragment shader RSD dynamically
panvk: Lower global memory IOs
panvk: Clean Midgard leftovers in the cmd_close_batch() path
panvk: Use vk_pipeline_shader_stage_to_nir()
panvk: Kill cmd_get_tiler_context()
panvk: Make sure we dump memory mappings before crashing
pan/decode: Be robust to NULL texture payload
pan/desc: Add missing format in translate_s_format()
pan/jc: Drop unused pool argument passed to pan_jc_add_job()
panvk: Add a render state to panvk_cmd_graphics_state
panvk: Take VK_RENDERING_{RESUM,SUSPEND}ING_BIT flags into account
panvk: Force a preload when the render area is not 32x32 aligned
panvk: Skip depth/stencil attachments with non-matching aspect mask
panvk: Fix dynamic rendering with images containing both depth and stencil
panvk: Make sure we don’t lose clear-only operations
panvk: Make sure we run the fragment shader if alpha_to_coverage is enabled
panvk: Make sure replay of command buffers containing Dispatch calls works
panvk: Override the default GetRender[in]AreaGranularityKHR()
egl: Use gbm_bo_create_with_modifiers2() when the surface has non-zero flags
panvk: Fix formatting around OpaqueCaptureAddress implementation
panvk/ci: Flag exact_sampling.*.edge_right test as fails
pan/bi: Make sure global loads/stores don’t exceed 16 bytes
pan/bi: Fix dynamic indexing of push constants
panvk: Fix Cube/2DArray/3D img -> buf copies
panvk: Don’t bail out when allocationSize is zero in AllocateMemory()
panvk: Prepare for Valhall image views
panvk: Prepare for Valhall buffer views
panvk: Prepare things for compiling valhall source files
panvk: Extend Valhall descriptor set implementation to support Bifrost
panvk: Overhaul the Bifrost descriptor set implementation
panvk: Refcount private BOs
panvk: Store private BOs in lists instead of dynarrays
panvk: Prepare panvk_mempool for shared device memory pools
panvk: Use memory pools for internal GPU data attached to vulkan objects
pan/desc: Extend pan_emit_fbd() to support multilayer rendering
pan/desc: Prepare things for fragment job chaining
pan/blitter: Let pan_preload_fb() callers queue the jobs to the job chain
panvk: Use IDVS jobs when we can
panvk: Add support for layered rendering
Caio Oliveira (35):
intel/brw: Don’t print IP as part of the dump
intel/brw: Hide register pressure information in dumps
intel/brw: Use `vNN` instead of `vgrfNN` when printing instructions
intel/brw: Fix commas when dumping instructions
spirv: Add MESA_SPIRV_DEBUG=values to dump all values
intel/brw: Track the number of uses of each def in def_analysis
intel/brw: Fix typo in DPAS emission code
intel/brw: Add unit tests for scoreboard handling FIXED_GRF with stride
intel/brw: Make component_size() consistent between VGRF and FIXED_GRF
glsl: Fix warning related to tg4_offsets in release mode
intel/brw: Print SWSB information when dumping instructions
intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task
anv: Use brw_nir_lower_cs_intrinsics for lowering Mesh/Task LocalID
intel/brw: Remove unused brw_reg related functions
intel/brw: Remove RALLOC helper from fs_reg
intel/brw: Remove unused variable from test
intel/brw: Move fs_reg data members up to brw_reg
intel/brw: Use public inheritance for fs_reg/brw_reg
intel/brw: Move most member functions from fs_reg to brw_reg
intel/brw: Remove conversion from fs_reg to brw_reg
intel/brw: Replace some fs_reg constructors with functions
intel/brw: Remove duplicated functions between fs_reg/brw_reg
intel/brw: Rename brw_reg() helper to brw_make_reg()
intel/brw: Make fs_reg an alias of brw_reg
intel/brw: Replace uses of fs_reg with brw_reg
intel/brw: Rename fs_reg_* helpers to brw_reg_*
intel/brw: Move brw_reg helpers into brw_reg.h
intel/brw: Don’t set acc_wr_control for Xe2
intel/brw: Use brw_inst_set_group() to set QtrCtrl and NibCtrl
intel/brw: Account for reg_unit() in assembler
intel/brw: Don’t print extra newlines in assembler
intel/brw: Split off assembler logic into library
spirv: Don’t warn about FPFastMathMode if not OpenCL
intel/brw: Convert missing uses of ralloc to linear in fs_live_variables
intel/elk: Convert missing uses of ralloc to linear in fs_live_variables
Chia-I Wu (8):
gallium: add pipe_picture_desc::flush_flags
frontends/va: track whether there are imported/exported surfaces
frontends/va: set PIPE_FLUSH_ASYNC when possible
radeonsi: prep for pipe_picture_desc::flush_flags
radeonsi: respect pipe_picture_desc::flush_flags
radv: check gs_copy_shader directly for executable props
radv: make radv_pipeline_has_ngg static
drm-shim: intercept access as well
Christian Gmeiner (42):
isaspec: Add method to get the displayname of BitSetEnumValue
isaspec: Improve ‘meta’ handling
etnaviv: isa: Drop capturing of python output
etnaviv: isa: Add clang-format special comments
etnaviv: isa: Print dst_full for ALU
etnaviv: isa: Switch to enum isa_thread
etnaviv: isa: Add more flags to etna_inst
etnaviv: isa: Rework modeling of left shift for store/load
etnaviv: isa: Add name for full writemask
mr-label-maker: Add teflon marker
etnaviv: isa: Do src swizzle with isaspec
clc: Always use spir for 32 bit
etnaviv: Zero init all srcs passed to etna_emit_alu(..)
ci: uprev mold to 2.32.0
gallium: Add vkms entrypoint
nak: Move nak_optimize_nir declaration to nak_private.h
meson: Update proc-macro2 subproject
meson: Update syn subproject
meson: Add pest rust dependencies
meson: Add roxmltree rust dependency
meson: Add indexmap rust dependencies
etnaviv: isa: Add meta elements to instructions
etnaviv: isa: Generate Rust FFI bindings for asm.h
etnaviv: isa: Make header C++ safe
etnaviv: isa: Add meson version check
etnaviv: isa: Add IsaParser proc_macro_derive
etnaviv: isa: Add struct etna_asm_result
etnaviv: isa: Make etna_asm_result usable in Rust
etnaviv: isa: Add EtnaAsmResultExt trait
etnaviv: isa: Add parser module
etnaviv: isa: Add C function impl
etnaviv: isa: Add cli assembler
etnaviv: isa: Extend disasm test
ci/etnaviv: Drop shaders@glsl-bug-110796 line
etnaviv: isa: Drop 1:1 mapping of opc to bits
etnaviv: isa: Add support for extended instructions
nak: Update comment about explicit padding
etnaviv: isa: Add support for bitset’s displayname
etnaviv: isa: Rework branch instruction
nak: Set has_imad32 conditionally
nak: Move imad late optimization to nir
dri: fix driver names
Christopher Michael (3):
v3d: Move spec@arb_texture_view@rendering-formats, Crash in broadcom-rpi4-fails
v3d: Move spec@!opengl 1.1@getteximage-formats, Fail in broadcom-rpi4-fails
broadcom: fix issue of ‘addr’ is used uninitialized
Colin Marc (3):
radv/video: don’t truncate frame_num and POC to 32
vulkan/video: generate profile_tier_level structure correctly
vulkan/video: correctly set sub-layer ordering in H.265 VPS/SPS
Collabora’s Gfx CI Team (6):
Uprev Piglit to 7aa7bc1b01d57b4b091c4fc82a94a6ff47f38ebf
Uprev Piglit to 8a6ce9c6fc5c8039665655bca4904d5601c6dba0
Uprev Piglit to e180f96239edba441f22f58dfc852cafb902844a
Uprev Piglit to fdf3fc09deb6beecdf212e65a16c645112540b59
Uprev Piglit to 647d0725024f72bc49bbc91c686c5f61168a1fe8
Uprev Piglit to 582f5490a124c27c26d3a452fee03a8c85fa9a5c
Cong Liu (1):
nir: Fix out-of-bounds access in ntt_emit_store_output()
Connor Abbott (92):
ir3: Add scan_clusters_macro to ir3_valid_flags()
ir3: Add scan_clusters.macro to is_subgroup_cond_mov_macro()
ir3: Validate tied sources better
ir3/ra: Don’t demote movmsk instructions to non-shared
ir3: Rewrite postsched dependency handling
ir3/legalize: Use define for register size
ir3: Rewrite regmask implementation
ir3/ra: Prepare for shared half-regs
ir3/ra: Fix printing shared reg file
ir3/ra: Prepare for shared phis
ir3: Fix lowering shared parallel copies with immed src
ir3/lower_pcopy: Fix immed/const flags for copy from shared
ir3: Fix shared parallel copy validation
ir3: Don’t use swz with shared registers
ir3/lower_copies: Handle HW bug with shared half-floats
ir3/lower_copies: Fix “inaccessible” half reg lowering with shared regs
ir3/ra: Use ra_reg_get_num() for validating num
ir3: Use INVALID_REG in array store
ir3: Reset num when creating parallel copies
ir3: Validate that shared registers are in-bound
ir3: Allow propagation of normal->shared copies
ir3: Moves with shared destination are always legal
ir3/legalize: Take (ss) into account in WaR hazards
ir3/legalize: Remove bad (eq) micro-optimization
ir3/legalize: any/all/getone are non-prefetch helper users
ir3: Use correct category for OPC_PUSH_CONSTS_LOAD_MACRO
ir3: Add support for “scalar ALU”
ir3: Implement source restrictions for shared ALU
ir3: Validate scalar ALU sources
ir3: Immediate source for stc is invalid
ir3: Don’t emit single-source collects
ir3/cp: Support swapping mad srcs for shared regs
ir3/cf: Don’t fold shared conversions
ir3: Distinguish lowered shared->normal moves
ir3: Add support for ldc.u
ir3: Add builder support for shared immediates
ir3: Create reduce identity directly
ir3: Make type_flags() return a bitmask enum
ir3: Support scalar ALU in the builder
ir3: Add scalar ALU-specific passes
ir3: Get sources before emitting scan_clusters.macro
ir3: Rewrite shared reg handling when translating from NIR
ir3: Directly use shared registers when possible
ir3/nir: Fix imadsh_mix16 definition
ir3: Use scalar ALU instructions when possible
ir3: Don’t scalarize all SSBO instructions
ir3: Don’t manually scalarize SSBO loads
freedreno/a7xx: Add AQE-related registers from kgsl
freedreno/a7xx: Add A7XX_HLSQ_DP_STR location from kgsl
freedreno/crashdec: Initial a7xx support
freedreno: Update HLSQ_*_CMD registers for a7xx
docs/android: Fix example meson cross file again
ir3: Put VS->TCS barrier after preamble
ir3/legalize: Insert dummy bary.f after preamble
freedreno,ir3: Add has_early_preamble
tu: Workaround early preamble HW bug
freedreno/a6xx: Workaround early preamble HW bug
ir3: Add ir3_info::early_preamble
tu: Implement early preamble
freedreno/a6xx: Implement early preamble
ir3: Enable early preamble
tu: Use image aspects for feedback loops
tu: Support VK_EXT_attachment_feedback_loop_dynamic_state
tu: Use a7xx terminology for flushes
freedreno, tu: Use CLEAN events on a7xx
tu: Fix unaligned indirect command synchronization
tu: Don’t WFI after every dispatch
freedreno/a7xx: Fix register file size
ir3: Make sure constlen includes stc/ldc.k/ldg.k instructions
freedreno: Disable early preamble on a6xx gen4
ir3, tu, freedreno: Move early_preamble to ir3_shader
tu: Add early preamble statistic
ir3: Introduce elect_any_ir3
ir3: Use elect_any_ir3 in preambles
freedreno: Fix RBBM_NC_MODE_CNTL variants
tu: Add support for aligned substreams
ir3: Fix UBO size with indirect driver params
tu: Make cs writeable for GMEM loads when FDM is enabled
tu: Fix fdm_apply_load_coords patchpoint size
tu: Support VK_EXT_fragment_density_map on a750
tu: Support bufferDeviceAddressCaptureReplay on kgsl
freedreno: Fix decoding primitive counter events on a7xx
tu: Add VPC hardware workaround for a750
ir3: Fix stg/ldg immediate offset on a7xx
nir/instr_set: Return the matching instruction
nir/instr_set: Don’t remove matching instruction
ir3: Split out bindless tex/samp encoding
ir3: Don’t consider r63.x as a GPR
ir3: Plumb through descriptor prefetch intrinsics
ir3: Make preamble rematerialization common code
ir3: Expand preamble rematerialization
ir3: Add descriptor prefetching optimization on a7xx
Constantine Shablia (5):
pan/bi: fix 1D array tex coord lowering
pan/bi: clean up tex coord lowering
panfrost: report correct MAX_VARYINGS
panvk: remove descriptor pool counters
panvk: enable KHR and EXT BDA
Corentin Noël (8):
zink: Always call deinit_multi_pool_overflow when destroying zink_descriptor_pool_multi
ci: Allow to override the virglrenderer render server
venus: sync protocol for VK_KHR_maintenance5
venus: enable VK_KHR_maintenance5
venus/ci: add more recently found flakes
wsi: Make sure to return a valid wayland id string
venus/ci: Update expectations
ci: Make sure to install libraries in the right directory on debian
Craig Stout (8):
util: detect_os: add DETECT_OS_FUCHSIA and DETECT_OS_POSIX_LITE
util: u_thread: add Fuchsia support
util: os_misc: add Fuchsia support
util: u_dl: add Fuchsia support
util: os_time: add Fuchsia support
vulkan/util: add missing dependencies
meson: remove unnecessary line continuation
vulkan/runtime: add spirv_info_h to vulkan_lite_runtime_header_gen_deps
Daniel Lundqvist (1):
radeonsi: Fix unused variable when LLVM is not used for AMD.
Daniel Schürmann (69):
aco/ra: fix kill flags after renaming fixed Operands
aco/ra: assert that the register file is empty after register allocation completed
aco/lower_phis: simplify check for uniform predecessors
aco: introduce aco_opcode::p_boolean_phi
aco/vn: copy-propagate trivial phis
aco/lower_phis: generalize init_state() so that it works with any scalar phis
aco/lower_phis: implement SGPR phi lowering
aco: use SGPR phi lowering for uniform phis in divergent merge blocks
aco: use SGPR phi lowering for all loop header phis
aco: use SGPR phi lowering for all scalar phis
aco/optimizer: remove p_linear_phi handling from optimizer
radv: mark nir_opt_loop() as not idempotent
radv: move nir_opt_dead_cf() before nir_opt_loop()
panfrost: skip gles-3.0-transform-feedback-uniform-buffer-object on Mali G52 and G57
nir/loop_analyze: adjust negative (or huge) iteration count check for bit size
nir/opt_if: don’t split ALU of phi into otherwise empty blocks
nir/opt_loop: add loop peeling optimization
aco/ra: fix handling of killed operands in compact_relocate_vars()
aco/ra: Fix array access when finding register for subdword variables
aco/ra: refactor get_reg_simple() with increased stride.
aco/ra: move can_write_m0() check into get_reg_specified()
aco/ra: re-use registers from killed operands
aco/ra: change heuristic to first fit
aco/ra: use round robin register allocation
aco/assembler: fix MTBUF opcode encoding on GFX11
aco/assembler: slightly refactor MTBUF assembly for more readability
aco/assembler: fix GFX67 MTBUF opcode encoding
aco/scheduler: remove unused register_demand parameter
aco: move live var information into struct Program
aco/reindex_ssa: replace live_var parameter with boolean
aco: make aco::monotonic_buffer_resource declaration visible for aco::IDSet
aco: use aco::monotonic_allocator for IDSet
spirv: make gl_HelperInvocation volatile if demote is being used
radv: emit discard as demote by default
nir: introduce discard_is_demote compiler option
nir/opt_peephole_select: handle nir_terminate{_if}
nir: remove nir_intrinsic_discard
zink: pass zink_screen to nir_to_spirv().
nir/shader_info: remove uses_demote
spirv: workaround for tests assuming that OpKill terminates invocations or loops
aco/scheduler: fix register_demand validation debug code
aco/spill: Unconditionally add 2 SGPRs to live-in demand
aco: calculate register demand per instruction as maximum necessary to execute the instruction
aco: track and use the live-in register demand per basic block
aco: remove get_demand_before()
aco/live_var_analysis: slightly refactor handling of additional register demand for Operand copies
aco/live_var_analysis: ignore dead phis
aco/spill: don’t remove spilled phis
aco/ra: use live_in_demand in should_compact_linear_vgprs()
aco: add RegisterDemand member to Instruction
aco/util: skip empty blocks in IDSet::insert(IDSet)
aco/live_var_analysis: refactor using ctx struct
aco/live_var_analysis: ignore phi definition and operand demand at predecessors
aco/live_var_analysis: inline block->register_demand updates
aco/live_var_analysis: remove unused includes
aco/live_var_analysis: use separate allocator for temporary live sets
aco/ra: remove special-casing of p_logical_end
nir: implement loop invariant code motion (LICM) pass
radv: use NIR loop invariant code motion pass
nir/opt_sink: ignore loops without backedge
aco: compute live-in variables in addition to live-out variables
aco/ra: use live-in variables directly rather than computing them
aco/spill: use live-in variables directly rather than computing them
aco/cssa: use live-in variables instead of live-out variables
aco/validate: use live-in variables for RA validation
aco/print_ir: print live-in instead of live-out variables
aco: remove live-out variables from IR
aco/spill: Don’t add phi definitions to live-in variables
util/disk_cache: enable Mesa-DB disk cache by default
Daniel Stone (27):
Revert “ci: disable g52”
gbm: Support fixed-rate compression allocation
venus/ci: Fix timeout
venus/ci: Significantly reduce CTS fraction
venus/ci: Temporarily disable jobs
dri: Fix BGR format exclusion
egl/surfaceless: Enable RGBA configs
egl/gbm: Enable RGBA configs
egl/dri2: Use createImageFromNames for DRM buffers
dri: Remove old createImageWithModifiers
dri: Remove createImageFromFds
dri: Stop answering DRI_IMAGE_ATTRIB_FORMAT
gallium/dri: Delete unused helper function
gallium/dri: Drop mesa_format indirection for lookup
loader/dri3: Use FourCC for create-image entrypoints
egl/x11: Update to createImageFromNames
dri: Delete createImageFromName
dri: Unify createImage and createImageWithModifiers
egl/x11: Remove __DRI_IMAGE_FORMAT remnants
loader/dri3: Use FourCC for buffer allocations
u_format: Rewrite format table to use YAML
format: Generate endian-independent format aliases
format: Generate sRGB<->linear conversions from table
u_format: Reword introduction
build: Check for PyYAML in Meson build
dri: Allow INVALID for modifier-less drivers
gbm/dri: Remove erroneous assert
Danylo Piliaiev (38):
tu: Handle non-overlapping WaW hazard with buffer copy/fill/update
tu/a7xx: Don’t set FLUSH_PER_OVERLAP_AND_OVERWRITE for feedback loops
tu/a750: Disable HW binning when there is GS
freedreno/devices: Add support for Adreno A32 (G3x Gen 2)
util/u_trace: Allow mixing of ArgStruct and Arg
tu: Add more info to renderpass tracepoint
vulkan/wsi: Make current_frame usable in all cases
util/u_trace: Pass explicit frame_nr argument to delimit frames
tu: Use current_frame from vk device to delimit u_trace frames
anv: Use current_frame from vk device to delimit u_trace frames
freedreno: Make fd_pps_driver.h usable without including other FD sources
turnip/msm: Do rd dump only when there are commands in submission
turnip: Implement VK_EXT_depth_clamp_zero_one
freedreno/a7xx: Update TPL1_DBG_ECO_CNTL1 to fix UBWC corruption
ir3/a7xx: Fix FS consts corruption when other FS has zero constlen
tu: Add LRZ disable reason to renderpass tracepoint
util/u_trace: Add support for fixed-length string params in tracepoints
tu: Add attachments’ UBWC info to renderpass tracepoint
freedreno/rddecompiler: Make possible to use original shader
freedreno/replay: Fix replaying without SET_IOVA
freedreno/ir3: mova has special meaning for (r) flag
ir3: Correctly assemble mova1 with (r) on const
tu: Fix issues with render_pass tracepoint
freedreno: Rename TPL1_DBG_ECO_CNTL1.UBWC_WORKAROUND into TP_UBWC_FLAG_HINT
tu: Add enable_tp_ubwc_flag_hint feature to a7xx
freedreno/devices: Turn off enable_tp_ubwc_flag_hint for a740 by default
freedreno/devices: Fix magic regs for Adreno A32
freedreno: Describe LRZ feedback mechanism
freedreno/devices: Define and appropriately set has_lrz_feedback
tu: Use LRZ feedback in gmem
tu: Enable LRZ feedback in sysmem
freedreno: Use LRZ feedback in gmem
ir3: Print bindless samp/tex ids for tex prefetch
ir3/tests: Make possible to specify raw instr value as uint64
ir3/tests: Make possible to add generated disasm tests
ir3: Fix decoding of stib.b/ldib.b with offset
turnip/kgsl: Support external memory via ION/DMABUF buffers
tu: Have single Flush/Invalidate memory entrypoints
Dave Airlie (27):
radv/video/encode: fix quality params on v2 hw.
Revert “zink: use a slab allocator for zink_kopper_present_info”
nvk: Only enable WSI modifiers if the extension is supported.
draw/texture: handle mip_offset[0] being != 0 for layered textures.
nouveau/nvc0: increase overallocation on shader bo to 2K
nvidia: fixup classes import and import new classes.
nouveau/push: add support for m2mf/i2mf to dumper
nouveau/nvc0: add support for using common pushbuf dumper
radv/video: fix layered decode h264/5 tests.
radv/video: use vcn ip versions for encoder detection.
ac/radv/radeonsi: move av1 ctx/probs size/filling to common code.
ac/radv/radeon: move film grain init to common code.
st/mesa: drop u_simple_shaders.h include where not used.
gallivm: create a pass manager wrapper.
gallivm: move ppc denorm disable to inline
gallivm: split some code out from init module.
gallivm: make lp_bld_coro.h c++ include safe.
gallivm: export target init code for orc-jit to reuse
gallivm: split out generating LLVM Mattrs
llvmpipe: Introduce llvmpipe_memory_allocation
nvk: use 2k overallocation for shader heap.
anv/video: use correct offset for MPR row store scratch buffer.
radv/video: advertise mutable/extended for dst video images.
draw/orcjit: supply stub function for tcs coro
llvmpipe/cs/orcjit: add stub function name for coro
gallivm/sample: fix sampling indirect from vertex shaders
nvc0: fix null ptr deref on fermi due to debug changes.
David (Ming Qiang) Wu (2):
radeonsi/vcn: set accurate size for dec header and index_codec
radeonsi/vcn: support DPB_MAX_RES on VCN5
David Heidelberg (58):
turnip: rename tu_queue_submit struct to follow ODR
ci: fail pipeline for users who got access to restricted traces
ci/traces: majanes has no longer access to the restricted traces
ci/deqp: correct EGL_EXT_config_select_group detection
egl/x11: Move RGBA visuals in the second config selection group
mailmap: add Freya Gentz entry
etnaviv: migrate from piglit include to generic deqp and toml spec
freedreno/ci: move platform to the deqp toml file for a530
freedreno/ci: move the disabled jobs from include to the main file
freedreno/ci: Switch a306_* to deqp-runner
freedreno/ci: do not depend on single job rules for another jobs
freedreno/ci: switch a306 to weston
freedreno/ci: re-enable a306_piglit
ci/panfrost: disable G52 until machines gets fixed
ci: drop unused piglit-test and integrate it into piglit-traces-test
freedreno/ci: Drop duplicated include and add missing stages
freedreno/ci: Implement nightly piglit job for Adreno 630 and 618
ci/freedreno: update expectations from the nightly run
ci: bump ANGLE
ci: Revert “ci: update failures list with angle for jsl, tgl”
ci/intel: add new jsl flake
ci/panfrost: Revert “ci/panfrost: disable G52 until machines gets fixed”
ci/alpine: re-enable Mold linker
ci/etnaviv: add flakes from nightly runs
winsys/i915: depends on intel_wa.h
subprojects: uprev perfetto to v45.0
ci/r300: update flake list from nightly reports
ci/nouveau: move disabled jobs back from include into main gitlab-ci.yml
ci/nouveau: separate HW definition from SW
ci/nouveau: adjust and add DEVICE_TYPE
ci/freedreno: a3xx will never have Vulkan support
docs: correct svga3d redirected URLs
ci/radv: dEQP-GLES3.functional.polygon_offset.fixed16_render_with_units passes now
ci: re-enable shader-db for nouveau
ci: do not build Nine in debian-build-testing
ci/piglit: be explicit about what we building
ci/lava: enable Piglit OpenCL tests so we can test rusticl on the HW
ci/lava: do not build Vulkan for armhf images
ci/lava: move wayland-protocols to the main section
ci/freedreno: document new failure after piglit update
ci/etnaviv: skip Vulkan tests on GC2000
ci/etnaviv: remove duplicated line from skips
mailmap: update my email
ci/arm64: rustify the build
ci/lava: add support for RustiCL
ci/meson: reuse meson installation
ci: move (c)bindgen to own shell script
ci/radv: Document recent flake
ci/lava: the containers take sometimes more than 60m
ci: propagate RUSTICL_ENABLE and DEBUG variables to the DUTs
rusticl: add -cl-std only when it’s not defined
ci/freedreno: some A306 tests now pass/skip since proper GL detection in Piglit
ci: introduce tool for comparing nightly runs
util: bump blake3 from 1.3.3 to 1.5.1, improve armv7 and aarch64 performance
build: pass licensing information in SPDX form
intel/debug: allow silencing CL warnings
llvmpipe: Silence “possibly uninitialized value” warning for ssbo_limit (cont)
ci/alpine: use llvm variables
David Rosca (39):
radv/video: Set correct bit depth and format for 10bit input
radv/video: Check encode profiles and bit depth in capabilities query
radv/video: Report maxBitrate in encode capabilities
radeonsi/vcn: Allocate session buffer in VRAM
radeonsi/vcn: Fix 10bit HEVC VPS general_profile_compatibility_flags
radeonsi/vcn: Only enable VBAQ with rate control mode
frontends/va: Fix AV1 slice_data_offset with multiple slice data buffers
Revert “radeonsi/vcn: AV1 skip the redundant bs resize”
frontends/va: Only increment slice offset after first slice parameters
radeonsi: Update buffer for other planes in si_alloc_resource
frontends/va: Store slice types for H264 decode
radeonsi/vcn: Ensure DPB has as many buffers as references
radeonsi/vcn: Allow duplicate buffers in DPB
radeonsi/vcn: Ensure at least one reference for H264 P/B frames
frontends/va: Fix leak when destroying VAEncCodedBufferType
radeonsi/vcn: Avoid copy when resizing bitstream buffer
frontends/va: Send all bitstream buffers to driver at once
frontends/va: Fix crash in vaRenderPicture when decoder is NULL
radv/video: Add missing VCN 3.0.2 to decoder init switch
radeonsi: Make si_compute_clear_image work with 422 subsampled formats
gallium/vl: Init shaders on first use
frontends/va: Don’t require exact match for packed headers
gallium: Add is_video_target_buffer_supported
radeonsi: Implement is_video_target_buffer_supported
frontends/va: Use is_video_target_buffer_supported for EFC
frontends/va: Rework EFC logic
frontends/va: Check if target buffer is supported in vlVaEndPicture
gallium: Remove PIPE_VIDEO_CAP_EFC_SUPPORTED
frontends/va: Simplify AV1 slice parameters handling
frontends/va: Move slice_data_offset to context
frontends/va: Rename slice_idx to have_slice_params and move to context
frontends/va: Support multi elements slice parameter buffers for H264/5
gallium: Remove pipe_h264_picture_desc.slice_parameter.slice_count
radeonsi/vcn: Limit size to target size in AV1 decode
radeonsi: Add debug option to enable low latency encode
radeonsi/vcn: Add low latency encode support
frontends/va: Support frame rate per temporal layer for AV1
radeonsi/vcn: Support 10bit RGB for EFC input
radeonsi/vcn: Add decode DPB buffers as CS dependency
Deborah Brouwer (1):
ci/lava: Detect a6xx gpu recovery failures
Derek Foreman (13):
wsi/wayland: refactor wayland dispatch
egl/wayland: Use loader_wayland_dispatch
perfetto: Add flows
wsi/wayland: Add perfetto flows to image acquisition and presentation
wsi/wayland: Add flow id to presentation feedback
wsi/wayland: Add timing debugging
perfetto: Add simple support for counters
wsi/wayland: Add latency information to perfetto profiling
perfetto: Add some functions for timestamped events
wsi/wayland: Add a perfetto track for image presentation
wsi/wayland: Add tracepoint in wsi_wl_swapchain_wait_for_present
wsi/wayland: Fix use after free from improperly stored VkAllocationCallbacks
wsi/wayland: Use different queue names for different queries
Dmitry Baryshkov (1):
freedreno/registers: drop display-related register files
Dmitry Osipenko (1):
venus: make cross-device optional
Doug Brown (1):
xa: add missing stride setup in renderer_draw_yuv
Dr. David Alan Gilbert (1):
treewide: Cleanup unused structs
Dylan Baker (12):
meson: use glslang –depfile argument when possible
clc: remove check for null pointer that cannot be true in llvm_mod_to_spirv
compiler/glcpp: don’t recalculate macro
intel/compiler: move predicated_break out of backend loop
anv/grl: add some validation that we’re not going to overflow
egl/wayland: fix memory leak in error handling case
compilers/clc: Add missing break statements.
mesa: fix memory leak when using shader cache
util/glsl2spirv: fixup the generated depfile when copying sources
tgsi_to_nir: free disk cache value if the size is wrong
crocus: properly free resources on BO allocation failure
crocus: check for depth+stencil before creating resource
Echo J (3):
nvk: Add sha1_h as a dependency
d3d10umd: Use pipe_resource_usage enum in translate_resource_usage()
util: Fix the integer addition in os_time_get_absolute_timeout()
Eli Schwartz (2):
meson: create libglsl declared dependency to propagate order-only deps
meson: add various generated header dependencies as order-only deps
Emma Anholt (13):
nir,panfrost,agx: Fix driver PIXEL_COORD_INTEGER setting and drop workaround.
dri: Fix a pasteo in dri2_from_names()
dri: Consistently use createImageWithModifiers2()
dri: Consistently use createImageFromFds2(), not createImageFromFds()
dri: Replace createImageFromDmaBufs() with createImageFromDmaBufs3()
dri: Drop old createImageFromRenderbuffer()
dri: Consistently use createImageFromDmabufs() not createImageFromFds()
dri: Drop createImageFromFds2() in favor of createImageFromDmaBufs()
dri: Move EGL image lookup/validate setup to dri_init_screen()
mesa: Drop some version checking around ValidateEGLImage
dri: Collapse dri2_validate_egl_image() into dri_validate_egl_image()
dri: Fold lookup_egl_image_validated into its one caller
dri: Drop the old lookupEGLImage wrapper function.
Eric Engestrom (295):
VERSION: bump to 20.2
docs: reset new_features.txt
docs: add release notes for 24.0.6
docs: update calendar for 24.0.6
docs: add an extra 24.0.x release
docs: add sha256sum for 24.0.6
docs: update calendar for 24.1.0-rc1
ci: fix container rules on release branches and tags
panvk/ci: add WSI testing to all the deqp-vk jobs
lavapipe/ci: add WSI testing to all the deqp-vk jobs
freedreno/ci: add flake
lavapipe/ci: add flakes
ci: pass MESA_VK_ABORT_ON_DEVICE_LOSS through to the DUT
rpi3/ci: drop duplicate comment without any corresponding actual skip line
v3dv/ci: skip all the WSI tests, they are way too flaky to be worth it
spirv: deduplicate default debug log level
v3dv/ci: add rpi5 failure
ci: mark microsoft farm as offline
meson: simplify `-gsplit-dwarf` compiler argument check
egl+glx: fix two #ifdef that should be #if like the rest
meson: always set USE_LIBGLVND
meson: use bool.to_int() instead of manually converting
lavapipe/ci: drop fixed test from failures
lavapipe/ci: add the rest of the failures introduced by the 1.3.8.2 uprev
lavapipe/ci: skip another test that goes over the timeout
meson: move tsan-blacklist.txt to build-support with the other build support files
llvmpipe/ci: fix indentation
llvmpipe/ci: only run jobs when their corresponding files are changed
lavapipe/ci: fix indentation
lavapipe/ci: avoid running all lavapipe jobs when llvmpipe ci is changed
lavapipe/ci: only run jobs when their corresponding files are changed
docs: update calendar for 24.1.0-rc2
llvmpipe/ci: trigger jobs on draw & gallivm changes
lavapipe/ci: trigger jobs on draw & gallivm changes
lavapipe/ci: add flakes seen lately
lavapipe/ci: generalize flakes list to all formats for these flaky tests
lavapipe/ci: skip ray tracing tests that sometimes time out
vc4/ci: add fails seen overnight
ci: uprev mold to 2.31.0
lavapipe/ci: skip two more timing out ray query tests
ci: backport fix for gl_PointSize bug in CTS
lavapipe/ci: move a few skips out from under the “llvm jit” comment
mr-label-maker: fix yaml syntax
docs: add release notes for 24.0.7
docs: update calendar for 24.0.7
docs: add sha256sum for 24.0.7
docs: update calendar for 24.1.0-rc3
ci/debian-build-testing: drop extra nesting section
ci/shader-db: drop extra nesting section
rpi4/ci: use deqp-runner suite for vk job as well
rpi5/ci: use deqp-runner suite for vk job
microsoft/clc: fix incorrect changes that got through while the Windows CI was down
llvmpipe: wrap the push/pull in the ifdef as well
radv/ci: add navi21 flakes
zink: avoid designated initializers as they are not supported in C++ < 20
Revert “ci: fail pipeline for users who got access to restricted traces”
radeonsi/ci: document new crash (assert)
util/format: add missing null check in util_format_is_srgb()
ci: drop default VKD3D_PROTON_RESULTS file name
ci: hardcode `-vkd3d` namespace for VKD3D_PROTON_RESULTS
amd/ci: track changes to VKD3D_PROTON_RESULTS files
mr-label-maker: mark *-vkd3d.txt files as CI results expectations files
ci: reuse dead .vkd3d-proton-test to make vkd3d less radv-specific
ci: fix section_end in debian-build-testing
ci: rename debian version variable job to include the word “version”
ci: factor out all the deps to build the debian containers into .debian-container
ci: inherit the debian container building infra for test container images
ci/b2c: rename B2C_TIMEOUT_FIRST_* to B2C_TIMEOUT_FIRST_CONSOLE_ACTIVITY_*
ci/b2c: rename B2C_TIMEOUT_* to B2C_TIMEOUT_CONSOLE_ACTIVITY_*
ci/b2c: allow setting timeouts in seconds
ci: drop dead VK_CPU option
ci/piglit-traces: drop re-definition of VK_DRIVER_FILES
ci/init-stage2: set VK_DRIVER_FILES for both xorg and wayland
ci/vkd3d: un-hardcode architecture
ci/vkd3d: fix version sanity check
ci/vkd3d: fail job when failing to get driver version
ci/b2c: remove dead rules: that’s always overwritten
ci/env: move dead-code-with-comment to the end of the list to make it clearer
zink/ci: rename .zink-lvp-venus-rules to .zink-venus-lvp-rules to match the rest of the names
README: update links to our own docs
docs: update calendar for 24.1.0-rc4
mailmap: add entry to unify Roman Stratiienko’s contributions
nvk/ci: add nvk job on a GA106 (RTX 3060)
zink/ci: add zink+nvk glcts+piglit job on a GA106 (RTX 3060)
zink+nvk/ci: skip glx piglit tests as they all fail
zink+nvk/ci: skip timing out test
zink+nvk/ci: skip more tests that times out
zink+nvk/ci: document flakes seen during stress-testing
zink+nvk/ci: update expected failures
docs: add release notes for 24.0.8
docs: update calendar for 24.0.8
docs: add sha256sum for 24.0.8
docs: add release notes for 24.1.0
docs: add sha256sum for 24.1.0
docs: update calendar for 24.1.0
ci: fix build-kernel.sh -> download-prebuilt-kernel.sh
ci: drop dead variables (see previous commit)
ci: rename debian/arm*_test to debian/baremetal_arm*_test to be clear about which infra uses that
ci: prepare base debian test image for multi-arch
ci: prepare GL debian test image for multi-arch
ci: prepare VK debian test image for multi-arch
ci/image-tags: rename DEBIAN_X86_64_TEST_*_TAG to drop the x86 mention
ci: add debian/arm64_test images for gl & vk
zink/ci: rename zink-turnip collabora rule to make it unambiguous
ci/b2c: add aarch64 tests for gl & vk
turnip/ci: add vkcts jobs on the a750
turnip+zink/ci: add gl & gles CTS jobs on the a750
nvk/ci: adjust the regex for “dut is broken and needs to be rebooted”
nvk/ci: mark the job as failing in case of hangs, instead of silently rebooting
nvk/ci: add missing .test rules to avoid running nvk tests in post-merge pipeline
radv/ci: move amdgpu-specific kernel message warning to src/amd/ci/
ci/b2c: make B2C_JOB_WARN_REGEX optional
zink+nvk/ci: more KHR-GL46.packed_pixels.varied_rectangle.* flakes, so mark the group as flaky
zink+nvk/ci: add more flakes seen in nightly
zink+nvk/ci: spec@ext_external_objects@vk-vert-buf-reuse has been fixed
mr-label-maker: label src/vulkan/wsi/ as wsi
.mailmap: fix email address for @cpmichael
v3dv/ci: fix typo in `renderer_check`
ci: disable debian-build-testing until it can be fixed
vc4/ci: skip VK piglit tests
freedreno/a6xx: fix kernel -> compute handling
zink+nvk/ci: add flakes seen in latest nightly run
docs/calendar: add 24.2 branchpoint and release candidates schedule
panfrost/ci: add missing genxml trigger path
panfrost: mark tests as fixed
etnaviv/ci: skip VK piglit tests
radv/ci: document angle regressions from !29436 on stoney
zink+nvk/ci: add flakes seen in latest nightly run
docs/meson: replace deprecated pkgconfig with pkg-config
zink+nvk/ci: add flakes seen in latest nightly run
v3dv: add missing bounds check in VK_EXT_4444_formats
docs: add release notes for 24.1.1
docs: add sha256sum for 24.1.1
docs: update calendar for 24.1.1
turnip/ci: add a750 flakes seen in the latest nightly
radv/ci: fix manual rules
radv/ci: move radv manual rules into their own group
nvk+zink/ci: add another flake seen in nightly
docs: add release notes for 24.0.9
docs: update calendar for 24.0.9
docs: add sha256sum for 24.0.9
venus/ci: add flake that’s been blocking MRs
v3d/drm-shim: emulate a rpi4 instead of a rpi3
nvk+zink/ci: add another flake seen in nightly
radv/ci: document navi31 regression from !29235
ci: set a common B2C_JOB_SUCCESS_REGEX with the message that’s printed for all jobs
ci/deqp: uprev gl & gles cts
radeonsi/ci: mark a bunch of tests as fixed on vangogh
radv/ci: drop duplicate navi21-aco flakes line
radv/ci: drop duplicate navi31-aco flakes line
turnip+zink/ci: mark a dEQP-GLES(2|3).functional.rasterization.(fbo|primitives).line_(strip_|)wide as fixed
turnip/ci: add a750 flakes seen in the latest nightly
panfrost/ci: remove duplicate path
nvk+zink/ci: mark KHR-GL46.sparse_texture2_tests.SparseTexture2* as fixed
nvk+zink/ci: add flakes seen in nightly pipeline
nvk+zink/ci: consider all the `double` tests in spec@glsl-4.00@execution@built-in-functions to be flaky
freedreno/ci: disable mid-testing reboot on a750
driconf: drop param for setting default gpu vendor id in DRI_CONF_FORCE_VK_VENDOR()
egl: fix teardown when using xcb
egl: move android-specific code into an android branch
egl: ensure future platforms get their teardown implemented
egl/device: drop unnecessary intermediate variable
ci: fix meson install script
lavapipe/ci: update trace checksum following nir change
lavapipe/ci: document regression while it’s being worked on
turnip+zink/ci: mark dEQP-GLES3.functional.fbo.depth.depth_test_clamp.* tests as fixed
bin/ci: escape literal url in regex
glx: fix build -D glx-direct=false
nvk+zink/ci: mark spec@ext_image_dma_buf_import@ext_image_dma_buf_import-refcount-multithread as fixed
nvk+zink/ci: add flakes seen over the last few nightlies
asahi/lib: generate git_sha1.h for agx_device.c
ci/vkd3d: deduplicate the diff between the expectation and the results
ci/vkd3d: print a message when the expected failures file is missing
ci/vkd3d: drop override of job artifacts
ci/vkd3d: fix error message printing
ci/vkd3d: stop ignoring errors in a block where errors can’t happen
ci/vkd3d: don’t ignore errors
ci/vkd3d: group version check lines together
ci/vkd3d: limit the vulkaninfo capture to the driverInfo line
ci/vkd3d: print a real error message when failing to get the list of failing tests
ci/vkd3d: rename vkd3d test log file to end in .txt
ci/vkd3d: print URL to the vkd3d-proton.log file to make it easier to access
ci/vkd3d: put `then` on the same line as the `if` to match the rest of the code style
ci/vkd3d: drop the “clear results folder without deleting the folder” logic
ci/vkd3d: drop `quiet` wrapper
ci/vkd3d: drop redundant “vkd3d-proton execution: SUCCESS”
docs: add release notes for 24.1.2
docs: add sha256sum for 24.1.2
docs: update calendar for 24.1.2
venus/ci: fix indentation of list nested in a dict item
venus/ci: add manual/nightly venus-lavapipe-full
venus/ci: skip timed out test
nvk+zink/ci: add flakes seen over the last two nightly runs
nvk+zink/ci: catch more `double` flakes
venus+zink/ci: drop fraction and add missing timeout on zink-venus-lvp
loader: use os_get_option() to allow android to set LIBGL_DRIVERS_PATH, GBM_BACKENDS_PATH, GALLIUM_PIPE_SEARCH_DIR
gallium/hud: use os_get_option() to allow android to set GALLIUM_HUD and related vars
egl: use os_get_option() to allow android to set EGL_LOG_LEVEL
venus/ci: make sure nightly job doesn’t get retried
venus/ci: drop fixed test from fails list
docs/ci: fix indentation of list nested in a dict item
docs/ci: merge test-docs and test-docs-mr
docs/ci: auto-run test-docs in fork pipelines
docs/ci: drop .no_scheduled_pipelines-rules from test-docs
ci: reorder alpine/x86_64_build rules to fix the nightly pipelines
drm-shim: stub syncobj_timeline_signal ioctl
llvmpipe/ci: add comment for later on weird-looking code
llvmpipe/ci: fix indentation of list nested in a dict item
llvmpipe/ci: set rusticl variables in deqp-runner instead of passing them down from the job
ci: include rusticl in the arm64 build
llvmpipe,rusticl/ci: move rusticl files rule out of llvmpipe
v3d/ci: add nightly job for rusticl testing
panfrost/ci: drop duplicate job rules
panfrost/ci: split gl & vk jobs rules
radeonsi/ci: mark test as fixed
lavapipe/ci: skip timing out test
broadcom/ci: disable auto-retry on manual jobs
docs/features: mark VK_KHR_maintenance7 as implemented on anv and lvp
docs: add release notes for 24.1.3
docs: update calendar for 24.1.3
docs: add sha256sum for 24.1.3
ci_run_n_monitor: add support for new `canceling` job status
ci_run_n_monitor: be coherent about using sets for `element in group` checks
ci_run_n_monitor: use COMPLETED_STATUSES in more places
ci_run_n_monitor: add RUNNING_STATUSES and use it where appropriate
bin/ci: allow bugfixes in requirements.txt
ci: split .no-auto-retry out of .scheduled_pipeline-rules
ci: simplify setting .no-auto-retry now that it isn’t bundled with unrelated rules:
v3d/ci: include results of GL full run in expectations
v3d/ci: include results of CL run in expectations
zink+nvk/ci: ascii-sort fails
zink+nvk/ci: document regression from !30033
turnip+zink/ci: add two more CS related flakes
lvp+zink/ci: document a flake seen in a merge pipeline
v3d/ci: add disabled job for GL testing on the RPi5
v3d/ci: rename “rusticl on v3d” suite to `v3d-rusticl`
v3d/ci: add disabled job for CL testing on the RPi5
features.txt: specify that VK_EXT_depth_clamp_zero_one is only supported on v3dv/vc7+
features.txt: specify that VK_EXT_depth_clip_enable is only supported on v3dv/vc7+
features.txt: specify that GL_ARB_depth_clamp is only supported on v3d/vc7+
docs: add release notes for 24.1.4
docs: update calendar for 24.1.4
docs: add sha256sum for 24.1.4
ci: replace gallium-drivers=swrast with gallium-drivers=llvmpipe,softpipe
bin/ci_run_n_monitor: explain that the ‘Universal Recycling symbol’ ♲ emoji means these jobs were cancelled
bin/ci_run_n_monitor: add text labels next to the emojis
bin/ci_run_n_monitor: replace ♲ with 🗙 to represent cancelled jobs
meson: fix filename printed when generating devenv files
meson/megadriver: fix install message to match the rest of meson
meson/megadriver: stop removing the “master” .so file
meson/megadriver: replace hardlinks with symlinks
ci/vkd3d: fix LD_LIBRARY_PATH
v3d/ci: mark spec@amd_performance_monitor@vc4 tests as fixed
llvmpipe/ci: mark spec@!opengl 1.1@gl_select tests as fixed
Revert “bin/ci_run_n_monitor: explain that the ‘Universal Recycling symbol’ ♲ emoji means these jobs were cancelled”
VERSION: bump for 24.2.0-rc1
.pick_status.json: Update to 0cc23b652401600e57c278d8f6fe6756b13b9f6a
radeonsi/ci: skip timing out test
freedreno/ci: double job timeout for a306
freedreno/ci: document extra variants of failing tests on a618 and a630
anv+zink/ci: mark some tests as fixed
anv+zink/ci: document two tests, one failing and one crashing
anv+zink/ci: mark a couple of tests as flaky
venus/ci: skip timing out test
loader: gc loader_get_extensions_name() and __DRI_DRIVER_{GET_,}EXTENSIONS defines
.pick_status.json: Update to 3b6867f53a6718de80bbff4acb84ffd5aca8a8c8
nak: fix meson typo
venus: initialize bitset in CreateDescriptorPool()
v3d/ci: mark spec@amd_performance_monitor@vc4 tests as flaky
meson: xcb & xcb-randr are needed by the loader whenever x11 is built
.pick_status.json: Update to c30e5d44b1027ed03a8fd542829df0055d3e1a96
.pick_status.json: Update to 6cd4372460b197fea98d257217328ddc3406e6ad
docs: add stub header for u_format_gen.h
.pick_status.json: Update to c33d2db06ac0ea4d3d5372caa93bee3bbbe028c7
VERSION: bump for 24.2.0-rc2
.pick_status.json: Update to ad90bf0500e07b1bc35f87a406f284c0a7fa7049
ci/baremetal: fix logic for retrying boot when it failed
meson: don’t select the deprecated `swrast` option ourselves
meson: improve wording of “incompatible llvm options” error
ci: remove llvmpipe in the job that disables llvm
.pick_status.json: Update to aa9745427b917bb0613b753ccd59c6c1e6f07584
VERSION: bump for 24.2.0-rc3
.pick_status.json: Update to 366e7e2ddc7d3b340bbf040eca1d3223219e6122
meson,ci: remove dead `kmsro` option in `gallium-drivers`
.pick_status.json: Mark 93f9afa1e039cbf681adcc6d170aec987d9f0f65 as denominated
.pick_status.json: Mark f427c9fe233e862bfa30d0c7441ce77592ce4654 as denominated
.pick_status.json: Update to d58f7a24d1be7b8b50ebdc0c1c3ce26bd65317a5
.pick_status.json: Update to d9849ac46623797a9f56fb9d46dc52460ac477de
.pick_status.json: Update to ef88af846761ca9e642f7ed46011db7d3d6b61fd
VERSION: bump for 24.2.0-rc4
.pick_status.json: Update to c90e2bccf756004e48f9e7e71e555db0d03c1b98
ci: pass MESA_SPIRV_LOG_LEVEL from job to the test
android: fix build in multiple ways
.pick_status.json: Update to 214b6c30406f844560bdf35a54ff8a51ee248709
.pick_status.json: Update to cc2dbb8ea5329b509d79eedb6c0cbb9a1903b5ad
Eric R. Smith (8):
panfrost: add a barrier when launching xfb jobs in CSF
get_color_read_type: make sure format/type combo is legal for gles
glsl: test both inputs when sorting varyings for xfb
glsl: make the xfb varying sort stable
panfrost: fix some omissions in valhall flow control
panfrost: change default rounding mode for samplers
panfrost: fix texture.border_clamp regression for valhall
panfrost: use RGB1 component ordering for R5G6B5 pixel formats
Erico Nunes (6):
ci: lima farm maintenance
lima/ci: update piglit ci expectations
Revert “ci: lima farm maintenance”
lima: fix surface reload flags assignment
mesa/st: don’t set lower_fdot in draw_nir_options
dri: fix sun4i-drm driver name
Erik Faye-Lund (106):
panfrost: add PAN_MAX_TEXEL_BUFFER_ELEMENTS define
panfrost: clamp buffer-size to max-size
panfrost: remove nonsensical assert
panfrost: do not deref potentially null pointer
panfrost: check return-value from u_trim_pipe_prim
panfrost: assert that drmSyncobjWait returns 0
panfrost: check return-code of drmSyncobjWait
panfrost: correct first-tracking for signature
panvk: drop needless null-check
panvk: do not leak bindings
panvk: drop needless null-checks
panvk: avoid dereferencing a null-pointer
docs/panfrost: compact gpu-table
docs/panfrost: move details to separate articles
docs/panfrost: link to conformant products
panfrost: simplify panfrost_texture_num_elements
panfrost: explicitly loop over surfaces
panfrost: untangle faces from layers
util/format: correct a typo
mesa/main: rewrite mipmap generation code
mesa/main: remove unused function
mesa/main: rework GL_IMAGE_PIXEL_TYPE query
mesa/main: clean up _mesa_uncompressed_format_to_type_and_comps
mesa/main: clean up switch statement
mesa/main: do not return _REV format for uncompressed format
mesa/main: prefer non-suffixed enums
mesa/main: fixup indent
mesa/main: updates for EXT_texture_format_BGRA8888
docs: wrap long words instead of overflowing
meson: bump test-timeout
mesa/main: remove unused function
panfrost: lower maxVertexInputStride to match vulkan runtime
mesa/main: remove stale prototype
mesa/main: remove duplicate error-checks
mesa/main: require EXT_texture_integer for GL 3.0
mesa/main: do not allow RGBA_INTEGER et al in gles3
mesa/main: factor out format/type enum checking
mesa/main: use extension-helper
mesa/main: tighten rg/half-float interaction
mesa/main: use _mesa_is_gles1()-helper
mesa/main: remove needless check
mesa/main: simplify conditions
mesa/main: merge identical checks
panvk: move macro-definition to header
mailmap: invert tomeu’s mapping
mailmap: merge Robert and Bob Beckett into one
mailmap: invert my mailmapping
mailmap: map collabora.co.uk to collabora.com
mailmap: move konstantin to the right sorted position
mailmap: use consistent spelling for constantine
mailmap: update rohan’s primary email address
nir: fix utf-8 encoding-issue
Revert “docs: use html_static_path for static files”
docs: edgeflag -> edge flag
docs: zink -> Zink
docs: Anv -> ANV
docs: tgsi -> TGSI
docs: hw -> HW
docs: mooth -> smooth
docs: unify spelling of front/back-facing
docs: eg. -> e.g.
docs: url -> URL
docs: nabled -> enabled
docs: sommelier -> Sommelier
docs: remove apostrophe from uppercased
docs: utrace -> trace
docs: google -> Google
docs: Nvidia -> NVIDIA
docs: ssbo/ubo -> SSBO/UBO
docs: cpu -> CPU
docs: gpu -> GPU
docs: renderpass -> render pass
docs: spell out “stencil reference”
docs: submision -> submission
docs: Steamos -> SteamOS
docs: colour -> color
docs: occured -> occurred
docs: precidence -> precedence
docs: undifined behaviour -> undefined behavior
docs: debian -> Debian
docs: zink -> Zink
docs: vulkan -> Vulkan
docs: attachements -> attachments
docs: acress -> across
docs: pluggins -> plug-ins
docs: pusbuf -> pushbuf
docs: metadatas -> metadata
docs: use os.pardir
docs: allow out-of-tree docs build
meson: build html-docs
docs: automatically generate depfile
meson: error when missing hawkmoth
meson: allow specifying html-docs-path
ci: build docs using meson
panvk: support x11 wsi
vulkan/runtime: tne -> the
vulkan/runtime: initizlie -> initialize
vulkan/runtime: abreviation -> abbreviation
vulkan/runtime: multiesample -> multisample
vulkan/runtime: implementaiton -> implementation
docs: fix bootstrap-extension
docs/panfrost: fix numbered list
docs/panfrost: fix math-notation
docs/panfrost: use math-role more
docs/panfrost: use c:func-role for function
docs/panfrost: quote identifiers
Esdras Tarsis (1):
nvk: Enable 8bit and 16bit access in VK_KHR_workgroup_memory_explicit_layout.
Faith Ekstrand (297):
nak: Don’t saturate depth writes
nvk: Only clip Z with the guardband
nouveau/class_parser.py: Fix the docs for –out-rs
nvk: Advertise VK_EXT_pipeline_robustness
nouveau/headers: Clean up the meson a bit
spirv: Auto-generate spirv_info.h
spirv: Update the JSON and headers
spirv: Better handle duplicated enums in the JSON parser
spirv: Generate a spirv_capabilities struct
spirv: Record capabilities rather than ad-hoc bools
mesa: Stop pretending to support SPV_AMD_gcn_shader in OpenGL
spirv: Move the old AMD extensions out of capabilities
spirv: Move the printf enable out of capabilities
spirv: Add supported_capabilities to vtn_builder
spirv: Use supported_capabilities for various checks
spirv: Drop the SubgroupUniformControlFlow check
spirv: Add a table of all implemented capabilities
spirv: Check capabilities using the supported_capabilities table
spirv: Add support for specifying caps through the new struct
spirv: Use spirv_capabilities in tests
mesa: Flip the script on SPIR-V extension enabling
mesa: Use the new spirv_capabilities struct
clover: Use the new spirv_capabilities struct
rusticl: Use the new spirv_capabilities struct
vulkan: Set SPIR-V caps from supported features
radv: Use vk_physical_device_get_spirv_capabilities()
intel/kernel: Use the new capabilities struct
asahi/clc: Use the new spirv_capabilities struct
zink: Use the new spirv_capabilities struct
anv: Use spirv_capabilities for the float64 shader
ir3: Use spirv_capabilities in ir3_cmdline
microsoft: Use spirv_capabilities for spirv_to_dxil
spirv: Get rid of the old caps struct
nvk: Re-emit sample locations when rasterization samples changes
nvk/meta: Restore set_sizes[0]
nvk: Get rid of sets_dirty
nvk: Don’t rely on push_dirty for which push sets exist
nouveau/headers: Add a bool for whether or not to dump offsets
nvk/upload_queue: Only upload one line of data
nvk/upload_queue: Add some useful asserts
nvk/upload_queue: Add a _fill method
nvk: Use the upload queue for NVK_DEBUG=zero_memory
nvk: Improve the GetMemoryFdKHR error
nouveau/winsys: Take a reference to BOs found in the cache
nouveau/winsys: Make BO_LOCAL and BO_GART separate flags
nvk: Allow GART for dma-bufs
nil: Use the right PTE kind for Z32 pre-Turing
nvk: Set color/Z compression based on nil_image::compressed
nil: Default to NV_MMU_PTE_KIND_GENERIC_MEMORY on Turing+
nvk: Allow VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT
drm-uapi: Sync nouveau_drm.h
nouveau/winsys: Add back nouveau_ws_bo_new_tiled()
nvk: Support image creation with modifiers
nvk: Set tile mode and PTE kind on dedicated dma-buf BOs
nvk: Implement DRM format modifier queries
nvk: Advertise VK_EXT_queue_family_foreign
nvk: Advertise VK_EXT_image_drm_format_modifier
vulkan/wsi: Bind memory planes, not YCbCr planes.
nvk/wsi: Advertise modifier support
zink: Set workarounds.can_do_invalid_linear_modifier for NVK
nvk: Fix misc. whitespace and style issues
nvk: Go wide for query copies
nvk: Store descriptor set addresses in descriptor state
nvk: Add static asserts for nvk_buffer_address layout
nvk: Store an nvk_buffer_address for each set in the root table.
nvk: Advertise 32 descriptor sets
nvk: Move and better document set_dynamic_buffer_start
nvk: Add an NVK_MAX_SAMPLES #define
nvk: Refactor nvk_meta_begin() to use a desc helper
nvk/meta: Save and restore set_dynamic_buffer_start
nak: Emit !PT for carries on IADD3
nak: Add with -0 for fabs()
nak: Don’t emit a plop3 for immediate shift sources
nak: Encode LDC directly
vulkan: Update XML and headers to 1.3.286
spirv: Update the JSON and headers
nir: Handle cmat types in lower_variable_initializers
spirv: Handle constant cooperative matrices in OpCompositeExtract
spirv: Assert that non-vector composites have the right length
spirv: Implement SPV_EXT_replicated_composites
nvk: Advertise VK_EXT_shader_replicated_composites
anv: Advertise VK_EXT_shader_replicated_composites
hasvk: Advertise VK_EXT_shader_replicated_composites
radv: Advertise VK_EXT_shader_replicated_composites
turnip: Advertise VK_EXT_shader_replicated_composites
lavapipe: Advertise VK_EXT_shader_replicated_composites
dozen: Advertise VK_EXT_shader_replicated_composites
nir/print: Improve divergence information
nak: Fix NAK_DEBUG=serial for warp barriers
nak: Only convert the written portion of the buffer in NirInstrPrinter
nak: Fix BasicBlock::phi*() for OpAnnotate
nak: BMov is always variable-latency
nak: Only copy-prop neg into iadd2/3 if no carry is written
nak: Get rid of OpINeg
nak: Expose a BasicBlock::map_instrs() helper
nak: Add some helpers for uniform instructions and registers
nak: Add OpR2UR
nak: Clean up bindless cbuf handles
nak/ra: Move an assert
nak: Make SSARef::file() return Option<RegFile>
nak: Drop BasicBlock::new()
nak: Add a concept of uniform blocks
nak/to_cssa: Resolve phi register file mismatches
nak/ra: Spill UGPRs and UPreds
nak/ra: Never move uniform regs in non-uniform blocks
nak: Support uniform regs in lower_copy_swap()
nak/sm70: Defer ALU src processing until encode_alu()
nak/sm70: Rework ALU source encode helpers
nak/sm70: Add support for encoding uniform ALU ops
nak/sm70: Fix encoding of fadd/fsetp and friends with UGPRs
nak/sm70: Implement a bunch of uniform ops on SM75+
nak/legalize: Fold immediate sources before instructions
nak/legalize: Drop some pointless plop3 logic
nak/legalize: Be more precise about shfl and out
nak/legalize: Fix imad and ffma legalization on SM50
nak/legalize: Patch a RegFile through to copy helpers
nak/legalize: Handle uniform sources in warp instructions
nak/legalize: Ensure all SSA values for a given ref are in the same file
nak/legalize: Copy uniform vectors in non-uniform control-flow
nak/legalize: Uniform instructions can’t have cbuf sources
nak/legalize: Explicitly ignore OpPhiSrcs and OpPhiDsts
nak/calc_instr_deps: Rename a couple variables
nak/calc_instr_deps: Rewrite calc_delays() again
nak/calc_instr_deps: Add latencies for uniform instructions
nak: Add a opt_uniform_instrs() pass
nak/copy_prop: Rewrap a couple comments
nak/copy_prop: Don’t propagate UBOs into uniform instructions
nak/lower_cf: Parent scopes are never NULL
nak/lower_cf: Track block divergence
nak: Convert to LCSSA before divergence analysis
nak/lower_cf: Flag phis as convergent when possible
nak/from_nir: Clean up phi annotations
nak: Add a UniformBuilder
nak/from_nir: Emit uniform instructions when !divergent
nak/sm70: Properly encode bindless cbufs
nak/dce: Account for bindless CBuf handles
nak/calc_instr_deps: Account for bindless CBufs
nak/bitset: Add an iterator
nak/ra: Handle bindless CBufs
nak/ra: Pull searching for unused/unpinned regs into a helper
nak/ra: Rename PinnedRegAllocator to VecRegAllocator
nak/ra: Add a concept of pinned registers to RegAllocator
nak: Add OpPin and OpUnpin
nak/legalize: Allow pinned uniform vectors in non-uniform blocks
nak/legalize: Bindless cbufs must be pinned in non-uniform blocks
nak/copy_prop: Don’t propagate bindless cbufs into non-uniform blocks
nir: Add some new _nv intrinsics
nvk,nak: Switch to nir_intrinsic_ldc_nv
nak: Implement r2ur_nv
nak: Implement [un]pin_cx_handle_nv
nir: Add nir_foreach_block_in_cf_node_safe() iterators
nak: Lower non-uniform ldcx_nv to global loads
nak: Implement nir_intrinsic_ldcx_nv
nvk: Split SSBO and UBO address formats
nvk: Split write_[dynamic_]buffer_desc into UBO and SSBO variants
nvk: Align buffer descriptors
nvk: Rename nvk_cmd_buffr_get_cbuf_descriptor()
nvk: Make nvk_min_cbuf_alignment() inline
nvk/lower_descriptors: Add a descriptor_type_is_ubo/ssbo() helper
nvk: Move the zero offset optimization to load_descriptor_for_idx_intrin()
nvk: Allow the cbuf optimization for VK_DESCRIPTOR_TYPE_MUTABLE_EXT
nvk/descriptor_set_layout: Record which dynamic buffers are UBOs
nvk: Use bindless cbufs on Turing+
nvk: Be much more conservative about rebinding cbufs
nvk: Use cbuf loads for variable pointers dynamic SSBO descriptors
nvk: s/draw_idx/draw_index/g
nvk: Pass the base workgroup and global size to flush_compute_state()
nvk: Use helper macros for accessing root descriptors
nvk: Pass the queue to draw/dispatch_state_init()
nvk: Use inline constant buffer updates for CB0
nvk: Only write draw parameters to cb0 when they change
nvk: Refactor build_cbuf_map()
nak,nir: Drop r2ur_nv in favor of as_uniform
nouveau: Fix a race in nouveau_ws_bo_destroy()
nvk: Use NVK_VK_GRAPHICS_STAGE_BITS in dirty_cbufs_for_descriprots()
nvk: Dirty cbufs in CmdPushDescriptorSetWithTemplate2KHR
intel/blorp: Set nir_shader::options up-front before building
util/format_pack: Fix packing of signed 1010102 SSCALED formats
util/format_pack: Also use iround for SCALED formats
util/format_pack: Clamp SNORM values to [-1, 1] when unpacking
util/format: Round to nearest even when converting to R11G11B20F
util/format: Handle denorms when converting to R11G11B10F
nir/format_convert: Smash NaN to 0 in pack_r9g9b9e5()
nir/format_convert: Use fmin/fmax to clamp R9G9B9E5 data
nir: Add a nir_intrinsic_use for unit tests
nir: Move most of nir_format_convert to a C file
nir: Support 0 and 32 bits in some format conversion helpers
util: Make format_srgb.h C++ safe
nir: Add a format pack helper and tests
nir: Add a format unpack helper and tests
nir/format_convert: Assert that UNORM formats are <= 16 bits
ci: Update trace SHAs
vulkan/meta: Use demote instead of discard
nvk: Fix whitespace issues around conservative rasterization
nvk: Re-order conservative rasterization checks
nvk: Don’t emit conservative rasterization before Maxwell B
nvk: Silently fail to enumerate if not on nouveau
util/cnd_monotonic: Move the guts to a c file
util/cnd_monotonic: Use a void * on Windows
vulkan/wsi/wayland: Use mtx_t and u_cnd_monotonic
vulkan/wsi/x11: Use c11/threads for thread spawning
vulkan/wsi/x11: Use mtx_t and u_cnd_monotonic
vulkan/wsi/display: Use mtx_t and u_cnd_monotonic
vulkan/wsi/queue: Use mtx_t and u_cnd_monotonic
vulkan/wsi: Delete wsi_init_pthread_cond_monotonic
vulkan: Use u_cnd_monotonic for vk_sync_timeline
nvk: Why are nvk_image.c/h writeable?
nvk: Bump the sparse alignment requirement on buffers to 64K
nvk: Align sparse-bound images to the sparse binding size
nvk: Fetch debug flags from the physical device
nvk: Initialize the debug flags in nvk_instance
nvk: Add the start of a KMD abstraction
nvk/nvkmd: Implement dev and pdev for nouveau
nvk: Use the NVKMD interface for device enumeration
nvk/nvkmd: Add memory and virtual address interfaces
nvk/nvkmd: Implement the mem and va interfaces for nouveau
nvk: Add static wrappers for image/buffer binding
nvk: Use nvkmd_mem for nvk_device_memory
nvk: Use nvkmd_mem for nvk_image::linear_tiled_shadow_mem
nvk: Use nvkmd_mem for nvk_cmd_pool
nvk: Use nvkmd_mem for nvk_descriptor_pool
nvk: Use nvkmd_mem in nvk_upload_queue
nvk: Use nvkmd_mem for descriptor tables
nvk: Use nvkmd_mem for shader and event heaps
nvk: Use nvkmd_mem for query pools
nvk: Use an nvkmd_mem for the SLM area
nvk: Drop extra_bos from nvk_queue_submit_simple()
nvk: Use nvkmd_mem for the nvk_queue_state::push
nvk: Use nvkmd_mem for the zero page, VAB, and CB0
nvk/nvkmd: Add a context interface
nvk/nvkmd: Implement nvkmd_ctx for nouveau
nvk: Convert the upload queue to nvkmd_ctx
nvk: Use an nvkmd_ctx for sparse binding
nvk: Use nvkmd_ctx for queue submit
nvk: Remove the last vestages of nouveau/winsys from core NVK
nouveau/mme: Don’t dereference an empty vector
nouveau/mme: Don’t leak data_bo
nouveau/mme: Use fixed BO addresses in the MME test
nvk: Move Heaps and BO binding into nvkmd
nvk: Move debug flags int nvk_debug.h
nvk/nvmkd: Plumb parent pointers through everywhere
nvk/nvkmd: Re-implement NVK_DEBUG=vm
nvk: Do mem maps directly in nvkmd on nouveau
nvk/nvkmd: Add real mem<->bo flag translation
nvk/nvkmd: Flip the script on NO_SHARED
nvk: Drop nvk_buffer::is_local
nvk/nvkmd: Rework memory placement flags
nvk/nvkmd: Be more specific about memory alignments
nvk/nvkmd: Be a lot more pedantic about VA alignments
nvk: Put CB0 in VRAM
nvk: Put descriptors in VRAM
nouveau/push: Cache the last header DW to avoid read-back
nak/sph: Stop storing the shader model in ShaderProgramHeader
nak: Move encode_sm* to to sm*.rs
nak/sm50: Get rid of the hand-rolled align_up/down() helpers
nak: Plumb a ShaderModel trait through everywhere
nak/ra: Move the NAK_DEBUG=spill logic into RA
nak: Move RegFile::num_regs() into ShaderModel
nak: Move Instr::can_be_uniform() into ShaderModel
nak: Move instruction encoding into ShaderModel
nak/sm70: Move instruction encoding into a trait
nak/sm70: Re-organize the code a bit
nak/legalize: Move a bunch of helpers to a trait
nak/legalize: Handle OpBreak and OpBSSy specially
nak/legalize: Handle RA instructions up-front
nak/sm70: Move legalization into SM70Op
nak/sm50: Move instruction encoding into a trait
nak/sm50: Move legalization into SM50Op
nak: Add a legalize() method to ShaderModel
nak/sm50: Re-order all the ops
nak/sm50: Fix immediates for IMnMx
zink/kopper: Set VK_COMPOSITE_ALPHA_OPAQUE_BIT when PresentOpaque is set
nak: gather instr count explicitly
nvk/nvkmd: nouveau uses the OS page size
nvk: Drop the sparse alignment back down to 4096
nvk: Use the page size queried from NVKMD
nak/nir: Use an indirect load for sample locations
nak/copy_prop: Propagate OpSel with a selector of SrcRef::Zero
nak/copy_prop: Ignore the top 16 bits of OpPrmt::sel
nak: Don’t print the destination of OpIpa twice
nir,nak: Add a nir_op_prmt_nv
nak/nir: Use prmt in texture lowering
nak/nir: Use prmt for barycentric offset lowering
nak/nir: Make interpolate_at_sample more efficient
nak: Add some helpers for working with OpPrmt selectors
nak: Optimize nested OpPrmt
nak: Add a pass macro for more consistent debug printing
nak: Run copy-prop again after opt_prmt and opt_lop
nvk: Fix indirect cbuf binds pre-Turing
nvk: Don’t advertise sparse residency on Maxwell A
nvk: Reject sparse images on Maxwell A and earlier
nak/spill_values: Don’t assume no trivial phis
meson/megadriver: Don’t invoke the megadriver script with no drivers
nak: Sample locations are byte-aligned
nvk: Require color or depth/stencil attachment support for input attachments
nvk: Support STORAGE_READ_WITHOUT_FORMAT on buffers
zink: Align descriptor buffers to descriptorBufferOffsetAlignment
Francisco Jerez (33):
intel/brw/xe2+: Keep PS sample mask in the f1.0 register whether or not kill is used.
intel/brw: Don’t emit Z coordinate interpolation if CPS isn’t in use.
intel/brw/xe2+: Fix indirect extended descriptor setup for scratch space.
iris: Allocate fixed amount of space for blend state.
blorp: Allocate fixed amount of space for blend state.
intel/brw/xe2+: Don’t use SEL peephole on 64-bit moves.
intel/brw/xe2+: Fix 64-bit subgroup scan intrinsics not to rely on SEL instructions.
intel/brw/xe2+: Lower 64-bit SHUFFLE and CLUSTER_BROADCAST.
intel/xe2+: Enable native 64-bit integer arithmetic.
nir: Add option to lower 64-bit uadd_sat.
intel/brw/xe2+: Lower 64-bit integer uadd_sat.
intel/brw/xe2+: Round up spill/unspill data size to nearest reg_size multiple.
intel/xe2+: Enable native 64-bit integer arithmetic.
iris,anv/xe2+: Enable the DX10/OGL border mode for YCrCb as per Wa_14014226147.
iris,anv/xe2+: Set tessellation redistribution regions per patch to recommended values.
iris,anv/xe2+: Use pipelined variant of 3DSTATE_DRAWING_RECTANGLE.
intel/brw/xe2+: Use active-thread-only barriers available since Xe2+.
iris/xe2+: Fix format of scratch space surface address in various 3DSTATE packets.
anv/xe2+: Fix format of scratch space surface address in various 3DSTATE packets.
intel/fs/gfx20+: Fix surface state address on extended descriptors for NIR scratch intrinsics.
intel/fs/xe2+: Ask driver for PS payload registers based on barycentric load intrinsics in use.
iris/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA.
anv/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA.
intel/fs/xe2+: Don’t lower barycentric load offsets to fixed-point format on Xe2+.
intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel offset.
intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel sample.
intel/dev: Add GRF size information to the intel_device_info struct.
anv/xe2+: Align push constant ranges to GRF boundaries.
intel/brw: Implement null push constant workaround.
intel/dev: Add devinfo flag for TBIMR push constant workaround.
anv/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround.
iris/gfx12.5: Pass non-empty push constant data to PS stage for TBIMR workaround.
iris: Pin pixel hashing table BO from iris_batch submission instead of from iris_state.
Friedrich Vock (7):
aco/tests: Insert p_logical_start/end in reduce_temp tests
aco/spill: Insert p_start_linear_vgpr right after p_logical_end
radv: Use max_se instead of num_se where appropriate
radeonsi: Use max_se instead of num_se where appropriate
radv/rt: Fix memory leak when compiling libraries
aco/spill: Don’t spill phis with all-undef operands
aco: Limit rt stages to 128 vgprs
GKraats (3):
i915g: fix generation of large mipmaps
i915g: fix mipmap-layout for npots
i915g: fix max_lod at mipmap-sampling
Ganesh Belgur Ramachandra (4):
radeonsi: add GL_EXT_texture_filter_minmax extension
radeonsi: add GL_ARB_texture_filter_minmax extension
radeonsi: fix eptich on chips without image opcodes (e.g. gfx940)
amd/common: skip lane size determination for chips without image opcodes (e.g. gfx940)
Georg Lehmann (88):
aco/tests: don’t use undef for descriptors
aco/tests/post_ra: fix various validation errors
aco/lower_to_hw: fix v_cvt_pk_u16_u32 instruction format
aco/lower_to_hw: fix 16bit p_insert on gfx8
aco/tests: validate before and after post-ra tests
spirv: preserve signed zero in modf
aco/lower_to_hw: don’t use regClass to identify subdword reductions
aco: add a subdword lowering pass
aco: add tests for lower_subdword
aco/ra: remove gfx6/7 subdword paths
aco/lower_to_hw: remove gfx6/7 subdword paths
ac/nir: explicitly use pack_half_2x16_rtz
radv, radeonsi: don’t use D16 for f2f16_rtz
radv: always run nir_opt_16bit_tex_image
nir/opt_16bit_tex_image: pass options to opt_16bit_dest
nir/opt_16bit_tex_image: optimize packed conversions too
aco/gfx11+: use v_cvt_pk_u8_f32 for 8bit constant copies
aco/gfx10: use v_add_u16 with literal for constant copies
aco/tests: simplify small constant copy test
aco/gfx11+: optimize v_fma_mix throughput
zink: use bitcasts instead of pack/unpack double opcodes
aco/gfx11: use v_swap_b16
aco/optimizer: remove ineffective vcc opt
aco/optimizer: remove ineffective undef opt
aco: remove perfwarn
aco: don’t pass program to emit_bpermute
aco/lower_to_hw: add copy_constant_sgpr
aco: small constant copy optimizations
aco/lower_to_hw: use copy_constant_sgpr for masks
aco/lower_to_hw: optimize split 64bit constant copies
aco/optimizer: use p_create_vector to create mask when a copy can’t be used
nir: remove unpack_half_flush_to_zero
nir/opt_uniform_atomics: handle inverse_ballot when detecting single lane ifs
aco: optimize branching sequence with p_create_vector exec producer
nir: sink/move inverse_ballot like moves
ac: set has_pack_32_4x8
nir: lower pack_uvec4_to_uint to pack_32_4x8 if supported
nir/opt_algebraic: alternative 8bit pack_[us]norm_4x8 lowering
aco: rework how affinities for acc operands are determined
aco: add affinities for possible sopk optimizations
aco/gfx11+: fix inline constants for v_pk_fmac_f16
aco: move literal unswizzle opt to RA
aco/ra: use a switch to check vop2acc instruction support
aco: move s_add_u32 -> s_addk_i32 optimization fully to ra
amd/common: set COMPUTE_STATIC_THREAD_MGMT_SE2-3 correctly on gfx10-11
aco: add more anonymous namespaces
aco: make local functions static in files without anonymous namespace
radv: inline partial push constant loads
nir: add ford, funord, fneo, fequ, fltu, fgeu
aco: implement ford, funord, fneo, fequ, fltu, fgeu
ac/llvm: implement ford, funord, fneo, fequ, fltu, fgeu
ac/nir: enable ford, funord, fneo, fequ, fltu, fgeu
nir/opt_algebraic: look through fabs/fneg when matching fmulz/ffmaz
nir/optimize cmp(a, -0.0)
nir/opt_algebraic: optimize cmp(fneg(a), #b) and feq with fabs
nir/opt_algebraic: add various unordered/ordered patterns from aco
aco: remove ordered/unordered optimizations
aco/ir: remove unused vopc helpers
iris/ci: update trace checksums
aco/ra: fix affinity for s_addk
aco: fix s_delay_alu with salu and trans dependency
aco,nir: add dpp16_shift_amd intrinsic
radv/nir: add a pass to optimize shuffle/booleans dependent only on tid/consts
radv: use radv_nir_opt_tid_function for shuffles
radv: use radv_nir_opt_tid_function to create inverse_ballot
aco/gfx12: use trans s_delay_alu for pseudo scalar
aco/gfx12: don’t allow vgpr operands for pseudo scalar
aco/gfx11.5: select s_cvt_[ui]32_f32
aco/gfx11.5: select s_(ceil|floor|trunc|rndne)
aco: add aco_opcode::p_s_cvt_f16_f32_rtne
aco/gfx11.5: select SALU float conversions
aco/gfx11.5: fix s_fmac acc to definition
aco/gfx11.5: select SOP2 float instructions
aco/gfx11.5: select SOPC float instructions
aco/gfx11.5: select SALU fsat
aco/gfx11.5: select SALU fsign
aco/gfx11.5+: allow sgpr dst for trans ops and use pseudo scalar ops on gfx12
aco/gfx11.5: select SALU fneg/fabs
aco/gfx11.5: select SALU fquantize2f16
aco: micro optimize VALU fquantize2f16
aco: handle clustered uniform reductions correctly
nir: constant fold inverse_ballot
aco: remove optimize_cmp_subgroup_invocation
spirv: ignore more function param decorations
aco/optimizer: update temp_rc when converting to uniform bool alu
aco/gfx11+: don’t use VOP3 v_swap_b16
nir/lower_int64: replace uadd_sat with ior for find_lsb64 and ufind_msb64
aco/gfx10+: set lateKill for sgprs used by wave64 VALU writing a mask
Gert Wollny (4):
zink/kopper: Wait for last QueuePresentKHR to finish before acquiring for readback
mesa/st: don’t use base shader serialization when uniforms are not packed
r600/sfn: Set bit size for newly created store intrinsic
zink: limit minSampleShading to a maxium value of 1.0
Guilherme Gallo (3):
ci: Add S3 id_token for all jobs
ci: Use id_tokens for JWT auth
ci/lava: Fix cmdline for UART/fastboot devices
Hans-Kristian Arntzen (5):
vulkan: Update XML and headers to 1.3.285.
ac/surface: Add surface flags to prefer 4K and 64K alignment.
radv: Implement VK_MESA_image_alignment_control
wsi/common: Do not update present mode with MESA_VK_WSI_PRESENT_MODE.
wsi/x11: Bump maximum number of outstanding COMPLETE events.
Heinrich Fink (1):
zink: remove workaround of FB modifiers forcing present state
Iago Toral Quiroga (53):
v3dv: fix VK_KHR_vertex_attribute_divisor
v3d,v3dv: stop hard-coding max attrib divisor
broadcom/compiler: assert on array overflow
v3d: fix array_len when precompiling outputs for shader-db
broadcom/compiler: fix num_textures for precompiled shaders
broadcom/compiler: don’t read excess channels on image loads
broadcom/compiler: simplify v3d_vir_emit_tex
broadcom/cle: fix up shader record for V3D 7.1.10 / 2712D0
v3d: support 2712D0
v3dv: support 2712D0
broadcom/compiler: make add_node return the node index
broadcom/compiler: don’t assign payload registers to spilling setup temps
broadcom/compiler: apply payload conflict to spill setup before RA
broadcom/compiler: check if vertex shader writes point size
v3dv: only flag ‘shader writes point size’ if the shader actually writes it
v3dv: emit a default point size when drawing points
v3dv: drop unused stride field from v3dv_pipeline_vertex_binding
v3dv: fix incorrect index buffer size
v3dv: use pSizes paramater in vkCmdBindVertexBuffers2
v3dv: implement vkCmdBindIndexBuffer2KHR
v3dv: handle VkBufferUsageFlags2CreateInfoKHR
v3dv: handle VkPipelineCreateFlags2CreateInfoKHR
v3dv: lower maxVertexInputBindingStride to match vulkan runtime
v3dv: shader modules are deprecated with VK_KHR_maintenance5
v3dv: implement vkGetImageSubresourceLayout2KHR
v3dv: refactor create_image
v3dv: add a get_image_subresource_layout helper
v3dv: implement vkGetDeviceImageSubresourceLayoutKHR
v3dv: implement vkGetRenderingAreaGranularityKHR
v3dv: fix pipeline leaks when meta pipeline cache is disabled
v3dv: fix a few asserts that check layerCount instead of array_layers
v3dv: allow VK_REMAINING_ARRAY_LAYERS in VkImageSubresourceLayers
v3dv: remove blit shader restriction on depth/stencil not being linear
v3dv: disable some TLB paths for cases of linear depth/stencil stores
v3dv: support VK_FORMAT_A1B5G5R5_UNORM_PACK16_KHR
v3dv: add more checks for device loss
v3dv: fix handling of pipeline flags when pipeline init fails
v3dv: expose VK_KHR_maintenance5
broadcom/compiler: initialize payload_conflict for all initial nodes
v3dv: don’t call wsi_device_init too early
broadcom/compiler: don’t spill in between multop and umul24
broadcom/compiler: fix per-quad spilling
broadcom/compiler: validate rtop + thrsw hazard
broadcom/compiler: drop multop if we dce umul24
broadcom/compiler: add missing signal compatibilities for V3D 7.x
broadcom/compiler: add new float32 unpack modifiers in V3D 7.x
broadcom/compiler: disallow copy propagation of FMOV exclusive modifiers
broadcom/compiler: implement nir_op_fsat
v3d: don’t lower fsat on V3D 7.x
v3dv: make nir helpers receive nir compiler options from caller
v3dv: don’t lower fsat on V3D 7.x
v3d: skip tlb loads when emitting clears with a draw call
v3d: rename job->clear to job->clear_tlb
Ian Romanick (33):
intel/brw: Fix optimize_extract_to_float for i2f of unsigned extract
intel/brw: Avoid optimize_extract_to_float when it will just be undone later
intel/elk: Fix optimize_extract_to_float for i2f of unsigned extract
nir/algebraic: Optimize some extract_* expressions
spirv: Use fp16 fp_fast_math settings when lowering fp16 asin and acos
intel/brw: Remove dsign optimization
intel/elk: Remove dsign optimization
intel/brw: Use fs_inst::resize_sources in brw_fs_opt_algebraic
intel/brw: Add support for fcsel opcodes
intel/brw: Handle fsign optimization in a NIR algebraic pass
intel/brw: Update CSEL source type validation
intel/brw: Combine constants and constant propagation for CSEL
intel/brw: Algebraic optimizations for CSEL
intel/brw: Implement more strictly correct fsign lowering
intel/brw: Use range analysis to optimize fsign
nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64
nir/search: Fix is_16_bits for vectors
nir/search: Refactor is_16_bits
nir/algebraic: More patterns to generate iadd3
nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers
intel/brw: Temporarily disable result=float16 matrix configs
intel/brw: Major rework of lower_cmat_load_store
intel/brw/xe2+: Catch invalid uses of writes_accumulator earlier
intel/brw/xe2+: Adjust size_read() for DPAS
intel/brw/xe2+: Scale size_written by reg_unit for DPAS
intel/brw/xe2+: Adjust DPAS lowering to DP4A to accommodate larger GRF and SIMD16
intel/brw/xe2+: Allow vec16 for cooperative matrix
nir: dpas_intel second source can have different number of components
intel/brw/xe2+: Add LNL cooperative matrix configurations
intel/tools: Advertise I915_PARAM_HAS_EXEC_TIMELINE_FENCES
intel/brw: Test corner case CSE of ADD3 instructions
intel/brw: Don’t propagate saturate to an instruction that writes flags
intel/elk: Don’t propagate saturate to an instruction that writes flags
Icenowy Zheng (7):
llvmpipe: add shader cache support for ORCJIT implementation
gallivm: orcjit: use a mutex to protect symbol looking up
util: detect LoongArch architecture
gallivm: add LoongArch support to the mattrs setting code
llvmpipe: add LoongArch support in ORCJIT
gallivm: orcjit: keep the ownership of tm for LPJit
gallivm: orcjit: use atexit to release LPJit singleton at exit
Italo Nicola (1):
nir: add {load,store}_global_etna intrinsics
Iván Briano (21):
compiler: reorder FLOAT_CONTROLS enums
nir: track some float controls bits per instruction
spirv: gather some float controls bits per instruction
nir: check inf/nan/sz preserve per-instruction
nir/algebraic: support float controls conditions per instruction
nir/algebraic: move float control conditions to be per instruction
vtn: support float controls2
anv: enable VK_KHR_shader_float_controls2
anv: check requirements for VK_IMAGE_USAGE_FRAGMENT_SHADING_RATE
anv: fix casting to graphics_pipeline_base
anv: consolidate DestroyPipeline for graphics and graphics_lib
intel/brw: fix subgroup size of geometry stages for lnl+
anv: check cmd_buffer is on a transfer queue more properly
intel/brw: add fetch_viewport_index function
intel/brw: always read LAYER/VIEWPORT from the FS payload
vulkan/runtime: pColorAttachmentInputIndices is allowed to be NULL
vulkan/properties: handle LayeredApiPropertiesListKHR
anv: enable VK_KHR_maintenance7
anv: get scratch surface from the correct pool
anv: set MOCS for protected memory when needed
intel/rt: fix terminateOnFirstHit handling
JCWasmx86 (1):
meson: Fix invalid kwarg name
Jeremy Gebben (1):
radv: Return hang status from radv_check_gpu_hangs()
Jesse Natalie (14):
nir_opt_algebraic: Add a couple optimizations for lowered unpack(pack())
wgl: Delete pixelformat support query
wgl: Fix flag check for GDI compat
nir_range_analysis: Use fmin/fmax to fix NAN handling
d3d12: Use GetResourceAllocationInfo instead of GetCopyableFootprints for residency sizes
nir: Remove assert-only variable by inlining its single use
zink: Add ASSERTED to assert-only local variable
mesa: Add ASSERTED to assert-only local variable
subprojects: Use depth=1 in the git wrap files
blake3: fix Windows ARM64 build and detect ARM64EC as ARM64
ci/windows: Disable zlib in LLVM
ci/windows: Specify numpy < 2.0 to prevent breaking changes
microsoft/clc: Split struct copies before vars_to_ssa in pre-inline optimizations
meson: Add an error message for llvmpipe without llvm draw support
Jessica Clarke (3):
Revert “meson: Do not require libdrm for DRI2 on hurd”
Revert “meson: fix with_dri2 definition for GNU Hurd”
meson: egl: Build egl_dri2 driver even for plain DRI
Jianxun Zhang (43):
intel/isl: Allow multi-sample on depth aux usage (xe2)
isl: Add a heading 4KB to MCS surface (xe2)
isl: Add AUX MCS encoding into aux modes (xe2)
blorp: Scaledown rectangle of MSAA fast clear (xe2)
blorp: Fix offset when ambiguating MCS buffer (xe2)
isl: Clone from isl_gfx12.* files (xe2)
isl: Update isl_gfx20 code (xe2)
isl: Add isl_gfx20 into build (xe2)
isl: Add dispatching in isl.c (xe2)
isl: Implement a part of WA_22018390030 (xe2)
isl: Remove code for Xe2 from isl_gfx12.c
isl: Update render CMF mapping (xe2)
isl: Don’t set clear values or their address (xe2)
blorp: Get fast clear rectangle of non-MSAA surfaces (xe2)
blorp: Pass down fast clear color value (xe2)
intel/genxml,blorp,common: Update 3DSTATE_PS command (xe2)
iris: Update aux state for color fast clears (xe2)
iris: Limit FCV_CCS_E to platforms that enable it
anv: Don’t enable compression with modifiers (xe2)
iris: Add more restrictions on compression (Xe2)
anv: Don’t enable compression on external bos (xe2)
iris: Disable PAT-based compression on depth surfaces (xe2)
anv: Disable PAT-based compression on depth images (xe2)
iris: Update synchronization of fast clear (xe2)
iris: Workaround: Don’t allocate compressed bo from cache (xe2)
isl: Remove restriction of CCS_E support on formats (xe2)
blorp: Don’t convert ccs_e formats for copy (xe2)
isl: Initialize the last usage in isl_encode_aux_mode[] (xe2)
anv: Update synchronization of fast clear (xe2)
iris: Disable predraw resolve (xe2)
blorp: Ensure MSAA fast clear in correct modes (xe2)
intel/dev: Select a compressed PAT entry (xe2)
isl: Add some formats not covered in CMF table (xe2)
anv: Disable tracking fast clear and aux state (xe2)
anv: Fix Vulkan CTS failure related to MCS (xe2)
anv: Support arbitrary fast-clear value on all layouts (xe2)
anv: Disable tracking of clear color on color attachment
intel/common: Ensure SIMD16 for fast-clear kernel (xe2)
intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2)
anv: Fix assertion failures on BMG (xe2)
iris: Fix an assertion failure with compressed format
anv: Disable compression on legacy modifiers (xe2)
anv: Disable legacy CCS setup in binding (xe2)
Job Noorman (33):
ir3: simplify cat5 parsing
ir3: add encoding for isam.v
ir3: use isam.v for multi-component SSBO loads
ir3: add encoding of ldib/stib offsets
ir3: lower SSBO access imm offsets
nir/opt_offsets: add callback for max base offset
nir/opt_offsets: add option to allow offset wrapping
nir/opt_offsets: add load/store_ssbo_ir3
ir3: use nir_opt_offsets for SSBO accesses
ir3: optimize SSBO offset shifts for nir_opt_offsets
ir3: remove spilled splits in shared RA
ir3: set wrmask for spilled splits in shared RA
ir3: print sharedness/halfness of merge set regs
ir3: print intervals when dumping merge sets
ir3: print dst_offset of spill.macro
ir3: debug print limit pressure and post-spill max pressure
ir3: set current instruction before all validation asserts
ir3: fix crash in try_evict_regs with src reg
ir3: fix handling of early clobbers in calc_min_limit_pressure
ir3: set offset on splits created while spilling
ir3: correctly set wrmask for reload.macro
ir3: don’t remove intervals for non-killed tex prefetch sources
ir3: don’t remove collects early while spilling
ir3: expose instruction indexing helper for merge sets
ir3: make indexing instructions optional in ir3_merge_regs
ir3: index instructions before fixing up merge sets after spilling
ir3: move liveness recalculation inside ir3_ra_shared
ir3: restore interval_offset after liveness recalculation in shared RA
ir3: add ir3_cursor/ir3_builder helpers
ir3: refactor ir3_spill.c to use the ir3_cursor/ir3_builder API
ir3: only add live-in phis for top-level intervals while spilling
ir3: print rounding mode for cov
ir3: set rounding mode for all floating point conversions
Jordan Justen (33):
blorp: Update programming for XY_FAST_COLOR_BLT on xe2
intel/genxml: Add XY_FAST_COLOR_BLT for xe2
intel/genxml: Update 3DSTATE_BTD for xe2
intel/dev: Allow setting FORCE_PROBE for intel PCI IDs
intel/dev: Support INTEL_FORCE_PROBE env-var
docs: Document INTEL_FORCE_PROBE env-var
intel/dev: Add LNL device info
pci_ids/intel: Add LNL PCI IDs (with FORCE_PROBE set)
anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc
intel/brw: Simplify enabling brw_fs_test_dispatch_packing
intel/brw: Allow xe2 in brw_stage_has_packed_dispatch()
intel/brw: Fix SSBO/shared load offset register size for Xe2
anv/grl: Build for xe2
Revert “anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT”
intel/dev/mesa_defs.json: Update LNL WA entries
intel/dev: Add INTEL_PLATFORM_BMG enum, BMG WA info
intel/dev: Add BMG device info
intel/dev: Add BMG PCI IDs (with FORCE_PROBE set)
intel/dev: Silence INTEL_FORCE_PROBE warning for intel_clc
intel/dev: If building the driver, always allow getting device info
Revert “anv/grl: Set INTEL_FORCE_PROBE=* when running intel_clc”
intel/compiler: Don’t set size written in brw_lower_logical_sends.cpp
intel/tools: Fix intel_dev_info –hwconfig switch
isl: Move isl_get_render_compression_format in isl_genX_helpers.h
isl: Implement isl_get_render_compression_format for xe2
intel/brw: Retype some regs to BRW_TYPE_UD for Xe2 indirect accesses
intel/perf/xe: Fix free pointer location in xe_add_config()
intel/dev: Enable LNL PCI IDs without INTEL_FORCE_PROBE
anv/generated_indirect_draws: Adjust xe2 simd32 sends_count_expectation
intel/dev: Disable LNL PCI IDs on Mesa 24.2 (require INTEL_FORCE_PROBE)
intel/brw/validate: Simplify grf span validation check by not using a mask
intel/brw/validate: Update dst grf crossing check for Xe2
intel/brw/validate: Convert access mask to be grf based
Jordan Petridis (1):
Revert “ci: mark microsoft farm as offline”
Jose Maria Casanova Crespo (9):
v3d: fix CLE MMU errors avoiding using last bytes of CL BOs.
v3dv: fix CLE MMU errors avoiding using last bytes of CL BOs.
v3d: Increase alignment to 16k on CL BO on RPi5
v3dv: Increase alignment to 16k on CL BO on RPi5
v3dv: V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction not needed
v3dv/ci: Add more dEQP-VK subgroups that are currently skipped
v3dv: Emit stencil draw clear if needed for GFXH-1461
v3dv: really fix CLE MMU errors on 7.1HW Rpi5
v3d: really fix CLE MMU errors on 7.1HW Rpi5
Josh Simmons (3):
radv: Fix crash when using SQTT and NO_COMPUTE
radv: Add `RADV_PROFILE_PSTATE` envvar
radv: Fix shader mask for SQ_WGP SPM counters
José Expósito (2):
meson: Update proc_macro2 meson.build patch
llvmpipe: Init eglQueryDmaBufModifiersEXT num_modifiers
José Roberto de Souza (87):
intel/perf: Nuke platform_supported
intel/perf: Remove i915_drm.h include from gen_perf.py
intel/perf: Fix the error check of i915_add_config()
intel/perf: Change oa_format to uint64_t
intel/perf: Store pointer intel_device_info to in intel_perf_config
intel/perf: Add intel_perf_free()
intel/perf: Add intel_perf_free_context()
intel/ds: Free perf config and context
intel/ds: Nuke ralloc_ctx and ralloc_cfg
anv: Free intel_perf_config when destroying physical device
hasvk: Free intel_perf_config when destroying physical device
iris: Free intel_perf_config and intel_perf_context
crocus: Free intel_perf_config and intel_perf_context
intel/perf: Add and use a function to return platform OA format
intel/perf: Add function to open perf stream
intel/perf: Fix return of read_oa_samples_until()
anv: Nuke perf_query_pass from anv_execbuf
intel/perf: Replace I915_OA_FORMAT_* usage by platform check
intel/perf: Move code that will be shared by both KMDs
intel/perf: Move i915 specific code from common code
intel/perf: Move i915 specific code to load configurations to i915 file
intel/perf: Allocate sseu in heap memory
intel/perf: Replace drm_i915_perf_record_header by intel_perf_record_header
intel/perf: Add a macro with header + sample length
intel/perf: Add intel_perf_stream_read_samples()
intel/dev: Add LNL stepping mapping
intel/dev: Add BMG stepping mapping
intel: Move slm functions from brw_compiler.h to intel_compute_slm.c/h
intel/common: Implement Xe2 SLM encode
intel/common: Implement preferred SLM encode
intel/dev: Use topology variables to calculate strides in Xe KMD
intel/dev: Add function to get the number of EUs per subslice
intel: Set preferred SLM allocation size >= than SLM size for Xe2
anv: Set maxComputeSharedMemorySize value for Xe2 platforms
intel: Compute the optimal preferred SLM size per subslice
anv: Initialize variable to fix static analyzer warning
intel/genxml/gfx20: Sync POSTSYNC_DATA struct with spec
anv/xe2: Enable compute walker and BTD thread preemption
anv/xe2: Add STATE_COMPUTE_MODE individual masks
anv: Remove block promoting non CPU mapped bos to coherent
intel/isl: Set dummy_aux_address to implement Wa_14019708328
anv: Implement Wa_14019708328
iris: Implement Wa_14019708328
anv: Implement Wa_14019857787
iris: Implement Wa_14019857787
intel/dev: Add compressed PAT entry
anv: Add support for compressed images allocation in Xe2
anv: Give apps the choice of compressed or uncompressed but cpu visible images
iris: Add support for compressed images allocation in Xe2
anv: Fix assert in xe_gem_create()
intel/perf: Change order of if blocks
intel/perf: Add assert to check if allocated enough query fiels
intel/dev: Add engine_class_supported_count to intel_device_info
intel/perf: Add LNL OA XML
intel/perf: Add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC
intel/perf: Extend intel_perf_query_result_read_gt_frequency() to gfx 20
intel: Sync xe_drm.h
intel/perf: Implement function that returns OA format for Xe KMD
intel/perf: Add function to check if OA/perf is supported by Xe KMD
intel/perf: Replace i915_perf_version and i915_query_supported by a feature bitmask
intel/perf: Refactor and add Xe KMD support to add and remove configs
intel/perf: Add Xe KMD perf stream open function
intel/perf: Refactor and add Xe KMD support to enable and disable perf stream
intel/perf: Refactor and add Xe KMD support to change stream metrics id
tool/pps: Add Xe KMD support
intel/perf: Remove i915_drm.h includes from common code
intel/perf: Implement Xe KMD perf stream read
anv: Implement Xe KMD query pools
intel/perf: Enable perf on Xe KMD
intel/perf: Implement intel_perf_query_result_accumulate() for gfx 20+
intel/perf: Add support for LNL OA sample format size
intel/perf: Return LNL OA sample format
intel/perf: Do not add INTEL_PERF_QUERY_FIELD_TYPE_SRM_OA_PEC
intel/perf: Adjust EU count for Xe2+
intel/dev: Replace intel_device_info::apply_hwconfig by a gfx version check
intel: Rename XE_PERF to XE_OBSERVATION
anv: Fix return of PAT index for compressed bos for discrete GPUs
intel/dev: Drop DG1 PAT entries
intel/dev: Add documentation about intel_device_info_pat_entry::mmap
intel/dev: Drop coherency from intel_device_info_pat_entry
intel/dev: Add comment documenting the PAT entries
intel/dev: Use GPU WB PAT for Xe2 writecombining
intel/dev: Drop writeback_incoherent from Xe2
isl: Fix Xe2 protected mask
anv: Propagate protected information to blorp_batch_isl_copy_usage()
intel: Sync xe_drm.h
intel/dev: Support new topology type with SIMD16 EUs
Juan A. Suarez Romero (57):
vc4/ci: update results
vc4/v3d/ci: update expected list
vc4: set src type on storing sample mask
broadcom/compiler: remove unused parameters in vpm read
broadcom/compiler: do not run lowering I/O for FS
v3d/vc4/ci: set full renderer version check
nir,v3d: rename tlb_color_v3d intrinsic
vc4: use tlb_color_brcm intrinsic
.gitignore: add .cache folder
vc4: use IO semantics for location
v3d: use BITSET for the masks
v3d: remove handled cases for devices <= 42
ci: define SNMP base interface on runner
v3d: use screen name in disk cache
v3d,v3dv: add compatibility revision in GPU name
broadcom/ci: update expected results
v3dv/ci: add expected failure
v3dv/ci: fix spurious line in expected
v3dv/ci: add new timeouts
dri: cast constant to uint for bitshift
util: do not access member of a NULL structure
util: use unsigned types when performing bitshift
vulkan: do not access member of a NULL structure
nir: fix overflow when negating maxint in constant expressions
nir: use unsigned types when performing bitshifting
glsl: fix downcasting addresses to wrong object types
egl: do not access member of a NULL structure
mesa: use unsigned types when performing bitshifting
mesa: do not pass NULL pointer to function not expecting NULLs
ci: disable Igalia farm
broadcom/compiler: use unsigned types when performing bitshifting
v3dv: do not access member of a NULL structure
v3dv: do not pass NULL pointer to function not expecting NULLs
v3dv: restrict to channels when encoding border color
v3dv: fix misalignment in descriptor layout structure
v3d: do not access member of a NULL structure
v3d: do not pass NULL pointer to function not expecting NULLs
vc4: use unsigned types when performing bitshifting
vc4: do not access member of a NULL structure
vc4: do not pass NULL pointer to function not expecting NULLs
vc4: do not create 0-bytes variable length arrays
vc4: fix out-of-bounds access to array
Revert “ci: disable Igalia farm”
v3d: use original enabled_mask on setting vertex buffers
broadcom/ci: read 32-bit kernel from arm32 path
broadcom/ci: remove arch from hardware name
vc4/ci: run tests in 64-bits
broadcom/ci: run some GL tests in arm32 arch
broadcom/qpu: clean all versions not supported
broadcom: follow version naming convention
broadcom/ci: add more jobs to test with rpi5
broadcom/ci: update traces for rpi4
v3d/ci: update expected list
v3dv: adversise VK_EXT_depth_clamp_zero_one
v3d: expose ARB_depth_clamp in V3D 7.x
v3dv: free temp image created when copying with blit
v3dv: don’t leak cache key
Julian Orth (1):
egl/wayland: ignore unsupported driver configs
Juston Li (8):
venus: refactor out image requirements helpers
venus: extend image cache to vkGetDeviceImageMemoryRequirements
sync protocol for VkRingPriorityInfoMESA
venus: forward nice priority when creating ring
zink: disable cpu_storage for PIPE_USAGE_STREAM
venus: add missing sTypes for vk_set_physical_device_properties_struct
venus: sync protocol for conditionally ignored dyn arrays
anv/android: set ANV_BO_ALLOC_EXTERNAL for imported AHW
Karmjit Mahil (6):
ir3: Don’t set saturation on `flat.b`
zink: Add missing currentExtent special value handling
turnip: Remove workaround for CTS bug zero-sized inline uniform block
mailmap: Add Karmjit Mahil
freedreno/isa: Fix isaspec map for a3xx-ld
tu: Set `TU_ACCESS_CCHE_READ` for transfer ops with read access
Karol Herbst (159):
nir: add SYSTEM_VALUE_BASE_WORKGROUP_ID
nir/divergence_analysis: handle load_base_global_invocation_id
intel/compiler: lower workgoup id to index only for mesh shaders
v3d: call nir_lower_compute_system_values to get rid of base intrinsics
lavapipe: lower base_workgroup_id to zero
mesa/st: lower base invoc and workgroup id
nir: remove global_invocation_id_zero_base
nir: remove workgroup_id_zero_base
nir: document base_global_invocation_id and base_workgroup_id
core/kernel: skip validating unique kernel signatures
rusticl/program: Arc the stored KernelInfo
rust/program: remove Program::kernels
nouveau: fix potential double-free in nouveau_drm_screen_create
nir: fix nir_shader_get_function_for_name for functions without names.
rusticl: use stream uploader for cb0 if prefered
rusticl/kernel: properly handle grid and offsets being usize
rusticl: lower huge grids
rusticl: add RUSTICL_MAX_WORK_GROUPS
rusticl/event: use Weak refs for dependencies
rusticl/icd: remove CLObject
rusticl/spirv: enable more caps
Revert “rusticl/event: use Weak refs for dependencies”
event: break long dependency chains on drop
rusticl/device: add DeviceCaps and move timestamp stuff into it
rusticl/device/caps: move enough for has_images
rusticl/device: properly handle devices with no support for images
rusticl/mesa/context: flush context before destruction
rusticl: merge rusticl_nir and rusticl_mesa_bindings_inline_wrapper targets
rusticl: move mesa_version_string out of the inline wrapper
rusticl: bump bindgen req to 0.65
rusticl: bump meson req to 1.4
rusticl: make use of new `output_inline_wrapper` meson.rust.bindgen feature
nir/lower_cl_images: set binding also for samplers
nouveau: import nvif/ioctl.h file from libdrm_nouveau
gallium/vl: stub vl_video_buffer_create_as_resource
gallium/vl: remove stubs which are defined in mesa_util
meson: centralize galliumvl_stub handling
rusticl: link against libgalliumvl_stub
wgl: link against libgalliumvl_stub
gallium/drivers: do not link against libgalliumvl directly
rusticl/event: fix deadlock when calling clGetEventProfilingInfo inside callbacks
iris: fix PIPE_RESOURCE_PARAM_STRIDE for buffers
rusticl/icd: make sure returned function pointers are of the right type
rusticl/kernel/launch: fix mapping usize types to GPU pointer sizes
rusticl/kernel/launch: remove useless upload of the input
rusticl/kernel: move most of the code in launch inside the closure
rusticl/kernel/launch: move allocation of resources vec
rusticl/kernel/launch: rework how the printf buffer is allocated
rusticl/kernel/launch: get rid of Arc clones for global resources
rusticl/kernel/launch: add helper to bind global buffers
broadcom/compiler: handle load_workgroup_size
v3d: add support for load_workgroup_size
rusticl/spirv: do not pass a NULL pointer to slice::from_raw_parts
rusticl/memory: copies might overlap for host ptrs
gallium: reduce pipe_resource.usage to 4 bits
gallium: properly type pipe_resource.usage with the enum
gallium: properly type fields of pipe_resource.usage
nir_lower_mem_access_bit_sizes: support unaligned store_scratch
nir: add global_atomic_2x32 variants to nir_get_io_offset_src_number
broadcom/compiler: support global load/store intrinsics
broadcom/compiler: use nir_lower_mem_access_bit_sizes for memory lowering
broadcom/compiler: convert 2x32 global operations to scalar variants
broadcom/compiler: only handle load_uniform explicitly in v3d_nir_lower_load_store_bitsize
broadcom/compiler: rework scratch lowering
rusticl/meson: add build root dir to the include dirs of rusticl_c
rusticl: depend on the spirv_info target
util/u_printf: properly handle %%
rusticl/memory: assume minimum image_height of 1
rusticl/memory: fix clFillImage for buffer images
rusticl: add new CL_INVALID_BUFFER_SIZE condition for clCreateBuffer
rusticl: add bsymbolic to linker flags
rusticl/icd: rename all entry points to the actual correct name
radeonsi: set bo_size for user memory allocations
rusticl/queue: gracefully stop the worker thread
rusticl/queue: run rustfmt
nir/lower_alu: support 8 and 16 bit bit_count
nir/opt_sink: add load_kernel_input
gallium: add PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT
rusticl/device: require PIPE_CAP_TEXTURE_SAMPLER_INDEPENDENT for image support
rusticl/mesa/context: handle clear_buffer not set by driver
rusticl/mesa/screen: handle get_timestamp not set by driver
rusticl/kernel/launch: fix global work offsets for 32 bit archs again
broadcom/compiler: add generated v3d_nir_lower_algebraic
broadcom/compiler: handle fp16 conversion ops
broadcom/compiler: fix iu2f32 for 8 and 16 bit inputs
broadcom/compiler: try handling 8/16 bit alu operations
broadcom/compiler: handle up to vec16 load_uniforms
broadcom/compiler: abort on unknown intrinsics
broadcom/compiler: implement load_kernel_input
broadcom/compiler: call nir_lower_64bit_phis
broadcom/compiler: handle variable shared memory
v3d: implement gallium APIs for OpenCL support
v3d: treat SHADER_KERNEL as SHADER_COMPUTE
v3d: lower CL alus
v3d: lower 64 bit ALUs
v3d: support variable shared memory
v3d: fix MAX_GLOBAL_SIZE and MAX_MEM_ALLOC_SIZE
v3d: never replace a mapped bo
rusticl: enable v3d
nir/schedule: add write dep also for shared_atomic
meson: rename with_gallium_opencl to with_gallium_clover
rusticl/program: move binary parsing into its own function
rusticl/program: make binary API not crash on errors
rusticl/program: use blob.h to parse binaries
rusticl/program: update binary format
rusticl/program: use default in more places
Revert “rusticl/queue: run rustfmt”
Revert “rusticl/queue: gracefully stop the worker thread”
rusticl/buffer: harden bound checks against overflows
rusticl/context: move SVM pointer tracking into own type
rusticl/ptr: add a few APIs to TrackedPointers
rusticl/memory: complete rework on how mapping is implemented
rusticl: remove unused interfaces to simplify code
rusticl/mesa: remove ResourceType::Cb0
rusticl/memory: optimize sw_copy when the row_pitch matches the height
rusticl/mesa: make PipeResource repr(transparent)
v3d: support unnormalized coords
rusticl/spirv: support more caps
rusticl/device: fix image_3d_write_supported for embedded
rusticl/device: turn image_3d_write_supported into a cap
rusticl/device: fix advertizement of 3d write images support
rusticl: require PIPE_CAP_IMAGE_STORE_FORMATTED for image support.
rusticl/event: make set_status handle error status properly
rusticl/queue: do not overwrite event error states
rusticl/queue: properly check all dependencies for an error status
rusticl/event: properly implement CL_EXEC_STATUS_ERROR_FOR_EVENTS_IN_WAIT_LIST
rusticl/queue: properly implement in-order queue error checking
rusticl/event: return execution errors when doing a blocking enqueue
rusticl/mesa: handle failures with u_upload_data
rusticl/mesa: set take_ownership to true in set_constant_buffer_stream
rusticl/event: fix outdated comment in call
rusticl/queue: format file
zink: fix OpenCL read_write images
rusticl: support read_write images
spirv: generate info for FunctionParameterAttribute
spirv: initial parsing of function parameter decorations
spirv: handle function parameters passed by value
nak: allow clippy::not_unsafe_ptr_arg_deref lints
clc: force linking of spirvs with mismatching pointer types in signatures
rusticl: fix clippy lint having bounds defined in multiple places
rusticl/program: protect against 0 length in slice::from_raw_parts
rusticl/api: protect against 0 length in slice::from_raw_parts
rusticl/spirv: protect against 0 length in slice::from_raw_parts
nouveau: handle realloc failure inside cli_kref_set
mesa: check for enabled extensions for *UID enums
nouveau/winsys: fix handling of NV_DEVICE_TYPE_IGP
nouveau: use nv_devince_info and fill in PCI and type information
nouveau: add nv_device_uuid
nouveau: implement driver_uuid and device_uuid
nvk: use nv_device_uuid
zink: lower 64 bit find_lsb, ufind_msb and bit_count
zink: lower 8/16 bit alu ops vk spirv doesn’t allow
rusticl/kernel: properly respect device thread limits per dimension
rusticl/memory: Fix memory unmaps after rework
rusticl/image: take pitches into account when allocating memory for maps
rusticl/image: properly sync mappings content for 1Dbuffer images
rusticl/queue: add clSetCommandQueueProperty
util/u_printf: do not double print format string with unused arugments
rusticl/memory: fix sampler argument size check
Kenneth Graunke (63):
isl: Set MOCS to uncached for Gfx12.0 blitter sources/destinations
intel/brw: Delete gfx10 table for align1 3src type encoding
intel/brw: Drop NF type support
intel/brw: Rework BRW_REGISTER_TYPE’s representation semantics
intel/brw: Stop using long BRW_REGISTER_TYPE enum names
intel/brw: Reindent after shortening BRW_REGISTER_TYPE_* to BRW_TYPE_*
intel/brw: Use newer brw_type_is_* shorter names
intel/brw: Replace brw_reg_type_from_bit_size by brw_type_with_size
intel/brw: Replace type_sz and brw_reg_type_to_size with brw_type_size_*
intel/brw: Combine a1/a16 3src type encoding functions
intel/brw: Combine a1/a16 3src type decoding functions
intel/brw: Rename brw_reg_type_to_hw_type to brw_type_encode
intel/brw: Don’t use inst return value when it isn’t needed
intel/brw: Make a helper for finding the largest of two types
intel/brw: Add builder helpers for math functions
intel/brw: Add builder helpers that allocate temporary destinations
intel/brw: Use new builder helpers that allocate a VGRF destination
intel/brw: Print W/UW immediates correctly
intel/brw: Do not create empty basic blocks when removing instructions
intel/brw: Support CSE on more ops
intel/brw: Don’t include unnecessary undefined values in texture results
intel/brw: Add a new VEC() helper.
intel/brw: Use VEC for load_const
intel/brw: Use VEC for gl_FragCoord
intel/brw: Use VEC for TCS/TES/GS input/output loads
intel/brw: Use VEC for FS outputs
intel/brw: Use VEC for output stores
intel/brw: Use VEC for NIR vec*() sources
intel/brw: Use VEC for emit_unzip()
intel/nir: Set src_type on TCS quads workaround store_output
intel/brw: Blockify convergent load_shared on Gfx11-12 as well
intel/brw: Recreate GS output registers after EmitVertex
intel/brw: Skip fs_nir_setup_outputs for compute shaders
intel/brw: Handle scratch address swizzling of constants
intel/brw: Add a idom_tree::dominates(a, b) helper.
intel/brw: Make brw_reg::bits publicly accessible from fs_reg
intel/brw: Update instructions_match() to compare more fields
intel/brw: Drop compiler parameter from try_constant_propagate()
intel/brw: Drop BRW_OPCODE_IF from try_constant_propagate
intel/brw: Refactor try_constant_propagate()
intel/brw: Refactor code to commute immediates into legal positions
intel/brw: Delete SAD2 and SADA2 opcodes
intel/brw: Make VEC() perform a single write to its destination.
intel/brw: Make gl_SubgroupInvocation lane index loading SSA
intel/brw: Skip LOAD_PAYLOADs after every texture instruction if possible
intel/brw: Add a new def analysis pass
intel/brw: Print defs in dump_instructions
intel/brw: Write a new global CSE pass that works on defs
intel/brw: Switch to the new defs-based global CSE pass
intel/brw: Delete old local common subexpression elimination pass
intel/brw: Introduce a new SSA-based copy propagation pass
intel/brw: Use the defs-based copy propagation along with the old one
intel/brw: Make opt_copy_propagation_defs clean up its own trash
intel/brw: Build the scratch header on the fly for pre-LSC systems
intel/brw: Skip discarding the interference graph
intel/brw: Delay liveness calculations in saturate propagation
intel/brw: Make an alu2 builder helper
intel/brw: Make bld.ADD(x, 0) emit no instructions and return x directly
intel/brw: Support CSE of ADD3
intel/brw: Add a lower_csel pass and allow building it for all types
intel/nir: Don’t needlessly split u2f16 for nir_type_uint32
intel/brw: Don’t mix types for unary extended math instructions
intel/brw: Disallow scalar byte to float conversions on DG2+
Kevin Chuang (6):
anv: Properly fetch partial results in vkGetQueryPoolResults
anv: Properly handle cases for different query types in copy_query_results_with_shader
intel/genxml: add task/mesh shader statistics registers
anv: Update pipeline statistics mask for task/mesh shader invocations
anv: implement mesh shader queries
anv: toggle meshShaderQueries based on whether we support mesh_shader or not
Khem Raj (1):
amd: Include missing llvm IR header Module.h
Konstantin (4):
docs: Add documentation about debugging GPU hangs on RADV
ac/debug,radv: Read UMR wave dumps into memory before parsing
radv: Use a struct for the trace_bo layout
radv: Trace indirect dispatch sizes
Konstantin Seurer (59):
radv: Handle all dependencies of CmdWaitEvents2
nir/print: Do not access invalid indices of load_uniform
radv: Fix radv_shader_arena_block list corruption
radv: Remove arenas from capture_replay_arena_vas
radv: Zero initialize capture replay group handles
radv/ci: Add back pipeline library flakes
radv/ci: Document recent flakes
gitlab: Reference hang debugging documenttion
radv: Remove radv_cmd_dirty_dynamic_bits
llvmpipe: Use a second LLVMContext for compiling sample functions
radv: Add locking to radv_replay_shader_arena_block
radv: Replace is_rt_shader with RADV_SHADER_TYPE_RT_PROLOG
radv: Remove uses_dynamic_rt_callable_stack
radv/rt: Track ray_launch_id reads
radv/rt: Track ray_launch_size reads
radv/rt: Remove load_rt_dynamic_callable_stack_base_amd
radv: Return a block from radv_replay_shader_arena_block
ac/llvm: Fix DENORM_FLUSH_TO_ZERO with exact instructions
ac/llvm: Enable helper invocations for vote_all/any
radv/ci: Bring back vkcts-navi21-llvm-valve
khronos-update: Add ANDROID guards to vk_android_native_buffer.h
zink: Always include renderdoc_app.h
zink: Blit using one triangle for nearest filtering
llvmpipe: Lock shader access to sample_functions
llvmpipe: Stop using a sample_functions pointer as cache key
llvmpipe: Only evict cache entries if a fence is available
lavapipe: Always call finish_fence after lvp_execute_cmd_buffer
radv: Clean up pipeline barrier handling
radv: Remove dead access bits
radv/meta: Use READ access for dst_access_flush
radv/rra: Detect BVHs with back edges
radv/rra: Move some code into handle_accel_struct_write
radv/rra: Fix disabling the ray history
radv/rra: Fix reporting the isec invocations
radv/rra: Bump rt_driver_interface_version to 8.0
radv/rra: Reduce the memory requirement of copy_after_build
radv/rra: Rework calculating the ray history size
radv/rra: Enable RADV_RRA_TRACE_COPY_AFTER_BUILD by default
util: Add a helper for querying sparse tile sizes
lavapipe: Do not allocate 0 sized buffers for descriptor sets
gallium: Add a memory range parameter to resource_bind_backing
llvmpipe: Use an anonymous file for memory allocations
lavapipe: Implement sparse buffers and images
lavapipe: Implement shaderResourceResidency
venus: Refactor hiding sparse features and properties
venus: Disable sparse binding on lavapipe
vulkan: Handle group stages in vk_.*_access2_for_pipeline_stage_flags2
vulkan: Add vk_expand_(dst|src)_access_flags2
radv: Use vk_expand_(src|dst)_access_flags2
radv: Remove no-op access flag handling
radv: Remove handling for expanded access flags
radv: Remove write access handling from radv_dst_access_flush
radv: Handle AS access bits like shader storage access bits
radv: Refactor radv_(dst|src)_access_flush
radv: Fix smooth lines with dynamic polygon mode and topology
radv: Always use dynamic line smoothing
nir: Stop using “capture : true” for nir_opt_algebraic
nir: Add FLOAT_CONTROLS_.*_PRESERVE
aco: print s_delay_alu INSTSKIP>3 correctly
Leo Liu (4):
ac/surface: add GFX12 256B tile mode for video
ac/surface/tests: add the test for ADDR3_256B_2D
radeon/vcn: use pipe video buffers for dpb
radeon/vcn: enable dpb to use pipe video buffer with swizzle mode
Lionel Landwerlin (125):
anv: disable dual source blending state if not used in shader
anv: reuse embedded samplers across shaders
anv: simplify multisampling check
anv: fixup indentation
anv: factor out wm_prog_data get in runtime flush
intel/brw: fixup wm_prog_data_barycentric_modes()
intel/fs: decouple alphaToCoverage from per sample dispatch
intel/brw: add min_sample_shading value in wm_prog_data
anv: track sample shading enable & min sample shading
anv: add dirty tracking of fs_msaa_flags in runtime
anv: move 3DSTATE_WM::BarycentricInterpolationMode programming to runtime
anv: move more PS_EXTRA programming to runtime
anv: move 3DSTATE_PS to partial packing
anv: move 3DSTATE_MULTISAMPLE to partial emission
anv: remove fs_msaa_flags from the graphics pipeline
anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples
anv: fixup alloc failure handling in reserved_array_pool
anv: fix leak of custom border colors
anv: avoid requirement to put flush_data as first field
anv: move device initialization as the last step of vkCreateDevice
anv: move empty_vs_input to physical device
anv: VK_EXT_legacy_vertex_attributes
docs: update anv features
anv: fix ycbcr plane indexing with indirect descriptors
intel/hang_replay: use newer API of i915 execbuffer
intel/hang_replay: use hw image param
intel/tools: add README file
brw: add more condition for reducing sampler simdness
intel: move debug identifier out of libintel_dev
brw: drop dependency on libintel_common
anv: fix push constant subgroup_id location
nir/divergence: add missing load_printf_buffer_address
nir: add a base offset for printf indexing
nir: add ptr_bit_size parameter to nir_lower_printf
nir: add a low level printf emission helper
intel/nir: remove unused prototypes
intel/nir: add reloc delta to load_reloc_const_intel intrinsic
intel/compiler: store u_printf_info in prog_data
intel/nir: add printf lowering
anv: add debug shader printf support
intel/clc: enable printfs support
anv: shader printf example
anv: switch to vk_device::mem_cache field for default cache
anv: use weak_ref mode for global pipeline caches
anv: fix shader identifier handling
intel/brw: ensure find_live_channel don’t access arch register without sync
anv: fix utrace compute walker timestamp captures
anv: fix timestamp copies from secondary buffers
anv: move last compute command pointers to the state structure
u_trace: extend tracepoint end_of_pipe bit into flags
anv: optimize POSTSYNC_DATA rewrites in timestamp emissions
intel: fix HW generated local-id with indirect compute walker
brw: use a single virtual opcode to read ARF registers
brw: limit dependencies on SR register
brw: better model READ_ARF_REG opcode
anv: fix Gfx9 fast clears on srgb formats
anv: rewrite Wa_18019816803 tracking to be more like state
anv: factor out some more gpu_memcpy setup
anv: fix pipeline flag fields
anv: expose VK_MESA_image_alignment_control
anv: support setting CFE_STATE::StackIDControl per application
anv: limit aux invalidations to primary command buffers
anv: ensure completion of surface state copies before secondaries
anv: simplify TRTT initialization
anv: reuse setup_execbuf_fence_params for utrace submissions
anv: rework utrace submission
anv: move trtt submissions over to the anv_async_submit
anv: use reserved array pool for legacy custom border colors
anv: make device initialization more asynchronous
mi-builder: rename relocated api
mi-builder: c++ warning fix
mi-builder: make instruction pointer manipulation more obvious
mi-builder: add missing write completion check
mi-builder: add relocated register/memory writes
mi-builder: add a write check parameter
anv: centralize mi_builder setup
anv: use the new relocated write mi-builder api
anv: move more MI_SDI to mi_builder
anv: use default mocs for memory bits only touched by CS
anv: set query mi-builder mocs only once
anv: use new mi-builder write check API to avoid stalls
genxml: add MI_MEM_FENCE for Gfx20
mi-builder: add read/write memory fencing support on Gfx20+
intel/fs: fix lower_simd_width for MOV_INDIRECT
anv: add custom mi write fences
anv: emit conditional after gfx state flushing
anv: factor out STATE_BASE_ADDRESS filling to helper function
anv: predicate emission of STATE_BASE_ADDRESS
anv: reuse device local variable
anv: avoid initalizing TRTT stuff without sparseBinding
anv: fix vkCmdWaitEvents2 handling
anv: don’t apply descriptor array bound checking
brw: add missing break
brw: factor out source extraction for rematerialization
brw: improve rematalization of surface/sampler handles
brw: bound the amount of rematerialized NIR instructions
brw: remove rematerialization assert
brw: remove some brackets
brw: enable rematerialization of non 32bit uniforms
brw: always use new registers for load address increments
brw: annotation send instructions with surface handles generated with exec_all
brw: avoid Wa_1407528679 in uniform cases
brw: blockify load_global_const_block_intel
brw: enable A64 loads source rematerialization
anv: limit vertex fetch invalidation on indirect read
anv: add a protected scratch pool
anv: prepare 2 variants of all shader instructions
anv: allocate compute scratch using the right scratch pool
anv: emit the right shader instruction for protected mode
anv: workaround flaky xfb query results on Gfx11
anv: fix u_trace on < Gfx12.0
intel/ds: remove duplicate arguments
hasvk: move cmd_emit_timestamp initialization to genX
hasvk: pass anv_address to predicate helper
brw: fix uniform rebuild of sources
anv: get rid of the second dynamic state heap
isl: account for protection in base usage checks
anv: properly flag image/imageviews for ISL protection
anv: propagate protected information for blorp operations
anv: fix check on pipeline mode to track buffer writes
vulkan/runtime: allow null/empty debug names
anv: reuse object string for RMV token
anv: add missing MEDIA_STATE_FLUSH for internal shaders
anv/blorp: force CC_VIEWPORT reallocation when programming 3DSTATE_VIEWPORT_STATE_POINTERS_CC
brw/rt: fix ray_object_(direction|origin) for closest-hit shaders
Louis-Francis Ratté-Boulianne (20):
dri_interface: add interface for EGL_EXT_surface_compression
gallium: add interface for fixed-rate surface/texture compression
egl/wayland: factor out common part of DRI image creation
egl: wire up EGL_EXT_surface_compression extension
st/dri2: add support for fixed-rate compression interface
egl/dri2: add support for EGL_EXT_surface_compression
mapi: add EXT_texture_storage_compression extension
mesa/st: add compression parameter to st_texture_create
mesa: implement EXT_texture_storage_compression extension
mesa: implement EXT_EGL_image_storage_compression extension
panfrost: Add AFRC overlay in v10 xml specification
panfrost: add device querying for AFRC support
panfrost: add utils for AFRC fixed-rate support
panfrost: encode component order as an inverted swizzle (v10)
panfrost: add support for AFRC textures
panfrost: add support for AFRC render targets
panfrost: add support for AFRC modifiers
panfrost: add translation between modifier and compression rates
panfrost: add support for fixed-rate compression
panfrost: add PAN_AFRC_RATE env var to force a compression rate
Luc Ma (4):
loader: silence implicit-load zink error by the loader
gallium: properly propagate the usage of resource
gallium: inline trivial needs_pack()
meson: Build pipe-loader when build-tests is true
Lucas Fryzek (7):
llvmpipe: query winsys support for dmabuf mapping
u_gralloc/fallback: Set fd from handle directly
egl/x11/sw: Implement swapbuffers with damage
vulkan/wsi: Update sw x11 wsi to only copy damage regions
egl/x11/sw: Implement shm support
egl/x11: Remove force software check for exporting SBWD
lp: only map dt buffer on import from dmabuf
Lucas Stach (2):
etnaviv: drm: don’t skip flush when there are active PMRs
etnaviv: always flush pending queries on get_query_result
M Henning (2):
nir: Handle texop_*_nv in nir_tex_instr_is_query
nak: Add minimum bindgen requirement
Maaz Mombasawala (2):
svga: Retry DRM_VMW_SYNCCPU ioctl on failure.
svga: Replace shared surface flag and simplify surface creation
Marcin Ślusarz (2):
intel/genxml/xe2: update MESH_CONTROL
anv,intel/compiler/xe2: fill MESH_CONTROL.VPandRTAIndexAutostripEnable
Marek Olšák (174):
ac/gpu_info: set tcc_rb_non_coherent only if number of TCCs != number of RBs
ac/surface: disable DCC for 3D textures on gfx9 to improve performance
ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8
radeonsi: don’t invalidate L2 for internal compute without DCC stores
radeonsi: fix KHR-GL46.texture_lod_bias.texture_lod_bias_all on gfx10-11
radeonsi: validate IO semantics in scan_io_usage
radeonsi: add workarounds for DCC MSAA for gfx9-10
radeonsi: enable DCC for MSAA on gfx10-10.3
radeonsi: check for FMASK correctly in gfx10_get_bin_sizes
amd/ci: 17 piglit failures are fixed for raven
nir: add ACCESS_CP_GE_COHERENT_AMD
nir: add nir_atomic_op_ordered_add_gfx12_amd
nir: add streamout intrinsics for AMD GFX12
nir: add sleep intrinsics for AMD
nir: validate src_type of store_output intrinsics, require bit_size >= 16
nir: add more build helpers
nir: add shader_info::use_aco_amd
nir/lower_tex: support FMASK loads with a 16-bit sample index
nir/lower_image: support FMASK loads with a 16-bit sample index
drm-uapi: update amdgpu_drm.h and drm_fourcc.h for gfx12
amd: import gfx12 addrlib
amd: add gfx12 register definitions
amd: add gfx12 register definitions into the register header generator
amd: add initial common code for gfx12
ac/nir: update ac_nir_lower_resinfo for gfx12
ac/nir,llvm: add GS VGPR changes for gfx12
ac/llvm: use new s_wait instructions and split the existing ones for gfx12
ac/llvm: add new cache flags for gfx12
ac/llvm: add CS SGPR changes for gfx12
ac/llvm: update inline assembly for buffer_load_format_xyzw with TFE for gfx12
ac/nir: add ac_nir_sleep and handle the intrinsics
ac/nir: add gfx12 streamout NIR code
ac/llvm: handle nir_atomic_op_ordered_add_gfx12_amd
ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_gfx12_amd
ac/llvm: add a workaround for nir_intrinsic_load_constant for LLVM on gfx12
ac/surface: add gfx12
ac/surface/tests: add gfx12 tests
radeonsi: add gfx12
util: shift the mask in BITSET_TEST_RANGE_INSIDE_WORD to be relative to b
ac/llvm: improve/simplify/fix load_ssbo
radeonsi: serialize shader disassembly string to fix asm dumps for ACO
radeonsi: fix the size of the query result SSBO
radeonsi: validate the buffer range in si_set_shader_buffer
radeonsi: remove GDS tests
radeonsi: set flags directly instead of having needs_db_flush
radeonsi/gfx11: use a lighter workaround for Navi31 dEQP failures
radeonsi: get NIR options from si_screen instead of calling get_compiler_options
radeonsi: minor simplifications of clear/copy_buffer shaders
radeonsi: simplify the complex clear/copy_buffer shader
radeonsi: use set_work_size for all internal compute dispatches
radeonsi: replace the clear_12bytes_buffer shader with the DMA compute shader
radeonsi: remove slow code from si_msaa_resolve_blit_via_CB
radeonsi/ci: fix caselists for vk-gl-cts/main
radeonsi/ci: update failures for all generations
radeonsi/ci: remove some gfx11 flakes
radeonsi: constify struct pipe_vertex_buffer *
nir/algebraic: eliminate pack+unpack and unpack+pack pairs
ac: move radv_mem_vectorize_callback to common code
ac/llvm: global stores should have no holes in the writemask
radeonsi: call nir_lower_int64 later to fix ACO failure with Tomb Raider
radeonsi: vectorize load/stores and shrink stores
amd: update addrlib
amd: add more gfx11 APUs
amd: enable 32B minimum DCC block size for gfx1151
ac/llvm: fix incorrect parameter type in llvm.amdgcn.s.nop
radeonsi: vectorize loads/store after ABI lowering and optimizations
radeonsi/gfx12: fix the alpha ref value
radeonsi/gfx12: fix incorrect condition for when to do clear_buffer via compute
radeonsi/gfx12: disable CU1 instead of CU0 for GS due to SQTT
radeonsi/gfx12: fix a regression in si_set_mutable_tex_desc_fields
radeonsi/gfx12: fix depth bounds register values
radeonsi/gfx12: fix a regression in si_init_depth_surface
radeonsi: don’t lower UBO/SSBOs to descriptors if they are already lowered
radeonsi: lower NIR resource srcs to descriptors last
ac/descriptors: fix gfx12 regressions
ac/nir/lower_ngg: use global_atomic_amd to fix gfx12 streamout
ac/nir/lower_ngg: use voffset in global_atomic_add for xfb
ac: add gfx12 DCC shared code
radeonsi/gfx12: fix GPU deadlocks due to query result incoherency
radeonsi: assume si_set_ring_buffer is only used by gfx6-10.3
radeonsi: remove cp_to_L2 and L2_to_cp, inline the values
radeonsi: remove RADEON_FLAG_READ_ONLY
radeonsi: allow RADEON_HEAP_BIT_GL2_BYPASS for VRAM
radeonsi: remove leftover comment of non-existent RADEON_FLAG_MALL_NOALLOC
radeonsi/gfx12: add DCC
ac/surface: pass the correct addrlib handle to Addr3GetPossibleSwizzleModes
amd: update addrlib
ac/nir/lower_ngg: don’t use gfx12 xfb defs outside their basic block on gfx11
radeonsi/gfx12: fix stencil corruption
gallium/u_blitter: add option to override fragment shader for util_blitter_blit
radeonsi: don’t declare 3D coordinates in the compute blit if they aren’t needed
radeonsi: use better workgroup sizes for compute blits to improve perf
radeonsi: ignore PIPE_SWIZZLE_1 for 40% VGPR usage reduction for compute blits
radeonsi: remove fp16_rtz from the compute blit
radeonsi: use MIMG D16 (16-bit data) for image instructions in compute blits
radeonsi: optimize unaligned compute blits
radeonsi: fix sample0_only for the compute blit
radeonsi: reject unsupported parameters as the first thing in the compute blit
radeonsi: don’t use si_can_use_compute_blit in the compute blit
radeonsi: don’t fail due to DCC when using the compute blit on compute queues
radeonsi/gfx11: enable MSAA image stores in the compute blit
radeonsi: document better how X/Y flipping in the compute blit works
radeonsi: cosmetic and robustness changes for the compute blit
radeonsi: extend the compute blit to do image clears as well
radeonsi: switch compute image clears to the compute blit shader
radeonsi: rename si_compute_blit “testing” parameter to “fail_if_slow”
radeonsi: rename si_compute_copy_image -> si_compute_copy_image_old
radeonsi: add a new version of si_compute_copy_image using the compute blit
radeonsi: switch the old compute image copy to the new one using the blit
radeonsi: remove the old si_compute_copy_image
radeonsi: convert the compute blit shader hash table to u64 keys
radeonsi: split xy_clamp_to_edge to separate X and Y flags for the compute blit
radeonsi: restructure (rewrite) the compute blit shader
radeonsi: adds flags parameter into si_compute_blit to replace fail_if_slow
radeonsi: change the compute blit to clear/blit multiple pixels per lane
radeonsi: extend NIR compute helpers to allow returning 16-bit results
radeonsi: use MIMG A16 (16-bit image coordinates) in compute blits
radeonsi: print the compute shader blit key for AMD_DEBUG
radeonsi: use shader_info::use_aco_amd to determine whether to use ACO
radeonsi: add use_aco into CS blit shader key
radeonsi: clear color buffers via compute for special tiling cases
radeonsi: add a custom MSAA resolving pixel shader
radeonsi: add fail_if_slow parameter into si_msaa_resolve_blit_via_CB
radeonsi: add a new blit microbenchmark
radeonsi: add decision code to select when to use CB_RESOLVE for performance
radeonsi: add decision code to select when to use compute blit for performance
ac/nir: import the MSAA resolving pixel shader from radeonsi
ac/nir: import the universal compute clear/blit shader
ac/nir: import the dispatch logic for the universal compute clear/blit shader
Revert “radeonsi: fix initialization of occlusion query buffers for disabled RBs”
radeonsi/ci: update gfx10.3 failures
nir/lower_io_to_scalar: add new_component temporary variable
nir/lower_io_to_scalar: don’t create output stores that have no effect
nir: add nir_opt_vectorize_io, vectorizing lowered IO
glsl/linker: vectorize lowered IO
nir: add a NIR option flag nir_io_prefer_scalar_fs_inputs
ac/nir/cdna: allow 16-bit coordinates
ac/nir/cdna: ignore image_descriptor intrinsics
ac/nir/cdna: don’t use image_descriptor intrinsics if the src is a descriptor
mesa: switch remaining shader functions from SHA1 to BLAKE3
radeonsi: replace shader SHA1 hashes with BLAKE3
radeonsi: don’t use CP DMA on GFX940
nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd
ac/nir: remove sleeps from gfx12 streamout code
ac/llvm: remove s_nop from ordered_add_loop_gfx12_amd
ac/llvm: fix inline assembly register constraints for ordered_add_loop_gfx12_amd
as/llvm: add s_nops before the ordered add loop and s_wait_alu workaround
radeonsi: implement nir_intrinsic_load_ssbo_address
radeonsi: expose internal buffer bindings to compute shaders
radeonsi/gfx12: always set BO metadata, not just during export
radeonsi/gfx12: fix compute register settings for global_atomic_ordered_add
ac/surface: finish display DCC for gfx11.5
ac/surface: finish display DCC for gfx12
radeonsi: add fail_if_slow parameter into compute_clear/copy_buffer
radeonsi: use a hash_table and define a shader key for the DMA compute shader
radeonsi: add dwords_per_thread parameter into si_compute_clear_copy_buffer
radeonsi: clear buffers with a 12B clear value by clearing 4 dwords per thread
radeonsi: rewrite the clear/copy_buffer microbenchmark
radeonsi/ci: update gfx11 failures
radeonsi: replace si_shader::scratch_bo with scratch_va, don’t set it on gfx11+
radeonsi: don’t update compute scratch if the compute shader doesn’t use it
ac: add radeon_info::has_scratch_base_registers
radeonsi: lock a mutex when updating scratch_va for compute shaders
util: make util_idalloc_exists private
util: don’t use variable names that can appear in args of idalloc foreach macros
util: add util_idalloc_sparse, solving the excessive virtual memory usage
mesa: switch ID allocation to util_idalloc_sparse to reduce virtual memory usage
nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering
radeonsi/gfx12: fix a GPU hang due to an invalid packet with window rectangles
radeonsi: ensure TC_L2_dirty is set if we don’t sync after internal SSBO blits
radeonsi: fix buffer coherency issues on gfx6-8,12 due to missing PFP->ME sync
radeonsi/gfx12: fix register programming to fix GPU hangs
radeonsi/gfx12: fix VS output corruption with streamout
ac/surface/gfx12: turn off HiZ for pre-production samples
Mark Burton (1):
gallivm: Fix compilation errors when using LLVM 13.
Mark Collins (21):
vdrm: Add fixed VA parameter for mapping memory
tu: Handle VkDeviceMemory BO unmapping in VkUnmapMemory
tu: Implement VK_EXT_map_memory_placed
docs/features: Add VK_EXT_map_memory_placed
tu/shader: Allow LRZ when write pos with explicit early frag test
tu/lrz: Emit GRAS_LRZ_CNTL2 on A7XX
tu/lrz: Use actual CHIP rather than hardcoding A6XX
fd/a7xx: Initialize magic register 8C34 to 0
fd/a7xx: Initialize magic register 8008 to 0
tu: Allow LRZ on A7XX
tu/lrz: Add structure for LRZ FC layout
tu: Update LRZ FC allocation for A7XX layout
tu: Update LRZ FC dirty clear for A7XX
tu: Specify LRZ FC depth clear value on A7XX
tu: Enable LRZ fast-clear for A7XX
fd/a7xx: Document `LRZ_FLIP_BUFFER` event
docs/freedreno: Add documentation on A7XX LRZ
tu: Emit GRAS_LRZ_DEPTH_BUFFER_INFO correctly
tu/kgsl: Spin unti KGSL reports queue timestamp during profiling
tu/kgsl: Fix profiling buffer GPU IOVA
fd/meson: Only build ‘ds’ when system has DRM
Martin Krastev (2):
svga: convert license block to SPDX
svga: update timespan in copyright message
Martin Roukala (né Peres) (9):
ci/b2c: Reduce the length of the kernel cmdline
nvk+zink/ci: rename the ga106 jobs to be more in line with RADV
nvk+zink/ci/ga106: make the expectations codename-specific
nvk+zink/ci: document more flakes in the ga106
turnip/ci: document a missing flake from the a750_vk job
turnip/ci: bump the a750_vk timeout
turnip+zink/ci: add more flakes to the expectations
radv+zink/ci: document recent flakes
radv/ci: add a bunch of flakes
Mary Guillemard (86):
nak: Pass has_mod to all form of src2 requiring it
panvk: Ensure we lower load_base_workgroup_id to 0
panfrost: Skip new failure from VKCTS 1.3.8.x
nvk, nak: Wire up conservative rasterization underestimate
docs/features: Add EXT_conservative_rasterization for NVK
agx: speed-up dce
panvk: Only clear UBOs descriptors when set isn’t present
nouveau: nvidia_header: Add AMPERE_B class generation
nak: Set SPH version to 4 on SM75+
nak: Migrate sph.rs to use SPH headers defintion
bi: Reformat code
midgard: Reformat code
bi: Alloc replacement array once in opt_cse
pan/lib, panvk: Ensure data_size is on 64 bits
panvk: Fix shader destruction when vk_shader_module_to_nir fail
panvk: Remove panvk_lower_blend
panvk: Remove dynarray from panvk_shader
panvk: Keep panvk_shader alive in panvk_pipeline_shader
panvk: Upload shader in panvk_shader
panvk: Upload copy tables in panvk_shader
panvk: Upload render state in panvk_shader
panvk: Move the linking bits to panvk_shader
panvk: Kill panvk_pipeline_shader and use panvk_shader directly
panvk: Link shaders at draw time
panvk: Move compile logic out of shader_create
panvk: Move NIR lower logic out of shader_create
panvk: Move preprocess logic out of shader_create
panvk: Implement vk_shader
panvk: Remove panvk_pipeline
pan/va: Ensure no clash with other defs in disassembler
bi: Make disassembler take a const void*
midgard: Make disassembler take a const void*
bi: Move bi_disasm definitions to their own header
panfrost: Add pan_shader_disassemble
panvk: Implement executable IR reporting
panvk: Advertise VK_KHR_pipeline_executable_properties
panvk: Generate proper device and driver UUIDs
panvk: Advertise VK_EXT_pipeline_creation_cache_control and VK_EXT_pipeline_creation_feedback
panvk: Advertise VK_EXT_shader_module_identifier
panvk: Advertise VK_KHR_pipeline_library and VK_EXT_graphics_pipeline_library
panvk: Enable pipeline library in CI for Mali-G52
docs: Update features.txt to add panvk for BDA extensions
panvk: Advertise VK_KHR_device_group and VK_KHR_device_group_creation
panvk: Reorder extensions by name
panvk: Advertise VK_KHR_maintenance3
panvk: Add missing null check in DestroyCommandPool
panvk: Add missing clean up in blend_shader_cache_init
panvk: Make mempool detect NULL BOs
panvk: Check for maxBufferSize in panvk_CreateBuffer
panvk: Make panvk_kmod_zalloc use correct allocation scope on non-transient
panvk: Ensure to unref transient bo in reset for mempools
panvk: Fix device mempool leaks
panvk: Add more allocation checks in create_device
panvk: Implement CmdDispatchBase
panvk: Enable device_init, null_handle and object_management in CI for Mali-G52
panvk: Advertise shaderModuleIdentifier feature
panvk: Report correct min value for discreteQueuePriorities
panvk: Enable dEQP-VK.info tests in CI for Mali-G52
panvk: Clamp viewport scissor to valid range
panvk: Enable offscreen_viewport tests in CI for Mali-G52
panvk: Skip dispatch on empty workgroup
panvk: Report proper workgroup invocation and size
panvk: Enable compute pipeline in CI for Mali-G52
panvk: Advertise VK_EXT_private_data
panvk: Do not emit blend shader when color_mask is 0
panvk: Run nir_lower_io_to_vector for fragment shader
panvk: Enable glsl.440.linkage in CI for Mali-G52
panvk: Implement and advertise anisotropy support
panvk: Advertise VK_KHR_sampler_mirror_clamp_to_edge
panvk: Enable texture filtering in CI for Mali-G52
pan/kmod: Avoid deadlock on VA allocation failure on panthor
panfrost: Handle context_init errors correctly
panfrost: Handle gracefully resource BO alloc failures
ci/panfrost: Update t760 fails
rusticl: Add panthor when panfrost is present in RUSTICL_ENABLE
bi: Clean up mem_access_size_align_cb
bi: Enable lower_pack_64_4x16
bi: Lower pack_32_4x8_split and pack_32_2x16_split in algebraic
bi: Enable lower_pack pass in compiler
bi: Implement basic 8-bit vec support
panfrost: Rewrite set_global_binding to make resources truly global
panfrost: Do not recreate bo if already mapped
panfrost: Increase address space to 48-bit
panfrost: Fetch available system memory
panvk: Fix image support in vertex jobs
panvk: Pass attrib_buf_idx_offset to desc_copy_info
MastaG (1):
gallivm: Call StringMapIterator from llvm:: scope
Matt Coster (1):
docs: List VK_EXT_debug_utils
Matt Turner (8):
intel: Build float64 shader only for Vulkan
intel/clc: Free parsed_spirv_data
intel/clc: Free disk_cache
intel/brw: Use REG_CLASS_COUNT
intel/elk: Use REG_CLASS_COUNT
docs: Drop references to LIBGL_DRIVERS_PATH
util: Add ATTRIBUTE_OPTIMIZE(flags)
util: Force emission of stack frame in stack unit test
Mauro Rossi (1):
intel/common: fix building error in intel_common.c
Maíra Canal (7):
v3dv: Use errno when logging an error to stderr
drm-uapi: Update v3d_drm.h
broadcom/common: Add maximum number of perf counters to v3d_device_info
v3dv: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information
v3d: Use DRM_IOCTL_V3D_GET_COUNTER to get perfcnt information
broadcom/simulator: Add DRM_V3D_PARAM_MAX_PERF_COUNTERS parameter support
broadcom/simulator: Add DRM_IOCTL_V3D_GET_COUNTER to simulator
Michel Dänzer (4):
wsi/wayland: Dispatch event queue in wsi_wl_swapchain_queue_present
wsi: Call drmSyncobjQuery only once for all images
egl/dri: Use packed pipe_format
dri: Go back to hard-coded list of RGBA formats
Mike Blumenkrantz (162):
glthread: check for invalid primitive modes in DrawElementsBaseVertex
zink: reconstruct features pnext after determining extension support
zink: prune zink_shader::programs under lock
zink: fully wait on all program fences during ctx destroy
kopper: fix bufferage/swapinterval handling for non-window swapchains
zink: slightly better swapinterval failure handling
kopper: don’t set drawable buffer age
zink: handle swapchain currentExtent special value
zink: clean up accidental debug print
dri: rename ‘implicit’ param from earlier series
tu: support VK_EXT_legacy_vertex_attributes
llvmpipe: add KHR-Single-GL45.arrays_of_arrays_gl.AtomicUsage skip
ci: disable lavapipe-vk-asan job
lavapipe: VK_EXT_legacy_vertex_attributes
zink: clamp buffer_indices_hashlist resets to used region
zink: delete GS conditional in update_so_info
zink: use zink_shader_key_optimal unions for pipeline state asserts
zink: use info.fs.uses_sample_qualifier instead of manual scan
zink: simplify confusing return in rewrite_tex_dest
zink: simplify flagging legacy shadow samplers
zink: rename zink_shader variable in create functions
zink: break out shadow sampler scanning
zink: always block the precompile threads when pruning shaders
zink: more effectively synchronize separate shader program precompiles
zink: use zink_shader type directly in zink_create_gfx_shader_state()
zink: split shader create into 2-stage functions
zink: reorder precompile_separate_shader_job() in file
zink: split generated tcs creation into 2-stage functions
zink: move gfx shader init to thread
zink: reorder some code in zink_create_gfx_program()
zink: reorder fencing in zink_create_gfx_program()
zink: split gfx program creation into 2-stage functions
zink: precompile_job() -> gfx_program_precompile_job()
zink: move blocking gfx program init functions to thread
ci: disable g52
egl/x11: disable dri3 with LIBGL_KOPPER_DRI2=1 as expected
zink: add a batch ref for committed sparse resources
u_blitter: stop leaking saved blitter states on no-op blits
freedreno/replay: use inttypes format string for 64bit
zink: delete unused zink_batch struct member
zink: move in_rp to zink_context
zink: move ref_lock from zink_batch to zink_batch_state
zink: move has_work from zink_batch to zink_batch_state
zink: rename last_was_compute -> last_work_was_compute
zink: move last_work_was_compute from zink_batch to zink_context
zink: move work_count from zink_batch to zink_context
zink: move swapchain from zink_batch to zink_context
zink: rename zink_batch::state -> zink_batch::bs
zink: delete all zink_batch uses from zink_query.c
zink: remove zink_batch usage from zink_clear.c
zink: remove all uses of zink_batch from zink_batch.c
zink: remove all zink_batch usage from zink_resource.h
zink: remove all zink_batch usage from zink_draw.cpp
zink: remove all zink_batch usage from zink_render_pass.c
zink: remove all zink_batch usage from zink_context.c
zink: delete zink_batch
zink: zink_batch_state::has_barriers -> has_reordered_work
zink: reset all the has_work flags in the same place
zink: check all has_work flags for flushes
zink: rely on zink_get_cmdbuf() to set has_work flags
zink: flag has_work in a few more places
zink: stop flagging has_work on batch tracking
zink: don’t submit main cmdbuf if has_work is not set
frontends/dri: only release pipe when screen init fails
frontends/dri: always init opencl_func_mutex in InitScreen hooks
zink: use u_minify for sparse calcs
zink: always commit full miptails
zink: refcount miptails
zink: clean up semaphore arrays on batch state destroy
zink: add a batch array for tracked semaphores
zink: stop leaking sparse semaphores
zink: rework sparse semaphore waits
ci: bump VVL to snapshot-2024wk19
zink: hook up VK_EXT_legacy_vertex_attributes
zink: set all spirv caps for the vvl vtn pass
ci: bump VVL to v1.3.285
zink: make unassigned io variables unreachable
zink: minor tweaks to shader io assignment
zink: outdent assign_producer_var_io()
zink: outdent assign_consumer_var_io()
zink: pass a struct through io assignment functions
zink: track masks of io locations used during linking
zink: unify io assignment
zink: move ‘reserved’ into io assign struct
zink: split slot map between regular varyings and patch
zink: ci updates
egl/dri2: fix error returns on dri2_initialize_x11_dri3 fail
nir/lower_aaline: fix for scalarized outputs
nir/linking: fix nir_assign_io_var_locations for scalarized dual blend
lavapipe: split out DGC into separate file
lavapipe: plumb print_cmds through NV DGC
lavapipe: lvp_indirect_command_layout -> lvp_indirect_command_layout_nv
zink: remove dgc debug mode
zink: add atomic image ops to the ms deleting pass
build/amd: add amd-use-llvm build option
ir3: flag progress from nir_lower_io_to_scalar
ir3: assert that no further optimizations can be done if !progress
gallium: add drawid_offset to draw_mesh_tasks interface
gallium: stop dropping drawid_offset param with util_draw_indirect
vulkan: Update XML and headers to 1.3.287
zink: add HKP to tiler mode switch
lavapipe: fix mesh+task binding with shader objects
mesa/st: fix zombie shader handling for non-current programs
zink: null check pipe loader config before use
zink: split out msaa replication
zink: implement msaa replication with dynamic rendering
radeonsi: enable compute pbo blits
ci: kill filament trace globally
zink: add a driver workaround to disable 2D_VIEW_COMPATIBLE+sparse
zink: free sparse page for miptail on uncommit
zink: remove adreno from broken_cache_semantics driver workaround
egl: deduplicate MESA_image_dma_buf_export enablement
egl: only enable MESA_image_dma_buf_export with PIPE_CAP_DMABUF
lavapipe: maint7
st/pbo: fix MESA_COMPUTE_PBO=spec crash on shutdown
st/pbo_compute: special case stencil extraction from Z24S8
mesa/st: use compute pbo download for readpixels
ci: bump vvl to v1.3.289
zink: add an a750 skip
zink: enable compute pbos for turnip
aux/tc: update docs to indicate replaced buffers have multiple pipe_resources
zink: don’t lower fpow
zink: propagate valid buffer range to real buffer when mapping staging
zink: track the “real” buffer range from replacement buffers
zink: modify some buffer mapping behavior for buffer replacement srcs
zink: move all driverID checks to a helper function
zink: hook up maintenance7
zink: use maint7 to capture venus driver and more accurately use workarounds
mesa/st: load state params for feedback draws with allow_st_finalize_nir_twice
egl/x11/sw: fix partial image uploads
egl/x11/sw: plug in swap_buffers_with_damage handling
winsys/radeon: take the full winsys struct in radeon_get_drm_value()
winsys/radeon: wrap fd access with util function
winsys/radeon: switch to rendernode when card node doesn’t work
winsys/radeon: revert recent changes
glx: directly link to gallium
egl: link with libgallium directly
gbm: link directly with libgallium
loader: delete loader_open_driver()
loader/dri3: check xfixes version in loader_dri3_open()
loader/dri3: avoid killing the xcb connection if dri3 not found
loader/glx: move multibuffers check to loader
egl: use loader’s multibuffer check to deduplicate lots of code
vl/dri3: use loader’s dri3 init code and delete everything else
zink: move image aoa access to nir pass
zink: use PIPE_CAP_NIR_SAMPLERS_AS_DEREF
gallium: install gallium-$version.so to libdir
ci: prune dri from LD_LIBRARY_PATH
dril: rework config creation
llvmpipe: handle vma allocation failure
llvmpipe: only use vma allocations on linux
dri: fix kmsro define
Revert “vl/dri3: use loader’s dri3 init code and delete everything else”
glx: include src/gallium for apple
dri: link with libloader
kopper: check swapchain size after possible loader image resize
pipe-loader: fix driconf memory management
dril: always take the egl init path
egl: fix zink init
dri: fix kms_swrast screen fail
egl/wayland: bail on zink init in non-sw mode if extension check fails
zink: fix partial update handling
Mike Lothian (2):
radeonsi,aco: Run ac_nir_lower_global_access pass
ac/llvm: Remove global access ops handling
Mingcong Bai (2):
meson: set default drivers for ppc, ppc64
meson: set default Vulkan drivers for ppc, ppc64
Mohamed Ahmed (4):
nil: Add a nil_image::compressed bit
nil: Add some helpers for DRM format modifiers
nil: Support creating images with DRM modifiers
nvk: enable rendering to DRM_FORMAT_MOD_LINEAR images
Mykhailo Skorokhodov (2):
egl/wayland: Fix sRGB format look up for config
ci/lima: expect fail of window_8888_colorspace_srgb on wayland
Nanley Chery (29):
intel/isl: Add and use _isl_surf_info_supports_ccs
intel/isl: Reduce halign for disabled CCS on XeHP
intel/isl: Update quote for XeHP’s CCS halign rule
intel/isl: Allow sampling from 3D HIZ_CCS_WT
intel/blorp: Factor bpb into the fast-clear rect
intel/blorp: Allow gfx12 fast-clears without CCS surf
intel/isl: Add and use ISL_DRM_CC_PLANE_PITCH_B
anv: Refactor modifier plane layout queries
intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_PITCH_SCALEDOWN
intel/aux_map: Add and use INTEL_AUX_MAP_META_ALIGNMENT_B
intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_SIZE_SCALEDOWN
intel/isl: Add and use ISL_MAIN_TO_CCS_SIZE_RATIO_XE
intel/isl: Add and use multi-engine surf usage bits
iris: Simplify bo import in memobj_create_from_handle
intel/isl: Assert alignments of surface addresses
anv: Rely on the primary surf usage to disable aux
anv,hasvk: Drop anv_get_isl_format_with_usage
anv: Support multiple aspects in anv_formats_ccs_e_compatible
anv: Rely more on ISL_SURF_USAGE_DISABLE_AUX_BIT
anv: Restrict CCS ISL surface creation to gfx9-11
iris: Add and use comp_ctrl_surf_offset on gfx12
intel/isl: Drop support for the gfx12 CCS ISL surf
intel/isl: Add and use isl_drm_modifier_needs_display_layout
iris,anv: Disable gfx12.0 fast-clears with unaligned pitch
intel/isl: Consolidate some tiling checks for CCS
intel/isl: Require display flag for 512B pitch alignment
intel/isl: Pad the pitch on gfx12.0 for fast-clears
anv+zink/ci: Change sparse test result from crash to fail
intel/isl: Enable Tile4 for CPB surfaces
Natanael Copa (1):
nir/opt_varyings: reduce stack usage
Neha Bhende (2):
svga: Retrieve stride info from hwtnl->cmd.vdecl for swtnl draws
dri: fix macro name check to detect svga driver
Oskar Viljasaar (8):
vulkan/properties: support Android in the property generator
v3dv: constify arguments of vendor/device id getters
v3dv: Use common runtime vk_properties
vulkan/properties: Document RENAMED_PROPERTIES in the property generator
anv: Move completely over to common runtime GetPhysicalDeviceProperties2
hasvk: switch to use runtime physical device properties infrastructure
vulkan: add a property struct setter function
venus: Use common physical device properties
Patrick Lerda (8):
gallium/auxiliary/vl: fix typo which negatively impacts the src_stride initialization
clover: fix pipe_box update regression
clover: fix memory leak related to optimize
r600: fix vertex state update clover regression
mesa/main: fix stack overflow related to the new mipmap code
radeonsi: fix assert triggered on gfx6 after the tessellation update
clover: fix meson opencl-spirv option
st/pbo_compute: fix async->nir memory leak
Paulo Zanoni (31):
isl: add ISL_TILING_64_XE2 to isl_tiling_to_name()
anv/sparse: add the MSAA block shape tables
anv/sparse: we can’t do multi-sampled depth/stencil sparse images
anv/sparse: properly reject sample counts we don’t support
anv/sparse: reject all sample flags that non-sparse doesn’t support
anv/sparse: fix block_size_B when the image is multi-sampled
anv/sparse: exclude Xe2’s Tile64’s non-standard block shapes
anv/sparse: flush the tile cache when resolving sparse images
anv/sparse: enable MSAA for Sparse when applicable
anv: check for VK_RENDERING_SUSPENDING_BIT once at CmdEndRendering
anv+zink/ci: add failures related to multi-sampled sparse binding
anv/sparse: assert a format can’t be standard and non-standard
anv/xe: fix declaration of memory flags for integrated non-LLC platforms
anv/sparse: reject 1D sparse residency images
anv/sparse: fix the image property sizes for multi-sampled images
anv/sparse: fix reporting of VK_SPARSE_IMAGE_FORMAT_SINGLE_MIPTAIL_BIT
intel/isl: pass struct isl_tile_info to choose_image_alignment_el()
anv/sparse: dump info about opaque binds when DEBUG_SPARSE
anv/sparse: fix TR-TT page table bo size and flags
anv/sparse: remove obsolete linear tiling code path
anv/sparse: unify and rework tile size calculation
anv/sparse: use ANV_SPARSE_BLOCK_SIZE instead of tile_size when possible
anv: properly store the engine_class_supported_count values
anv: LNL+ doesn’t need the special flush for sparse
anv: reimplement the anv_fake_nonlocal_memory workaround
iris: fix iris_xe_wait_exec_queue_idle() on release builds
anv/trtt: fix the process of picking device->trtt.queue
anv/xe: try harder when the vm_bind ioctl fails
anv: don’t expose the compressed memory types when DEBUG_NO_CCS
anv: disable CCS for Source2 games on Xe2
intel: fix compute SLM sizes on Xe2 and newer
Pavel Ondračka (13):
r300: replace constant size field with usemask
r300: move dead constants pass earlier for vertex shaders
r300: switch to a new constant remap table format
r300: compact scalar uniforms into empty slots
r300: better packing for immediates
r300/ci: fails list update
r300: fix cycles counting for KIL
r300: fix writemask rewrite when converting to omod
r300: fix for ouput modifier and DDX/DDX
r300: fix RC_OMOD_DIV_2 modifier
r300: missing whitespace in shader stats
r300: vectorization tweaks for R300/R400
r300: bias presubtract fix
Philipp Zabel (7):
etnaviv: drm: Suppress get-param error message for non-existent core
etnaviv: drm: Stop after model query failure
etnaviv: Pass npu to etna_screen_create in a separate parameter
etnaviv: Add a separate NPU pipe
etnaviv: Allow collecing both GPU and NPU specs
etnaviv/nn: Pipe through input/accumulation buffer depth from hwdb
etnaviv: update headers from rnndb
Pierre-Eric Pelloux-Prayer (34):
radeonsi/sqtt: use ac_sqtt_get_shader_mask for spm counters
radeonsi/sqtt: cleanup si_sqtt_add_code_object a bit
radeonsi/sqtt: support sqtt buffer auto-resizing
radeonsi: add new si_shader_binary_upload_at method
radeonsi/sqtt: use si_shader_binary_upload_at to reupload shaders
radeonsi: allocate sqtt and spm buffers in GTT
radeonsi: add testmemperf mem bandwidth test
radv/sqtt: use radeon_check_space before emit_spm_*
radeonsi: use the common SQTT implementation
radeonsi/sqtt: add AMD_THREAD_TRACE_INSTRUCTION_TIMING
ac/sqtt: make VA helpers static
ac/llvm: implement WA in nir to llvm
radeonsi: store the total binary size in si_shader
radeonsi: handle DBG(TEX) after tc_compatible_htile is set
radeonsi/tests: don’t match gfx10_3 baseline for gfx10 family
radeonsi/tests: add a shortcut to re-run only failing tests
ac/surface: reject modifiers with retile_dcc and bpe != 32
radeonsi: add gfx11 workaround for upgraded_depth
ac/nir: don’t use the compute blit for PIPE_FORMAT_R5G6B5_UNORM
radeonsi/tests: update tests baseline
radeonsi/tests: clarify the output when results changes
radeonsi: fix buffer_size in si_compute_shorten_ubyte_buffer
Revert “ac, radeonsi: remove has_syncobj, has_fence_to_handle”
ac/info: remove has_syncobj
winsys/radeon: fill lds properties
radeonsi: fix crash in si_update_tess_io_layout_state for gfx8 and earlier
radeonsi/tests: correctly parse the family name
radeonsi: fix ac_create_shadowing_ib_preamble parameter
radeonsi, radv: bump libdrm_amdgpu version requirement
ci: bump Fedora and Android libdrm2 to 2.4.122
radeonsi: fix si_get_dmabuf_modifier_planes for gfx12
frontends/dri: add error logs to dri2_create_image_from_fd
amd: use a valid size for ac_pm4_state allocation
egl,gbm,glx: fix log message spam
Qiang Yu (8):
glsl: respect GL_EXT_shader_image_load_formatted when image is embedded in a struct
radeonsi: add missing nir_intrinsic_bindless_image_descriptor_amd
nir: fix lower array to vec metadata preserve
nir: fix clip cull distance lowering metadata preserve
nir: add filter parameter to nir_lower_array_deref_of_vec
nir: nir_vectorize_tess_levels support indirect access
nir: consider more deref types when fixup deref
glsl: fix indirect tess factor access for compact_arrays=false drivers
Rebecca Mckeever (9):
panvk: Add jm and bifrost dirs
panvk: Add push_uniform/constant helpers
panvk: Make helper functions panvk_cmd_buffer agnostic
panvk: Move panvk_descriptor_state to bifrost subdir
panvk: Move vkCmdDraw* functions to their own file
panvk: Move vkCmdDispatch* functions to their own file
panvk: Move vkCmd*Event functions to their own file
panvk: Add Valhall DescriptorSetLayout implementation
panvk: Add Valhall Descriptor{Set,Pool} implementations
Renato Pereyra (2):
anv: Attempt to compile all pipelines even after errors
intel/perf: Move sysmacros.h include from header to implementation
Rhys Perry (95):
aco/tests: add tests for hidden breaks/continues
aco/tests: add tests for divergent merge phi with undef
nir/dead_cf: stop reindexing blocks for each non-block cf node
aco/stats: fix s_waitcnt parsing
aco/stats: don’t use VS counter pre-GFX10
aco/waitcnt: fix DS/VMEM ordered writes when mixed
aco: make wait_imm indexable
aco/waitcnt: add target_info
aco/waitcnt: refactor for indexable wait_imm
aco/stats: refactor for indexable wait_imm
aco: add wait_imm::unpack and wait_imm::max
radv: keep track of unaligned dynamic vertex access
aco: form hard clauses in VS prologs
aco: copy VS prolog constants after loads
aco: support VS prologs with unaligned access
aco/util: improve small_vec assertion
radv: advertise VK_EXT_legacy_vertex_attributes
aco: don’t count certain pseudo towards VMEM_STORE_CLAUSE_MAX_GRAB_DIST
aco/tests: support GFX12
aco: add SFPU/ValuPseudoScalarTrans instr class
aco: add GFX11.5+ opcodes
aco: support GFX12 in assembler
aco/tests: add GFX12 assembler tests
aco: don’t change prefetch mode on GFX11.5+
aco/gfx12: disable s_cmpk optimization
aco: add GFX12 wait counters
aco/waitcnt: support GFX12 in waitcnt pass
aco/stats: support GFX12 in collect_preasm_stats()
aco: update VS prolog waitcnt for GFX12
aco/lower_phis: create loop header phis for non-boolean loop exit phis
aco: create lcssa phis for continue_or_break loops when necessary
aco: use scalar phi lowering for lcssa workaround
aco: remove nir_to_aco
aco/lower_phis: don’t create boolean loop header phis in some situations
radv: malloc graphics pipeline stages
aco: support GFX12 in insert_NOPs
aco/gfx12: implement subgroup shader clock
aco/gfx12: implement workgroup barrier
aco/gfx12: sign-extend s_getpc_b64
aco/gfx12: don’t create v_fmac_legacy_f32
aco/gfx12: use ttmp9/ttmp7 for workgroup id
radv/gfx12: don’t add workgroup id shader args
aco/gfx12: remove MIMG vector affinity
aco/gfx12: decrease max_nsa_vgprs for VSAMPLE
aco/gfx12: disallow SCC and most constants for BUF SOFFSET
aco: fix fddx/y with uniform inf/nan input
meson: remove –depfile for aco_tests
ac/llvm: implement load_subgroup_id
aco/gfx12: implement load_subgroup_id
ac/nir: skip subgroup_id/local_invocation_index lowering for gfx12
aco/gfx12: fix s_wait_event immediate
aco: don’t combine vgpr into writelane src0
aco: implement nir_atomic_op_ordered_add_gfx12_amd
aco: implement nir_intrinsic_nop_amd and nir_intrinsic_sleep_amd
ac/nir: support lowering of sub-dword push constants
radv: lower sub-dword push constants
ac/llvm: remove support for sub-dword push constants
aco: remove support for sub-dword push constants
aco/gfx6: set glc for buffer_store_byte/short
aco: inline store_vmem_mubuf/emit_single_mubuf_store
aco: use ac_hw_cache_flags
aco: use GFX12 scope/temporal-hint
ac: stop using radeon_info for ac_get_hw_cache_flags
aco: use ac_get_hw_cache_flags()
aco: remove some missing label resets
nir/opt_loop: rematerialize derefs instead of creating phis
nir/opt_loop: fix formatting
aco: insert s_nop before discard early exit sendmsg(dealloc_vgpr)
radv: lower push constants in NIR
ac/llvm: remove push constants
aco: remove push constants
aco/insert_exec_mask: ensure top mask is not a temporary at loop exits
vtn: ensure TCS control barriers have a large enough memory scope
aco: use 1.5x vgprs for gfx1151 and gfx12
aco: skip continue_or_break LCSSA phis when not needed
aco: use s_pack_ll_b32_b16 for pack_32_2x16_split
aco: combine extracts into s_pack_ll_b32_b16
aco: use s_pack_*_b32_b16 more in p_insert/p_extract lowering
aco: turn split(vec()) into p_parallelcopy instead of p_create_vector
aco: add missing isConstant()/isTemp() checks
aco: fix follow_operand with combined label_extract and label_split
aco: use alignment information in visit_load_constant()
aco: fix wmma raw hazard
aco: replace constant v_bfrev_b32 with v_mov_b32 to create vopd
aco/gfx11: don’t use v_bfrev_b32 with wave64
glsl: always lower non-TCS outputs to temporaries
gallium: remove PIPE_CAP_SHADER_CAN_READ_OUTPUTS
nir/linking_helpers: remove special case for read mesh outputs
nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars
nir/linking_helpers: remove nested IF
radv: remove unecessary nir_remove_unused_varyings cleanup passes
aco/gfx11.5: workaround export priority issue
aco: fix validation of v_s_ opcodes
docs: update ACO_DEBUG documentation for scheduler options
docs: update ACO_DEBUG documentation for perfwarn
Rob Clark (63):
tu: Add missing error path cleanup
tu: Fix a6xx lineWidthGranularity
freedreno/ir3: Skip DAG validation on release builds
llvmpipe: Fix build error with clang-18
freedreno/ci: Switch a618_piglit to deqp-runner
vulkan/android: Add helper to probe AHB support
vulkan: Don’t request Ycbcr conversion for rgb
vulkan: Add helper to resolve Android external format
tu: Skip YUV conversion for RGB formats
tu: Support VkExternalFormatANDROID
freedreno/ci: Remove some skips
freedreno/ci: Remove some obsolete skips
freedreno/ci: Refactor out common a6xx skips list
freedreno/ci: Skip unsupported legacy gl stuff
freedreno/ci: Skip max-texture-size
freedreno/ci: Add a common skips file to a618_piglit.
freedreno/ci: Skip built-in-functions VS/GS tests
freedreno/ci: Skip some slow tests
freedreno/ci: Increase a630/a618 piglit fraction
freedreno/ir3: Fix ldg/stg offset
egl/android: Fix sRGB visuals
docs/features: Add missing AHB for tu
tu: Don’t advertise AHB handle time on non-android
freedreno: Namespace DEFINE_CAST()
virgl: Update headers
loader: Add better support for virtgpu nctx driver loading
freedreno/loader: Switch over to probe_nctx
vulkan/android: Fix suggestedYcbcrModel with !mapper4
tu: Fix imageview + ahb
vulkan/android: Fix YcbcrRange for !mapper4
ir3: Add some more missing progress accumulation
gallium/tc: Add optional buffer replacement limit
freedreno: Use buffer replacement limit
gallium/tc: Allow replacement if replacing valid_range
freedreno/drm: Add rd dumper support
st/mesa/pbo: Set src type on image_store
freedreno: Handle non-null cb with null buffer
u_blitter+d3d12: Move stencil fallback clear to caller
freedreno/a6xx: Implement S8 support
freedreno: Implement stencil blit fallback
freedreno: Use LINEAR for staging resources
freedreno/a6xx: Drop 16b packed image formats
freedreno/bc: Rework flush order
freedreno/a6xx: Tweak blitter traces
freedreno/a6xx: Skip blitter for L/A conversions
freedreno/a6xx: Add more format swizzles
freedreno/a6xx: Allow blit based transfers
freedreno: Enable the X1-85
tu: Fix issues with 16k (or larger) page sizes
freedreno/drm/virtio: Fix issues with 16k (or larger) page sizes
freedreno/a6xx: Implement reg stomper support
freedreno/a7xx: Fix GRAS_UNKNOWN_80F4 writes
freedreno/cffdec: Fix a7xx CP_EVENT_WRITE decoding
tu/drm/virtio: Add missing a7xx case
freedreno/drm: Handle a7xx case
freedreno: Move GENX/CALLX magic to common
freedreno: Extract out common UBWC helper
freedreno: Extract out shared LRZFC layout helpers
freedreno/a6xx: Allocate lrcfc when needed for direction tracking
freedreno/a6xx: Refactor CP_EVENT_WRITE emit
freedreno/a6xx: Rework CCU_CNTL emit for a7xx
freedreno/a6xx: Initial a7xx support
gallium: Add option to not add version to libgallium filename
Robert Mader (3):
egl: Implement EGL_EXT_config_select_group
egl: Implement EGL_MESA_x11_native_visual_id
egl/x11: Allow all RGB visuals to match 32-bit RGBA EGLConfigs
Rohan Garg (21):
anv: formatting fix when printing pipe controls
anv: allocate space for generated indirect draw id’s using the temporary allocation helper
intel/brw: update Xe2 max SIMD message sizes
Revert “iris: slow clear higher miplevels on single sampled 8bpp resources that have TILE64”
intel/eu/xe2+: Fix src1 length bits of SEND instruction with UGM target.
intel/brw: Advertise fp64 atomic add’s when we have 64 bit float support and a LSC
intel/brw: We no longer have atomic fmin/fmax ops for fp64 in xe2
intel/genxml: add the new state byte stride instruction
intel/genxml: update 3DSTATE_CPSIZE_CONTROL_BUFFER for xe2+
isl: enable compression for CPS buffers on xe2+
intel/genxml: update CFE_STATE for LNL
intel/genxml: Update XY_BLOCK_COPY_BLT
intel/genxml: update MI_SEMAPHORE_WAIT for Xe2
intel/genxml: Update STATE_COMPUTE_MODE for Xe2
anv: 3D stencil surfaces have fewer layers for higher miplevels
isl: disable CCS for 3D depth/stencil surfaces when WA is applicable
isl: Enable volumetric STC_CCS,HiZ+CCS on gfx12.0
intel/genxml: Add RESOURCE_BARRIER for xe2
intel/compiler: fix shuffle generation on LNL
anv: flag WSI images as scanout images for ISL
anv: reuse existing macro to query for flushes
Roland Scheidegger (1):
lavapipe: add option to enable snorm blending
Romain Naour (1):
glxext: don’t try zink if not enabled in mesa
Roman Stratiienko (11):
vulkan/android: Add basic u_gralloc support
vulkan/android: Add common vkGetSwapchainGrallocUsage{2}ANDROID
vulkan/android: Add android buffer classification to vk_image
vulkan/android: Add common helpers for the ANB extension
vulkan/android: Add common helpers for the AHB extension
vulkan/android: Add common vkGetAndroidHardwareBufferPropertiesANDROID
turnip/android: Migrate to common ANB code
v3dv/android: Migrate ANB and AHB to use common helpers
u_gralloc/fallback: Extract modifier from QCOM native_handle
turnip/android: Use DETECT_OS_ANDROID in tu_device
turnip/android: Use DETECT_OS_ANDROID in freedreno_rd_output
Romaric Jodin (1):
intel/brw: allocate large table in the heap instead of the stack
Ruijing Dong (14):
radeonsi/vcn: add vcn5 encoding interface change
radeonsi/vcn: add vcn5.0 for h264 enc only
radeonsi/vcn: add hevc support for vcn5
radeonsi/vcn: enable decoding in vcn5.
radeonsi/vcn: correct tile_size_bytes_minus1
radeonsi/vcn: add cdef modes for vcn5 encoding
radeonsi/vcn: apply cdef mode to vcn5
radeonsi/vcn: share functions between vcn4/vcn5
frontends/va: parsing uniform_tile_spacing flag
radeonsi/vcn: add header files for vcn5 av1 tile
radeonsi/vcn: enable av1 encoding in vcn5
radeonsi/vcn: enable roi feature for vcn5
radeonsi/vcn: remove tile_config_flag
radesonsi/vcn: update vcn4 tile processing logic
Ryan Neph (7):
venus: reclaim signal semaphore feedback resources for wasteful clients
venus: sync headers for VK_EXT_external_memory_acquire_unmodified
venus: enable VK_EXT_external_memory_acquire_unmodified
venus: factor image memory barrier fixes to common implementation
venus: refactor image memory barrier fix storage and conventions
venus: skip barrier fixes as early as possible
venus: chain VkExternalMemoryAcquireUnmodifiedEXT for wsi ownership transfers
Rémi Bernon (2):
zink: Add VKAPI_PTR specifier to zink_stub_function_not_loaded.
zink: Add VKAPI_PTR specifier to generated stub functions.
Sagar Ghuge (8):
intel/compiler: Fix destination type for CMP/CMPN
intel/disasm: Fix cache load/store disassembly for URB messages
iris: Load 32-bit MMIO PREDICATE register from buffer
intel/compiler: No need to re-type the destination register
intel/fs: Adjust destination register size for untyped atomic on Xe2+
intel/fs: Adjust destination register size for global atomic on Xe2+
intel/compiler: Don’t use half float param for sample_b
intel/compiler: Add indirect mov lowering pass
Samuel Pitoiset (399):
radv: fix image format properties with fragment shading rate usage
docs: Add an alternative way to debug GPU hangs with RADV
radv/rt: add radv_ray_tracing_state_key
radv/rt: pass radv_ray_tracing_state_key to radv_rt_pipeline_compile()
radv/rt: rework the helper that hashes a ray tracing pipeline
radv/ci: add more flakes
radv: simplify DB_Z_INFO.NUM_SAMPLES with null ds target on GFX11
radv: remove bogus VkShaderCreateInfoEXT::flags being 0 assert for compute
radv: simplify radv_emit_primitive_restart_enable()
radv: inline radv_get_pa_su_sc_mode_cntl() in radv_emit_culling()
radv: remove useless DB_Z_INFO.NUM_SAMPLES when emitting the MSAA state
radv: pre-compute VGT_TF_PARAM.DISTRIBUTION_MODE
radv: use the bound GS copy shader when emitting shader objects
radv: add GS copy shader BO to the cmdbuf BO list at bind time
radv: add RT prolog BO to the cmdbuf BO list at bind time
radv: add shaders BO to the cmdbuf BO list at bind time
radv: emit compute pipelines directly from the cmdbuf
radv: precompute compute/task shader register values
radv: clear unwritten color attachments for monolithic PS earlier
radv: compact SPI_SHADER_COL_FORMAT as late as possible
radv: rename col_format_non_compacted to spi_shader_col_format
radv: store cb_shader_mask for fragment shaders and epilogs
radv: add a new dirty state for emitting the color output state
radv/ci: document a recent regression on GFX6-8
radv: split cmdbuf dirty flags into dirty/dirty_dynamic
radv: precompute existing legacy GS register values later
radv: precompute fragment shader register values
radv: precompute mesh shader register values
radv: precompute legacy GS register values
radv: precompute vertex shader register values
radv: precompute DB_SHADER_CONTROL for fragment shaders later
vulkan: Update XML and headers to 1.3.284
aco: add support for remapping color attachments
radv: implement VK_KHR_dynamic_rendering_local_read
radv: advertise VK_KHR_dynamic_rendering_local_read
radv: add a new mechanism for tracking registers per cmdbuf
radv: move common registers between VS/GS and NGG
radv: precompute NGG register values
radv: remove unused parameter to radv_pipeline_emit_pm4()
radv: stop recomputing the last VGT API stage when emitting graphics shaders
radv: do not emit non-context registers to radv_pipeline::ctx_cs
radv: track and bind more VRS states from the graphics pipeline
radeonsi: remove the _unused parameter in all radeon_xxx macros
radv: remove gfx10_emit_ge_pc_alloc()
radv: do not emit VGT_GS_OUT_PRIM_TYPE to ctx_cs on GFX11
radv: simplify radv_emit_hw_ngg() slightly
radv: simplify radv_emit_hw_vs() slightly
radv: simplify radv_emit_hw_gs() slightly
radv: fix the late scissor workaround for GFX9 since a recent refactoring
radv: make radv_conv_gl_prim_to_gs_out() a non-static function
radv: emit graphics pipelines directly from the cmdbuf
radv: add graphics shaders context registers that need to be tracked
radv: add more radeon_opt_set_xxx variants
radv: track all graphics shaders context registers
radv: simplify radv_emit_ps_inputs() slightly
radv: stop using radv_physical_device for radeon helpers
radv: introduce radeon_set_reg_seq()
radv: remove redundant radeon_set_perfctr_reg() helper
radv: rename radeon perfctr uconfig helpers
radv: add a helper to configure ring buffer descriptors
radv: only enable VK_MESA_image_alignment_control on GFX9-11.5
radv: reject unsupported buffer formats earlier
ac,radv,radeonsi: add a helper to translate buffer numformat
ac,radv,radeonsi: add a helper to translate buffer dataformat
radv: simplify radv_emit_default_sample_locations()
radv: pass radv_physical_device to radv_emit_default_sample_locations()
radv: use float instead of double for viewport zscale/ztranslate
radv: add more helpers to emit viewports
radeonsi: refactor si_translate_border_color()
ac,radv,radeonsi: introduce a helper to build a sampler descriptor
radv: stop checking the output value of radv_translate_tex_numformat
radv: use PIPE_FORMAT in radv_translate_tex_numformat()
ac,radv,radeonsi: add a function to translate tex numformat
radv: use PIPE_FORMAT in radv_translate_colorswap()
ac,radv,radeonsi: add a function to translate colorswap
radv: use PIPE_FORMAT in radv_translate_dbformat()
ac,radv,radeonsi: add a function to translate db format
ac,radv,radeonsi: add a function to get the color format endian swap
radv: allow 3d views with VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
radv: simplify creating gfx10 texture descriptors for sliced 3d/2d view of 3d
radv: remove redundant check for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 on GFX6-10
radv: stop checking the return value of ac_get_cb_number_type()
radv: add radv_is_colorbuffer_format_blendable()
amd/common: move some format related helpers to ac_formats.c
ac,radv,radeonsi: add helper to know if a format is supported by CB
ac,radv,radeonsi: add helper to know if a format is supported by DB
ac,radv,radeonsi: introduce a helper to build a FMASK descriptor
radeonsi: reject some texture formats but only on GFX8/CARRIZO
radv: mark some formats as unsupported on GFX8/CARRIZO
radv: set image view descriptors as buffer for non-graphics GPU
ac,radv,radeonsi: add a helper to get the tile mode index
radv: tidy up swizzle in radv_set_mutable_tex_desc_fields()
radv: tidy up custom pitch for gfx10.3 in radv_set_mutable_tex_desc_fields()
radv: tidy up meta_va in radv_set_mutable_tex_desc_fields()
radv: set ITERATE_256 for GFX10+ in radv_set_mutable_tex_desc_fields()
radv: stop clearing unnecessary bitfields in radv_set_mutable_tex_desc_fields()
ac,radv,radeonsi: add a helper to set mutable tex desc fields
ac,radv,radeonsi: add a function for getting border color swizzle
radv: only set ALPHA_IS_ON_MSB if the image has DCC on GFX6-9
radv: adjust determining if alpha should be on MSB
ac,radv,radeonsi: add function to determine if alpha should be on MSB
ac,radv,radeonsi: add a common helper for translating swizzle
amd/common: only pass gfx_level to ac_get_gfx10_format_table()
amd/common: add new helpers to build buffer descriptors
radv: use new common helpers for building buffer descriptor
aco: use new common helpers for building buffer descriptors
radv: remove unused radv_translate_buffer_dataformat()
radv: remove useless check about FIXED formats
radv: simplify radv_is_vertex_buffer_format_supported()
radv: replace vk_to_non_srgb_format() by vk_format_no_srgb()
radv: fix setting a custom pitch for CB on GFX10_3+
radv: replace db_{z,stencil}_{read,write}_base by db_{depth,stencil}_base
radv: tidy up radv_initialise_ds_surface()
radv: separate non-mutable vs mutable fields for ds surface
amd/common: add a function to initialize ds surface
radv: use the common helper for initializing DS surfaces
radeonsi: use the common helper for initializing DS surfaces
radv: only enable DB_STENCIL_INFO.ITERATE_FLUSH when necessary
radv: do not check image usage for ITERATE256 with TC-compat HTILE
ac,radv,radeonsi: add function to get the number of ZPLANES
ac,radv,radeonsi: a function that sets mutable DS surface fields
ac,radv,radeonsi: move ZRANGE_PRECISION to mutable DS fields
amd/common: add ac_gpu_info::has_etc_support
radv: use PIPE_FORMAT in radv_translate_tex_dataformat()
ac,radv,radeonsi: add a function to translate texture data format
amd/common: add a helper to set the third word of buffer descriptor
ac,radv: add a stride parameter to ac_build_attr_ring_descriptor()
radeonsi: use common build buffer descriptor helpers
radv: align DCC control settings to RadeonSI for GFX8
amd/common: add a function to initialize cb surface
radv: use the common helper for initializing CB surfaces
radeonsi: use the common helper for initializing CB surfaces
ac,radv,radeonsi: a function that sets mutable CB surface fields
radv: apply the workaround for no PS inpputs and LDS on GFX11 only
radv: apply the SQ_THREAD_TRACE_WPTR workaround on GFX11 only
radv: fix flushing DB meta cache on GFX11.5
radv: only emit streamout enable for legacy streamout
amd/common: define SDMA v7.0 for GFX12
ac,radv,radeonsi: add ac_gpu_info::has_tc_compatible_htile
radv: update NUM_THREAD_FULL bitfields
radv: enable GS_FAST_LAUNCH_2 by default on GFX12
radv: do not emit non-existent registers on GFX12
radv: update configuring sample locations on GFX12
radv: update configuring viewport/scissor on GFX12
radv: update configuring PS states on GFX12
radv: update configuring NGG states on GFX12
radv: update configuring VGT states on GFX12
radv: update configuring DB states on GFX12
radv: update configuring rasterization states on GFX12
radv: update configuring some CB states on GFX12
radv: update configuring occlusion query state on GFX12
radv: update configuring MSAA state on GFX12
radv: update configuring GFX preamble on GFX12
radv: update configuring tess rings on GFX12
radv: update binning settings on GFX12
radv: update emitting discard rectangles on GFX12
radv: update shader input arguments for GS stage on GFX12
aco: adjust loading local invocation ID for GS on GFX12
radv: do not emulate clear state for shadowed regs on GFX12
radv: update cache flush emission on GFX12
radv: update emitting stipple line on GFX12
radv: disallow merging multiple draws into one wave on GFX12
radv: emit SQ_NON_EVENT packets after drawing with streamout on GFX12
radv: update configuring the number of patch control points on GFX12
radv: update configuring VGT_SHADER_STAGES_EN on GFX12
radv: enable GE_CNTL.DIS_PG_SIZE_ADJUST_FOR_STRIP on GFX12
radv: update NUM_THREAD_FULL bitfields on GFX12
radv: update global graphics shader pointers on GFX12
radv: update SDMA resource type on GFX12
radv: update VS input VGPRs on GFX12
radv: do not enable HTILE for depth/stencil storage images
radv: allow STORAGE for depth formats
radv: fix configuring the number of patch control points on GFX6
radv: configure DB_Z_INFO.NUM_SAMPLES on GFX12
radv: configure DB_RENDER_CONTROL to zero on GFX12
radv: do not enable MEM_ORDERED on GFX12
radv: update configuring the attribute ring on GFX12
radv: do not flush L2 metadata on GFX12
radv: mark all images coherent with TC L2 on GFX12
radv: update configuring SPI_SHADER_PGM_LO_LS on GFX12
radv: update configuring SPI_SHADER_PGM_LO_ES on GFX12
radv: update configuring SPI_SHADER_PGM_RSRC4_{HS,GS,PS} on GFX12
radv: update configuring GE_CNTL.PRIM_GRP_SIZE_GFX11 on GFX12
radv: update configuring SPI_PS_IN_CONTROL on GFX12
radv: configure PA_SC_HISZ_CONTROL on GFX12
radv: configure SPI_SHADER_GS_OUT_CONFIG_PS on GFX12
radv: update configuring GS_VGPR_COMP_CNT on GFX12
radv: do not set DX10_CLAMP on GFX12
radv: fix VRS subpass attachments with mipmaps
ac,radeonsi: set COLOR_SW_MODE for mutable CB surfaces on GFX12
radv: configure PA_SC_SAMPLE_PROPERTIES on GFX12
radv: update number of input VGPRs for VS on GFX12
radv: update configuring color buffers on GFX12
radv: update configuring depth stencil buffers on GFX12
radv: update configuring PA_SC_WINDOW_SCISSOR on GFX12
radv: do not emit SPI_SHADER_PGM_RSRC3_GS on GFX12
radv: fix configuring NGG registers on GFX12
radv: do not set VGT_PRIMITIVEID_EN.PRIMITIVEID_EN on GFX12
radv: cleanup radv_precompute_registers_hw_{ngg,fs}
radv: assert that GDS/GDS OA buffers can’t be created on GFX12
radv: only set valid bitfields for CB/DS surfaces address
radv: only emit VGT_GS_MAX_PRIMS_PER_SUBGROUP on GFX9
radv: only emit SQ_PERFCOUNTER_MASK on GFX7-9
radv: do not set VGT_SHADER_STAGES_EN.DYNAMIC_HS on GFX9
radv: only emit SPI_SHADER_PGM_SRC3_GS on GFX7+
radv: only emit CB_COLOR0_DCC_CONTROL on GFX8
radv: use pipe_format when building image view descriptors
ac,radv,radeonsi: add a function to build texture descriptors
amd/common: add MIN_LOD for texture descriptors on GFX12
Revert “radv/ci: Bring back vkcts-navi21-llvm-valve”
radv: update configuring depth clamp enable on GFX12
radv: update configuring COVERAGE_TO_SHADER_SELECT on GFX12
radv: fix emitting VGT_PRIMITIVEID_RESET in the GFX preamble on GFX12
radv: only set valid bitfields for CB/DS surfaces address on GFX12
radv: add a helper to get image VA
ac,radeonsi import PM4 state from RadeonSI
ac,radeonsi: add a function to initialize compute preambles
radv: initialize compute preambles with the common helper
radv: fix creating unlinked shaders with ESO when nextStage is 0
radv: pass a radv_shader to radv_get_compute_pipeline_metadata()
radv: don’t assume that TC_ACTION_ENA invalidates L1 cache on gfx9
ac,radv: add a helper for SQTT control register
ac,radv,radeonsi: add more parameters to ac_sqtt
amd: allow to emit privileged config registers in PM4
amd: mark more registers that need RESET_FILTER_CAM in PM4
amd: add a common implementation for SQTT using PM4
radv: emit more consecutive registers for SQTT on GFX8-9
radv: use the common SQTT implementation
radv: update VGT_TESS_DISTRIBUTION.ACCUM_ISOLINE value
radv: do not set registers set by CLEAR_STATE in the preamble on GFX10-11.5
radv: emit SPI_GS_THROTTLE_CNTL1 when the attr ring is emitted
radv: fix incorrect buffer_list advance for multi-planar descriptors
radv: use BDA in the DGC prepare shader
radv: remove dynamic uniform/storage buffers support with DGC
radv: do not use nir_pkt3() when the packet len is constant with DGC
radv: add new macros for emiting packets in DGC
radv: remove redundant nir_builder param in some DGC helpers
radv: add a helper to load the pipeline VA for DGC
radv: store a pointer to the logical device in dgc_cmdbuf
radv: allow VK_NV_device_generated_commands_{compute} with LLVM
radv: always save/restore all shader objects for internal operations
radv: update configuring WALK_ALIGN8_PRIM_FITS_ST on GFX12
ac/surface: add NBC view support on GFX12
radv: declare a new user SGPR for the streamout state buffer on GFX12
radv/nir: lower nir_intrinsic_load_xfb_state_address_gfx12_amd
radv: implement streamout on GFX12
radv: force using indirect descriptor sets for indirect compute pipelines
radv: emit indirect sets for indirect compute pipelines with DGC
radv: fix emitting indirect descriptor sets in the DGC prepare shader
radv: cleanup getting AC_UD_TASK_RING_ENTRY for mesh shader
radv: use radv_shader_info::user_data_0 for task shaders
radv: remove dead mesh shader code for indirect draws
radv: remove useless masking in radv_cs_emit_indirect_mesh_draw_packet()
radv: remove useless draw_id to radv_emit_userdata_task()
radv: add the DGC preprocess BO to the cmdbuf BO list
radv/amdgpu: allow cs_execute_ib() to pass a VA instead of a BO
radv/amdgpu: use the non-IB path for dumping CS with external IBs
ac/parse_ib: dump PKT3_DISPATCH_{TASKMESH_GFX,TASKMESH_DIRECT_ACE}
radv/amdgpu: fix chaining CS with external IBs on compute queue
radv: add a helper to execute a DGC IB
radv: add support for computing the DGC ACE IB size
radv: prepare for DISPATCH_TASKMESH_GFX emission in the DGC shader
radv: prepare for DISPATCH_TASKMESH_DIRECT_ACE emission in the DGC shader
radv: refactor some DGC helpers in preparation for the ACE IB
radv: add a helper to pad DGC IB
radv: add support for preparing the ACE IB in DGC
radv: add support for executing the DGC ACE IB
radv: fix incorrect cache flushes before decompressing DCC on compute
radv: improve clarity of DGC offset computations
radv: pre-compute the base upload offset in radv_prepare_dgc()
radv: add a helper that determines if DGC uses task shaders
radv: split allocating and emitting push constants with DGC
radv: rework emitting push constants with DGC
radv: reserve space for push constants in the DGC ACE IB
radv: adjust the base upload offset when DGC uses task shaders
radv: emit push constant for task shaders with DGC
radv: disable conditional rendering with DGC and task shaders
radv: fix a synchronization issue with non-preprocessed DGC with task shader
radv: enable task shaders support with NV DGC
radv: suspend user conditional rendering when DGC has task shaders
radv: rename radv_get_user_sgpr() to radv_get_user_sgpr_info()
radv: add radv_get_user_sgpr{_loc}() helpers
radv: use radv_get_user_sgpr_loc() for the GS copy shader too
radv: remove unused parameter to dgc_emit_draw_mesh_tasks_ace()
radv: do not emit compute userdata for empty dispatches
radv: cleanup using vtx_base_sgpr for userdata with DGC
radv: use radv_dgc_with_task_shader() more
radv: move radv_CmdPreprocessGeneratedCommandsNV() to radv_cmd_buffer.c
radv: use the graphics pipeline from the DGC info
radv: use radv_get_user_sgpr() more in DGC
vulkan: Update XML and headers to 1.3.289
radv: advertise VK_KHR_maintenance7
ci: bump vkd3d-proton to 3d46c082906c77544385d10801e4c0184f0385d9
radv: remove unused parameter to radv_pipeline_import_retained_shaders()
radv: simplify importing libraries with retained shaders
radv: remove unused get_vs_output_info() function
radv: remove unnecessary radv_pipeline_has_ngg() function
radv: move radv_hash_shaders() to radv_graphics_pipeline.c
radv: simplify determining when the rasterization primitive is unknown
radv: simplify determining when a VS prolog is needed
radv: stop passing a pipeline to some graphics related helpers
radv: rework generating all graphics state for compiling pipelines
radv: remove radv_descriptor_set_layout::shader_stages
radv: use blake3 for hashing descriptor set layouts
radv: use blake3 for hashing pipeline layouts
radv: disable VK_EXT_sampler_filter_minmax on TAHITI and VERDE
ac,radeonsi: add ac_is_reduction_mode_supported()
radv: use ac_is_reduction_mode_supported()
radv: fix marking RADV_DYNAMIC_COLOR_ATTACHMENT_MAP as dirty
nir/gather_info: handle uses_fbfetch_output for sparse image loads
nir/gather_info: handle uses_fbfetch_output for texture operations
radv: destroy the perf counter BO in radv_device_finish_perf_counter()
radv: add radv_device_init_perf_counter()
radv: add helpers for init/deinit device memory cache
radv: add helpers for init/deinit RGP
radv: simplify keeping shader info for GPU hangs debugging
radv: add radv_device_init_trap_handler()
radv: add helpers for init/deinit device fault detection
radv: add radv_device_init_rmv()
radv: regroup all tools initialization in one helper
radv: use zero allocation for the device queues
radv/meta: remove non-valuable comments
radv/meta: remove unnecessary blit2d_dst_temps struct
radv/meta: remove redundant check for hw resolve pipelines
radv/meta: remove unused number of rectangles for internal operations
radv/meta: remove useless checks for NULL handles before destroying
radv/meta: add a helper to create compute pipeline
radv/meta: add a helper to create pipeline layout
radv/meta: add a helper to create descriptor set layout
zink/ci: skip arb_shader_image_load_store also on NAVI31/VANGOGH
zink/ci: remove redundant arb_shader_image_load_store skips on POLARIS10
radv: do not expose ImageFloat32AtomicMinMax on GFX11_5
radv: fix programming DB_RENDER_CONTROL for NULL depth/stencil on GFX11_5
radv: expose BufferFloat32AtomicMinMax on GFX11_5
radv: disable SPM trace on GFX11_5
ac/rgp: assume GFX11_5 use the same SQTT/RGP versions as GFX11
radv: allow to capture with RGP on GFX11_5
radv/meta: fix potential race condition when creating the copy VRS pipeline
radv/meta: rework creating the VRS copy HTILE pipeline
radv/meta: remove the depth resummarize operation
radv/meta: avoid potential NULL deref with the gfx depth decompress pipeline
radv/meta: move locking around the gfx depth decompress pipeline
radv/meta: remove unused parameter to radv_get_depth_pipeline()
radv/meta: rework creating the gfx depth decompress pipeline
radv/meta: create the compute depth decompress pipeline on-demand
radv/meta: cleanup creating the compute depth decompress pipeline
radv/meta: separate creating the fill/copy pipelines
radv/meta: create the fill/copy pipelines on-demand
radv/meta: cleanup radv_device_init_meta_blit_{color,depth,stencil]()
radv/meta: move the locking around creating blit pipelines
radv/meta: cleanup meta_emit_blit()
radv/meta: rework creating blit pipelines
radv/meta: create fmask expand layouts regardless on-demand
radv/meta: rework creating FMASK expand pipelines
radv/meta: create fmask copy layouts regardless on-demand
radv/meta: rework creating copy expand pipelines
radv/meta: fix potential race condition when creating DCC retile pipelines
radv/meta: fix potential memleak when creating DCC retile pipelines
radv/meta: rework creating DCC retile pipelines
radv/meta: remove useless memset when destroying DCC retile state
radv/meta: rework creating GFX depth/stencil resolve pipelines
radv/meta: rework creating GFX color resolve pipelines
radv/meta: rework creating compute color resolve pipelines
radv/meta: rework creating compute depth/stencil resolve pipelines
radv/meta: cleanup creating HW resolve pipelines
radv/meta: rework creating HW resolve pipelines
radv/meta: rework creating DCC decompress compute pipelines
radv/meta: rework creating clear HTILE mask pipeline
radv/meta: create clear HTILE mask pipeline on-demand when needed
radv/meta: create DCC comp-to-single pipelines on-demand when needed
radv/meta: add a helper to create itob pipelines
radv/meta: create itob pipelines on-demand when needed
radv/meta: add a helper to create btoi pipelines
radv/meta create btoi pipelines on-demand when needed
radv/meta: add a helper to create btoi r32g32b32 pipeline
radv/meta: create btoi r32g32b32 pipeline on-demand when needed
radv/meta: update the helper that creates itoi pipelines
radv/meta: create itoi pipelines on-demand when needed
radv/meta: add a helper to create itoi r32g32b32 pipeline
radv/meta: create itoi r32g32b32 pipelines on-demand when needed
radv/meta: update the helper that creates clear pipelines
radv/meta: create clear pipeliones on-demand when needed
radv/meta: add a helper to create clear r32g32b32 pipeline
radv/meta: create clear r32g32b32 pipelines on-demand when needed
radv: fix shaders cache corruption with indirect pipeline binds
radv/meta: stop checking that creating NIR shaders failed
radv/meta: remove unnecessary goto
radv/meta: stop creating similar pipeline layouts for depth decompress
radv/meta: create the layouts for blit pipelines on-demand
radv/meta: create the layouts for FS resolve pipelines on-demand
radv/meta: create the layouts for depth decompress on-demand
radv/meta: create the layouts for FMASK copy on-demand
radv/meta: create the layouts for FMASK expand on-demand
radv/meta: create the layouts for compute resolve on-demand
radv/meta: create the louts for DCC comp-to-single clear on-demand
radv/meta: rework getting clear color pipelines
radv/meta: create the layout for clear color on-demand
radv/meta: rework getting depth stencil clear pipelines
radv/meta: create the layout for clear depth/stencil on-demand
Saroj Kumar (2):
mesa: Add functions to print blake3
mesa: replace shader_info::source_sha1
Sathishkumar S (3):
util/format: add planar3 y8_u8_v8_440 pipe format
frontends/va,gallium/vl: add support for yuv440 format
radeonsi/vcn: enable yuv440 jpeg decode
Sebastian Wick (1):
vulkan/wsi/wayland: refactor wsi_wl_swapchain_wait_for_present
Sergi Blanch Torne (21):
mr-label-maker: specialize CI labels
ci: kernel stored in a different s3 bucket
ci: identify and label S3 buckets
ci: disable Collabora’s farm due to maintance
Revert “ci: disable Collabora’s farm due to maintance”
ci: fix stress counter in run’n’monitor
ci: disable Collabora’s farm due to maintenance
Uprev Piglit to cf8daaf5ba90fc9b8a0e144355026e2a14c79944
Revert “ci: disable Collabora’s farm due to maintenance”
ci: disable Collabora’s farm due to runners maintenance
Revert “ci: disable Collabora’s farm due to runners maintenance”
ci: continue stress run’n’monitor
ci: Fix parse GitLab pipeline url
ci: run_n_monitor, collect and summarize
ci: disable Collabora’s farm due to maintenance
Revert “ci: disable Collabora’s farm due to maintenance”
ci: run_n_monitor, arguments review and unicode
ci: run_n_monitor, pretty duration with padding
ci: run_n_monitor, listing job names with a padding
ci: run_n_monitor, sort by name when listing jobs
ci: fix run_n_monitor single execution
Sil Vilerino (5):
d3d12: Fix static analysis issues due to bad parenthesis closing
nir: Mark variable as ASSERTED to fix unused variable warning treated as error
d3d12: Video Encode - Fix inputs for older OS support query cap
d3d12: Add missing case for CQP in d3d12_video_encoder_disable_rc_qualitylevels
Revert “d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported” This reverts commit d6bb4ddc638f3ee37fbbe066c631dad80aaeb2d3. Fixes: d6bb4ddc638 (“d3d12: Video Encode - Remove PIPE_VIDEO_PROFILE_MPEG4_AVC_BASELINE as not supported”)
Simon Ser (1):
glapi: fix param type in TexGenxOES
Sushma Venkatesh Reddy (4):
drm-uapi: Sync i915_drm.h with a78313bb206e
anv/drirc: add option to provide low latency hint
anv: Fix I915_PARAM_HAS_CONTEXT_FREQ_HINT check
intel/clflush: Utilize clflushopt in intel_invalidate_range
Sviatoslav Peleshko (5):
anv: Fix descriptor sampler offsets assignment
anv,driconf: Add fake non device local memory WA for Total War: Warhammer 3
intel/brw: Actually retype integer sources of sampler message payload
intel/elk: Actually retype integer sources of sampler message payload
mesa: Fix PopAttrib not restoring states that changed on deeper stack level
Tapani Pälli (14):
iris: change stream uploader default size to 2MB
anv: skip gfx push constants alloc optimization on gfx9/11
iris: ForceZeroRTAIndexEnable if last geom stage does not write layer id
vulkan/runtime: add a subpass bit for legacy dithering
anv: VK_EXT_legacy_dithering support
docs/features: add VK_EXT_legacy_dithering
ci: update failures list with angle for jsl, tgl
anv/android: enable emulated astc for applications
anv: implement WA 14018283232
mesa: remove some conditions in mipmap code
isl: fix condition for enabling sampler route to lsc
isl/iris/anv: provide drirc toggle intel_sampler_route_to_lsc
anv: move some pc was to batch_emit_pipe_control_write
anv: fix a cmd_buffer reference in simple shader
Tatsuyuki Ishi (5):
radv: Remove radv_queue::device again
vk_entrypoints_gen: Add missing ATTR_WEAK for instance and physdev entrypoints
vk_entrypoints_gen: Rework ATTR_WEAK to unify Unix and MinGW
vk_entrypoints_gen: Apply hidden visibility to generated symbols
vk_cmd_queue_gen: Exclude CmdDispatchGraphAMDX
Thomas H.P. Andersen (2):
nvk: advertise EXT_depth_range_unrestricted
nvk/upload_queue: fix the _fill method
Tim Huang (2):
amd: add GFX v11.5.2 support
amd/vpelib: support VPE IP v6.1.3
Timothy Arceri (36):
glsl: wrap nir_opt_loop in NIR_PASS()
glsl: use hash table when serializing resource data
glsl: move geom input array sizing to nir linker
lima: drop unrequired opt from standalone compiler
glsl: remove unused detect_recursion_linked()
lima: remove the standalone compiler
glsl: add support for glsl es 310/320 to standalone compiler
nir: clarify and update loop conditional instruction
nir: more aggressively remove in loop during partial unroll
nir: support more loop unrolling for logical operators
nir: add merge loop terminators optimisation
nir: add test for opt_loop_merge_terminators
nir: correctly track current loop in nir_opt_loop()
nir: test opt_loop_merge_terminators() skips unhandled loops
nir: add additional opt_loop_merge() test of deref handling
glsl: drop dump-builder support from standalone compiler
glsl: remove Par-linking from the standalone linker
glsl: remove do_function_inlining()
glsl: make glsl_to_nir() more generic
glsl: remove unused symbol table functionality
glsl: remove out of date TODO
glsl: move call to create explicit ifc layout out of glsl_to_nir
glsl: drop glsl ir optimisation from the standalone compiler
glsl: make warning tests pass linking
glsl/mesa: remove UniformHash field
glsl/standalone: init EmptyUniformLocations
glsl/tests: fix test_gl_lower_mediump
mesa: remove _mesa_get_log_file() wrapper
util/mesa: move mesa/main log code to util
mesa: add unreachable to _mesa_shader_stage_to_subroutine_prefix()
glsl: set how_declared to hidden for compiler temps
glsl: fix cross validate globals
glsl: remove out of date comment
nir: set disallow_undef_to_nan for legacy ARB asm programs
glsl: fix glsl to nir support for lower precision builtins
glsl: always copy bindless sampler packing constructors to a temp
Timur Kristóf (25):
ac/nir/esgs: Slightly refactor emitting IO loads and stores.
ac/nir/tess: Slightly refactor emitting LS outputs.
ac/nir: Add helper macros for emitting IO code.
ac/nir/esgs: Implement packed 16-bit ES->GS I/O using helper macros.
ac/nir/tess: Implement packed 16-bit LS->HS I/O using helper macros.
ac/nir/tess: Implement packed 16-bit HS->TES I/O using helper macros.
aco: Add missing nir_builder include.
ac/nir: Move some helpers to new file.
ac/nir: Add helper for pre-rasterization output info.
ac/nir/ngg: Use new pre-rasterization output info helper.
ac/nir/legacy: Use new pre-rasterization output info helper.
nir: Add nir_opt_load_store_update_alignments.
radv: Add TES num_linked_patch_inputs.
radv: Add shader stats for inputs and outputs.
radv: Fix TCS -> TES I/O linking typo of VARYING_SLOT vs. BIT.
nir/opt_varyings: Print FS VEC4 type when debugging relocate_slot.
nir/opt_varyings: Don’t promote flat inputs when moving post-dominator.
ac/nir/tess: Adjust TCS->TES output mapping for linked shaders.
radv: Properly link TCS->TES IO again.
nir/lower_io: Add option to implement mediump as 32-bit.
radv: Ignore mediump IO flag.
ac/nir/tess: Only write tess factors that the TES reads.
ac/nir/tess: Fix per-patch output LDS mapping.
ac/nir/tess: Fix per-patch output VRAM mapping.
radv: Use number of TES inputs for TCS-TES linking.
Tomeu Vizoso (2):
etnaviv/nn: Make parallel jobs disabled by default
etnaviv: handle missing alu conversion opcodes
Turo Lamminen (1):
radv: Optimize memcpy in write_image_descriptor
Tvrtko Ursulin (1):
intel/hang_replay: fix batch address
Valentine Burley (40):
docs: Update VK_EXT_legacy_vertex_attributes entries
tu: Add missing VK_EXT_legacy_vertex_attributes feature
tu: Change commas to semicolons in VK_EXT_map_memory_placed features
drm-shim: Stub syncobj reset ioctl
tu: Expose VK_EXT_nested_command_buffer
freedreno/devices: Fix indentation for Adreno A32
freedreno/ci: Update expectations
wsi: Guard DRM-dependent function implementations with HAVE_LIBDRM
tu: Add support for VkBindMemoryStatusKHR
tu: Add support for NULL index buffer
tu: Add support for version 2 of all descriptor binding commands
tu: Advertise VK_KHR_maintenance6
tu: Move event related related code to tu_event.cc/h
tu: Handle all dependencies of CmdWaitEvents2
mr-label-maker: Update nouveau directories
mr-label-maker: Separate freedreno and turnip labels
tu: Handle the new sync2 flags
tu: Remove declaration of unused update_stencil_mask function
tu: Switch to vk_ycbcr_conversion
tu: Use vk_sampler
tu: Use device->vk.enabled_features instead of iterating twice
tu: Move sampler related code to tu_sampler.cc/h
tu: Drop tu_init_sampler helper function
tu: Advertise VK_KHR_shader_float_controls2
tu: Use the common version of vkGetBufferMemoryRequirements2
tu: Move buffer related code to tu_buffer.cc/h
tu: Use the common version of vkQueueBindSparse
tu: Use vk_buffer_view
tu: Drop tu_buffer_view_init helper function
tu: Move buffer view related code to tu_buffer_view.cc/h
tu: Rename tu_query.cc/h to tu_query_pool.cc/h
tu: Use the common versions of vkBegin/EndQuery()
tu: Use vk_query_pool
tu: Don’t disable 2 10-bit formats
freedreno,tu,ir3: Move threadsize_base and max_waves to fd_dev_info
freedreno/ci: Use the common a6xx-skips on a750
tu: Enable VK_KHR_shader_subgroup_uniform_control_flow
tu/kgsl: Remove unused variable
vulkan/wsi: Refactor can_present_on_device
tu: Always report that we can present on kgsl
Vignesh Raman (3):
virtio/ci: separate hiden jobs to -inc.yml files
ci: add farm variable for devices in collabora farm
ci/lava: add farm in structured log files
Vinson Lee (2):
panvk: Remove duplicate variable src_idx
panvk: Fix assert
Vlad Schiller (2):
pvr: Handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO
pvr: Handle VK_STRUCTURE_TYPE_IMAGE_FORMAT_LIST_CREATE_INFO
WANG Xuerui (2):
meson: Force use of LLVM ORCJIT for hosts without MCJIT support
meson: Additionally probe -mtls-dialect=desc for TLSDESC support
Weifeng Liu (1):
anv/anroid: Query gralloc for tiling mode
X512 (2):
egl/haiku: fix double free of BBitmap
egl/haiku: fix synchronization problems, add missing header
Yiwei Zhang (31):
venus: avoid client allocators for ring internals
venus: silence a stack array false alarm
venus: workaround excessive dma-buf import failure on turnip
venus: fix to destroy all pipeline handles on early error paths
meson: disallow Venus debug + LTO build via GCC
turnip: msm: clean up iova on error path
turnip: msm: fix racy gem close for re-imported dma-buf
venus: drop the workaround for excessive dma-buf import oom on turnip
turnip: virtio: fix error path in virtio_bo_init
turnip: virtio: fix iova leak upon found already imported dmabuf
turnip: virtio: fix racy gem close for re-imported dma-buf
vulkan: cast to avoid -Wswitch for Android struct beyond VkStructureType
venus: directly use vk drm and pci props in renderer info
venus: move custom props fill from GPDP2 to props init
venus: move props sanitization to a separate helper
venus: define VN_SET_VK_PROPS(_EXT) to simplify vk props init
vulkan: drop redundant core props query and copy helpers
venus: drop internal memory pools
venus: allow non-wsi image alias path to passthrough upon bind memory
ci/venus: skip a timeout test
anv: use os_get_option instead of getenv
venus: defer qfb buffer init upon query being used
venus: refactor vn_android_image_from_anb
venus: refactor to add vn_android_image_from_anb_internal
venus: support VK_ANDROID_NATIVE_BUFFER_SPEC_VERSION 8
vulkan: properly ignore unsupported feature structs
venus: tentative fix for test flakiness from invalid ring wait
venus: simplify cached mem type emulation
venus: clarify wsi image ownership
venus: fix a race condition between gem close and gem handle tracking
Revert “meson: disallow Venus debug + LTO build via GCC”
Yogesh Mohan Marimuthu (4):
radeonsi: remove si_query_hw_ops table and call func directly
radeonsi: use reseults_end instead of unprepared to init query buffer
radeonsi: rename query_hw_ops to hw_query_ops match sw
radeonsi: add more comments in si_query.c
Yonggang Luo (2):
util: Rename DETECT_OS_UNIX to DETECT_OS_POSIX
gallivm: add lp_context_ref for combine usage of LLVMContextSetOpaquePointers
Yukari Chiba (7):
llvmpipe: add gallivm_add_global_mapping
llvmpipe: make unnamed global have internal linkage
util: detect RISC-V architecture
gallivm: add riscv support to the mattrs setting code
llvmpipe: add function name to gallivm_jit_function
llvmpipe/tests: add a new test for multiple symbols for orc jit testing
llvmpipe: add an implementation with llvm orcjit
Yusuf Khan (7):
nouveau: Fix crash when destination or source screen fences are null
nouveau/headers: Make nvk_cl**** turn to nv_push_cl****
nvk: remove NVK_MME_COPY_QUERIES
zink/query: begin time elapsed queries even if we arent in a rp
nvc0/vbo: wrap draw_vbo for multidraw performance
nv50/vbo: wrap draw_vbo to avoid ovehead from multidraw
aux/draw: Use the draw info we get passed in instead of our own
Zach Battleman (2):
intel/brw: update comment to accurately reflect intended behavior
intel/brw: update Wa_1805992985 to use workarounds mechanism
Zack Middleton (2):
gles1: fix GL_OES_vertex_array_object
gles1: fix glBufferSubData()
Zan Dobersek (14):
fdperf: use snprintf instead of asprintf
fdperf: select_counter() should work with a countable value
fdperf: prettify logic around the reserved CP counter
fdperf: improve reads of counter values
fdperf: simplify counter value output
freedreno: add a7xx perfcounter support
tu: fix ZPASS_DONE interference between occlusion queries and autotuner
tu: avoid memory polling in occlusion query endings using ZPASS_DONE
tu: use either the 16-bit or 32-bit descriptor
ir3_nir_opt_preamble: handle 8-bit preamble loads and stores
ir3: rework TYPE_S8 as TYPE_U8_32
tu: support KHR_8bit_storage
tu: add format feature flag checks for VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT
freedreno/drm: add mesautil dependency
bbhtt (1):
nvk: Clean up unused header from libdrm_nouveau
chiachih (9):
amd/vpelib: Resolve mismatch with shader
amd/vpelib: Remove linear_0_125 TF
amd/vpelib: Remove gamma cached table
amd/vpelib: Remove support for non-linear FP16
amd/vpelib: adding blend gamma bypass
amd/vpelib: Remove checks for pitch alignment
amd/vpelib: Fix Color Adjustment Failing Test Cases
amd/vpelib: Fix blndgam bypass flag assignment
amd/vpelib: Bypass de/regam on HLG
msizanoen (1):
egl/wayland: Fix direct scanout with EGL_EXT_present_opaque
nyanmisaka (1):
frontends/va: add support for A2RGB10/X2RGB10/A2BGR10/X2BGR10
tarsin (4):
turnip: Change tu_image to use common initialization helpers
turnip: Convert tu_device_memory to use vk_device_memory
turnip: Split tu_image_init to use layout setting logic separately
turnip: Support AHardwareBuffer