Mesa 23.3.0 Release Notes / 2023-11-29¶
Mesa 23.3.0 is a new development release. People who are concerned with stability and reliability should stick with a previous release or wait for Mesa 23.3.1.
Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. Some drivers don’t support all the features required in OpenGL 4.6. OpenGL 4.6 is only available if requested at context creation. Compatibility contexts may report a lower version depending on each driver.
Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by the apiVersion property of the VkPhysicalDeviceProperties struct depends on the particular driver being used.
SHA256 checksum¶
50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz
New drivers¶
NVK: A Vulkan driver for Nvidia hardware
New features¶
VK_EXT_pipeline_robustness on ANV
VK_KHR_maintenance5 on RADV
OpenGL ES 3.1 on Asahi
GL_ARB_compute_shader on Asahi
GL_ARB_shader_atomic_counters on Asahi
GL_ARB_shader_image_load_store on Asahi
GL_ARB_shader_image_size on Asahi
GL_ARB_shader_storage_buffer_object on Asahi
GL_ARB_sample_shading on Asahi
GL_OES_sample_variables on Asahi
GL_OES_shader_multisample_interpolation on Asahi
GL_OES_gpu_shader5 on Asahi
EGL_ANDROID_blob_cache works when disk caching is disabled
VK_KHR_cooperative_matrix on RADV/GFX11+
Bug fixes¶
crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1)
mesa: vertex attrib regression
[RADV] War Thunder has some grass flickering.
radv: satisfactory broken shader
RADV problem with R7 M440 in some games
gpu driver crashes when opening ingame map playing dead space 2023
[anv] Valheim water misrendering
EGL/v3d: EGL applications under a X compositor doesn’t work
RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games
radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled
anv: DIRT5 gfx11_generated_draws_spv_source triggers “assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));”
panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers
[RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox.
LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380
[radv] Crypt not rendering properly
Leaks of DescriptorSet debug names
[Tracing flake] Missing geometry in trace@freedreno-a630@freedoom@freedoom-phase2-gl-high.trace
Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates
RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2
radv: Regression with UE5 test
SIGSEGV with MESA_VK_TRACE=rgp and compute only queue
[ANV] Corruptions in Battlefield 4
anv regression w/ commit e488773b29d97 (“anv: Fast clear depth/stencil surface in vkCmdClearAttachments”)
ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4
Zink + Venus: driver can’t handle INVALID<->LINEAR!
Anv: Particles have black square artifacts on Counter Strike 2 on Skylake
Lords of the Fallen 2023 Red Eye mode crashing game and desktop
[radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer
[BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode
Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24
H264 to H264 transcode output corruption with gst-vaapi
opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover
[R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970
Ubuntu 23.10 build error with rusticl_opencl_bindings.rs
Rusticl fails to build
ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range
tu: Wolfenstein: The New Order misrenders on a740
DRI_PRIME fails with ACO only radeonsi
nir_to_tgsi: Incorrect handling of indirect array access
ANV gen9 32 bit vulkan asserts on many cts tests
GPU hang observed while launching 3DMark Wildlife Unlimited on MTL
ac/gpu_info: Query maximum submitted IBs from the kernel
RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games
GPU page faults reported while playing Talos Principle 2 (demo)
No CCS_E scanout on tgl+ with ANV
anv: Modifier tests assert-fail on TGL+
ci: zink-tu jobs no longer included in manual pipelines
[ANV][A770] GravityMark segfaults and buffer allocation errors
etnaviv: gc2000 gles2 regression
ci_run_n_monitor: pipeline finding unreliable
nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
anv: jsl timeline semaphores flaky
anv: OOB access in vkDestroyDevice?
nvk: Implement VK_EXT_primitive_topology_list_restart
nvk: Implement VK_EXT_image_sliced_view_of_3d
nvk: Implement VK_KHR_workgroup_memory_explicit_layout
util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit
r300/r400 regression; can’t compile `if/then` in shaders
iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers
nvk: Implement VK_EXT_depth_bias_control
ICL/zink: gpu hang on ‘piglit.object namespace pollution.framebuffer with gldrawpixels’
[R600] Wolfenstein: The New Order text glitch on menu
need extension to request image/texture not use data dependent compression
rusticl: segfault in clCreateKernel on AMD Instinct MI100
!25587 broke xserver
GPU Hang in Deep Rock Galactic on DG2
intel: Wrong length for 3DSTATE_3D_MODE on gfx125
[radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO
traces job reference images missing again sometimes
Vulkan Texture/Polygon Glitches in Games
freedreno: dmabuf modify query ignores format
virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer
Turnip build error on termux
failiure in amd llvm helper
failiure in amd llvm helper
radv_amdgpu_cs_submit: Assertion `chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type’ failed.
hasvk: subgroups regression
radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected)
aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes
anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow
[RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x
[ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression
radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect
Doom Eternal freezing on NAVI31 with current git
iris CTS blend test fail with MSAA config on DG2
anv: 32bit mesa asserts
RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics
anv: missing barrier handling on video engines
radv: Star Wars The Old Republic hang when DCC is enabled
radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings
radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen
ANV: fp64 shader leaked
v3d: noop drm-shim raises some warnings
freedreno: crashdec/etc chip_id support
intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead
ci build issues with builtin types
freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes
RadeonSI: glClear() causes clear texture for some frames on RX580
radeonsi: corruption when seeking video decoded with vaapi in mpv
Zink/HasVK regression bisected to “gallium: move vertex stride to CSO”
[radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete.
radv: images don’t always have extents in RGP
shader_test causing a crash in compiler
D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required.
Crash in st_ReadPixels
[regression] intel build issue on i386
[ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
zink on hasvk regression: Assertion `(dyn)->vi_binding_strides[first_binding + i] == (strides[i])’ failed.
Penumbra: Overture hangs on new game loading screen
[r300, RV516] Some deqp-gles2@performance@shader@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550
v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it
radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
h264 encoding picture showed randomly repeated frames.
Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1
zink: no uniform buffer objects support for v3dv?
v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support
[ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry
[Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
VAAPI on VCN: bad stream may crash whole gfx system
Crash after GPU reset
Bifrost PanVK should not be in CI
[Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory
mesa: new glcts fails
tu: GPL support is broken
lavapipe: ycbcr regression
aco: Assertion when compiling CP2077 shader
anv: flakiness on tgl+ with samplemask handling
[RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
r300: optionally convert MULs into output modifier for the following MUL or DOT instructions
r300: better 1-x presubtract pattern matching
gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
wsi: deadlocks when DISPLAY is changed
hasvk: Incompatible with minigbm/gralloc4 on Android
VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
lavapipe/llvmpipe: shader unregister crash
[ANV] [DG2/A380] Corruption in Borderlands 3
blorp regression on dg2
decouple -Dshader-cache= from EGL_ANDROID_blob_cache
radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
[radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
ubsan + gtest build fails
glCopyTexSubImage2D is very slow on Intel
NVE4 (GeForce 710) fails to get vdpau in mesa git
[RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
nouveau prevents hardware acceleration with Chromium (Wayland)
Corrupt text rendering in Blender
DRI2 gallium frontend is using bad format type
regression - MR 23089 - Hellblade RT crashing
Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
Issue with clang-format
Follow-up from “Draft: intel: Disable color fast-clears for blorp_copy”
nightly VA-API build: new timeout
r600: retire the SB optimizer
ci: do not download perfetto on-fly in build jobs
Shared Memory Leak With Qt OpenGL Applications
OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED
975a8ecc881873744d851ab0ef45ad7698eaa0ef “frontends/va: use resources instead of views” cause radeonsi can’t play video.
zink: reduce pipeline hash size
Rusticl,radeonsi: ac_rtld error(2): too much LDS
aco, radv Rage 2 menu corruption - bisected
radv, aco: World War Z character texture regression on 7900xtx
android: De-stage drm_gralloc support from mesa3d
Cyberpunk screen goes black at game launch on integrated Gfx
lavapipe/llvmpipe: regressions since descriptor rewrite
intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
ci: HW job logs have spam at the end
kernel crash seen on AMD Raven device
crocus: regression crashing in doubles/ubo tests
turnip: object management CTS crashes
a618: multiple assertions with different kernel config on u_vector_add
[anv] Death Stranding crashes
Can no longer build Clover without llvmspirvlib
[radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format
Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
[AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow
Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
r300: calculate some cycles estimate for shader-db
intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
error message when encoding via VAAPI AMD
gpu hangs on dg2 with mesh shading enabled on vkcts
radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
robustness2 raygen tests intermittently fail in Intel Mesa CI
ci/ci_run_n_monitor.py: KeyError: ‘clang-format’
glthread: huge performance regression
DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
rusticl: fails to build for iris + radeonsi
Changes¶
Adam Jackson (3):
egl: Implement EGL_EXT_explicit_device
mesa: Implement and advertise GL_MESA_sampler_objects
docs: Mention ‘meson devenv’ in the pre-install test instructions
Aditya Swarup (6):
isl: enable Tile64 for 3D images
intel/isl: Unittest for linear to Ytile conversion
intel/isl: Convert linear texture to Tile4 format
intel/isl: Convert Tile4 texture to linear format
intel/isl: Linear to Tile-4 conversion unittest
Revert “iris: Disable tiled memcpy for Tile4”
Alba Mendez (1):
meson: support installation tags
Alejandro Piñeiro (61):
v3dv: re-enable sync_fd import/export on the simulator
broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build
broadcom/cle: update the packet definitions for new generation v71
broadcom/common: add some common v71 helpers
broadcom/qpu: add comments on waddr not used on V3D 7.x
broadcom/qpu: set V3D 7.x names for some waddr aliasing
broadcom/compiler: rename small_imm to small_imm_b
broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig
broadcom/qpu: add v71 signal map
broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr
broadcom/qpu: add raddr on v3d_qpu_input
broadcom/qpu: defining shift/mask for raddr_c/d
broadcom/commmon: add has_accumulators field on v3d_device_info
broadcom/qpu: add qpu_writes_rf0_implicitly helper
broadcom/qpu: add pack/unpack support for v71
broadcom/compiler: phys index depends on hw version
broadcom/compiler: don’t favor/select accum registers for hw not supporting it
broadcom/vir: implement is_no_op_mov for v71
broadcom/compiler: update vir_to_qpu::set_src for v71
broadcom/qpu_schedule: add process_raddr_deps
broadcom/qpu: update disasm_raddr for v71
broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71
broadcom/compiler: add support for varyings on nir to vir generation for v71
broadcom/compiler: payload_w is loaded on rf3 for v71
broadcom/qpu_schedule: update write deps for v71
broadcom/compiler: update register classes to not include accumulators on v71
broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71
broadcom/compiler: update one TMUWT restriction for v71
broadcom/compiler: update ldunif/ldvary comment for v71
broadcom/compiler: update payload registers handling when computing live intervals
broadcom/qpu: new packing/conversion v71 instructions
v3dv/meson: add v71 hw generation
v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71
v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71
v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71
v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
v3dv/cmd_buffer: just don’t fill up early-z fields for CFG_BITS for v71
v3dv: default vertex attribute values are gen dependant
v3dv/pipeline: default vertex attributes values are not needed for v71
v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71
v3dv: no specific separate_segments flag for V3D 7.1
v3dv: add support for TFU jobs in v71
v3d: add v71 hw generation
v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1
v3d: emit CLEAR_RENDER_TARGETS for v71
v3d: just don’t fill up early-z fields for CFG_BITS for v71
v3d: emit CLIPPER_XY_SCALING for v71
v3d: no specific separate_segments flag for V3D 7.1
v3d: default vertex attributes values are not needed for v71
v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
v3d: handle new texture state transfer functions in v71
v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields
v3d: setup render pass color clears for any format bpp in v71
v3d: GFX-1461 does not affect V3D 7.x
v3d: don’t convert floating point border colors in v71
v3d: handle Z clipping in v71
v3d: add support for TFU blit in v71
v3dv: implement depthBounds support for v71
doc/features: update after last v3d changes
Alex Denes (1):
virgl: link VA driver with build-id
Alexander Orzechowski (1):
radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts
Alyssa Rosenzweig (431):
zink: Switch to register intrinsics
gallium/trace: Collect enums from multiple files
gallium,util: Move blend enums to util/
gallium,util: Move util_blend_dst_alpha_to_one
util/blend: Add helpers for normalizing inverts
vulkan: Add helpers for blend enum translation
lvp: Use common blend/logicop translation
nir/lower_blend: Use util enums
panfrost: Convert to PIPE_BLEND enums internally
gallium: Remove pipe->compiler BLEND enum translation
compiler: Remove blend enums duplicating util
nir/legacy: Fix fneg(load_reg) case
nir/legacy: Fix handling of fsat(fabs)
ntt: Switch to new-style registers and modifiers
ir3: Convert to register intrinsics
nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics
nir: Devendor load_sample_mask
nir: Promote tess_coord_r600 to tess_coord_xy
nir: Add nir_lower_tess_coord_z pass
r600: Use nir_lower_tess_coord_xy
ir3: Use nir_lower_tess_coord_z
nir: Initialize workgroup_size in builder_init_simple_shader
v3dv: Rely on nir_builder setting workgroup size
radv: Rely on workgroup_size initialization
panfrost: Fix transform feedback on v9
r600/sfn: Remove nir_register unit tests
panfrost: Lower vertex_id for XFB
panfrost: Fix transform feedback on v9 harder
asahi: Augment fake drm_asahi_params_global
asahi: Use nir_builder_at more
asahi: Remove unused #define
asahi: Refactor PBE upload routine
asahi: Extract shader_initialize helper
asahi: Serialize NIR in memory
asahi: Identify background/EOT counts
asahi,agx: Set coherency bit for clustered targets
ail: Page-align layers for writable images
asahi: Mark writeable images as such
asahi: Reallocate to set the writeable image flag
asahi: Add agx_batch_track_image helper
asahi: Add texture/image indexing lowering pass
asahi: Upload at most the max texture state registers
asahi: Upload image descriptors
asahi: Make clear the non-sRGBness of EOT images
asahi: Don’t restrict sampler views
asahi: Forbid 2D Linear with images
agx: Add try_coalesce_with helper
agx: Try to allocate phis compatibly with sources
agx: Try to allocate phi sources with phis
agx: Try to allocate phi sources with loop phis
agx: Vectorize 16-bit parallel copies
agx: Reduce un/packs with mem access lowering
agx: Fix bogus assert
asahi: Augment PBE descriptor for software access
asahi: Extend PBE packing for image support
asahi: Use nir_lower_robust_access
agx: Legalize image LODs to be 16-bit
agx: Lower image size to txs
agx: Generalize texture/PBE packing
agx: Add image write instruction
agx: Model texture bindless base
agx: Handle bindless properly for txs lowering
agx: Pack bindless textures
agx: Translate texture bindless handles
agx: Translate image_store from NIR
agx: Handle frag side effects without render targets
agx: Wait for outstanding stores before barriers
agx: Implement image barriers
agx: Handle early_fragment_tests
agx: Add interleave opcode
agx: Extract coords_for_buffer_texture helper
agx: Extract texture_descriptor_ptr_for_* helpers
agx: Lower image atomics
agx: Lower buffer images
asahi,agx: Fix txf sampler
agx: Add image_load opcode
agx: Extract texture write mask handling
agx: Implement image_load
agx: Emit global memory barriers for images
agx: Don’t emit silly barriers
agx: Implement fence_*_to_tex_agx intrinsics
agx: Add simple image fencing pass
agx: Require tag writes with side effects
agx: Plumb in coverage mask
asahi: Extract sampler_view_for_surface
asahi: Introduce concept of spilled render targets
asahi: Add agx_tilebuffer_spills query
asahi: Do not support masking with spilled RTs
asahi: Ignore spilled render targets in EOT shaders
asahi: Ignore spilled render targets with partial renders
asahi: Extract some tilebuffer lowering code
asahi: Lower tilebuffer access for spilled RTs
asahi: Lower multisample image stores
asahi: Permit meta shaders to use preambles
asahi: Ignore spilled render targets for background load
asahi: Offset clear colour uniform by 4
asahi: Execute preambles for background programs
asahi: Advertise Z16_UNORM
ir2: Switch to nir_legacy
intel/fs: Don’t read reg.base_offset
panfrost: Remove unused helpers
nir: Remove nir_lower_locals_to_regs
nir: Rename lower_locals_to_reg_intrinsics back
nir: Remove register arrays
asahi: Don’t depend on glibc to decode
pan/bi: Remove leftover include
nir/trivialize: Handle more RaW hazards
panfrost: Disable blending for no-op logic ops
nir/lower_blend: Fix 32-bit logicops
nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP
clang-format: Ignore original panfrost commit
nir/schedule: Assume no old-style registers
gallium/u_simple_shaders: Optimize out ffloors
gallium/u_transfer_helper: Remove dead forward decl
nir/loop_analyze: Drop unused inverse_comparison
nir/passthrough_gs: Drop unused array_size_for_prim
panfrost: Add missing static inline annotation
pan/decode: Drop unused debug function
pan/mdg: Add missing static inline annotation
panfrost: Drop unused decode_position for samples
panfrost: Only define pan_blitter_get_blend_shaders for midgard
panfrost: Add missing inline
panfrost: Gate overdraw_alpha on Bifrost+
nir: Rename scoped_barrier -> barrier
nir: Remove lower_to_source_mods
nir: Remove lower_vec_to_movs
nir: Remove reg_intrinsics parameter to convert_from_ssa
nir: Remove register load/store builders
r600/sfn: Stop referencing legacy functionality
r600/sfn: Ignore instruction write masks
nouveau/codegen: Drop writemask check
vc4,broadcom/compiler: Drop write_mask handling
zink: Collapse is_ssa check
nir: Add {…} before case
nir/from_ssa: Drop legacy reg support
nir/schedule: Drop nir_schedule_dest_pressure
nir: Drop NIR reg create/destroy
nir: Remove nir_index_local_regs and callers
nir/schedule: Drop more nir_register handling
nir: Remove nir_foreach_register
nir: remove nir_{src,dest}_for_reg
ntt: Drop nir_register reference
nir/print: Assume SSA
nir/clone: Assume SSA
nir/serialize: Drop legacy NIR
nir/validate: Assume SSA
nir: Remove impl->{registers,reg_alloc}
nir: Remove nir_alu_dest::saturate
treewide: Drop is_ssa asserts
nir: Collapse some SSA checks
treewide: Remove more is_ssa asserts
nir: Remove reg-only dest manipulation
nir: Remove stale todo
nir/print: Drop legacy NIR
nir: Drop nir_alu_src::{negate,abs}
treewide: sed out more is_ssa
pan/mdg: Assume SSA
treewide: Drop some is_ssa if’s
nir: Drop trivial reg handling
aco: Remove is_ssa check
intel: Collapse is_ssa checks
llvmpipe: Assume SSA
ir3: Collapse is_ssa checks
lima: Collapse is_ssa checks
radeonsi: Collapse SSA check
nir/gather_ssa_types: Collapse SSA checks
nir/worklist: Assume SSA
nir/range_analysis: Assume SSA
treewide: Collapse more SSA checks
nir/instr_set: Assume SSA
nir: Collapse more SSA checks
nir: Remove def_is_register
nir: Do not init dests
nir: Initialize source as a NULL SSA def
nir: Collapse more SSA checks
nir: Remove nir_{src,dest}::is_ssa
nir: Drop nir_register
nir/from_ssa: Remove pointless union
ir3: Drop write_mask handling
rogue: Stop reading write masks
etnaviv: Don’t use alu->dest.write_mask
etnaviv: What if we just didn’t have a compiler?
intel/vec4: Don’t use legacy write mask
ntt: Evaluate write_mask check
nir: Remove nir_alu_dest::write_mask
nir: Remove nir_foreach_def
lima: Clean up after deleting asserts
nir: Remove no-op remove_def_cb
nir: Drop no-op all_srcs_are_ssa
nir: Simplify alu_instr_is_copy
nir: Add load_coefficients_agx intrinsic
agx: Implement nir_intrinsic_load_coefficients_agx
agx: Allow more varying slots
agx: Set lower_fisnormal
agx: Forcibly vectorize pointcoord coeffs
agx: Add interpolateAtOffset lowering pass
agx: Lower flat shading in NIR
asahi: Stub num_dies
asahi: Move a bunch of helpers to common
agx: Lower 8-bit ALU
agx: Handle 8-bit vecs
asahi,agx: Respect no16 even for I/O
agx: Don’t lower load_local_invocation_index
agx/dce: Use the helper
agx: Fix atomics with no destination
agx: Fix shader info with sample mask writes
agx: Do not move bindless handles
agx: Put else instructions in the right block
agx: Use unconditional else instruction
agx: Optimize out pointless else instructions
agx: Fix length bit confusion
agx: Require an immediate for `nest`
agx: Use compressed fadd/fmul encodings
agx: Optimize swaps of 2x16 channels
agx: Optimize logical_end removal
agx: Fix AGX_MESA_DEBUG=demand
agx: Maintain ctx->max_reg while assigning regs
agx: Allow 64-bit memory regs
agx: Fix accounting for phis
agx: Set phi sources in predecessors
agx: Stop setting registers after the shader
agx: Use agx_replace_src
agx: Assert invariant stated in the comment
agx: Don’t use ssa_to_reg across blocks
agx: Don’t reuse ssa_to_reg across blocks
agx: Remove unused allocation
agx: Stop setting forwarding bit
agx: Handle blocks with no predecessors
agx: Lower f2u8/f2i8
agx: Handle conversions to 8-bit
agx: Fix uadd_sat packing
agx: Fix 64-bit immediate moves
agx: Lower f2f16_rtz
agx: Handle f2f16_rtne like f2f16
agx: Handle <32-bit local memory access
agx: Do not allow creating vec8
asahi: Legalize compression before blitting
nir: Drop “SSA” from NIR language
agx: Stop passing nir_dest around
agx: Remove agx_nir_ssa_index
pan/mdg: Don’t reference nir_dest
pan/bi: Don’t reference nir_dest
asahi: Do not reference nir_dest
panfrost: Do not reference nir_dest
zink: Do not reference nir_dest
ir3: Do not reference nir_dest
dxil: Do not reference nir_dest
nir: Drop nir_dest_init
panfrost: Pack stride at CSO create time on v9
lvp,nir/lower_input_attachments: Use nir_trim_vector
broadcom/compiler: Use nir_trim_vector explicitly
nir: Assert that nir_ssa_for_src components matches
nir: Add nir_shader_intrinsics_pass
nir: Lower fquantize2f16
agx: Lower fquantize2f16
nir/lower_helper_writes: Consider bindless images
nir/passthrough_gs: Correctly set vertices_in
nir/passthrough_gs: Fix array size
nir/print: Print access qualifiers for intrinsics
nir/lower_gs_intrinsics: Remove end primitive for points
panfrost/ci: Disable T720
nir: Add load_sysval_agx intrinsic
agx: Fix extraneous bits with b2b32
agx: Use more barriers
asahi: Copy CSO stride
agx: Assert vertex_id, instance_id are VS-only
asahi: Keep drawoverhead from OOMing itself
agx: Don’t blow up when lowering textures twice
agx/lower_vbo: Handle nonzero component
agx: Allow loop headers without later preds
agx: Handle b2i8
agx: Convert 8-bit comparisons
agx: Implement imul_high
asahi: Advertise OpenGL ES 3.1!
asahi/decode: Turn assert into error
asahi: Report local_size from compiler
asahi: Use local_size from compiler directly
asahi: Pass layer stride in pixels, not elements
agx: Clear sample count after lowering MSAA
agx: Clear image_array after lowering
asahi: Preserve atomic ops when rewriting image to bindless
agx: Use 16-bit reg for pixel_coord
asahi: Generalize query logic
asahi: Simplify occlusion query batch tracking
asahi: Refactor agx_get_query_result
asahi: Only touch batch->occlusion_queries for occlusion
asahi: Sync when beginning a query
asahi: Add non-occlusion query tracking
asahi: Add get_query_address helper
agx/fence_images: Use intrinsics_pass
agx: Do not fence write-only images
asahi: Add missing LOD source for agx_meta’s txfs
agx: Do some texture lowering early
agx: Add helper returning if a descriptor crawl is needed
nir,asahi: Remove texture_base_agx
asahi: Move UBO lowering into GL driver
asahi: Add sysval tables for each shader stage
asahi: Split out per-stage sysvals
asahi: Collapse grid_info
asahi: Extract agx_upload_textures
asahi: Upload a single draw_uniforms per draw
asahi: Add real per-stage dirty flags
asahi: Extract sampler upload
asahi: Put unuploaded uniforms on the batch
asahi: Decouple sysval lowering from uniform assignment
asahi: Use finer dirty tracking for blend constant
asahi: Use proper dirty tracking for VBOs
asahi: Dirty track VBOs + blend const separately
asahi: Dirty the shader stage when the shader changes
asahi: Fix shader stage dirtying
treewide: Use nir_shader_intrinsic_pass sometimes
treewide: Also handle struct nir_builder form
nir/lower_shader_calls: Fix warning with clang
nir: Add nir_before/after_impl cursors
treewide: Use nir_before/after_impl in easy cases
treewide: Use nir_before/after_impl for more elaborate cases
radv: Use before/after_cf_list for entrypoints
ci: Disable known broken Bifrost Vulkan job
ci: Disable WHL jobs
nir/opt_if: Simplify if’s with general conditions
asahi: Fixes for clang-warnings
agx: Fix jmp_exec_none encoding
agx/validate: Print to stderr
agx: Annotate opcodes with a scheduling class
agx: Add schedule-specialized get_sr variants
agx: Include schedule class in the opcode info
agx: Schedule for register pressure
agx: Lower pack_32_4x8_split
asahi: Force translucency for ignored render targets
agx: Remove logical_end instructions
agx: Lower pseudo-ops later
agx: Expand nest
agx: Lower nest later
agx: Split nest instruction into begin_cf + break
agx: Add break_if_*cmp instructions
agx: Add agx_first/last_instr helpers
agx: Use agx_first_instr
agx: Detect conditional breaks
agx: Omit push_exec at top level
agx: Omit while_icmp without continue
agx: Add helper to determine if a NIR loop uses continue
agx: Only use nest by 1 for loops w/o continue
agx: Add pseudo-instructions for icmp/fcmp
agx: Generate unfused comparison pseudo ops
agx: Fuse conditions into if’s
agx: Fuse compares into selects
agx: Add unit test for if_cmp fusing
agx: Add unit test for cmp+sel fusing
asahi: Translate cube array dimension
ail: Force page-alignment for layered attachments
agx: Handle cube arrays when clamping arrays
agx: Lower coordinates for cube map array images
agx: Run opt_idiv_const after lowering texture
asahi: Forbid linear 1D Array images
asahi: Handle linear 1D Arrays
asahi: Conditionally expose cube arrays
gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag
asahi: Skip LOD bias lowering for GLES
nir: Add nir_function_instructions_pass helper
nir: Add NIR_OP_IS_DERIVATIVE property
nir: Hoist nir_op_is_derivative
nir/opt_preamble: Use nir_op_is_derivative
nir/opt_gcm: Use nir_op_is_derivative more
nir/gather_info: Use nir_op_is_derivative
nir/opt_sink: Sink load_constant_agx
nir/opt_sink: Sink load_local_pixel_agx
nir/opt_sink: Sink frag coord instructions
nir/opt_sink: Do not move derivatives
nir/opt_sink: Move ALU with constant sources
nir/opt_sink: Also consider load_preamble as const
agx: Enable sinking ALU
treewide: Drop nir_ssa_for_src users
treewide: Remove remaining nir_ssa_for_src
nir: Remove nir_ssa_for_src
asahi: Clamp index buffer extent to what’s read
agx: Align the reg file for 256-bit vectors
agx: Hoist sample_mask/zs_emit
agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED
agx: Augment if/else/while_cmp with a target
agx: Add jumps to block ends
agx: Add agx_prev_block helper
agx: Insert jmp_exec_none instructions
nir: Add layer_id_written_agx sysval
nir: Support arrays in block_image_store_agx
agx/nir_lower_texture: Allow disabling layer clamping
agx: Pack block image store dim correctly
agx: Handle layered block image stores
agx: Add pass to lower layer ID writes
asahi: Add helper to get layer id in internal program
asahi,agx: Select layered rendering outputs
agx: Support packed layered rendering writes
agx/tilebuffer: Support layered layouts
agx/lower_tilebuffer: Support spilled layered RTs
asahi: Use layered layouts
asahi: Expose VS_LAYER_VIEWPORT behind a flag
asahi: Account for layering for attachment views
asahi: Assume LAYER is flat-shaded
asahi: Add pass to predicate layer ID reads
asahi: Predicate layer ID reads
asahi: Write to cubes/etc attachments as 2D array
asahi: Use a 2D Array texture for array render targets
asahi: Generate layered EOT programs
asahi: Handle layered background programs
lima/pp: Do not use union undefined behaviour
nir: Add trivial nir_src_* getters
nir: Use set_parent_instr internally
nir: Use getters for nir_src::parent_*
nir: Assert the nir_src union is used safely
nir: Use a tagged pointer for nir_src parents
nir: Add ACCESS_CAN_SPECULATE
ir3: Set CAN_SPECULATE before opt_preamble
ir3: Model cost of phi nodes for opt_preamble
nir/opt_preamble: Walk cf_list manually
nir/opt_preamble: Preserve IR when replacing phis
nir/opt_preamble: Unify foreach_use logic
nir/opt_preamble: Move phis for movable if’s
nir/opt_preamble: Respect ACCESS_CAN_SPECULATE
freedreno/ci: Minetest
r600/sfn: Handle load_global_constant
nir/opt_phi_precision: Work with libraries
nir/legalize_16bit_sampler_srcs: Use instr_pass
nir/print: Handle KERNEL
nir/lower_io: Use load_global_constant for OpenCL
nir/opt_algebraic: Reduce int64
nir/opt_algebraic: Optimize LLVM booleans
nir/trivialize_registers: Handle obscure load hazard
hasvk: Support builiding on non-Intel
crocus: Support building on non-Intel
meson: Add vulkan-drivers=all option
meson: Add gallium-drivers=all option
agx: Fix fragment side effects scheduling
Amber (7):
ir3: make wave_granularity configurable
turnip: Add support for devices not supporting double thread size.
turnip: make sampler_minmax support configurable.
freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low
ir3: handle non-uniform case for atomic image/ssbo intrinsics
freedreno: Add support for devices not supporting double thread size.
turnip: Add debug option to allow non-conforming features.
Andrew Randrianasulu (1):
nv50/ir: Remove few nvc0 specific defines from nv50-specific header.
Antonio Gomes (9):
rusticl/kernel: Removing unnecessary clone in kernel launch
rusticl/kernel: Add CsoWrapper
rusticl/compiler: Add NirPrintfInfo
rusticl: Move Cso to Program
rusticl/compiler: Remove unnecessary functions
rusticl: Move NirKernelBuild to ProgramDevBuild
rusticl/program: New helper functions to NirKernelBuild
rusticl/core: Delete KernelDevState and KernelDevStateInner
rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader)
Asahi Lina (29):
docs/tgsi: Specify that depth texture fetches are replicated
asahi: Add synctvb debug flag
asahi: Add smalltile debug option
asahi: Add nomsaa debug flag
asahi: decode: Add a params argument to pass through
asahi: Add extra CDM header block for G14X
asahi: wrap: Handle freeing shmems
asahi: decode: Refactor to always copy GPU mem to local buffers
asahi: decode: Add a function to construct decode_params from a chip_id
asahi: Add a shared library interface for decode
asahi: Add a noshadow debug flag
asahi: Do not overallocate BOs by more than 2x
asahi: Fix race in BO stats accounting
asahi: Always use resource size, not BO size
asahi: Print info about shadowed resources
asahi: Impose limits on resource shadowing
asahi: Force linear for SHARED buffers with no/implicit modifier
asahi: Enable explicit coherency for G14D (multi-die)
asahi: Handle non-written RTs correctly
asahi: Fix incorrect BO bitmap reallocations
asahi: Allocate staging resources as staging
asahi: cmdbuf: Identify call/ret bits
asahi: decode: Implement VDM call/ret
asahi: decode: Do not assert on buffer overruns
asahi: Fix VDM pipeline field width
asahi: Add scaffolding for supporting driconf options
asahi: Add and support the no_fp16 driconf flag
driconf: Disable fp16 for browsers
asahi: Allow no16 flag for disk cache
Bas Nieuwenhuizen (16):
aco: fix nir_op_vec8/16 with 16-bit elements.
aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
nir: Fix 16-component nir_replicate.
radv: Expose VK_EXT_external_memory_acquire_unmodified.
util/perf: Add gpuvis integration.
egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto.
vulkan: Add CPU tracing for vkWaitForFences.
docs: Add documentation for gpuvis.
vulkan: Add trace points for more Vulkan waiting functions.
radv: Use a double jump to limit nops in DGC for dynamic sequence count.
nir: Add AMD cooperative matrix intrinsics.
aco: Add WMMA instructions.
aco: Make RA understand WMMA instructions.
radv: Don’t transparently use wave32 with cooperative matrices.
radv: Add cooperative matrix lowering.
radv: Expose VK_KHR_cooperative_matrix.
Benjamin Cheng (10):
radv/video: use app provided hevc scaling list order
radv/video: copy from correct H264 scaling lists
anv/video: copy from correct H264 scaling lists
vulkan/video: add helper to derive H264 scaling lists
radv/video: use vk_video_derive_h264_scaling_list
anv/video: use vk_video_derive_h264_scaling_list
util/vl: extract gallium vl scanning data to shared code
radv/video: send h264 scaling list in raster order
anv/video: send h264 scaling list in raster order
radv/video: find SPS with pps_seq_parameter_set_id
Benjamin Lee (1):
nvk: Fix segfault when opening DRI device file returns error
Biswapriyo Nath (1):
radv/video: Match function definitions to declarations
Boris Brezillon (1):
panfrost: Flag the right shader when updating images
Boyuan Zhang (3):
virgl: Add vp9 picture desc
virgl: Implement vp9 hardware decode
radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0
Caio Oliveira (134):
nir: Use instructions_pass() for nir_fixup_deref_modes()
meson: Ensure that LLVMSPIRVLib is not required for Clover
nir: Let nir_fixup_deref_modes() fix deref_casts when possible
nir: Add nir_opt_reuse_constants()
radv: Use nir_opt_reuse_constants()
compiler/types: Use ralloc for the key in array_types
compiler/types: Use smaller keys for array_types table
compiler/types: Extract get_explicit_matrix_instance() function
compiler/types: Use smaller keys for explicit_matrix_types table
anv/tests: Refactor state_pool_test_helper to not use macros for parametrization
anv/tests: Link a single anv_tests binary using gtest
anv/tests: Propagate failures to gtest
hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization
hasvk/tests: Link a single hasvk_tests binary using gtest
hasvk/tests: Propagate failures to gtest
util: Add convenience macros for linear allocator
compiler/types: Use right hash for function types
compiler/types: Don’t duplicate empty string
compiler/types: Constify a couple of pointers in glsl_type
compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions
compiler/types: Move GLSL specific builtin structs into glsl/
glsl: Add missing glsl_types initialization to test_optpass
glsl: Don’t create struct type builtins
compiler/types: Add extra level of macro to builtin_macros
compiler/types: Use designated initializer syntax to specify builtins
compiler/types: Move local cache details to implementation file
compiler/types: Add a mem_ctx for the glsl_type_cache
compiler/types: Use type cache mem_ctx for hash tables
compiler/types: Don’t store a mem_ctx per type
compiler/types: Simplify clearing the glsl_type_cache
compiler/types: Move static asserts about glsl_type to a central place
compiler/types: Store builtin types directly as data
compiler/types: Use a linear (arena) allocator for glsl_types
compiler/types: Make struct glsl_type visible to C code
compiler/types: Add workaround to use builtin_type_macros.h in C
compiler/types: Move builtin type initialization to C
glsl: Annotate _mesa_glsl_error() with PRINTFLIKE
compiler/types: Fix array name dimension flipping for unsized arrays
compiler/types: Use Python to generate code for builtin types
compiler/types: Use glsl_get_type_name() to access the type name
compiler/types: Change glsl_type::name to be an uintptr_t
compiler/types: Use a string table for builtin type names
intel/compiler/xe2: Account for reg_unit() in TCS intrinsics
intel/compiler/xe2: Account for reg_unit() in TES intrinsics
intel/fs/xe2+: Update BS payload setup for Xe2 reg size.
intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size.
compiler: Use a meson dependency for libcompiler
meson: Remove unnecessary inc_compiler mentions
rusticl: Ensure NIR generated headers will be available
clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV
clover: Only compile/depend libclspirv and libclnir when using SPIR-V support
compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support
intel/compiler: Don’t allocate memory for SIMD select error handling
microsoft/compiler: Fix printf formatting string issues
util: Add more PRINTFLIKE and MALLOCLIKE annotations
util: Remove ralloc_parent from linear_header
util: Use linear parent to (r)allocated extra nodes
util: Remove size from linear_parent creation
util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors
util: Use an opaque type for linear context
util: Remove usages of linear_realloc()
util: Remove linear_realloc()
util: Remove size information from child allocations
util: Remove per-buffer header in linear alloc for release mode
util: Add a few basic tests for linear_alloc
util: Fix bookkeeping of linear node sizes
intel/compiler: Don’t store stage name and abbrev
intel/compiler/xe2: URB fence uses LSC now
intel/compiler/xe2: Fix URB writes in TCS
intel/compiler/xe2: Update TCS ICP handle code to support SIMD16
compiler/types: Add support for Cooperative Matrix types
nir: Add new intrinsics for Cooperative Matrix
nir: Handle cooperative matrix in various passes
spirv: Expose some memory related functions in vtn_private.h
spirv: Let vtn_ssa_value hold references to variables
spirv: Implement SPV_KHR_cooperative_matrix
compiler/types: Remove private related declarations
compiler/types: Remove use of new/delete
compiler/types: Remove use of references
compiler/types: Remove use of auto
compiler/types: Use C compatible cast syntax
compiler/types: Spell struct and enum in type names
compiler/types: Add void parameter to ensure these are valid C prototypes
intel/fs: Tweak default case of fs_inst::size_read()
compiler/types: Move the C++ inline functions in glsl_type out of the struct body
compiler/types: Move C declarations into glsl_types.h
compiler/types: Flip wrapping of base_type checks
compiler/types: Flip wrapping of various type identification checks
compiler/types: Flip wrapping of convenience accessors for vector types
compiler/types: Flip wrapping of basic “get type” functions
rusticl: Add Rust bindings for inline glsl_types functions
util: Add size to ralloc_header in debug mode
util: Add a canary to identify gc_ctx in debug mode
util: Add function print information about a ralloc tree
util: Avoid waste space when linear alloc’ing large sizes
spirv: Expose stage enum conversion in vtn_private.h
spirv: Change spirv2nir to use the shorter shader name abbreviations
spirv: List entry-points in spirv2nir when unsure what to use
spirv: Let spirv2nir find out the shader to use
intel/compiler: Don’t emit calls to validate() in release build
compiler/types: Flip wrapping of “type contains?” predicate functions
compiler/types: Flip wrapping of array related functions
compiler/types: Flip wrapping of cmat related functions
compiler/types: Flip wrapping of CL related functions
compiler/types: Flip wrapping of size related functions
compiler/types: Flip wrapping of struct related functions
compiler/types: Flip wrapping of interface related functions
compiler/types: Flip wrapping of layout related functions
compiler/types: Flip wrapping of record_compare
compiler/types: Flip wrapping of get_instance()
compiler/types: Flip wrapping of texture/sampler/image get instance functions
compiler/types: Flip wrapping of various get instance functions
compiler/types: Flip wrapping of get row/column type helpers
compiler/types: Flip wrapping of remaining non-trivial type getters
compiler/types: Flip wrapping of remaining small data getters
compiler/types: Flip wrapping of numeric type conversion functions
compiler/types: Move remaining code from nir_types to glsl_types
rusticl: Add bindings for glsl_vector_type()
compiler/types: Add more glsl_contains_*() functions and use them in C++
compiler/types: Add glsl_get_mul_type() and use it in C++
compiler/types: Add glsl_type_compare_no_precision() and use it in C++
compiler/types: Add glsl_type_uniform_locations() and use it in C++
compiler/types: Add glsl_get_std430_array_stride() and use it in C++
compiler/types: Add glsl_get_explicit_*() functions and use them in C++
compiler/types: Implement glsl_type::field_type() in terms of existing functions
compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type()
compiler/types: Add remaining type extraction functions and use them in C++
compiler/types: Use C instead of C++ constants for builtin types
compiler/types: Remove usages of C++ members in glsl_types.cpp
compiler/types: Annotate extern “C” only once in glsl_types.cpp
compiler/types: Rename glsl_types.cpp to glsl_types.c
compiler/types: Remove warnings about potential fallthrough
compiler/types: Move comments and reorganize declarations
anv: Fix leak when compiling internal kernels
Carsten Haitzler (2):
kmsro: Add hdlcd DPU
panfrost: Add GPU variant of G57 to the set of known ids
Charles Giessen (1):
panvk: Use 1.0 in ICD Manifest json
Charmaine Lee (8):
svga: set clear_texture to NULL for vgpu9
svga: fix stride used in vertex declaration
svga: fix persistent mapped surface update to constant buffer
svga: restrict use of rawbuf for constant buffer access to GL43 device
svga: fix immediates used in rawbuf for constant buffer
svga: use srv raw buffer for accessing readonly shader buffer
svga: sync resource content from backing resource before image upload
svga: ignore sampler view resource if not used by shaders
Chia-I Wu (38):
radv: fix separate depth/stencil layouts in fb state
radv: fix separate depth/stencil layouts in resolve meta
radv: refactor depth clear in clear meta
radv: fix separate depth/stencil layouts in clear meta
amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear
radv: disable tc-compat htile for layered images on gfx8
amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve
winsys/amdgpu: fix a race between import and destroy
ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
winsys/radeon: fix a race between bo import and destroy
vulkan/runtime: add a helper for ETC2 emulation
radv: use vk_tecompress_etc2 from the runtime
vulkan/runtime: fix image type check for ETC2 emulation
vulkan/runtime: fix a harmless typo for ETC2 emulation
vulkan/runtime, radv: remove 1D support from ETC2 emulation
radv: add radv_is_format_emulated
radv: simplify view format override for emulated formats
radv: hard code format features for emulated formats
mesa: make astc_decoder.glsl vk-compatible
radv, drirc: rename radv_require_{etc2,astc}
anv: remove unused field from anv_image_view
anv: add anv_image_view_{init,finish}
anv: support image views with surface state stream
anv: add anv_push_descriptor_set_{init,finish}
anv: support alternative push descriptor sets
anv: add anv_descriptor_set_write
anv: add anv_cmd_buffer_{save,restore}_state
anv: add anv_is_format_emulated
anv: add a hidden plane for emulated formats
anv: decompress on upload for emulated formats
anv: fix up image views for emulated formats
anv: fix up blit src for emulated formats
anv: advertise emulated formats
anv: add support for vk_require_astc driconf
util: improve BITFIELD_MASK and BITFIELD64_MASK on clang
anv: prep for gen9 astc workaround
anv: add gen9 astc workaround
radv: fix image view extent override for astc
Chris Spencer (9):
radv: initialize result when pipeline cache creation fails
anv/android: Fix importing hardware buffers with planar formats
anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12
anv: Advertise Vulkan 1.3 on Android 13
anv: Don’t reject Android image format if external props not supplied
android: Add explanatory comment to u_gralloc
anv/android: Enable shared presentable image support
anv/video: use correct enum value for max level IDC
radv/video: use correct enum value for max level IDC
Christian Gmeiner (41):
nir/print: print instr pass_flags
etnaviv: move nir texture lowerings into one pass
nir: add enta specific intrinsic used for txs lowering
etnaviv: nir: support intrinsic used for txs lowering
etnaviv: nir: lower nir_texop_txs
ci/etnaviv: update ci expectations
etnaviv: make use of BITFIELD_BIT(..) macro
etnaviv: name the enum used for pass_flags
etnaviv: add is_dead_instruction(..) helper
etnaviv: extend etna_pass_flags with source modifiers
etnaviv: do not clear all pass_flags before RA
etnaviv: nir: look at parent instr in lower_alu(..)
etnaviv: nir: add etna_nir_lower_to_source_mods(..)
etnaviv: nir: switch to etna_nir_lower_to_source_mods(..)
etnaviv: nir: convert to new-style NIR registers
freedreno/regs: remove double assignment of self.current_domain
freedreno/regs: remove not used variable
freedreno/regs: remove dead code
freedreno/regs: python does not need ‘;’
etnaviv: switch to log2f(..)
etnaviv: switch to U_FIXED(..) macro
etnaviv: switch to S_FIXED(..) macro
etnaviv: fix null pointer dereference
etnaviv: switch to float_to_ubyte(..)
ci/etnaviv: update ci expectation
etnaviv: unbreak cmdline compiler
agx/lower_address: Use intrinsics_pass
agx/lower_address: Remove not used has_offset
isaspec: python does not need ‘;’
docs: Move isaspec out of drivers/freedreno
isaspec: Add support for templates
isaspec: encode: Correct used regex
isaspec: Add method to get all instrustions
isaspec: Add support for custom meta information
isaspec: Add BitSetEnumValue object
spirv: Don’t use libclc for rotate
docs: update etnaviv extensions
etnaviv: drm: Be able to mark end of context init
etnaviv: Skip ‘empty’ cmd streams
ci: Bump PyYAML to 6.0.1
etnaviv: Don’t leak disk_cache
Collabora’s Gfx CI Team (2):
Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1
Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0
Cong Liu (2):
r300: Fix out-of-bounds access in ntr_emit_store_output()
virgl:Fix ITEM_CPY macro pointer copy bug
Connor Abbott (83):
afuc: Rework and significantly expand README.rst
tu: Fix vk2tu_*_stage flag type
tu: Fix and simplify execution dependency handling
tu, freedreno/a6xx: Remove has_ccu_flush_bug
ir3: Handle GS stream “mixing” with non-point output primitives
tu: Disable transformFeedbackPreservesProvokingVertex
isaspec: Add “displayname” for altering {NAME} when decoding
isaspec: Add support for “absolute” branches
isaspec: Add support for function and entrypoint labels
isaspec: Add “custom” field type
isaspec: Add callback after decoding an instruction
isaspec: Rename isa_decode() to isa_disasm()
isaspec: Add initial decoding support
afuc: Fix xmov lexer typo
afuc: Convert to isaspec
afuc: Add setbit/clrbit
afuc: Fix writing $00
freedreno/afuc: Initial a7xx support
ir3: Parse (eq) flag
ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD
tu: Add missing last_baryf statistic
freedreno, tu, ir3: Add last_helper statistic
ir3: Gather pixlod status earlier
ir3: Implement helper invocation optimization
vk/graphic_state, tu: Use dynamic blend count from subpass
freedreno/a7xx: Add CP_RESET_CONTEXT_STATE
vk/graphics_state: Fix copying MS locations pipeline state
tu: Remove MSAA draw state
tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states
tu: Merge PC_RASTER_CNTL into RAST draw state
tu: Stop reusing base Vulkan dynamic state enums
tu: Merge depth/stencil draw states
tu: Rename PrimID-related registers
tu, freedreno/a6xx: Don’t use VS for PrimID passthru state
tu: Pull entangled shader state into program config
ir3: Add ir3_find_input_loc() helper
tu: Split up tu6_emit_vpc()
freedreno, ir3, tu: Constify various uses of ir3_shader_variant
ir3: Add helper to determine when variant exceeds safe constlen
tu: Split program draw state into per-shader states
tu: Fix per-view viewport state propagation
tu: Fix tu6_emit_*_fdm size call
tu: Fix assert in FDM state emission
tu: Actually emit patchpoint for viewports with FDM
nir/lower_subgroups: Don’t do multiple lowerings at once
nir/spirv: Add inverse_ballot intrinsic
amd: Use inverse ballot intrinsic if available
tu: Create singleton “empty” shaders
tu: Start tracking shaders independently of pipeline
tu: Move FS-specific pipeline information to the shader
tu: Use shader directly for VS/TCS output size and patch size
tu: Rewrite tessellation modes handling
tu: Rework passing shared consts
tu: Decouple program state from the pipeline
tu: Use pipeline feedback loop flag indirectly
tu: Rewrite remaining pipeline LRZ handling
tu: Don’t reference pipeline for some draw states
tu: Make compute dispatch use the shader
tu: Don’t use pipeline for dynamic draw states
tu: Don’t use pipeline for bandwidth validity
tu: Don’t use pipeline for per_view_viewport
tu: Don’t use pipeline for active stages
tu: Remove pipeline from state
zink: Rework color clamping and conversion
freedreno/fdl: Use A8_UNORM HW format for sampling
tu: Support clearing A8_UNORM
freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx
tu/clear_blit: Fix staging image view layer count
tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount
tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes
tu: Implement vkCmdBindIndexBuffer2KHR
tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR
tu: Implement vkGetRenderingAreaGranularityKHR
tu: Use new buffer usage flags
tu: Support VkPipelineCreateFlags2CreateInfoKHR
tu: Check for DEVICE_LOST in vkGetEventStatus()
tu: Add maintenance5 properties
freedreno/ci: Skip dEQP-VK.info.device_extensions
tu: Expose VK_KHR_maintenance5
freedreno/ci: Remove minetest trace
v3d/ci: Remove minetest trace
ir3/ra: Don’t swap killed sources for early-clobber destination
tu: Fix re-emitting VS param state after it is re-enabled
Corentin Noël (16):
ci: Add locked flag to bindgen-cli installation
virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
ci: disable Collabora’s LAVA lab for maintenance
llvmpipe: make sure to initialize the lp_setup_context slots with the default values
virgl: Cover all the formats defined in the virgl definition
mesa: Ensure that the baselevel will never exceed the maximal supported number
ci: Uprev virglrenderer
freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE
util: Avoid the use of MESA_TRACE_BEGIN/END
util/perf: Remove the tracing categories
util: Remove MESA_TRACE_BEGIN/END
mesa/bufferobj: ensure that very large width+offset are always rejected
frontends/va: Remove wrong use of ProfileToPipe
Daniel Schürmann (9):
nir/opt_move: fix handling of if-condition
aco: append p_logical_end after monolithic RT shaders
aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
aco: don’t optimize cross-lane instructions across p_wqm
aco: make p_wqm a marker instruction without Operands/Definitions
aco: don’t insert a copy when emitting p_wqm
aco: insert a single p_end_wqm after the last derivative calculation
aco/insert_exec_mask: Simplify WQM handling (1/2)
aco/insert_exec_mask: Simplify WQM handling (2/2)
Daniel Stone (23):
dri: Support 1555/4444 formats
egl/dri2: Don’t look up image extension twice
egl/wayland: Always initialise fd_display_gpu
egl/wayland: Add image loader extension for swrast
egl/wayland: Never use DRI2_LOADER extension
egl/wayland: Assume modern DRI interface versions
egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER
egl/drm: Assume modern DRI interface versions
ci: Disable nouveau CI
panfrost/vk: Use correct sampler dimensions for MSAA
ci: Declare stages before jobs
ci/radeonsi: Add new flake
ci/d3d12: Add new flake
ci/intel: Add new skqp flake
ci/zink: Add new zink-lvp flakes
ci/radeonsi: Skip more really slow tests
ci/zink: Add another conversion fail on a618
ci: Move farm-disable rules before anything else
ci: Always set user container jobs to manual
ci: Use container rules for containers
ci: Only look at file changes for MRs
ci: Fix pre-merge pipelines with no code changes
ci: Try really hard to print final result string
Daniel van Vugt (1):
glx: Increment dpy->request before issuing an error that had no request
Danylo Piliaiev (71):
freedreno/cffdec: Decode CP_DRAW_AUTO
freedreno, turnip: Clarify some RB_CCU_CNTL fields
freedreno,turnip: Make number of VSC pipes configurable
freedreno,turnip: Make CS shared memory size configurable
freedreno,turnip: Make VS input attr/binding count configurable
freedreno: Add A605, A608, A610, A612 GPUs definition
turnip: Make multiview support configurable per generation
ir3: Make FS tex prefetch optimization optional
ir3: Use NIR info to enable per sample shading
freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
ir3: Fix FS quad ops returning wrong values from helper invocations
tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
radv: fix unused non-xfb shader outputs not being removed
vulkan/nir: Add common helper to check if output is XFB
radv: Use common nir_vk_is_not_xfb_output
turnip: Use common nir_vk_is_not_xfb_output
freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields
freedreno/registers: Refactor gen_header.py to allow more options
freedreno/registers: Generate python files with reg offsets
freedreno: Add a list of raw magic regs
freedreno: Fully define a730 and a740 device properties
ir3/tests: Use fd_dev_info to infer GPU generation
freedreno/computerator: Fix remaining issues with A7XX
isaspec: Make possible to obtain gpu_id in <expr> blocks
ir3/a7xx: cat5 mode1 has swapped tex/samp ids
ir3/a7xx: Don’t multiply global mem instruction’s offset by 4
ir3/a7xx: insert lock/unlock at the end of every compute shader
ir3/a7xx: Add ccinv instruction
ir3/a7xx: Use ccinv for data synchronization
ir3/a7xx: Disable shared consts for a7xx
tu/common: Generalize TU_GENX macro
tu: Basic a7xx support
freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx
tu/a7xx: Fix geometry shaders
tu/a7xx: Fix tesselation shaders
tu/a7xx: Fix multiview
tu/a7xx: Fix flat shading
tu/a7xx: Fix occlusion query
tu/a7xx: Fix 3d blits after multiview usage
tu/a7xx: Fix CmdDrawIndirectByteCountEXT
tu/a7xx: Disable LRZ
ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset
tu: Exclude SP_UNKNOWN_AE73 from reg stomping
tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen
freedreno/replay: Add limited support for KGSL
freedreno/rddecompiler: Update to handle a7xx
freedreno/replay: Add “print” instr to ir3 asm to be used in replay
freedreno/replay: Add “gpu_print” function for command streams
tu/perfetto: Remove now unnecessary tu_perfetto_util
tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit
tu/kgsl: Fix memory leak of tmp allocations during submissions
tu/kgsl: Support u_trace and perfetto
tu/a7xx: Correctly record timestamps for u_trace
tu/virtio: Fix incorrect call to tu_perfetto_submit
ci: Compile Turnip’s virtio kmd in debian-arm64
freedreno/registers: Refine a7xx push consts registers
ir3,tu: Refactor push consts info plumbing
freedreno: Make possible to specify A7XX feature flags
turnip,ir3: Implement A7XX push consts load via preamble
tu: Add push_consts_per_stage debug option
tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc
tu/kgsl: Fix field order in kgsl_command_object init
tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering
tu: Zero init tu_render_pass and tu_subpass for dynamic rendering
tu: Disable preamble push consts when they are not used
ir3: Fix values of #wrmask not being compatible with ir3 parser
tu: Count a whole push consts range in constlen for PREAMBLE push consts
freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm
freedreno/rddecompiler: Decompile repeated IBs
freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH
tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0
Dave Airlie (163):
ci: remove binding model from the asan skips for lavapipe.
gallivm: fix atomic global temporary storage.
llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
nvk: add missing finish calls
nvk: add some initial wsi framework.
nvk: fix header guards to be less generic.
nvk: add bind buffer memory
nvk: Add initial queue
nvk: add cmd buffer framework
nvk: Reset pushbufs on command buffer reset
nvk: reindent descriptor sets to mesa std.
nvk: add initial descriptor pool framework.
nvk: some boilerplate for descriptor sets
nvk: add descriptor set bo allocation.
nvk: implement buffer address.
nvk: descriptor set freeing fix
nvk: move to new command stream generator.
nvk: port the blit and copy code to new command submission.
nouveau/ws: drop the old push generators.
nvk: link in codegen without gallium bits.
nvk: Initial wiring in of the compiler
nvk: Basic descriptor binding
nouveau/vk: add support for compute classes to generator.
nvk: retrieve gpc/mp counts from kernel.
nvk: add support for preamble and tls allocation.
nvk: add record result to cmd_buffer.
nvk: add command stream upload buffer.
nouveau/winsys: Add m2mf/compute objects
nvk: add some basic format wrapping framework
nvk: add some compute limits
nvk: add basic nve4+ compute support.
nvk: fix empty cmd submission.
nouveau/ws: add a push reset just for references.
nouveau/classes: add 906f header support.
nvk: add initial 8/16 byte clears.
nvk: fix pipeline pushbuf sizing
nvk: increase graphics cpu push buffer
nvk: fix depth emission ordering.
nvk: add some limits/features from binary driver.
nvk: add indexed draw support.
nvk: assign vertex locations according to input attrib index
nvk: lower io to temps to avoid output reads in vertex shaders
nvk: handle NULL to destroy descriptor pool
nvk: add basic primitive restart
nvk: fix copy lower address extraction
nvk: fix multiple pipelines failure allocation case.
nvk: init dev->physical_device earlier.
nvk/winsys: store device ptr into bo instead of ptr
nvk: set the device fd
nil: Fix image align and size constraints
nvk: Report image alignments from NIL
nouveau/winsys: allocate unique object handles across channels.
nvk/nil: don’t ask for compressed image kind
nvk/barrier: handle host bit.
nvk: add compute support for ampere
nvk: add min_lod to spirv caps.
nvk: fix r32_sint format support
nvk: expose EXT_sampler_filter_minmax
nvk: fix transform feedback crash when optimiser removes things.
nvk: merge tess info between tcs/tes.
nvk: introduce an optimisation loop.
nvk: add support for D32_SFLOAT_S8_UINT
nvk/query: fix push buffer size for copy pool results.
nvk: init image fields for requirements
nvk: handle alignments in device memory
nvk/tess: don’t emit patch control points in pipeline
nvk: align geometry clip setting with nvc0
nvk: fix independent color write masks.
nvk: enable rgb32 texel buffer support
nvk: enable EXT_depth_clip_control
nvk: enable EXT_depth_clip_enable
nvk: always sync internal cmd bufs for vma lifetimes.
nouveau/winsys: add support for the vma bind interfaces
nvk: Add support for sparse buffers
nvk: Add support for sparse images
nvk/queue: add support for syncobjs and sparse binds
nvk: Handle pre-turing indirect buffers with sparse
nvk: enable sparse features
nvk: enable a bunch of external fence/semaphore bits
nvk: enable sparse residency buffer on maxwell+
nvk: add new internal bo allocation flag.
docs: add two nvk exts to features.txt
zink: use fprintf instead of printf to align the requirements warnings
nvk: align sampler allocation counts with nvidia.
zink: turn off threaded cpu access if not visible.
nvk: add gart forced cmd pool side buffer.
nvk: add cond render upload buffer.
nvk: enable KHR_shader_clock.
nvk: NOUVEAU_WS_BO_LOCAL is a trap.
gallivm: drop unused info parameter
llvmpipe/fs: drop cbuf 0 since it’s lowered now.
gallivm/nir: avoid using params->info
llvmpipe/fs: move some tgsi checks in nir path to nir code.
llvmpipe/cs: convert to using tgsi->nir
llvmpipe/cs: drop tgsi for compute/mesh/task shader internals.
lavapipe: use vk_buffer common code.
lavapipe: use vk_buffer_range common code.
llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi
llvmpipe/analyse: drop TGSI path.
llvmpipe/fs: start using nir info in some places.
llvmpipe/fs: drop the simple shader logic
llvmpipe/fs: rewrite output finding using nir.
nvk: add build_id linker argument.
nir/gather: add support for fbfetch and bindless image loads.
llvmpipe/cs: further cleanups after tgsi removal.
llvmpipe: move to nir lowering for fquantize2f16
rusticl: don’t store ptrs to nir_variables across opt passes.
llvmpipe: enable f16 paths on aarch64.
clover/llvm: move to modern pass manager.
nir: use a _clone so users calling their variable clone don’t get a warning
nir: rename nir_inline_functions.c to nir_functions.c
nir: use nir_function_instructions_pass in the inliner.
nir: move the libclc lowering over to functions file.
nir/functions: use helper to get function for a name.
nir/functions: put link state into a struct
nir/functions: move linker pass to new helper
nir: add nir function clone
nir: don’t inline linked functions
gallivm/nir: split prepasses out to make per-function work easier.
gallivm: rework translator to allow per-impl work.
spirv/nir: parse function control and store in nir.
nir: add driver_functions option to avoid inlining.
nir: add a function usage tracker
rusticl: use cleanup funcs
gallivm: add support for function calling
llvmpipe/cs: add support for function calls.
llvmpipe: enable driver functions.
radv: don’t emit event code on video queues.
spirv: use a pointer sized int type for opencl event_t
clover: fix parameter arguments since recent translator changes.
radv/video: take db alignment into account when allocating images.
ac,radeonsi: move vcn enc structs to common
ac,radeonsi: move vcn enc av1 default cdf file to common
nir: add a deref slot counter that handles compact
llvmpipe/linear: drop tgsi path.
gallivm: drop tgsi aos paths.
llvmpipe/nir: call gather info to update inputs read properly
llvmpipe/fs: start converting interp/input paths to nir.
llvmpipe/fs: start converting dervied state to nir based.
llvmpipe/linear: convert to using nir for output.
llvmpipe/linear: move to nir inputs
draw/mesh: reset some user state values on mesh draws.
llvmpipe/fs: fix regression in sample mask handling from tgsi removal.
llvmpipe: reset viewport_index_slot in fb bind
llvmpipe/cs: migrate to generic jit texture from pipe code.
llvmpipe/cs: migrate cs image handle to common jit code.
lavapipe: fix some whitespace in advance of other changes.
lavapipe: fix subresource layers asserts
lavapipe: support host image copying on compressed texture formats
llvmpipe: don’t create texture functions for planar textures.
lavapipe: don’t emit blit src/dst for subsampled formats.
llvmpipe: don’t support planar formats for buffers.
lavapipe: convert sampler to use vk base class.
lavapipe: cleanup copy code to use a local region variable.
lavapipe: start introducing planes structure.
lavapipe: allocate image and image view planes.
lavapipe: handle planes in copies
lavapipe: handle planes in get image sub resource
lavapipe: add descriptor sets bindings for planar images
lavapipe: handle planes in texture lowering.
lavapipe: expose planar ycbcr formats and new ycbcr features
lavapipe + docs: update ycbcr extension enables.
intel-clc: avoid using spirv-linker.
David Heidelberg (82):
ci/freedreno: update a530 flakes
ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI
ci: update kernel to 6.3.13
ci/freedreno: add fails introduced by upreving to 6.3.13
Revert “lima/ci: temporarily disable deqp-egl tests due to timeouts”
ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13
ci/lima: EGL testing was disabled when fp16 fail was removed
ci/freedreno: fix unexpectedpass flake on a630
ci/freedreno: add another a530 flakes
ci: add quirk for GitLab assuming changes is always true for scheduled runs
ci/microsoft: when re-enabling Windows Farm, always run the container
ci/freedreno: add a530 flakes, remove one fail which recently started passing
ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook
ci/freedreno: cover all texture gather flakes
ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4
ci/farms: always compare the code against main repository
Revert “ci/farms: always compare the code against main repository”
ci/kernel: add amd patch to prevent crashes when starting X
ci/kdl: remove extra-verbose ls command
ci/nouveau: add 20 minutes timeout to gk20a and align gm20b
ci/freedreno: document another mapbuffer flake on a530
ci/amd: fix timeouting radeonsi-raven-va-full job
docs/ci: default to port 80 for the caching proxy
docs/ci: update to systemd and used version of the trace for testing
docs/ci: remove default nginx config, which we don’t need for proxy
bin/ci: handle errors more gracefully in update_traces_checksum script
ci/freedreno: document another flakes on Adreno 530
ci: add perfetto into mesa git-cache
ci/panfrost: re-enable t760 and t860 traces as a nightly job
CI: Re-enable G52 Vulkan testing
ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1
ci/zink: Add flake seen in the wild
ci/build: limit debian-build-testing to 30 minutes
ci/amd: add glx@glx-visuals-depth flake to raven
ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530
ci/farms: enabled Microsoft job only when conditions are met
ci/deqp: really remove the uncompressed results.csv file
ci/baremetal: do not install curl, it’s already there
ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only
ci/freedreno: document another a530 flake batch
ci: remove LAVA prefix from variables which can be used also elsewhere
ci/zink: drop a630, which we currently have very low amount available
ci/freedreno: the tag belongs to the apq8016 only
ci/freedreno: switch references, the farm-rules takes care about this
ci/freedreno: handle disabling farm properly for each FD/Collabora farm
ci/freedreno: another batch of Adreno 530 flakes
gtest: backport ansi color fix
ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
panfrost/ci: revert Disable T720
ci/piglit: add extra space on top to prevent single quote getting into URL
ci/freedreno: There is only one King of Town.
ci: switch to 6.4 kernel, improving Adreno 660 reliability
ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout
ci/panfrost: add G52 flakes
ci/panfrost: we have enough device, parallelize Vulkan tests
ci/virgl: flakes in functional.draw_buffers_indexed group
ci/freedreno: add another a530 flake
ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake
panvk: architecture isn’t invalid, just unsupported
panvk: catch unsupported arch in the panvk_physical_device_init
Revert “ci: disable a660 jobs”
docs: add LAVA farm informations
ci: disable Google Freedreno farm, currently timeouting on all jobs
Revert “ci: disable Google Freedreno farm, currently timeouting on all jobs”
ci/farms: no need to check RUNNER_TAG for Collabora farm
ci/traces: extend no-output timeout by 5 minutes
ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake
iris: do not mention specifically clover for OpenCL support
ci/freedreno: disable broke cheza (Adreno 630) runners
ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data
ci/bare-metal: drop unused imports, sort, use SPDX license
ci/lima: farm is down, disable for now
ci: do not report failed job when flakes reporting fails
ci/freedreno: re-enable Cheza (Adreno 630) runners
ci/traces: upload only missing trace images
ci/traces: keep images for every job except the performance testing
ci/traces: rename upload function to reflect it works with S3
ci/traces: always export piglit EXTRA_ARGS
ci: ci_marge_queue.py
ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline
ci/freedreno: disable Adreno 660 Vulkan pre-merge
ci/traces: drop the freedoom-phase2-gl-high.trace
David Rosca (70):
radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
frontends/va: Skip processing buffers already converted with EFC
frontends/va: Don’t use EFC with scaling or filtering enabled
radeonsi/vcn: Don’t use chroma in AV1 encode with RGB input
frontends/va: Parse H264 SPS for video signal parameters
frontends/va: Parse HEVC SPS for video signal parameters
frontends/va: Add postproc support for converting to full range
radeonsi/vcn: Set H264 video signal parameters in bitstream
radeonsi/vcn: Set HEVC video signal parameters in bitstream
radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1
radeonsi/vcn: Fix setting color range in AV1 bitstream
gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix
gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv
gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv
frontends/va: Add YUV420 to NV12 postproc conversion
gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer
gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave
frontends/va: Also map VAImageBufferType for reading
frontends/va: Alloc interlaced surface for interlaced pics
frontends/vdpau: Alloc interlaced surface for interlaced pics
radeonsi: Don’t prefer interlaced for video decode
ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test
frontends/va: Don’t allow multi-plane derive without driver support
frontends/va: Init view_resources array in vlVaPut/GetImage
radeonsi: Copy all planes with multi-plane staging textures
radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP
ci/amd: Skip all VAAPI tests that creates too many huge surfaces
radeonsi/vcn: Update rate control when framerate changes with HEVC
frontends/va: Ignore requested size when creating VAEncCodedBufferType
gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer
radeonsi/vcn: Fix leaking fences in decode
gallium/auxiliary/vl: Add BT.709 full csc matrix
frontends/va: Set csc matrix in postproc
gallium/auxiliary/vl: Don’t set csc matrix in video_buffer/rgb_to_yuv_layer
frontends/va: Add BT.709 as supported postproc color standard
Revert “radeonsi/vcn: add an exception of field case for h264 decoding”
gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter
gallium/auxiliary: Fix util_compute_blit half texel offset with scaling
gallium/auxiliary/vl: Map range when updating constants
gallium/auxiliary/vl: Clamp coordinates in compute shaders
gallium/auxiliary/vl: Support chroma sample location in compute shaders
frontends/va: Support chroma sample location in postproc
frontends/va: Flush after unmapping VAImageBufferType
frontends/va: Parse chroma sample location in H264/HEVC SPS
radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream
radeonsi/vcn: Don’t hang GPU when using DCC surface as encoder input
frontends/va: Track surfaces in context
frontends/va: Destroy fences when destroying surface or context
radeonsi/vcn: Implement destroy_fence vfunc
frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
frontends/va: Set default rate control values once when creating encoder
gallium/auxiliary/vl: Add RGB to YUV compute shader
gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader
gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing
gallium/auxiliary/vl: Only map the shader constants buffer in render
frontends/va: Add High Quality preset mode
radeonsi/vcn: Add High Quality encoding preset for AV1
radeonsi: Fix plane size in si_copy_multi_plane_texture
frontends/va: Implement vaMapBuffer2
frontends/va: Fix locking in vlVaBeginPicture
frontends/va: Parse H264 SPS for max_num_reorder_frames
util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes
frontends/va: Fix parsing packed headers without emulation bytes
radeonsi/vcn: Add encode support for H264 B-frames
frontends/va: Map decoder and postproc surfaces for reading
radeonsi: Fix offset for linear surfaces on GFX < 9
gallium/auxiliary/vl: Fix coordinates clamp in compute shaders
gallium/auxiliary: Fix coordinates clamp in util_compute_blit
gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane
util/rbsp: Fill bits twice if reading more than 16 bits
Derek Foreman (2):
vulkan/wsi: Allow binding presentation_timing when software rendering
vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt
Dmitry Baryshkov (3):
gallium: move kmsro definition to the bottom of the file
gallium: unbreak kmsro/freedreno case
tu: Pass real size of prime buffers to allocator
Dmitry Osipenko (3):
util/cache_test: Re-add test for disabled cache
util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT
util/cache_test: Add test for get/put() with disabled cache
Dor Askayo (1):
nouveau: add exported GEM handles to the global list
Dr. David Alan Gilbert (6):
rusticl/core: Add profiling time storage (queued) to event
rusticl: Wire the ‘queued’ profiling time up
rusticl: Wire the ‘submit’ profiling time up
rusticl: Wrap pipe queries
rusticl: Wrap pipe query reads
rusticl: Wire the ‘start’ and ‘end’ profilng times up
Dylan Baker (4):
VERSION: bump to 23.3.0-devel
docs: Update release calendar for 23.2.0-rc1
docs: truncate feature list for 23.3-devel
meson: use a single dependency call for lua
Echo J (5):
nvk: Fix some cast defines
nvk: Add A8B8G8R8_*_PACK32 format support
nvk: Add bufferImageGranularity limit
nvk: Reset offset value in ResetDescriptorPool
nil: Add A4B4G4R4_UNORM format support
Emma Anholt (111):
ci/radv: Clarify when the ANGLE GS failures started happening.
ci: Uprev ANGLE to 0518a3ff4d4e (“Android: Simplify power metrics collection”)
ci/tgl: Improve the info for ANGLE’s MSAA regression on TGL.
ci/tu: Add more crash cases for the multithreading bugs caught on a630.
ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too.
ci/tu: Drop some xfails for !24086
tu: Fix data race in userspace VMA management.
ci/a5xx: Add another GPU hanging piglit test to the skips.
Revert “ci: Disable nouveau CI”
nvk: Avoid strict aliasing warning in the pushbuffer encoding.
nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY.
nvk: Fix unused result warnings in pushbuf resets.
nvk: Remove duplicate (disabled) point sprite setup.
nvk: Fix missing init of the stages to sync against.
nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP.
nouveau/winsys: Fix an undefined use in the error path.
nvk: Quiet a compiler warning.
nvk: Clean up redundant vendor checking for physical device creation.
nvk: Add support for probing as a platform device.
nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell.
freedreno/a5xx: Fix border color structure size.
freedreno/a5xx: Skip emitting unused texture descriptors for images.
freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler.
ci/freedreno: Drop a bunch of stale a530 xfails.
ci/freedreno: Sort another a530 xfail with its friends.
ci/freedreno: Update comments for some a530 xfails.
ci/freedreno: Add some more db820c xfails.
freedreno/devices: Move fibers_per_sp to the common info struct.
freedreno/devices: Set num_sp_cores explicitly for pre-gen6.
freedreno/a6xx: Move pvtmem allocation to ir3_gallium.
freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs.
freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function.
freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly.
freedreno/a5xx: Add private mem support.
freedreno/cffdec: Fix decode on pixel 2 blob’s COMPUTE_CHECKPOINT
ci/freedreno: Add a regression test for decoding a540 blob’s compute shaders.
freedreno: Fix crashdec pre-a6xx.
freedreno/a5xx: Skip SSBO emit when none are enabled.
vulkan/util: Make multialloc succeed with 0 allocations.
turnip: Track the first/last subpass an attachment is used in.
turnip: Skip emitting empty CP_COND_REG_EXEC.
turnip: Save the renderpass’s clear values in the cmdbuf state.
turnip: Move gmem clears and loads to the first subpass that uses them.
turnip: Move sysmem clears to the first subpass that uses them.
ci/freedreno: Skip some tests on a5xx that destabilize other tests.
freedreno/a3-5xx: Don’t try to emit ISAM for SSBO loads.
ci/turnip: Add a660 VK coverage.
disk_cache: Disable the “List” test for RO disk cache.
blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too.
intel/fs: Move defin/defout setup to the start of the loop.
intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup.
intel/fs: Simplify compute_start_end().
ci/freedreno: Add another excessive-constlen UBO skip.
ci/anv: Drop DEQP_VER:vk setting.
ci/anv: Drop “-vk” from the job name.
ci/anv: Add a manual full VK run for TGL.
ci/anv: Add testing on JSL.
freedreno: Build drm subdir before perfcntrs, which uses it.
ci/intel: Add various updates from our nightly runs.
ci/virgl: Disable virgl-iris-traces.
ci/zink: Add a few updates for anv/tgl from the nightly runs.
ci/fastboot: Use a case insensitive match for a fastboot line.
ci/etnaviv: Skip some tests that hang the GPU and knock out other tests.
ci/etnaviv: Drop some gc2k flakes that I think are resolved.
ci/anv: Drop incorrect xfail addition for TGL
ci/anv: Drop the 16bit.scalar.13 skip.
ci/etnaviv: Minor xfail/flake polishing.
ci/etnaviv: Skip a GLES2 test that times out the asan job.
ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts.
ci/docker: Clear the results file before starting a new deqp test run.
ci/crocus: Add a related flake to a known one.
ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake
ci/crocus: Add known piglit flakes
ci/hasvk: Add a bunch of new CTS border color fails.
i915: Re-clang-format and enforce it in CI.
i915: Print the relevant counts vs limits when throwing errors.
i915: Don’t log I915_DEBUG=fs output for blit shaders.
i915: Save fragment program compile error messages in the fragment shader.
i915: Do a test compile at glLinkShader() time.
i915: Make exceeding tex indirect count fatal.
i915: Use nir_group_loads() to reduce texture indirection phases.
ci/crocus: Generalize the drawarrays-vertex-count flakes.
ci/zink: Skip 3-minute-long glx-visuals timeouts.
ci/zink: Skip dmat[34] op tests in general, as well
ci/crocus: Disable flaky unvanquished-ultra trace
nir/print: Decode system values in the variable declarations.
ci/zink: Add a TGL flake that’s showed up in nightlies recently.
ci/radeonsi: Drop an xfail for vangogh.
i915: Make I915_DEBUG=fs log shaders that fail to link due to CF.
nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF.
glsl: Remove lower_discard().
ci/zink: Only test half of piglit pre-merge on anv.
ci: Stop doing internal retries in bare-metal.
ci/bare-metal: Drop the 2 vs 1 exit code from poe_run.
ci/bare-metal: Default our boards to a 20-minute timeout for the whole job.
ci/iris: Drop parallel on kbl piglit to 2.
ci/freedreno: Fold a630_egl into a630_gl.
ci/freedreno: Move skqp testing to a618.
ci/zink: Cut zink-lvp coverage in half.
ci/freedreno: Generalize the implicit_unmap timeouts.
ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time.
glx: Delete support for GLX_OML_swap_method.
ci: drop skip for glx-swap-copy.
dri: Drop a duplicate mesa vs pipe format table.
docs/ci: Drop old instructions for farm disabling
docs/ci: Add some links in the CI docs to how to track job flakes
glsl: Remove int64 div/mod lowering.
llvmpipe: Set nir_lower_dround_even.
nir: Add nir_lower_dsign as 64-bit fsign lowering.
glsl: Retire dround lowering.
ci_run_n_monitor: Always resolve –rev arguments for looking up pipelines.
Eric Engestrom (194):
ci: avoid running hardware jobs if lint fails - now on LAVA too!
ci: avoid running hardware jobs if lint fails - now on Windows too!
ci: replace copy of nouveau rules with reference
ci: drop leftover kernel configs
ci: use !reference for scheduled_pipeline retry rule
ci: add .llvmpipe-manual-rules and use it
ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor
ci: replace llvmpipe_file_list anchor with reference
ci: replace softpipe_file_list anchor with reference
ci: replace lavapipe_file_list anchor with reference
ci: replace iris_file_list anchor with reference
ci: replace radv_file_list anchor with reference
ci: replace radeonsi_file_list anchor with reference
ci: replace virgl_file_list anchor with reference
ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml
ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml
ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml
ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml
ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml
ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml
ci: move amd files rules to src/amd/ci/gitlab-ci.yml
ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml
ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml
ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml
ci: move intel files rules to src/intel/ci/gitlab-ci.yml
ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml
ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml
ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml
ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml
ci: delete install.tar after extracting it to avoid re-uploading it
docs: add release notes for 23.1.4
docs: add sha256sum for 23.1.4
docs: update calendar for 23.1.4
asahi: drop unused include paths
ci/lint: deduplicate formatting check jobs
ci/lint: also print a diff for rust format issues
ci: allow hw jobs even if lint jobs fail for non-Marge pipelines
ci: print rustfmt’s version
ci: print clang-format’s version
bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring –rev to match
lavapipe/ci: use tighter changes: rules
ci: add a 10min job timeout to formatting checks
ci: reduce bare-metal retries of poe_run to only 3 attempts
broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h)
broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h)
broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h)
broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h)
ci: add .core-rules to .gallium-core-rules
ci: drop rule for non-existent src/include/
docs: add release notes for 23.1.5
docs: add sha256sum for 23.1.5
docs: update calendar for 23.1.5
ci: include some timing information in the git cache download script
docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS
docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that
docs/ci: drop mention of build systems variants in the CI
docs/ci: expand the description of test suites
bin: add wrapper to run scripts in a python venv
bin/ci/ci_run_n_monitor: use venv wrapper
bin/ci/gitlab_gql: use venv wrapper
bin/ci/update_traces_checksum: use venv wrapper
bin/pick-ui: use venv wrapper
ci: include mold in x86_64_test-base & rootfs images
ci: use mold to build deqp
zink/ci: set the default timeout for zink jobs to 30min instead of 1h
egl: make _eglFilterConfigArray static
egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper
ci: build nvk
ci: document max image tag length
docs/radv: mark VK_EXT_tooling_info as implemented
docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented
git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it
git-blame-ignore-revs: add radv formatting commit
git-blame-ignore-revs: add pvr formatting commit
meson: fix indentation
docs/v3dv: mark direct display extensions as implemented
ci: reorder vk drivers alphabetically in debian-vulkan job
ci: build hasvk in debian-vulkan job
ci/zink+radv: set a timeout of 2x the normal runtime
amd/ci: drop duplicate test expectations
panfrost: upcast uint8/uint16 before shifting them beyond their range
ci/a530: document piglit flake
docs: add release notes for 23.1.6
docs: add sha256sum for 23.1.6
docs: update calendar for 23.1.6
docs: add one more 23.1.x release
ci: rename *.log to *.txt to work around gitlab bug
ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it
egl: bump extension string length
vc4: drop duplicate .lower_ldexp
zink: fix format in zink_make_{image,texture}_handle_resident()
v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
v3dv: fix copy/pasted type of `sample`
v3dv: fix shader stage name in error message
v3d/qpu: fix type of function argument
ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_*
ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
bin/ci_run_n_monitor: error out if both –project and –pipeline-url are passed
ci: document farm rules
ci/b2c: skip install.tar extraction if the tarball is not present
ci/b2c: don’t allow failures in test script preparation
ci/b2c: assert that install folder is present whether or not the tarball was extracted
ci/amd: split the polaris10 rules into one for each farm
ci: skip containers & build jobs when disabling a farm
docs: add release notes for 23.1.7
docs: add sha256sum for 23.1.7
docs: update calendar for 23.1.7
docs: add one more 23.1.x release
ci: taking igalia farm offline
ci/b2c: drop logic to remove install.tar
ci: drop clover leftover
Revert “ci: taking igalia farm offline”
bin/ci_run_n_monitor: print in which repo we’re looking for the pipeline
bin/ci_run_n_monitor: automatically pick MR pipelines when they exist
ci: remove duplicate fork pipeline in MRs
ci_run_n_monitor: add comment to explain “MR > fork” logic
ci: don’t run everything just because a farm gets re-enabled
ci/windows: centralize definition of windows runners tags
ci/windows: add windows docker runner tags to .windows-docker-vs2019
ci/windows: drop build rules from test jobs
ci: document which image tags need to be bumped when updating piglit
ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64
ci/farm-rules: rename .disable-farm-mr-rules to make it clear it’s only about MRs
ci/farm-rules: re-add “run every container and build job when a farm gets re-enabled”
ci/zink: drop redundant `MESA_LOADER_DRIVER_OVERRIDE: zink`
docs: add release notes for 23.1.8
docs: add sha256sum for 23.1.8
docs: update calendar for 23.1.8
docs: add another 23.1.x
ci: limit build jobs to 30min so that they can retry when they go wrong
docs: drop outdated and redundant note about the minimum meson version
ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm
ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes
docs: add calendar for 23.3
ci: unify container and build jobs rules
docs/meson: drop mention that our meson is ready
ci/docs: drop extra overwritten rules
ci/zink+radv: document flake
docs: document the merging process and what is allowed or not
ci: drop unused shader-db clone + build from alpine image
ci: drop unused shader-db clone + build from fedora image
ci: move shader-db clone/build into its own script
ci/deqp-runner: fix indentation
ci/deqp-runner: restore exit-on-error after getting deqp-runner’s exit code
ci: fix shebang in build-deqp-runner.sh
docs: add release notes for 23.1.9
docs: add sha256sum for 23.1.9
docs: update calendar for 23.1.9
ci: drop unused ephemeral packages in alpine image
docs/ci: rewrite the “farm maintenance ^ other change” rule to mean what we actually meant
ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone
pick-ui: use assignment expressions
pick-ui: use more expressive variable names
pick-ui: add `Backport-to: XX.Y` nomination
v3d/ci: move traces job to wayland
ci: print deqp version in the job log
ci/b2c: move to the shiny new `gfx-ci/ci-tron` repo
ci/b2c: use latest mesa-trigger image
include/dri_interface.h: restore define mistakenly removed in !25587
ci_run_n_monitor: dependency jobs must always be started
util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported
util/xmlconfig: inline datadir
ci/b2c: change artifacts path to match baremetal and LAVA
VERSION: bump for rc1
.pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c
.pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac
.pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288
.pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52
VERSION: bump for 23.3.0-rc2
.pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec
.pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated
.pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb
.pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18
.pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500
.pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated
VERSION: bump for 23.3.0-rc3
.pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1
.pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated
.pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee
.pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33
.pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated
VERSION: bump for 23.3.0-rc4
.pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8
egl/dri2: increase NUM_ATTRIBS to fit all the attributes
.pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d
.pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b
VERSION: bump for 23.3.0-rc5
.pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab
intel/perf: fix regex escaping
intel/ci: fix .hasvk-manual-rules
VERSION: bump for 23.3.0
Revert “VERSION: bump for 23.3.0”
docs: add release notes for 23.3.0
Revert “docs: add release notes for 23.3.0”
Erico Nunes (10):
lima/ppir: don’t optimize loads with different block successors
lima/ppir: convert to nir_legacy
lima/gpir: switch to register intrinsics
egl/drm: fix EGL_EXT_buffer_age with gbm contexts
lima: fix plbu block stride calculation
ci: disable lima LAVA lab for maintance
Revert “ci: disable lima LAVA lab for maintance”
v3dv: allow headless device without display device
Revert “ci/lima: farm is down, disable for now”
v3dv: Rework to remove drm authentication for wsi
Erik Faye-Lund (30):
meson: report with_glvnd in summary
docs: upgrade bootstrap to 5.3.1
docs: expand mobile-menu without js
panfrost: delete stale editorconfig file
docs/panfrost: link to lima
docs/panfrost: use code-blocks with wrapping for long blocks
docs/panfrost: use math-role to denote powers of two
docs: fix linkcheck
docs: update a few links to https
docs: update anchor for link
docs: update link to git-wiki
docs: link to upstream etnaviv
docs: apply some trivial redirects
docs: use doc-role when linking to lists article
docs: keep up with intels ever-moving documentation
docs: mark some redirects as allowed
docs: only link to old docs from html
docs: use html_static_path for static files
ci/etnaviv: update ci expectation
ci/etnaviv: allow failure on failing test
zink: fix wording of warning
ci/etnaviv: move failure to flake
meson: add wayland-protocols from meson wrapdb
util/xmlconfig: add an env-var for overriding drirc search dir
meson: add src/util to the drirc search path
docs/relnotes: remove cruft from end of lines
docs/ci: escape at-symbols
docs/relnotes: escape some at-symbols
bin/gen_release_notes: escape at-symbols
panfrost: use perf_debug instead of open-coding
Faith Ekstrand (809):
nv50/ir: Convert to new-style NIR registers
nv50/ir: Support vector movs
intel/fs: Add support for new-style registers
intel/vec4: Assume get_nir_dest() provides a sane write-mask
intel/vec4: Add support for new-style registers
intel: Switch to intrinsic-based registers
intel/fs: Drop support for nir_register
intel/vec4: Drop support for nir_register
anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus
nir: Properly handle divergence for load_reg
nir/trivialize: Maintain divergence information
nir/trivialize: Trivialize cross-block loads
vc4: Convert to new-style NIR registers
nir/schedule: Support load/store_reg
broadcom/compiler: Convert to new-style NIR registers
intel/fs: Use write masks from store_reg intrinsics
intel/fs: Rework the overlapping mov/vec case
intel/fs: Assume NIR is in SSA form
nir: Add a backend_flags field to nir_tex_instr
intel/fs: Add a parameter to speed up register spilling
nir/builder: Allow tex helpers on image types
nir/builder: Add a nir_txs_deref() helper
vulkan: Add a core vk_buffer_view struct
vulkan: Add a more direct way to use a NIR shader
vulkan: Add a vk_query_pool base object
vulkan: Add common vkCmdBegin/EndQuery wrappers
vulkan/format: Add the remaining 1-plane YCbCr formats
vulkan: Add a core vk_sampler struct
nv50/nir: Lower to scratch AFTER optimization
nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS*
nouveau/nir: Implement support for compact arrays
nouveau/codegen: Handle/indirect goes before sample index
nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images
nouveau/codegen: Don’t convertSurfaceFormat for unknown formats
nv50/ir: Run nir_divergence_analysis before out-of-SSA
anv: Use vk_sampler
anv: Use vk_buffer_view
vulkan: Add init/finish helpers for vk_query_pool
anv: Use vk_query_pool
anv: Use the common versions of vkBegin/EndQuery()
nir/builder: Don’t assume we have compiler options
Revert “mesa, compiler: Move gl_texture_index to glsl_types.h”
Revert “compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h”
vulkan: Use VkBufferUsageFlags2 in vk_buffer
clang-format: Set ColumnLimit to 78
nvk: Implement EnumerateInstanceVersion
nvk: Add stub implementations of VkImage and VkImageView
nvk: Add stub implementation of VkSampler
nvk: Add a stub implementation of VkBuffer
nvk: Implement VkDescriptorSetLayout
nvk: Implement VkPipelineLayout
nvk: Add initial descriptor set lowering
nvk: Implement vkUpdateDescriptorSets
nvk: Expose nvk_descriptor_stride_align_for_type
nvk: Re-format descriptor set layouts
nvk: Re-format pipeline layouts
nvk: Re-format descriptor sets some more
nvk/buffer: Take an offset in nvk_buffer_address
nvk/buffer: Add a push_buffer_ref helper
nvk/copy: Use nvk_buffer_address in CmdCopyBuffer
nvk/image: Add image address helpers
nvk/copy: Use nvk_image_base_address()
nvk: Add an nvk_device_physical helper
nvk: Add a skeleton for pipelines
nvk: Re-arrange nvk_descriptor_set.h a bit
nvk: Reformat nvk_nir_lower_descriptors
nvk: Add a couple descriptor set address helpers
nvk: Move nvk_cmd_pool cast definitions
nvk: Rework whitespace in nvk_cmd_buffer.c
nvk: Add a root descriptor table
nvk: Fetch descriptor set addresses from the root table
nvk: Re-arrange nir_lower_explicit_io a bit
nvk: Lower load_global_constant_offset
nvk: Drop image_view_init
nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable
nvk: Allow R32_UINT
nvk: Mark nvk_push_descriptor_set_ref() inline
nvk: Add a descriptor table data structure
nvk: Copy in the nouveau TIC format table
nvk/image_view: Reformat and fix Create/DestroyImageView
nvk: Add an image descriptor table to the device
nvk: Fill out TIC table entries for image views
nvk: Set b->cursor when lowering image intrinsics
nvk: Unify descriptor loading in lower_descriptors
nvk: Re-format nvk_image_view.h a bit
nvk: Re-format nvk_buffer.c a bit
nvk: Add a stub implementation of buffer views
nvk: Make texture descriptors a bit more acceptable to codegen
nvk: GART os host-cache-coherent
nvk: Reserve a null image descriptor
nvk: Rework descriptor writes
nouveau: Add stubs for an image layout library called NIL
nil: Create images
nil: Add the TIC format table from nouveau
nil: Add a nil_view and code to fill out TIC entries
nvk: Add an nvk_get_format helper
nvk: Use helpers for push_ref
nvk: Align arguments consistently in copy/blit code
nvk: Move Fill/UpdateBuffer to nvk_cmd_copy
Revert “nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable”
nvk: Manually offset for array layers in copy/blit
nvk: Convert to using NIL for image layout
nvk: Re-indent image entrypoints
nvk: Implement VkGetImageSubresourceLyout
nvk: Reset and properly clean up command buffer upload areas
nvk: Rework format features queries
nvk: Add a more competent GetPhysicalDeviceImageFormatProperties
nvk: Support compressed images in copy commands
nvk: Drop vk_sync BO refs after push_submit
nil: Drop miptail support for now
nil: Don’t minify image dimensions when setting up TIC
nil: Refactor TIC image extent setup
nil: Fix image array layer alignments
nvk: Teture pool sizes are maximums not sizes
nvk: Re-format nvk_sampler.c
nvk: Implement samplers
nil: Add a helper for filling out buffer TIC entries
nvk: Move is_storage_image_format to nvk_format.c
nvk: Implement buffer views
nvk: Advertise KHR_dedicated_allocation
nvk: Use the correct root descriptor table size for CmdDispatch
nvk: Add support for dynamic buffers
nvk: Better advertise image format features
nvk: Advertise descriptor array indexing
nvk: Advertise non-zero descriptor set limits
nvk: Use a descriptor type instead of a hand-rolled thing
nvk: Handle cube storage images properly
nvk: Load the requested descriptor size
nvk: Implement push constants
nvk: Properly indent a comment
nvk: Fix descriptor offset alignment
nvk: Use a switch for descriptor types in load_descriptor
nvk: Support inline uniform blocks
nvk: Delete the storage TIC in nvk_image_view_destroy
nvk: Assert that we don’t double-free descriptors
nvk: Initial vkCmdClearImage support
nvk: Unconditionally zero image format properties
nvk: No-op sparse image format properties
nvk: Advertise minUniformBufferOffsetAlignment
nvk: Rework OOM handling for descriptor pools
nvk: Bind immutable samplers on descriptor set creation
nvk: Padd shader BOs by 4K to avoid I-cache overflow
nvk: Include nvk_private.h in everything
nvk: Make image/buffer address helpers const
nouveau/push: Add a P_INLINE_FLOAT helper
nvk: Init WSI after setting up supported_sync_types
nouveau/parser: Fix an integer overflow and a typo
nouveau/parser: Properly dump most arrays used by 3D
nouveau/parser: Better dump float data
nouveau/parser: Handle arrays properly in P_IMMD()
nouveau/push: Make P_IMMD more versatile
nouveau: Null terminate the debug flag list
nouveau: Generate 3D headers
nvk: Add graphics state to command buffers
nvk: Split pipeline binding into helpers
nvk: Switch to vk_pipeline_shader_stage_to_nir
nvk: Don’t free the NIR in nvk_compile_nir
nvk: Add an nvk_shader_address helper
nvk: Free pipeline shader BOs
nvk: Expose pipeline alloc/free functions
nvk: Make shader_upload take an nvk_device
nvk/shader: Assign I/O locations and gather info
nvk/shader: Populate headers for vertex and fragment shaders
nvk: Add a nvk_cmd_buffer_device() helper
nvk: Import 3D context init code from nouveau
nil/format: Add helpers for render formats
nvk: Add boilerplate for Begin/EndRendering
nvk: Misc. additional state setup
nvk: Emit dynamic graphics state
nvk: Implement push constants and descriptors for graphics
nouveau: Add CPU push buffers
nvk: Graphics pipelines
nvk: Implement vkCmdDraw()
nvk: Color attachments clears via image clears
vulkan/meta: Add the start of a meta framework
vulkan/meta: Add an object tracking list
vulkan/meta: Add a concept of rect pipelines
vulkan/meta: Implement attachment clears
vulkan/meta: Implement start-of-rendering clears
vulkan/meta: Add implementations of Clear*Image
nvk: Add an attachment format even for secondaries
nvk: Add an addr field to nvk_buffer
nvk: Expose a bind_vertex_buffer helper
nvk: Use vk_meta for CmdClearAttachments
nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end()
nvk: Enable all the dynamic state features
nouveau: Fix pushbuf ref reset for user command buffers
nvk: add linear image creation support.
nvk: Use max alignment for descriptor pool sizes
nil: Switch to using the new headers for TIC entries
nvk: Use meta for CmdClear*Image
nvk: Zero client memory objects
nvk: Bind texture and sampler header pools for 3D
nvk: Use the new headers for samplers
nvk: Implement nir_intrinsic_load_frag_coord
vulkan/meta_clear: Populate VkRenderingInfo::renderArea
nvk: Don’t assert when there are no attachments
nvk: Track and reference all device memory objects
vulkan: Allow scissors or viewports to be set without counts
nvk/copy: Mape bpp part of nouveau_copy_buffer
nvk: Implement copies for D24_UNORM_S8_UINT images
nvk: Drop sample locations structs
nvk/meta: Save and restore VI state
nvk: Re-initialize dynamic_graphics_state.vi when recycling
nvk: Move the vertex format table into nvk_format.h
nvk: Advertise vertex buffer format featues
nvk: Clean up try_create_physical_device error handling
nouveau/parser: Dump more fields as float
nvk: Depth bounds need fui()
nouveau: Add class information to nouveau_ws_device
nil: Properly depend on nouveau winsys and nvidia-headers
nil: Use nvidia headers for texture format enums
nil: Use the nvidia headers for render target format enums
nil: Use nvidia headers for ZS format enums
nil: Rename rt to czt in the format info struct
nil: Rename rendering to color_target
nil: Re-introduce the format capabilities
nil: Add more format support helpers
nvk: Advertise more format features
nvk: Clear dynamic state dirty after flushing it all
vulkan/meta: Make stencil reference dynamic for clears
nvk: Depth buffers don’t allow Z-tiling
nvk: Disable sparse Z on Maxwell+
nil: Compute PTE kinds and tile modes for images
nouveau: Add a function to allocate a tiled buffer
nvk: Add internal helpers for device memory allocation
nvk: Do internal dedicated allocations for ZS images
nvk: Fix depth/stencil render pass clears
nvk: Fix viewport Z scale
nvk: Enable two-sided stencil
nvk: Flip the front-face setting
nvk: Advertise depth/stencil support
nvk: Don’t destroy NULL descriptor pool BOs
nvk: Call nir_lower_input_attachments
nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly
nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails
nil: Add a PTE kind for Z32_FLOAT
nvk: Add nvk_queue_init/finish() helpers
nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT
nvk: Re-flow a couple function prototypes
nvk: Assert samples == 1
nvk: Allocate descriptors for input attachments
nvk: Wire up early z and post depth coverage
nvk: Save/restore push constants around meta ops
nouveau/parser: Add array and float tags for clear values
nvk: Use hardware clears for attachment clears
nvk: Add image_view_init/finish functions
nvk: Implement vkCmdClear*Image directly
nvk: Use a UINT format to clear non-renderable images
nvk: Don’t advertise tiling on non-power-of-two formats
nvk: Fix max anisotropy
nvk: Assert on CmdExecuteCommands
nvk: VkSamplerCreateInfo::mipLodBias is signed
nvk: Fix border color alpha
nil/format: Depth/stencil formats appear as red
nil: Fix max mip level
nil: Fix nonnormalized coordinates
nvk: Set up clip and cull distances
nvk: Fix dynamic buffer descriptor copies
nvk: Inline nouveau_copy_linear
nvk/copy: Rename push to p
nvk/blit: Rename push to p
nvk/dispatch: Rename push to p
nvk: Drop most buffer tracking
nvk: Rework TLS/SLM and image/sampler table handling
nvk: Invalidate texture header and sampler caches each submit
nvk/sampler: Free descriptor table entries
nvk: Rework nvk_descriptor_table_add/remove
nvk: Implement descriptor table growing
nvk: Zero unused descriptors
nvk: Add some asserts for nv50 compiler image restrictions
nvk: Update to the new command buffer infrastructure
nvk: Split nvk_queue into its own file
nvk: Start every command buffer with a nop
nvk: Initialize fixed draw/default state once
nouveau/parser: Convert to mako
nouveau/parser: Use more idiomatic python
nouveau/parser: Put the dump helpers in C files
nvk: Use f for extension features
nvk: Drop a TODO
nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears
nvk: Increase the graphics pipeline push space
nil: Don’t claim texture support for 2-bit SNORM
nouveau/push: Fix a void pointer arithmetic bug
nouveau/parser: Add more arrays
nouveau/mme: Add basic structures for the Turing+ MME
nouveau/mme: Add isaspec XML for the Turing+ MME
nouveau/mme: Add an assembler and disassembler for the Turring+ MME
nouveau/mme: Add a builder for the Turing+ MME
nouveau/mme: Add a tiny simulator for the Turing+ MME
nouveau/mme: Add an isaspec-based dumper
nouveau/mme: Make the winsys headers C++ safe
nouveau/mme: Add unit tests for the Turing+ MME simulator
nvk: Add MME infrastructure
nvk: Use MME for clears
nouveau/mme: Add helper macros for setting fields
nvk: Use MME for vkCmdDraw[Indexed]()
nvk: Implement vkCmdDraw[Indexed]Indirect()
nvk: Use p for the nouveau_ws_push_buffer in zero_vram
nouveau: Add an nv_push struct
nouveau: Rename the fields of vk_push
nouveau: Move nv_push and helpers to their own header
nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA
nouveau: Move push validate to nv_push.c
nouveau: Move push dumping to nv_push.c
nvk: Use nv_push directly for graphics pipelines
nouveau: Add a nouveau_ws_bo_new_mapped helper
nvk: Use bo_new_mapped for the zero page
nvk: Always allocate empty_push
nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c
nvk: Submit pushbufs directly
nvk: Use a regular BO for the empty push
nvk: Use a regular BO for the queue state push
nvk: Add an nvk_queue_submit_simple helper
nvk: Initialize the queue later in device setup
nvk: Use submit_simple for draw state init
nvk: Use queue_submit_simple for zero_vram
nvk: Break nvk_cmd_pool into its own file
nvk: Use cmd instead of cmd_buffer
nvk: Add BO recycling to the command pool
nvk: Return VkResult from nvk_cmd_buffer_upload_alloc
nvk: memcpy root descriptors for compute instead of doing a DMA
nvk: Fully populate QMDs before uploading
nvk: Constant buffer alignment is actually 64B
nvk: Rework side-band data upload
nvk: Add an nvk_cmd_buffer_push helper
nvk: Add an nvk_cmd_buffer_ref_bo helper
nvk: Allocate upload buffers from the command pool
nvk: Use nvk_cmd_bo for push bufs
nvk: Implement vkCmdExecuteCommands()
nvk: Remove remaining references to nouveau_push.h
nouveau: Use DRM interfaces directly in MME tests
nouveau: Drop nouveau_ws_push
nvk: Re-indent vk_instance.c
nvk: Use vk_object_zalloc/free for descriptor pools/sets
nvk: Fix up whitespace in nvk_descriptor_set.c
nvk: Implement VK_KHR_push_descriptor
nvk: Reference descriptor set layouts in the sets themselves
nvk: Embed a nv_device_info in nvk_physical_device
nvk: Add an nvk_queue_submit wrapper
nvk: Also store the push BO map in nvk_queue_state
nvk: Bring back push sync and dumping
nvk: drop nvk_nir.h
nvk: Add lowering for load_global_constant_bounded
nvk: Properly implement robustBufferAccess
vulkan/meta: Add key types
vulkan/meta: Add a helper for image view types
vulkan/meta: Add a create_sampler helper
vulkan/meta: Fixes for clear
vulkan/meta: Implement vkCmdBlitImage()
nvk: Support load_layer_id
nvk/meta: Save/restore descriptor set 0
nvk: Use meta for doing blits with the 3D hardware
nvk: WFI in pipeline barriers
util/vma: Allow initializing zero-size heaps
nvk: Rework nvk_queue_submit_simple()
nvk: Add a heap data structure
nvk: Return a VkResult from nvk_shader_upload()
nvk: Add a shader heap to nvk_device
nvk: Allocate shaders from a heap
nvk: Rework whitespace in nvk_device_memory.c
nvk: Style fixes in nvk_physical_device.c
nvk: Reset semaphore syncs on wait
nvk/wsi: Style fixes
nvk/wsi: Use the common present implementation
nouveau/parser: Parse all fields in each method
nvk: Add a query pool object
nvk: Implement timestamp queries
nvk: Implement pipeline statistics and occlusion queries
nouveau/mme: Allow ZERO as the destinatio nof mme_load_to
nouveau/mme: Assert on OOB registers
nouveau/mme: Add support for freeing registers
nouveau/mme: Add a couple helpers for working 64-bit addresses
nouveau/mme: Add a helper for MME_DMA_READ_FIFOED
nvk: Use mme_tu104_read_fifoed()
nvk: Implement vkCmdCopyQueryPoolResults()
nvk: Handle large command buffer uploads better
nvk: Use a normal DMA for CmdUpdateBuffer
nouveau/parser: Handle 6F methods
nvk: Use mme_load_addr64()
nvk: Use poll for BO waits
nvk: Events
nvk: Don’t crash if we fail to allocate a push BO
nvk: Stop leaking command pool BOs
nvk: Enable VK_KHR_create_renderpass2
nvk: Advertise VK_KHR_imageless_framebuffer
nvk: Flush the current pushbuf before allocating a new one
nvk: Advertise VK_KHR_separate_depth_stencil_layout
nvk: Tell WSI we don’t support legacy scanout
nouveau: Add PCI information to nv_device_info
nvk: Implement VK_EXT_pci_bus_info
nvk: Bind 3D images as 3D for clears
nvk: Support copies between 3D and 2D images
nil: Add a helper for getting 2D views of 3D images
nvk: Support 2D views of 3D images
nvk: Advertise VK_KHR_maintenance1
nvk: Use 2D array views for 3D storage images
nil: Fix include guards in nil_image.h
nvk: Advertise custom border color features
vulkan: Add a helper for swizzling color values
nvk: Implement VK_EXT_border_color_swizzle
nvk: Advertise VK_EXT_extended_dynamic_state3
nvk: Move more states to dynamic
nvk: Advertise VK_KHR_storage_buffer_storage_class
nvk: Add a helper for pushing descriptors
nouveau/headers: Add generated headers to dependencies
nvk: Implement VK_EXT/KHR_buffer_device_address
nvk: Break the guts of CmdDispatch into a helper
nvk: Implement DispatchIndirect
nouveau/mme: Add a mul64 helper
nvk: Implement CS invocations statistics queries
nil: Use ONE for the anixotropic coarse spread function
nil: Properly support MSAA
nil: Add an offset4d struct and some helpers
nouveau/parser: Sort METHOD_ARRAY_SIZES
nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS
nvk: Stop asserting on MSAA
nvk: Handle zero color attachments better
nvk: Handle multisampled render targets properly
nvk: Support copies of MSAA images
nvk: Use the right view format for stencil texturing
nvk: Pass through a shader key for fragment shaders and MSAA
nvk: Set correct multisample regs for graphics pipelines
nvk: Stop creating a new upload BO every time
nvk: Fill out sample locations on Maxwell B+
vulkan/meta: Bind whole LODs of 3D blit destinations
vulkan/meta: Add a helper for building texture ops
vulkan/meta: Break the guts of blit into a helper
vulkan/meta: Support writing stencil as iterative discard
vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c
vulkan/meta: Add support for MSAA resolves
nvk/meta: Fix restore for descriptor set 0
nvk: Use meta for MSAA resolves
nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy)
nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts
nvk: Advertise MSAA via image format properties
nvk: Advertise VK_KHR_depth_stencil_resolve
nvk: Assert that descriptor buffer access stays in-bounds
nvk: Add a bo size to nvk_descriptor_set
nvk/format: Style fix for VkFormatProperties3KHR
nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers
nvk: Add a devenv ICD json file
nvk: Advertise EXT_vertex_attribute_divisor
nvk: Lower image_size to txs
nvk: Fix a comment
nvk: Add an nvk_buffer_addr_range helper
nvk: Use nvk_buffer_addr_range for buffer descriptors
nvk: Re-order Vulkan 1.0 feature bits
nvk: Enable inheritedQueries
nvk: Enable VK_EXT_provoking_vertex
nvk: Advertise samplerMirrorClampToEdge via 1.2 features
nvk: Advertise VK_KHR_bind_memory2
nvk: Enable KHR_dynamic_rendering
nvk: Advertise KHR_uniform_buffer_standard_layout
nvk: Advertise EXT_index_type_uint8
nvk: Advertise VK_EXT_separate_stencil_usage
nvk: Capitalize NVK in user exposed strings
nvk: Rename grid_size to group_count
nvk: Lower load_num_workgroups ourselves
nvk: Drop block_size from the root descriptor table
nvk: Add a helper for loading resource_index-based descriptors
nvk: Set maxMemoryAllocationCount
nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new()
nouveau/winsys: Add an info to nouveau_ws_device
nouveau/winsys: Move device type into nv_device_info
nouveau/nil: Take an nv_device_info for image functions
nouveau/nil: Use nv_device_info for format queries
nouveau/mme: Invoke SET_OBJECT in the tests
nouveau/mme: Make alu_op_to_str static
nouveau/mme: Move mme_value into its own header
nouveau/mme: Add a mme_reg_alloc struct
nouveau/mme: Add an intermediate MME_ALU_OP enum
nouveau/mme: Add an intermediate MME_CMP_OP enum
nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources
nouveau/mme: Make helpers less Turing specific
nouveau/mme: Break the Turing builder guts into a separate header
nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c
nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c
nouveau/mme: Drop the implicit_imm parameter from mme_alu_to()
nouveau/mme: Move the cf_stack struct to mme_builder.h
nouveau/mme: Prepare the builder for multiple GPU generations
nouveau/mme: Take an nv_device_info in mme_builder_init
Support immediates in MERGE
Add add immediate optimizations
nvk: Add support for contiguous heaps to nvk_heap
nvk: Use a contiguous shader heap pre-Volta
nvk: Disable indirect draw/dispatch and query copy MMEs for now
nvk: Free a couple regs in nvk_mme_build_draw_*()
nvk: Properly align root descriptor tables for pre-Pascal
nvk: Compile all NIR before running codegen
vulkan/meta: Insert a geometry shader when needed
nvk: Use a GS for layerered rendering pre-MaxwellB
nvk: Handle zero-size index and vertex buffers pre-Turing
nvk: Cosmetic clean-ups to Create/DestroyDevice
nil: Only choose a PTE kind for tiled images
nouveau/mme: Fix is_int18 for negative numbers
nouveau/mme: Don’t swap x and y in mme_fermi_merge_to()
nouveau/mme: Take a const nv_device_info in mme_builder_init
nouveau/mme: Unify some of the test framework
nouveau/mme: Add some generic builder tests
nouveau/mme: Add builder tests for SUB
nouveau/mme: Use a uint32_t for size in mme_fermi_bfe()
nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL
nvk/drm: Take a byte offset/range in push_add_push
nvk: Rework nvk_cmd_push a bit
nvk: Add a helper for pushing indirect data
nvk: Make some MME builder names more consistent
nouveau/mme: Don’t allow WaW dependencies in the same Turing instruction
nvk: Reduce register pressure in nvk_mme_build_draw*()
nouveau/push: Add an NV_PUSH_MAX_COUNT #define
nvk: Implement Draw*Indirect on pre-Turing
vulkan/meta: Use the new NIR texture helpers
nvk: Add a build test for MMEs
nvk: Don’t over-size push descriptor sets
nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn’t NVIDIA
nvk: Bump init context batch size
nouveau/mme: Fix nested while instructions on Turing+
nouveau/mme: Add a helper to dump instructions
nvk: Rework extension enables
nvk: Rework features enables
nvk: Advertise shaderImageGatherExtended
nouveau/mme: Add a bfe helper
nouveau/mme: Ensure that zero-initizlied mme_value is ZERO
nvk: De-duplicate MME code for setting draw params
nvk: Clamp viewport clip to max range
nvk: Use the same lock for the submit and the memory objects list
nvk: Advertise ICD/loader interface version 4
nvk: Add instace WSI entrypoints
nouveau/mme: Use ADD for ine with an immediate
nouveau/mme: Fix while loops pre-Turing
nvk: Add begin to mme_scratch
nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW
nouveau/mme: Add a helper for re-allocating registers
nvk: Rework spill helpers and DRAW_COUNT spilling
nvk: Spill DRAW_IDX pre-Turing
nvk: Break the inner MME draw loop into a helper
nvk: Increase the push runout to 512 dwords
nil: Add a nil_image_for_level helper
nil: Add an image_level_as_uncompressed helper
nvk: Implement uncompressed views of compressed images
nvk: Set pointClippingBehavior
nvk: Expose VK_KHR_maintenance2
nvk: Add a separate #define for SSBO alignment
nvk: Set spirv_to_nir_options::min_*_alignment
nvk: Use vk_device_memory
nvk: Implement VK_KHR_map_memory2
nvk: Sort SPIR-V caps
nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+
nvk: Only use view_id for layer in multiview
nvk/heap: Set the right pitch for heap resize copies
nvk: Advertise shaderStorageImageReadWithoutFormat
nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect
nvk: Advertise KHR_spirv_1_4
nvk: s/device/dev in nvk_image.c
nvk: Add helpers for binding image planes
nvk: Take an nvk_image_plane in nouveau_copy_rect_image
nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport
nvk: Use NVIDIA_VENDOR_ID in pdev try_create()
nvk: Use abbreviated names in nvk_device_memory.c
nvk: Add device and driver UUIDs
nvk: Add external memory queries
nvk: Dedicated allocations override internal
nvk: Require dedicated allocations for external images
nouveau/winsys: Add dma-buf import support
nvk: Support dma-buf import
nvk: Support dma-buf export
nvk: Enable external memory extensions
nvk: Reformat nvk_buffer.c
nvk: Add a buffer alignment helper
nvk: Add an addr field to nvk_image_plane
nvk: Use canonical variable names in nvk_physical_device.c
nvk: Use canonical variable names in nvk_shader.c
nvk: Use canonical variable names in nvk_bo_sync.c
nvk: Use canonical variable names in nvk_sampler.c
nvk: Drop nvk_physical_device::instance
nvk: Only advertise EXT_pci_bus_info on discrete GPUs
nouveau: Put PCI info in a pci substruct in nv_device_info
nouveau: Stop using hex for SM numbers
nvk: Set deviceType based on nv_device_info::type
nouveau: Move more stuff into nv_device_info
nouveau: Move gart_size to nv_device_info
nvk: Use nv_device_info for class checks
nvk: Rename nvk_device::ctx to ws_ctx
nvk: Add a ws_dev to nvk_device and use it
nvk: Move the winsys device to nvk_device
nvk: Don’t enumerate pre-Kepler GPUs
nvk: Implement VK_EXT_physical_device_drm
nvk: Require an environment variable for poorly tested hardware
nvk: Use the new core vk_sampler struct
Revert “vulkan: Allow scissors or viewports to be set without counts”
vulkan/meta: Add a get_pipeline_layout helper
vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve
nvk: Bind 3D depth/stencil images as 2D arrays
nvk: Flush more state on VI_BINDINGS_VALID dirty
nvk: Don’t skip zero-size bindings in GetDescriptorSetLayoutSupport
docs: Add a docs page for NVK
docs: Add NVK to features.txt
docs/relnotes: Stick something about NVK in new_features.txt
nouveau: Drop GART size from nv_device_info
nil: Add a nil_image_level_extent_px() helper
nvk: Use the new NIL helper for image level extents for copies
nvk: Improve image format properties and limits
nvk: Rework multi-plane format features a bit
nvk: Use nvk_root_descriptor_offset for drawInfoBase
nvk: Add a root_desc_addr to the root descriptor table
nvk: Add support for variable pointers
nvk: Enable the SPIR-V DeviceGroup capability
nvk: Separate the MME query copy code out a bit
nvk: Implement CopyQueryPoolResults with a compute shader
nvk: Misc. style nits
nvk: Rework memory requirements to handle aspects correctly
nvk: Implement the maintenance5 image layout queries
nvk: Use VkBufferUsageFlags2
nvk: Implement CmdBindIndexBuffer2KHR
nvk: Implement GetRenderingAreaGranularityKHR
nvk: Decorate CmdBegin/EndRendering entrypoints
nouveau: Move shader topology info to nv_device_info
drm-uapi: Import nouveau_drm.h
nouveau/winsys: Use the imported nouveau_drm.h headers
nvk: Use the imported nouveau_drm.h headers
nouveau/shim: Use the imported nouveau_drm.h headers
nouveau/mme: Support the new UAPI
nvk: Use an empty EXEC for the empty submit case
nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND
nvk: Print an error message if VM_BIND support is missing
nvk: Enable the new UAPI
nvk: Use more consistent device variable names
nvk: Call nir_lower_int64
nir/gl: Move glsl_type::sampler_target() into a helper in its one caller
nvk: Remove plane sources from tex instructions
nvk: Use common physical device properties
nv50/ir: Rework conversions for texture array indices
clang-format: Add nir_foreach_reg_*
clang-format: nir_foreach_src is not a foreach macro
clang-format: Set the default ColumnLimit to 0
nir: Re-align a couple enums and add clang-format comments
nir: Don’t clang-format const_value helpers
nir: Don’t clang-format a couple typedefs
nir: Don’t clang-format debug print setup
nir: More manual formatting
nir: Pretty format type mapping helpers
nir: Wrap pass macros in braces
nir: Add a do to the do/while in nir_const_value_t_array()
nir: Add a .clang-format file
nir: clang-format src/compiler/nir/*.[ch]
nvk: Don’t use nir_ssa_for_src()
nir: Drop most instances of nir_ssa_dest_init()
nir: Drop more instances of nir_ssa_dest_init()
nir/clone: Clone nir_def nor nir_dest
nir/serialize: [De]serialize nir_def nor nir_dest
nir: Drop nir_ssa_dest_init()
nir: Drop nir_ssa_dest_init_for_type()
nir: nir_foreach_ssa_def() -> nir_foreach_def()
st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest
dxil: Use nir_foreach_def() instead of nir_foreach_dest()
nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest()
nir: Drop nir_foreach_dest()
intel/vec4: Stop passing around nir_dest
intel/fs: Stop passing around nir_dest and nir_alu_dest
broadcom: Stop using nir_dest directly
vc4: Stop passing around nir_dest
nir,ntt,a2xx,lima: Stop using nir_dest directly
lima: Stop using nir_dest directly
etnaviv: Stop passing around nir_dest
r600/sfn: Stop passing around nir_dest and nir_alu_dest
nv50/ir: Stop passing around nir_dest and nir_alu_dest
nir/gather_types: Stop passing around nir_dest
nir/dce: Stop passing around nir_dest
nir/propagate_invariant: Stop passing around nir_dest
nir/validate: Replace all dest validation with validate_def
nir/print: Replace all dest printing with print_def
nir: Get rid of nir_dest_bit_size()
nir: Get rid of nir_dest_num_components()
nir: Get rid of nir_dest_is_divergent()
nir: Drop nir_alu_dest
nir: Drop nir_dest
util/format: 8-bit interleaved YUV formats are UNORM
gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM
blorp: Use R8G8_UINT for YCRCB_* formats with CCS
anv: Disable CCS_E for ISL_FORMAT_YCRCB_*
vulkan/format: Use correct swizzle for 1-plane YCbCr formats
gallivm: Drop the Vulkan YUV format hacks
nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef
nir s/nir_get_ssa_scalar/nir_get_scalar/
nir: s/live_ssa_def/live_def/
nir: s/nir_instr_ssa_def/nir_instr_def/
nir: Rework nir_scalar_chase_movs a bit
nir: Fix nir_op_mov handling in nir_collect_src_uniforms
nir: Handle nir_op_mov properly in opt_shrink_vectors
nir: Don’t handle nir_op_mov in get_undef_mask in opt_undef
nir: Clean up nir_op_is_vec() and its callers
nir/large_constants: Use nir_component_mask_t
nir/large_constants: Add read/write_const_values helpers
nir/opt_large_constants: Add Small constant handling
spirv: Re-emit constants at their uses
nir: Take a nir_def * in nir_tex_instr_add_src()
nir: Take a nir_def * in nir_phi_instr_add_src()
nir/opt_undef: Don’t rewrite a bcsel to mov
nir: Add a nir_instr_clear_src() helper and use it
nir: Add and use a nir_instr_init_src() helper
nir: Drop nir_if_rewrite_condition()
nir: Drop most uses of nir_instr_rewrite_src_ssa()
nir: Drop nir_instr_rewrite_src_ssa()
nir: Drop most uses if nir_instr_rewrite_src()
nir: Drop nir_instr_rewrite_src()
nir: Drop nir_push_if_src()
nir: Fix metadata in nir_lower_is_helper_invocation
nir: Use nir_shader_intrinsic_pass() a few places
drm-uapi: Sync nouveau_drm.h
nvk: Plumb no_prefetch through to the DRM back-end
nouveau/mme: Fix a compile warning
intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s
intel/isl: Add ICL variants of Yf and Ys tiling
intel/isl: Implement correct tile size calculations for Ys/Yf
intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces
intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64
intel/isl: Indent uncompressed surface code
intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf
intel/isl: Support Yf/Ys tiling in surf_fill_state
intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz
intel/isl: Add initial data-structure support for miptails
intel/isl: Add support for computing offsets with miptails
intel/isl: Support miptails in isl_surf_get_uncompressed_surf
intel/isl: Start using miptails
intel/isl: Disallow CCS on 3D surfaces with miptails
intel/isl: Allow Ys tiling
anv: Align memory VA to support for Ys, Tile64 tiled images
nvk: Clean up includes
nvk: Add include guards to nvk_bo_sync.h
nvk: SPDX everything
nouveau/nil: SPDX everything
nouveau/mme: SPDX everything
nvk: Don’t add a dummy attachment when gl_SampleMask is written
nvk: Set the discard bit for Z/S self-deps
nvk: Invalidate the texture cache in PipelineBarrier
nvk: Lower interp_at_sample to interp_at_offset
nvk: Disable statistics around meta ops
nvk: Clean up viewport math
nvk: Fix depth clipping parameters
nvk: Enable dynamic clip/clamp enable
nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping
r600: Use more auto-generated nir_builder helpers
r600: Use nir_builder helpers for load/store_shared_r600
nvk: Re-order physical device limits
nvk: Advertise maxMemoryAllocationCount = 4096
nvk: Advertise discreteQueuePriorities = 2
nvk: Rip out old UAPI support
nvk/drm: Drop the push_add_push_bo() helper
nvk/drm: Drop the push_add_bo() helper
nvk: Drop command buffer BO tracking
nvk: Drop memory object tracking
nvk: Drop the device-level mutex
nvk: Get rid of the tiled memory allocation helpers
nvk/drm: Restructure nvk_queue_submit_drm_nouveau()
nvk/drm: Split exec as needed for large command buffers
nvk: Don’t store the descriptor pool BO in the set
nvk: Store a 20-bit driver_build_sha in nvk_instance
nvk: Hook up the disk cache
nvk: Re-structure early shader compilation a bit
nvk: Add a default pipeline cache
nvk: Cache NIR shaders
nvk: Init pipelineCacheUUID
drm-uapi: Sync nouveau_drm.h
nvk: Take GETPARAM_EXEC_PUSH_MAX into account
nvk: Handle zero-sized sparse buffers
nvk: Use align() and align64() instead of ALIGN_POT
nouveau: Generate headers for Maxwell B compute
nvk: Add a nvk_cmd_buffer_compute_cls() helper
nvk: Invalidate sampler/texture header caches in BeginCommandBuffer()
nvk: Invalidate SKED caches at the top of command buffers
nvk: Advertise more inline uniform block limits
nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch
nvk: Set max descriptors to 2^20 for most descriptor types
nvk: Reset descriptor pool allocator when all sets are destroyed
nil/format: Use A for alpha blend
nil/format: Advertise R10G10B10A2_UINT texture buffer support
nvk: Disable depth or stencil tests when unbound
nvk: Always emit at least one color attachment
nvk: Improve address space and buffer size limits
nvk: Always set pixel_min/max_Z to CLAMP
nvk: Use nouveau_ws_bo_unmap() instead of munmap()
nvk: Free the disk cache
nvk: Add an nvk_shader_finish() helper
nvk: Handle unbinding images and buffers
nvk: Clean up the disk cache on physical device create fail path
vulkan/wsi: Allow for larger linear images
nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper
nvk: Re-sort device features
nvk: Implement VK_EXT_depth_bias_control
nvk: Advertise VK_KHR_workgroup_memory_explicit_layout
nvk: Implement VK_EXT_image_sliced_view_of_3d
nvk: Advertise VK_EXT_primitive_topology_list_restart
nvk: Advertise VK_EXT_attachment_feedback_loop_layout
features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK
nvk: Re-arrange Vulkan 1.2 features to match the header
nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex
nvk: Enable descriptorIndexing
nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
nir: Add a nir_ssa_def_all_uses_are_fsat() helper
nir: Add convert_alu_types to divergence analysis
nir/lower_tex: Add a lower_txd_clamp option
nir: Add a load_sysval_nv intrinsic
nir: Add NV-specific texture opcodes
nir: Add an load_barycentric_at_offset_nv intrinsic
nir: Add a range to most I/O intrinsics
nir: Add NVIDIA-specific I/O intrinsics
nir/lower_bit_size: Fix subgroup lowering for floats
nir: add deref follower builder for casts.
nir: Handle wildcards with casts in copy_prop_vars
Felix DeGrood (12):
anv: save a shader source uint32_t hash in gfx/compute pipelines
anv: Add Source hash field to VkPipelineExecutableStatisticKHR
iris: save shader source sha1 in ish
mesa: propagate shader source sha1 from gl_shader to nir_shader
intel: use shader source hash in INTEL_MEASURE
intel/compiler: use shader source hash in shader dump code
anv: add fake sparse support
anv: enable fake sparse for Elden Ring
anv: debug messaging for sparse texture usage
anv: fix frame count reporting in INTEL_MEASURE
anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4
anv: remove CS_FLUSH from query regression
Feng Jiang (9):
virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation
meson: Export winsys function symbols for target va
frontends/va: Add slice_count to AV1 slice_parameter
virgl/video: Add definition of virgl_av1_picture_desc
virgl/video: Add support for AV1 decoding
virgl/video: Enable AV1 decoding
meson: Rename dri-vdpau.dyn to dri.dyn
CODEOWNERS: Add @flynnjiang for VirGL video
meson: Move video to separate section in meson configuration summary
Filip Gawin (1):
crocus: Avoid fast-clear with incompatible view
Flora Cui (1):
radeonsi: limit CP DMA to skip holes in sparse bo
Francisco Jerez (29):
intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2.
intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms.
intel/eu/xe2+: Fix encoding of various message descriptors for change in register size.
intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges().
intel/fs/xe2+: Fix payload node live range calculations for change in register size.
intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size.
intel/fs/xe2+: Fixes for increased accumulator register width.
intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size.
intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size
intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions.
intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units.
intel/fs/xe2+: Update encoding of FB write message payload.
intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit.
intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size.
intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size
intel/fs/xe2+: Update GS payload setup for Xe2 reg size.
intel/fs/xe2+: Update TCS payload setup for Xe2 reg size.
intel/fs/xe2+: Update TES payload setup for Xe2 reg size.
intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs.
intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs.
intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU.
intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs.
intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg().
intel/fs/xe2+: Fix URB writes with 0 data components.
intel/fs: Specify number of data components of logical URB writes via control immediate.
intel/fs: Delete manual ‘inst->mlen’ calculations from all uses of logical URB writes.
intel/fs: Delete manual ‘inst->mlen’ calculations from all uses of logical URB reads.
Frank Binns (10):
pvr: clang-format fixes
pvr: skip setting up SPM consts buffer when no const shared regs are used
pvr: cleanup SPM EOT dynarray after upload
pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported
pvr: remove pvr_pbe_get_src_pos()
pvr: fix attachments segfault in pvr_is_stencil_store_load_needed()
pvr: fix allocation size of clear colour consts shared regs buffer
pvr: change a few places to use PVR_DW_TO_BYTES()
pvr: fix setup of load op unresolved msaa mask
pvr: emit PPP state when vis_test dirty bit is set
Friedrich Vock (19):
radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs
radv: Handle VK_SUBOPTIMAL_KHR in trace layers
ac/msgpack: make fixstrs a const char
ac/sqtt,radv: Split internal and API hash in PSO correlations
ac/rgp: Write lds_size metadata
ac/rgp: Add metadata for separate-compiled RT stages
radv/sqtt: Move record filling to helper function
radv/sqtt: Unregister records based on hash
radv/sqtt: Write LDS size metadata in code objects
radv/sqtt: Handle separately-compiled RT pipelines
ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation
nir/load_store_vectorize: Handle intrinsics with constant base
radv/rt: Pre-initialize instance address
radv: Initialize shader freelist on allocation
radv: Fix check in insert_block
radv/rt: Reject hits within 10ULP of previous hits in emulated RT
radv/rra: Recognize LPDDR memory
radv/rmv: Recognize LPDDR memory
vulkan: Don’t use set_foreach_remove when destroying pipeline caches
Ganesh Belgur Ramachandra (5):
radeonsi: stores bottom_edge_rule option in the rasterizer state
radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule
radeonsi: “clear_render_target” shader in nir
radeonsi: “clear_render_target_1d_array” shader in nir
radeonsi: “clear_12bytes_buffer” shader in nir
Georg Lehmann (39):
aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
aco: improve get_gfx11_true16_mask description
aco: combine a & ~b to bfi(b, 0, a)
aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz
aco: fix non constant 16bit bitnz/bitz
ac/nir: handle more special cases in ac_nir_unpack_arg
aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000
nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan
aco: always use rtne for fquantize2f16
nir/opt_if: also rewrite uniform uses for read_invocation
nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select}
nir: unify lower_bitfield_extract with has_bfe
nir: unify lower_find_msb with has_{find_msb_rev,uclz}
aco: fix u2f16 with 32bit input
aco: combine a | ~b to bfi(b, a, -1)
aco: use v_cvt_f32_ubyte for signed casts too
nir: add nir_scalar intrinsic helpers
nir: add nir_scalar_equal
aco: implement some exclusive scans with inclusive scans
aco/gfx11: don’t use bfe for local_invocation_id if the others are always 0
nir/opt_algebraic: remove broken fddx/fddy patterns
aco: simplify masked swizzle dpp selection by removing or_mask first
aco: fix p_extract with v1 dst and s1 operand
aco: implement 64bit div find_lsb
nir: scalarize masked_swizzle_amd created from shuffle_xor
aco/optimizer: check if we can use omod before labeling it
aco/optimizer: copy propagate to output modifier instructions
aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed
nir: make quad intrinsic dst bit size match src0
nir/lower_subgroups: use intrinsic builder more
aco: assume new generations are unsupported by clrx
aco: assume newer generation will use GFX11 wait_imm packing
aco: print final ir instead if printing asm is unsupported
aco/gfx11: optimize dual source export
aco/gfx11: apply clamp/omod to vinterp
aco: support v_fma_f32_dpp as fma_mix
aco/gfx11: support vinterp as fma_mix
aco: add missing scc def for SALU quad broadcast
aco/sched: treat p_dual_src_export_gfx11 like export
George Ouzounoudis (38):
nouveau/codegen: Support compact clip distances with arrayed_io
nouveau/codegen: Handle nir op amul
nouveau/codegen: Fix compact patch varyings in case of NIR
nouveau/codegen: Add capability to pre-specify tessellation domain
nvk: Do not increment instance id across draws
nvk: Add a macro for root descriptor table byte offsets
nvk: Set base vertex state in sequential mme draw
nvk: Support base instance in instanced draw calls
nvk: Switch point rasterization to point sprites
nvk: Support large points
nvk: Compile geometry shaders
nouveau/mme: Keep device info in mme_builder
nvk: Simplify mme build function argument
nvk: Support VK_KHR_shader_draw_parameters
nvk: Support for vertex shader transform feedback
nvk: Support transform feedback indirect draws
nvk: Support transform feedback geometry streams
nvk: Support transform feedback queries
nvk: Support vertex shader transform feedback on Fermi
nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default
nvk: Fix geometry shader active stream mask
nvk: Support geometry shaders
nvk: Basic tessellation shader support
nvk: Assign locations correctly for arrayed IO
nvk: Enable multiview with tessellation shader
nvk: Fix cases where execution mode is specified in the tesc shader.
nvk: Respect tessellation domain origin state
nvk: Lower io to temporaries for tessellation evaluation nir
nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport
nvk: Handle cases of descriptor bindings with variable counts
nvk: Add nir non-uniform optimization pass
nvk: Enable descriptor indexing
nvk: Do not keep redundant info for tessellation domain
nouveau/codegen: Do not keep redundant info for tessellation domain
nvk: Enable dynamic line rasterization mode state
nvk: Fix support for VK_EXT_sample_locations
nvk: Support dynamic state for enabling sample locations
nouveau/codegen: Add a 4th optimization level for MemoryOpts
Gert Wollny (63):
r600/sfn: Switch to register intrinsics
r600/sfn/tests: add simple copy-prop test with register source
r600/sfn: Allow for larger ALU CF’s
r600/sfn: Handle indirect array load/store dependencies better
r600/sfn: Increase LDS fetch schedule priority
r600/sfn: Add peephole optimization to move a dest to the previous op
r600/sfn: reorder the value factory class member declaration a bit
r600/sfn: Add some tests for proper register access
r600/sfn: Print more info if scheduling fails
r600/sfn: remove debug output leftovers
r600/sfn: Fix use of multiple IDX with kcache
r600/sfn: Always check arrays writes before allowing copy propagation
r600/sfn: set block sizes based on chip class
r600/sfn: Fix typo with block type
r600/sfn: override slot count for IfInstr
r600/sfn: Add method to convert to AluGroup directly
r600/sfn: Add flags to check whether a group starts CF and can do that
r600/sfn: make remaining slots a signed value
r600/sfn: on Cayman loading an index register needs only one slot
r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots
r600/sfn: rework checks for ALU CF emission
r600/sfn: Schedule AR uses befor possible groups
r600: Explicitly force new CF in gs copy shader
r600: Assert when backend wants to create a new ALU CF
r600: don’t check possible size of ALU CF
r600: don’t use sb disasm to disassamble copy shader
r600: Force CF when emitting a NOP on R600 in gs copy shader
r600/sfn: Don’t try to propagate to vec4 with more than one use
r600/sfn: Only switch to other CF if no AR uses are pending
r600/sfn: AR loads should depend on all previous non ALU instructions
r600/sfn: Renumber shader blocks in scheduler
r600/sfn: Track whether a register is ALU clause local
r600/sfn: Use clause local registers in RA
r600/sfn: Take source uses into account when switching channels
r600/sfn: take number of dest values into account
r600: retire SB optimizer
r600/sfn: work around injecting extra CF’s to handle hardware bugs
r600: use correct cso pointer for fetch shader
r600/sfn: Make use of four clause local registers
r600/sfn: drop unused ControlFlowInstr type enum
r600/sfn: factor out resource as extra class
r600/sfn: Simplify dependency chain for index loads on EG
r600: print texture resource index mode separately
r600/sfn: Make address split pass obligatory
r600/sfn: rename method resource_base to resource_id
r600/sfn: Add old address to update_indirect_addr
r600/sfn: Sepeate resource and sampler in texture instructions
r600/sfn: get rid of the method to get the index mode
r600/sfn: sort the uniforms of the right shader
r600/sfn: Fix use of scheduled_shader vs shader
virgl: report MIRROR_CLAMP features better
ci: Upref virglrenderer
copyimage: check requested slice early when cube maps are involved
mesa: check numlevels and numlayers when creating a texture view
virgl: Use common clear_texture if host doesn’t support the feature
r600/sfn: don’t remove texture sources by using the enum value
r600: drop egcm_load_index_reg
r600/sfn: Don’t override a chgr pinning during copy propagation
r600/sfn: When simplifying src vec4 pinnings, also check all uses
virgl: Fix logic for reporting PIPE_MIRROR_CLAMP
r600: Add callbacks for get_driver_uuid and get_device_uuid
r600: Link with libgalliumvl, when enabling rusticl this is needed
r600/sfn: Fixup component count only if intrinsic has it
Guilherme Gallo (5):
bin/ci: Ensure that all jobs have nodes in DAG
ci/radeonsi: Update flake list
ci/freedreno: Add a new flake
ci/zink: Found some flakes
ci/anv: Catch some flakes
Hannes Mann (1):
vulkan/wsi/wayland: Fix detection of tearing control protocol
Hans-Kristian Arntzen (2):
wsi/x11: Fix potential deadlock in present ID.
wsi/x11: Don’t allow signal_present_id to rewind.
Helen Koike (21):
ci: re-add EXTRA_LOCAL_PACKAGES to rootfs
ci: add EXTRA_LOCAL_PACKAGES to apt-get install
docs/ci: Add docs for EXTRA_LOCAL_PACKAGES
ci: disable duplicated pipelines triggered by marge
ci: add –project option to ci_run_n_monitor.py
ci/android: remove strace output from cuttlefish-runner.sh
ci: add locked flag to bindgen-cli on x86_64_build.sh
ci: separate hiden jobs to -inc.yml files
ci/ci_run_n_monitor: add docs for multiple targets
ci/ci_run_n_monitor: print stress test results per job
ci/ci_run_n_monitor: simplify with defaultdict
ci/ci_run_n_monitor: merge print_job_status_change with print_job_status
ci/ci_run_n_monitor: make –target mandatory
ci/ci_run_n_monitor: merge enable_job with retry_job
ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline()
ci/ci_run_n_monitor: allow <user>/<project> in –project
ci/ci_run_n_monitor: limit repetitions on –stress
ci/marge_queue: add missing python-dateutils to requirements.txt
ci/ci_run_n_monitor: keep monitoring if a job is still running
ci/marge_queue: add pretty_dutation()
ci/ci_run_n_monitor: print job duration time
Honglei Huang (7):
virgl/video: Add support for mpeg12 decoding
virgl/video: Add support for vc1 decoding
virgl/video: Add support for jpeg decoding
virgl/video: Add support for hevc10bit decoding.
virgl/video: Add more pipe type in virgl formats convert table
virgl/video: Add jpeg buf start code check
virgl: Enable vp9 hardware decode
Hyunjun Ko (3):
anv: use ycbcr_info for P010 format
anv: don’t use cmd_buffer after destroyed.
anv: don’t flush_llc on gen9
Iago Toral Quiroga (100):
nir/trivialize: Move decl_reg to the start of the block
v3dv: stop incrementing UBO indices by one
nir/lower_robustness: drop skip_ubo_0 option
v3dv: fix incorrect key setup
broadcom/compiler: stop asserting on Vulkan environment
broadcom/compiler: use NIR’s lowering for dispatch base
broadcom/compiler: move uniform offset lowering from compiler to GL driver
broadcom/compiler: move vulkan’s point coord lowering to the driver
v3dv: don’t set lower_wpos_pntc for Vulkan
broadcom/compiler: always clamp results from logic ops
broadcom/compiler: drop execution environment from the shader key
v3dv: drop cpu path for buffer to image copies
v3dv: remove unused code
nir/lower_tex: copy backend_flags field when copying a tex instruction
nir/lower_tex: use a callback to check sampler return size packing
squash! v3dv,broadcom/compiler: don’t abuse sampler index
v3dv: assert that only tex instructions with sampler state have a sampler src
v3d: fix texture packing lowering
v3d,v3dv: use fquantize2f16 lowering in NIR
v3dv: be more precise in vkGetImageSubresourceLayout
v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT
v3dv: bump up MAX_UNIFORM_BUFFERS to 16
v3dv: add support for sampling simple 2D linear textures
v3dv: expand sampling from linear image hack to support multi-planar images
v3dv: don’t assume that bound descriptors have been written
v3dv: only handle Android Hardware Buffer on Android
v3dv: we can sample from 1D array too
broadcom/compiler: add a couple of shader key helpers
v3d: compute nir sha1 for uncompiled shader state
v3d: use pre-computed shader sha1 for disk cache
v3d: fix RAM shader cache
v3d: get rid of shader_state pointer in v3d_key
broadcom/simulator: reset CFG7 for compute dispatch in v71
broadcom/common: retrieve V3D revision number
broadcom/compiler: update node/temp translation for v71
broadcom/compiler: implement “reads/writes too soon” checks for v71
broadcom/compiler: implement read stall check for v71
broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper
broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71
broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71
broadcom/qpu: fix packing/unpacking of fmov variants for v71
broadcom/compiler: make vir_write_rX return false on platforms without accums
broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly
broadcom/compiler: only handle accumulator classes if present
broadcom/compiler: don’t assign rf0 to temps across implicit rf0 writes
broadcom/compiler: CS payload registers have changed in v71
broadcom/compiler: don’t schedule rf0 writes right after ldvary
broadcom/compiler: allow instruction merges in v71
broadcom/qpu: add MOV integer packing/unpacking variants
broadcom/qpu: fail packing on unhandled mul pack/unpack
broadcom/compiler: generalize check for shaders using pixel center W
broadcom/compiler: v71 isn’t affected by double-rounding of viewport X,Y coords
broadcom/compiler: update peripheral access restrictions for v71
broadcom/qpu: add packing for fmov on ADD alu
broadcom/compiler: handle rf0 flops storage restriction in v71
broadcom/compiler: enable ldvary pipelining on v71
broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71
broadcom/compiler: don’t assign rf0 to temps that conflict with ldvary
broadcom/compiler: convert mul to add when needed to allow merge
broadcom/compiler: implement small immediates for v71
broadcom/compiler: update thread end restrictions for v7.x
broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x
broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x
broadcom/compiler: start allocating from RF 4 in V7.x
broadcom/compiler: validate restrictions after TLB Z write
broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x
broadcom/compiler: fix up copy propagation for v71
broadcom/compiler: don’t allocate spill base to rf0 in V3D 7.x
broadcom/compiler: improve allocation for final program instructions
broadcom/compiler: don’t assign registers to unused nodes/temps
broadcom/compiler: only assign rf0 as last resort in V3D 7.x
v3dv: expose V3D revision number in device name
v3dv/device: handle new rpi5 device (bcm2712)
v3dv: setup render pass color clears for any format bpp in v71
v3dv: setup TLB clear color for meta operations in v71
v3dv: fix up texture shader state for v71
v3dv: handle new texture state transfer functions in v71
v3dv: implement noop job for v71
v3dv: handle render pass global clear for v71
v3dv: GFX-1461 does not affect V3D 7.x
broadcom/compiler: update thread end restrictions validation for v71
v3dv: handle early Z/S clears for v71
v3dv: handle RTs with no color targets in v71
v3dv: don’t convert floating point border colors in v71
v3dv: handle Z clipping in v71
v3dv: make v3dv_viewport_compute_xform depend on the V3D version
v3dv: fix depth clipping then Z scale is too small in V3D 7.x
v3d/v3dv: fix texture state array stride packing for V3D 7.1.5
v3d,v3dv: support up to 8 render targets in v7.1+
v3d,v3dv: don’t use max internal bpp for tile sizing in V3D 7.x
v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x
v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x
v3dv: fix color write mask for v3d 7.x
v3d,v3dv: fix depth bias for v3d 7.x
v3d,v3dv: fix compute for V3D 7.1.6+
v3dv: expose fullDrawIndexUint32 in V3D 7.x
v3dv: expose depthClamp in V3D 7.x
v3dv: expose scalarBlockLayout on V3D 7.x
v3dv: fix confusing nomenclature about DRM nodes
v3d,v3dv: fix MMU error from hardware prefetch after ldunifa
Ian Douglas Scott (1):
egl/wayland: Don’t segfault if `create_wl_buffer` returns `NULL`
Ian Romanick (38):
intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
intel/fs: Constant fold SHL
intel/fs: Constant fold OR and AND
util/rb-tree: Return the actual first node from rb_tree_search
util/rb-tree: Fix typo in comment
nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers
nir/algebraic: Remove redundant pack / unpack lowering patterns
intel/fs: Completely re-write the combine constants pass
intel/fs: Combine constants for SEL instructions too
intel/fs: Combine constants for integer instructions too
intel/fs: New VGRF packing scheme for constant combining
intel/compiler: Combine control barriers with identical memory semantics
intel/compiler: Don’t evict for workgroup-scope fences
glsl/list: Clean up an inappropriate comment
util/rb-tree: Work around C++’s dislike of offsetof
util/rb-tree: Inline rb_tree_init
intel/fs: Don’t continue fixed point iteration just because liveout changes
intel/fs: Don’t try to copy propagate into a source again after progress is made
intel/fs: Make try_constant_propagate and try_copy_propagate file private
intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate
intel/fs: Don’t loop in try_constant_propagate
intel/fs: Simplify check in can_propagate_from
intel/fs: Make opt_copy_propagation_local file private
intel/fs: Encapsulate per-block ACP in a structure
intel/fs: Use rb_tree to store ACP entries by source
intel/fs: Use rb_tree to store ACP entries by destination
intel/fs: Use rb_tree for copy prop dataflow
intel/fs: Merge copy prop dataflow loops
intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size
intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size
intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode
nir/rematerialize: Rematerialize ALUs used only by compares with zero
intel/compiler/xe2: Handle new URB read messages
intel/compiler/xe2: Handle new URB write messages
intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8
spirv: Track when a shader has a cooperative matrix
intel/fs: Add DP4A to get_lowered_simd_width
nir/split_vars: Don’t split arrays of cooperative matrix types
Igor Torrente (4):
zink: Fix enumerate devices when running compositor
zink: Removes `disable_xcb_surface`
zink: Fix one addicional case when running a compositor
zink: fix for startup crash of weston running on top of zink + venus
Illia Abernikhin (2):
state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside
i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int.
Illia Polishchuk (7):
iris: remove NULL check for already dereferenced pointer earlier
s/Intel: fix/anv: fix: potentially overflowing expression in genX
glx: fix dead code when gc var cannot be null due to earlier check
state_tracker: fix dereference before null check
anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation
zink: move find_sampler_var from zink to nir core
nir: fix invalid sampler search by texture id
Italo Nicola (24):
mesa/main: account for RTT samples when updating framebuffer
mesa/main: allow readpix/teximage to read from implicitly multisampled fbos
panfrost/genxml: fix Surface With Stride descriptor alignment
panfrost/genxml: add Multiplanar Surface descriptor
panfrost: refactor (un)packing of surface descriptors
pan/decode: decode Multiplanar Surface descriptors
panfrost: prepare pan_image_view for multiplanar formats
panfrost: prepare the driver to support YUYV and variants
panfrost: advertise support for YUYV and variants
panfrost: mandate proper alignment requirement depending format and arch
panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler
gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420
gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420
pan/bi: add support for I420 and YV12 sampling
gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8
panfrost: fix invalid memory access in get_equation_str()
pan/decode: handle more than one panfrost_device
panfrost/ci: updated CI expectations
egl: reenable partial redraw with a warning when using gallium hud
pan/genxml: add Width/Height fields to v9+ Plane descriptor
panfrost: rename _needs_multiplanar_descriptor to _is_yuv
panfrost: prepare v9+ to support YUV sampling
panfrost: use centered YUV chroma siting
panfrost: advertise YUV formats for valhall
Iván Briano (23):
anv: ensure CFE_STATE is emitted for ray tracing pipelines
iris: ensure mesh is disabled on context init
anv: ensure mesh is disabled on context init
anv: implement Wa_14019750404
intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir
anv,hasvk: respect provoking vertex setting on geometry shaders
anv: fix missing 3DSTATE_SBE_CLIP emission
anv: ensure pipelines have all state
anv: tell blorp to do mesh stuff only if it’s enabled
blorp: fix hangs with mesh enabled
anv: use a simpler MUE layout for fast linked libraries
anv: track what kind of pipeline a fragment shader may be used with
intel/fs: read viewport and layer from the FS payload
intel/fs: handle URB setup for fast linked mesh pipelines
anv: enable VK_EXT_mesh_shader where supported
intel/fs: use ffsll so we don’t explode on 32 bits
vulkan/runtime: add internal parameter to vk_spirv_to_nir
nir/lower_int64: respect rounding mode when casting to float
intel/compiler: round f2f16 correctly for RTNE case
util: add double_to_float16 helpers
nir: round f2f16{_rtne/_rtz} correctly for constant expressions
anv: advertise VK_KHR_global_priority_queue
anv: use the right vertexOffset on CmdDrawMultiIndexed
Jani Nikula (1):
docs/vulkan: fixup some typos
Janne Grunau (4):
asahi: toggle more barrier bits after transform feedback
asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
asahi,agx: Upload constant buffers immediately
asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful()
Jesse Natalie (2):
nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics
d3d12: Fix multidimensional array ordering
Jianxun Zhang (1):
intel/common: Only set op mask on instructions in decoder
Jonathan Marek (2):
freedreno: move redump.h to common code + cleanup
tu: add a TU_DEBUG=rd option for cmdstream dumping
Jordan Justen (73):
isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
genxml/hsw: Add additional MOCS field enumerations
genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW
isl/dev: Add uncached MOCS value
isl: Set MOCS to uncached for MTL stream-out
intel/isl: Use intel_needs_workaround() for MTL CCS WA
intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID
intel/dev: Add LNL platform enum
intel/dev: Support xe2 device init (for intel_device_info_test)
intel/tools: Use ‘env bash’ to find bash executable
intel/decoder: Fix xml filename when verx10 % 10 is not 0
intel/decoder: Add intel_spec_load_common()
intel/decoder: Make intel_spec_load_filename() have separate dir and name strings
intel/genxml: Align “Texture Coordinate Mode” naming
intel/genxml: Split some genxml sorting code into a intel_genxml module
intel/genxml: Convert gen_bits_header to use ElementTree
intel/genxml: Convert gen_pack_header to use ElementTree
intel/genxml: Add GenXml class into intel_genxml module
intel/genxml: Add filter_engines() to GenXml class
intel/genxml: Move sorting & writing into GenXml class
intel/genxml: Don’t rewrite sorted xml if the contents didn’t change
intel/genxml: Add final newline to output when saving xml
intel/genxml: Update xml with gen_sort_tags.py output
intel/dev: Use RPL-U name on RPL-U devices
intel/dev: Add more RPL PCI IDs
anvil,hasvk: Rename need_clflush to need_flush
intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c
anvil,hasvk: Replace intel_clflush_range with intel_flush_range
intel/common: Add intel_flush_range_no_fence
anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers
util/u_cpu_detect: Drop unused has_tsc
util/u_cpu_detect: Detect clflushopt support
meson: Check for the __builtin_ia32_clflushopt function
intel/clflush: Add support for clflushopt instruction
intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES
intel/genxml: Ignore tail leading/trailing whitespace in node_validator()
intel/genxml: Fix comparing xml when node counts differ
intel/dev: Update device string for MTL PCI ID 0x7d55
intel/genxml: Support importing from another genxml file
intel/genxml: Add support for excluding items when importing
intel/genxml: Add all xml files as pack dependencies
intel/genxml: Add GenXml.optimize_xml_import()
intel/genxml: Drop assertion to allow for importing
intel/genxml: Add GenXml.add_xml_imports method
intel/genxml: Add GenXml.flatten_xml() method
intel/genxml: Add genxml_import.py script
intel/decoder: ralloc_steal() values from spec context for fields and enums
intel/decoder: Implement support for importing genxml
intel/genxml: Start Xe2 support
intel/genxml: Auto-import genxml files using genxml_import.py
intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set
intel/compiler/fs: Support Xe2 reg size in assign_curb_setup
intel/compiler: Update opt_split_sends() for Xe2 reg size
intel/compiler: Update emit_rt_lsc_fence() for Xe2
intel/compiler: Update lower_trace_ray_logical_send() for Xe2
intel/compiler: Update ray-tracing intrinsic lowering for Xe2
intel/compiler: Update RT stack_id access for Xe2
intel/fs: Update SSBO & shared uniform block loads for Xe2
intel/genxml: Build with gen20.xml
intel/isl: Build for Xe2
iris: Build for Xe2
anv/blorp: Use anv_genX to set device->blorp.exec
anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT
anv: Build for Xe2
anv: Print warning that Xe2 is not supported rather than failing
intel/compiler: Add enum xe2_lsc_cache_store
intel/compiler: Use enum xe2_lsc_cache_store on xe2
intel/compiler: Add enum xe2_lsc_cache_load
intel/compiler: Use enum xe2_lsc_cache_load on xe2
anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch()
anv/batch: Assert that extend_cb is non-NULL if the batch is out of space
intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs
Jose Maria Casanova Crespo (2):
vc4: mark buffers as initialized at vc4_texture_subdata
vc4: Fix mask RGBA validation at YUV blit
José Expósito (3):
zink: Fix crash on zink_create_screen error path
zink: fix dereference before NULL check
zink: allow software rendering only if selected
José Roberto de Souza (51):
anv: Use workaround framework to Wa_14016118574
intel/aux_map: Nuke format_enum
intel/aux_map: Use get_aux_entry() in remove_mapping()
intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT
intel/aux_map: Rename some variables to improve readability
intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask()
intel/aux_map: Convert l1_entry_addr_out to canonical
intel/aux_map: Drop magic sub table size number
intel/aux_map: Add function and macro to return l2 and l1 table masks
anv: Add gem_create_userptr() to KMD backend
anv: Replace handle by anv_bo in the gem_close()
anv: Add support for userptr in Xe KMD
intel: Sync xe_drm.h
intel/dev/xe: Add support for small-bar setups
anv: Request Xe KMD to place BOs to CPU visible VRAM when required
iris: Request Xe KMD to place BOs to CPU visible VRAM when required
iris/xe: Call iris_lost_context_state() when batch engine is replaced
intel/dev: Port intel_dev_info tool to Xe KMD
iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code
intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h
intel/common: Move functions inside of C++ ifdef
intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext()
iris: Move i915_gem_set_domain() call to i915 backend
iris: Move iris_bufmgr_bo_close() to kmd backend
iris: Add gem_create_userptr() to KMD backend
iris: Add support for userptr in Xe KMD
intel/genxml/gen125: Add missing fields in MI_MATH
iris: Set MI_MATH MOCS field
anv: Set MI_MATH MOCS field
intel/tests/mi_builder: Set MI_MATH MOCS field
intel/genxml/gen125: Set MI_MATH MOCS field as non-zero
anv: Nuke unused READ_ONCE() from anv_batch_chain.c
anv: Remove VkAllocationCallbacks parameter from reloc functions
anv: Return earlier in anv_reloc_list functions
intel: Sync xe_drm.h and rename engine to exec_queue
anv: Override vendorID for Hogwarts Legacy
intel/isl: Remove unknown workaround
intel/isl: Remove Wa_22011186057
anv: Update Wa_16014390852 for MTL
intel: Sync xe_drm.h
anv: Move i915 specific gem_set_caching to backend
anv: Move i915 specific code from common anv_gem.c
anv: Move bo_alloc_flags_to_bo_flags() to backend
anv: Move i915 handling of imported bos bo_flags
anv: Remove i915_drm.h include from common code
iris: Lock bufmgr->lock before call vma_free() in error path
iris: Nuke useless flags from iris_fine_fence_new()
intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms
intel: Sync xe_drm.h
anv: Switch Xe KMD vm bind to sync
anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo()
Juan A. Suarez Romero (7):
broadcom/ci: update expected results
vc4/ci: update expected results
v3d/shim: include new ioctl parameters
v3dv/ci: update expected list
broadcom: add performance counters for V3D 7.x
broadcom/simulator: add per-hw version calls
v3d/vc4/ci: add new fails/timeout
Julia Tatz (10):
gallium/dri: fix dri2_from_names
aux/trace: skip multi-line comments in enums2names
aux/trace: deduplicate enum dump macro work
aux/trace: move trace_sample_view logic
aux/trace: fix set_hw_atomic_buffers method name
aux/trace: add screen video methods
aux/trace: add context video methods
aux/trace: wrap video_codec & video_buffer
aux/trace: unwrap refrence frames in picture_desc
aux/trace: trace video_buffer method return vals
Julia Zhang (1):
radeonsi: modify algorithm of skipping holes of sparse bo
Julian Hagemeister (1):
Gallium: Fix shared memory segment leak
Juston Li (10):
zink: remove venus from renderpass optimizations
venus: sync protocol for VK_EXT_vertex_input_dynamic_state
venus: implement VK_EXT_vertex_input_dynamic_state
venus: set lvp queries as saturate on overflow
venus: add helper function to get cmd handle
venus: refactor out common cmd feedback functions
venus: support deferred query feedback recording
venus: track/recycle appended query feedback cmds
venus: append query feedback at submission time
venus: switch to unconditionally deferred query feedback
Kai Wasserbäch (3):
fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer
fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/
fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result]
Karmjit Mahil (29):
pvr: Remove mrt setup from SPM EOT
pvr: Compile SPM EOT shader
pvr: Use the SPM EOT on barrier stores
pvr: Remove some magic numbers and increments from km stream
pvr: Restructure `rogue_kmd_stream.xml`
pvr: Submit PR commands
pvr: Use the correct size for the unified store allocation
pvr: Allow query stage for barrier sub cmds
pvr: Fix occlusion query unaccounted for user fences
pvr: Fix writing query availability write out
pvr: Fix packing issue with max_{x,y}_clip
pvr: Fix csb relocation status assert on `pvr_csb_finish()`
pvr: Fix `for` loop itarator usage
pvr: Fix dynamic desc offset storage
pvr: Fix cubemap layer stride
pvr: Use the render passes’ attachments array to setup ISP state
pvr: Adjust EOT PBE state to account for the iview’s base array layer
pvr: Fix MRT index in PBE state
pvr: Fix pbe_emit assert
pvr: Fix OOB access of pbe_{cs,reg}_words
pvr: Order tile buffer EOT emits to be last
pvr: Fix subpass sample count on ds attachment only
pvr: Refactor subpass ds and sample count setup
pvr: Fix SPM load shader sample rate
pvr: Fix PPP_SCREEN sizes
vulkan: Add `vk_subpass_dependency_is_fb_local()` helper
tu: Use common `vk_subpass_dependency_is_fb_local()`
pvr: Don’t merge subpasses on framebuffer-global dependancy
pvr: Only setup the bgobj to load if we have a load_op
Karol Herbst (213):
nvc0: initial Ada enablement
rusticl/mesa: make svm_migrate optional
llvmpipe: enable system SVM
nvc0: fix num_gprs for Volta+
rusticl: fix warnings with newer rustc
gm107/ir: fix SULDP for loads without a known format
nv50/ir/nir: fix txq emission on MS textures
nv50/ir/nir: Fix zero source handling of tex instructions.
rusticl/kernel: only handle function_temp memory before lowering printf
meson,ci: bump meson req for rusticl to 1.2
rusticl/nir: add helper functions we need for a NIR_PASS macro
rusticl/nir: add a nir_pass macro
rusticl/nir: use the new nir_pass macro
rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late
rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late
rusticl/kernel: move things around in lower_and_optimize_nir
rusticl/kernel: get rid of initial function_temp type lowering
rusticl/kernel: mark can_remove_var as unsafe and document it
n50/compute: submit initial compute state in nv50_screen_create
nvk: add vulkan skeleton
nouveau/winsys: add the new winsys implementation
nvk: use winsys lib
nvk: fix nvk_buffer include guards
nouveau/headers: add script to sync in-tree headers with open-gpu-doc
nouveau/headers: initial sync of headers
nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy
nvk: advertize memory heaps and types
nouveau/ws: reorganize a little
nouveau/ws: dup the fd
nouveau/ws: add a field for the SM version
nvk: set nonCoherentAtomSize as the CTS divides with this value
nouveau/ws: add bo API
nvk: add basic device memory support
nouveau/headers: add nvtypes.h
nouveau/headers: typedef Nv void types
nouveau/headers: add host classes
nouveau/ws: add context support
nouveau/ws: add a cmd buffer
novueau/bo: refcount it
novueau/bo: add nouveau_ws_bo_wait
nvk: allocate a GPU context for each VkDevice
nvk: add nvk_bo_sync
nvk: add nvk_CmdPipelineBarrier2 stub
nvk: impl nvk_CmdCopyBuffer
nouveau/ws: fix setting push bo domains
nouveau/ws: PUSH_IMMD only works with 16 bit values
nouveau/ws: set GPU object class
nouveau/ws: bind 2D class
nvk: use fermi class definitions
nvk: add basic support for images
nvk: simple format table
nvk: add support for blits
nvk: report maxMipLevels as 1
nvk: optimize blit command buffer gen
nvk: implement CmdFillBuffer
nvk: implement CmdUpdateBuffer
nvk: implement CmdCopyBuffer2
nvk: advertise VK_KHR_copy_commands2
nvk: implicitly reset the command buffer
nouveau/ws: handle 0inc inside nvk_push_val as well
nvk: reduce pitch even further in CmdFillBuffer
nvk: support multiple miplevels
nvk: support array blits over multiple layers
nvk: tiling prep work for VK_EXT_image_2d_view_of_3d
nouveau/ws: make sure we don’t submit nonsense
nouveau/ws: assert on broken channel
nvk/blit: assert that formats are supported
nouveau/headers: Generate parser functions
nouveau/ws: initial debugging options for command submissions
nouveau/ws: depend on generated class header files
nouveau/ws: get rid of libdrm
nouveau/ws: use new NVIF interface to query oclasses
nvk: set deviceName
nouveau/headers: add path for 3D headers
nouveau/headers: initial 3D headers import
nouveau/ws: allocate 3D subchan
nouveau/ws: allocate copy subchan as well
nouveau/ws: add API to query if the context was killed
nouveau/ws: add a bo unmap helper function
nvk: clean up bo mappings
nouveau/ws: bound check nouveau_ws_push_append
nouveau/ws: rework refing push buffer bos
nouveau/ws: push chaining
nvk: fix OOB read inside nvk_get_va_format
nvk: alloc a zero page and use it for vertex runouts
nvk: fix zero page refing
nvk: support exporting buffers
nvk: fix some class version checks
nvk: properly align shaders pre Turing
nvk: rework QMD handling to support pre Turing
nvk: align desc root table
nvk: Use SET_PIPELINE_PROGRAM pre-Volta
nvk: properly align slm size
nvk: use remaps for image copies
nvk: reduce pitch for FillBuffer
nvk: bind more subchans in init_context_state
nvk: support pre Maxwell Texture Headers
nvk/device: fix order of error handling
nvk: allocate VAB memory area
nvk: wire up M2MF for Fermi
nouveau/mme: add test for BEQ with magic exit offset
nouveau/mme: add a macro exit helper
nvk: Add a macro to set MMIO registers via falcons
nouveau/winsys: fix SM value for Ada
nvk: fix num_gprs for Volta+
nvk: replace mp with tpc
nvk: properly calculate SLM region by taking per arch limits into account
nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled
nvk: enable fp helper invocations loads on more gens
nv50/ir: use own info struct for sys vals
nv50/ir: convert system values to gl_system_value
nouveau/mme: fix OOB access inside while_ine builder test
nouveau/mme: fix OOB inside tu104 simulator
clc: use CLANG_RESOURCE_DIR for clang’s resource path
nv50: fix code uploads bigger than 0x10000 bytes
nouveau: take glsl_type ref unconditionally
rusticl/kernel: optimize nir between lowering io and explicit types
nv50: limit max code uploads to 0x8000
zink: fix source type in load/store scratch
zink: fix global stores
zink: update some compute caps
rusticl: add debug option to sync every event
rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned
ci: disable a660 jobs
nir: make workgroup_id 32 bit only
nir: make num_workgroups 32 bit only
ac: drop 64 bit handling for cl workgroup intrinsics
gallivm/nir: drop 64 bit handling for cl workgroup intrinsics
intel/compiler: drop 64 bit handling for cl workgroup intrinsics
panfrost: drop 64 bit handling for cl workgroup intrinsics
rusticl: reduce global_invocation_id_zero_base to 32 bit
panfrost: drop pan_nir_lower_64bit_intrin
rusticl/disk_cache: fix stack corruption
rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe
rusticl/event: disable profiling for devices without timestamps
rusticl/queue: properly implement clCreateCommandQueueWithProperties
rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported
asahi: fetch available system memory
asahi: lower hadd
asahi: handle kernels
asahi: handle load_workgroup_size
asahi: handle load_global_invocation_id_zero_base
asahi: implement get_compute_state_info
asahi: implement set_global_binding
asahi: implement clear_buffer
asahi: gracefully handle allocating linear images
asahi: handle images in is_format_supported
rusticl/memory: fallback if allocating linear images fails
rusticl: enable asahi
rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS
docs/features: cl_khr_3d_image_writes needs driver support
rusticl/mesa: fix `set_constant_buffer` when passing an empty buffer
rusticl/kernel: skip adding global id offsets if not used
meson/rusticl: add sha1_h
rusticl/mesa/context: fix clear_sampler_views
nir: add nir_lower_alu_vec8_16_srcs pass
zink: lower vec8/16
rusticl/mesa: create COMPUTE_ONLY contexts
rusticl: fix clippys bool_to_int_with_if
rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill
nir/load_libclc: fix libclc memory leak
rusticl/kernel: Fix creation from programs not built for every device
ci: add half-life 2 freedreno flake
zink: implement get_compute_state_info
zink: copy has_variable_shared_mem cs property
zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state
zink: refactor spec constant handling
zink: variable shared mem support
zink: support more nir opcodes
zink: make spirv_builder_emit_*op compatible with spec constants
zink: support samplers with unnormalized_coords
zink: implement remaining pack ops via bitcast
zink: fix RA textures
zink: fix load/store scratch offsets
rusticl/mesa/screen,device: add driver_name
rusticl: enable zink
pipe-loader: allow to load multiple zink devices
rusticl: bump rustc version to 1.66
rusticl/mesa/nir: mark more methods as mut
rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync
rusticl/mesa: mark PipeResource as Send and Sync
rusticl/mesa: mark PipeTransfer as Send
rusticl/cl: mark _cl_image_desc as Send and Sync
rusticl/queue: get rid of pointless Option around our worker thread handle
rusticl/queue: make it Sync
rusticl/kernel: get rid of Arcs in KernelDevStateVariant
rusticl/memory: use get_mut instead of lock in drop
zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS
rusticl/api: remove cl_closure macro
zink: implement load_global_constant
zink: properly emit PhysicalStorageBufferAddresses cap
nir/lower_mem_access_bit_sizes: fix invalid shift bit_size
rusticl/device: restrict 1Dbuffer images for RGB and RGBx
rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images
rusticl/format: disable all sRGB formats
asahi: flush denorms on exact fmin/fmax
zink: wrap shared memory blocks in a struct
zink: properly alias shared memory
zink: fix zink_destroy_screen for early screen creation fails
docs/features: remove empty lines confusing mesamatrix
rusticl/device: restrict image_buffer_size
rusticl/device: restrict param_max_size further
rusticl/mem: properly set pipe_image_view::access
zink: lower fisnormal as it requires the Kernel Cap
radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4
rusticl/queue: Only take a weak ref to the last Event
rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user
zink: deallocate global_bindings array
rusticl/mesa/screen: do not derefence the entire pipe_screen struct
nvc0: implement PIPE_CAP_TIMER_RESOLUTION
rusticl/queue: do not send empty lists of event to worker queue
rusticl/queue: fix implicit flushing of queue dependencies
Kenneth Graunke (21):
iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
intel/compiler: Fix sparse cube map array coordinate lowering
intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER
intel/fs: Account for payload GRFs when calculating register pressure
intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions()
intel/fs: Index scheduler mode string table by mode enum
intel/fs: Make helpers for saving/restoring instruction order
intel/fs: Pick the lowest register pressure schedule when spilling
intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER
iris: Check prog[] instead of uncompiled[] for BLORP state skipping
nir: Fix function parameter indentation in nir_opt_barriers.c
nir: Add an optimization pass to reduce barrier modes
nir: Reduce the scope of shared memory barriers
lavapipe: Don’t delete control barriers
virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers
dxil: Set UAV_FENCE_THREAD_GROUP any time global isn’t required
glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers
anv: Use nir_opt_barrier_modes() to drop unnecessary barriers
mesa: Fix zeroing of new ParameterValues array entries when growing
intel/fs: Fix Xe2 URB read/lowering with per-slot offsets
anv: Add support for a transfer queue on Alchemist
Kevron Rees (1):
Force vk vendor for spider-man remastered
Konrad Dybcio (5):
freedreno: Set magic writes per-GPU, using existing data
freedreno: Include speedbin fallback in 740 chipid to fix probing
freedreno: Include speedbin fallback in 730 chipid to fix probing
freedreno: Include speedbin fallback in 690 chipid to fix probing
freedreno: Add Adreno 643
Konstantin Seurer (95):
radv: Stop using the misleading round_up_u* functions
radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS
radv/meta_buffer: Rename size_minus16 to max_offset
llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
nir/tests: Refactor boilerplate into a common header
nir/tests: Use a single binary
draw: Do not restart the primitive_id at 0
gallivm: Fix subsampled format sampling under Vulkan
gallivm: Ignore nir_tex_src_plane
lavapipe: Remove dummy sampler ycbcr conversion
lavapipe: Store immutable_samplers as lvp_sampler array
lavapipe: Fix binding immutable samplers with desc buffers
lavapipe: Implement samplerYcbcrConversion
lavapipe: Advertise samplerYcbcrConversion
llvmpipe: Zero extend vectors in widen_to_simd_width
vulkan: Add a generated vk_properties struct
radv: Use common physical device properties
clang-format: Disable formatting by default
lavapipe: Use common physical device properties
nir/from_ssa: Don’t insert store_reg instructions before phis
gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa
lavapipe/ci: Remove descriptor_indexing fails
radv/rt: Rename shader_pc and next_shader
radv/rt: Rename traversal_shader to traversal_shader_addr
nir/opt_large_constants: Handle small float arrays
bin: Update spirv sources
vulkan: Allow beta extensions for physical device features
vulkan: Allow beta extensions for physical device properties
vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX
nir: Add shader enqueue data structures and handling
spirv: Update headers and grammer JSON
spirv: Implement SPV_AMDX_shader_enqueue
lavapipe: Add lvp_pipeline_type
lavapipe: Implement exec graph pipelines
lavapipe: Implement AMDX_shader_enqueue commands
lavapipe: Advertise AMDX_shader_enqueue
radv: Add internal_nodes_offset to scratch_layout
radv: Remove leaf_args::dst_offset
radv/rt: Remove some dead code
radv/rt: Do not apply stack_ptr for non-recursive stages
radv/rt: Add and use radv_build_traversal
radv/rt: Insert rt_return_amd before lowering shader calls
radv/rt: Split stage initialization and hashing
aco: Do not fixup registers if there are no shader calls
radv: Stop updating the stack_size in insert_rt_case
lavapipe: Lock around CSO destroys
vulkan/wsi/x11: Implement capture hotkey using the keymap
venus: Use the common GetPhysicalDeviceFeatures2 implementation
nir/lower_shader_calls: Limit the remat chain length
lavapipe: Avoid lowering shaders twice
lavapipe: Fix the locking around cso destruction
aco/validate: Handle p_wqm like p_parallelcopy
aco: Use bytes() instead of size() in emit_wqm
aco: Unify demote and demote_if selection
radv: Only generate debug info if required
aco/lower_to_cssa: Fix typo
radv: Don’t use the depth image view for depth bias emission
radv/rt: Store NIR shaders separately
radv/rt: Add monolithic raygen lowering
radv/rt: Enable monolithic pipelines
radv/ci: Document new flake
vulkan/properties: Handle unsized arrays properly
radv: Remove dead radix_sort_vk_get_memory_requirements call
radv/radix_sort: Vendor the radix sort dispatch code
radv: Perform multiple sorts in parallel
radv/ci: Improve ray tracing skips
ac/llvm: Fix typed loads with 16bit formats
ac/llvm: Use the correct return type for uadd_carry and usub_borrow
ac/llvm: Use float types for float atomics
radv: Don’t advertise features requiring PS epilogs with LLVM
radv: Update navi21 llvm fails
radv/rt: Handle stages without nir properly
radv: Remove ray tracing shader module identifier skips
radv/bvh: Treat instances with mask == 0 as inactive
radv/ray_queries: Skip cull_mask handling if it is FF
radv/rt: Skip cull_mask handling if it is FF
aco/spill: Make sure that offset stays in bounds
nir: Add nir_cf_node_cf_tree_prev
nir: Add nir_foreach_block_in_cf_node_reverse
nir: Add nir_rematerialize_deref_in_use_blocks
nir/lcssa: Fix rematerializing derefs
nir/deref: Layer rematerialization helpers
lavapipe/ci: Fix asan expectations
hasvk: Use the common GetPhysicalDeviceFeatures2 implementation
vulkan: Remove vk_get_physical_device_core_1_*_feature_ext
radv/bvh/ploc: Load child bounds from LDS
radv: Merge the sync_data and header initialization
radv: Do not sync after radv_update_buffer_cp
zink: Initialize primitive types to an invalid value
nir/passthrough_gs: Support edge flags with points
zink: Enable edge flags with points
mesa: Fix glBegin/End when LINE_LOOP is not supported
llvmpipe: Compile a nop texture function for unsupported configurations
radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions
radv/sqtt: Fix tracing acceleration structure commands
Lang Yu (5):
amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property
radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size
radeonsi: use wave size to determine index stride
amd/common: add missing stuff for gfx11.5
amd/radeonsi: add missing stuff for gfx11.5
Leandro Ribeiro (13):
egl: rewrite outdated comment in _eglFindDevice()
egl: remove unused parameter from _eglAddDRMDevice()
egl: simplify _eglAddDRMDevice()
egl: make explicit that we don’t support render nodes for software EGLDevice
egl: move is_render_node flag to platform_wayland
loader: rename loader_open_render_node() to loader_open_render_node_platform_device()
loader: add driver list as parameter in loader_open_render_node_platform_device()
pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd()
dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension
kmsro: try to use only compatible render-capable devices
loader: add loader_is_device_render_capable()
egl/drm: get compatible render-only device fd for kms-only device
egl: error out if we can’t find an EGLDevice in _eglFindDevice()
Leo Liu (4):
radeonsi: add AV1 profile to supported profile
radeonsi/vcn: fix the incorrect dt_size
Revert “frontends/va: Also map VAImageBufferType for reading”
ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG
Lina Versace (14):
docs: Add row for VK_KHR_maintenance5
intel/pci_ids: Consistently use lowercase
venus: Sync protocol for VK_EXT_graphics_pipeline_library
venus: Erase pViewports and pScissors in fewer cases
venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing
venus: Fix subpass attachments
venus: Drop incorrectly-used always-true pipeline vars
venus: Use VkImageAspectFlags in vn_subpass
venus: Add enum vn_pipeline_type
venus: Renames for VkGraphicsPipelineCreateInfo fixes
venus: Refactor pipeline fixup into two stages
venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library
venus: Enable VK_EXT_graphics_pipeline_library behind debug flag
venus: Fix -Wmaybe-uninitialized
LingMan (22):
rusticl/memory: fix potential use-after-free in clEnqueueSVMFree
rusticl: Rename XyzCB aliases to FuncXyzCB
rusticl: add structs to hold the C callbacks
rusticl: use CreateContextCB
rusticl: use DeleteContextCB
rusticl: use EventCB
rusticl: use MemCB
rusticl: use ProgramCB
rusticl: use SVMFreeCb
rusticl: Make EventSig take ownership of its environment
rusticl: add a safe abstraction to execute a DeleteContextCB
rusticl: add a safe abstraction to execute an EventCB
rusticl: add a safe abstraction to execute a MemCB
rusticl: add a safe abstraction to execute an SVMFreeCb
rusticl: add a safe abstraction to execute a CreateContextCB
rusticl: add a safe abstraction to execute a ProgramCB
rusticl/api: drop a few include paths
rusticl: mark the fields of callback structs private
rusticl: drop an `#[allow(dead_code)]` marker
rusticl/core: don’t take a lock while dropping `Context`
rusticl: Show an error message if the build is attempted with an outdated bindgen version
rusticl: Show an error message if the version of bindgen can’t be detected
Lionel Landwerlin (169):
anv: hide exec_flags selection inside the i915 backend
isl: add a tool to query surface parameters
intel/fs: fix missing predicate on SEL instruction
intel/compiler: rework input parameters
ci/a530: switch a few tests to flakes to unblock CI
vulkan: bump header register to 1.3.258
intel/fs: don’t try to rebuild sequences of non ssa values
intel/vec4: fix log_data pointer
intel/fs: consider UNDEF as non-partial write
intel/fs: add more UNDEFs around SEND messages
isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields
anv: simplify buffer address+size loads from descriptor buffer
intel/fs: add support for sparse accesses
intel/nir: handle image_sparse_load in storage format lowering
intel/nir: add lower for sparse images & textures
anv: wire image sparse loads
blorp: switch blorp_update_clear_color to early return
blorp: update and move fast clear PIPE_CONTROLs to drivers
anv: fix 3DSTATE_RASTER::APIMode field setting
anv: enable EDS3 ConservativeRasterizationMode
vulkan: skip non required extension structures
vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer
anv: enable INTEL_DEBUG=nofc
anv: fake non intel vendorID for Death Stranding
hasvk: fix null descriptor handling with A64 messages
anv: remove descriptor array bounds checking
hasvk: remove descriptor array bounds checking
anv/hasvk: track robustness per pipeline stage
anv: implement VK_EXT_pipeline_robustness
intel/fs: track more steps with INTEL_DEBUG=optimizer
intel/fs: add variable for output of debug backend optimizer
intel/decoder: constify some input parameters
blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER
anv: emit 3DSTATE_GS only once per pipeline
intel/decoder: add options to decode surfaces/samplers
anv: get rid of genX(emit_multisample)
anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c
anv: don’t try to access dynamic buffers from surface states
iris: ensure stalling pipe control before fast clear
intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
intel/compiler: fix dynamic alpha-to-coverage handling
intel/fs: implement dynamic interpolation mode for dynamic persample shaders
intel/fs: move lower of non-uniform at_sample barycentric to NIR
zink+anv: add regression testing with pipeline libraries
anv: implement vkCmdBindIndexBuffer2KHR
anv: handle new VkBufferViewUsageCreateInfoKHR
anv: add vkGetRenderingAreaGranularityKHR()
anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR
anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support
anv: deal with new pipeline flags
anv: enable KHR_maintenance5
anv: add missing ISL storage usage
genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER
genxml/gfx12: rename Tiled Resource Mode
isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented
intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces
isl: disable Yf/Ys/Tile64 tilings for 1D images
isl: add a usage flag to request 2D/3D compatible views
isl: disallow TileYs/Yf on 3D storage images on Gfx9/11
intel/isl: Add a max_miptail_levels field to isl_tile_info
isl: make isl_surf_get_uncompressed_surf robust to argument accesses
isl: add Gfx12/12.5 restriction on 3D surfaces & compression
isl: disallow miptails on planar formats
isl: disable miptails on gfx12 with yuv formats
isl: disable CCS on Ys/Yf
blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling
intel/aux_map: correctly program tiling mode for Ys
isl: reorder tiling selection
anv: enable standard Y tiles
isl/tilememcpy_test: add multiple tile testing
anv: rename total_batch_size
anv: reuse cmd_buffer::total_batch_size
intel/measure: track batch buffer sizes
intel/nir: rerun lower_tex if it lowers something
intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL
hasvk: add state cache invalidation back before fast clears
blorp: remove unused variable
anv: remove ReorderMode from pipeline 3DSTATE_GS emission
anv: change anv_batch_emit_merge to also do packing
intel/anv: batch stats util
intel/decoder: implement accumulated prints
anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state
anv: rename files to represent their usage
anv: categorize partial/final pipeline instruction
anv: split 3DSTATE_TE packing between static & dynamic parts
anv: split 3DSTATE_VFG emission
anv: add a flag tracking occlusion query count change
anv: split pipeline programming into instructions
vulkan/runtime: add helper to name dirty states
anv: add new low level emission & dirty state tracking
anv: remove unused state emission
anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit
docs: update Anv documentation about dynamic state emission
anv: create individual logical engines on i915 when possible
anv: Copy/Clear MSAA images over companion RCS while we are on compute
pps-producer: add ability to select device with DRI_PRIME
anv: remove aux checking asserts
anv: bound image usages to the associated queue family
anv: fix 3DSTATE_VFG emission
anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled
anv: ensure mesh pipeline have all pre-rasterization stages disabled
anv: ensure partially packed instructions are emitted in the pipeline
anv: fix missing 3DSTATE_SBE_MESH emission
anv: fix utrace timestamp buffer copies
anv: add a memcpy compute internal kernel
anv: add simple shader support without a command buffer
anv: move simple shaders code to its own object
anv: move utrace flush out of backends
anv: enable utrace timestamp buffer copies on compute engine
intel: don’t assume Linux minor dev node
intel/ds: lock submissions to u_trace_context
util/u_trace: count number of tracepoints
intel/ds: track number of tracepoint timestamp copies
anv/utrace: trace CPU on timestamp buffer readiness
intel/ds: avoid dropping traces when running out of shared memory
anv/iris: widen Wa_14015946265 to Gfx11+
anv: add missing workaround for 3DSTATE_LINE_STIPPLE
iris: add missing workaround for 3DSTATE_LINE_STIPPLE
intel/fs: handle ishl in surface/sampler rematerialization
intel/fs: handle add3 in surface/sampler rematerialization
intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization
anv: fix internal compute copy shader build
anv: reduce working temporary memory for BVH builds
anv: move bo_pool allocation flags to init caller
anv: use buffer pools for BVH build buffers
intel/ds: track acceleration RT commands
anv: fix index buffer size programming
anv: implement INTEL_DEBUG=reemit
anv: add missing workaround handling in simple shader
anv: fix a couple of missing input for 3DSTATE_RASTER programming
anv: flag 3DSTATE_RASTER as dirty after simple shader primitive
vulkan: bump headers/registry to 1.3.267
anv: rename primary in container in ExecuteCommands()
anv: add support for VK_EXT_nested_command_buffer
anv: simplify push descriptors
anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5
Revert “intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL”
anv: update batch chaining to Gfx9 commands
anv: workaround Gfx11 with optimized state emission
u_trace: generate tracepoint index parameter in perfetto callbacks
u_trace: generate tracepoint name array in perfetto header
intel/ds: provide names for different events of a timeline’s row
anv: reuse local variable for gfx state
anv: track render targets & render area changes separately
anv: don’t uninitialize bvh_bo_pool is not initialized
anv: uninitialize queues before utrace
anv: move generation shader return instruction to last draw lane
anv: fix generated draws gl_DrawID with more than 8192 indirect draws
anv: extract out draw call generation
anv: identify internal shader in NIR
anv: avoid MI commands to copy draw indirect count
anv: move generation batch fields to a sub-struct
util/glsl2spirv: add ability to pass defines
anv: factor out host/gpu internal shaders interfaces
anv: index indirect data buffer with absolute offset
anv: add ring buffer mode to generated draw optimization
anv: merge gfx9/11 indirect draw generation shaders
anv: document the draw indirect optimization ring mode
anv: fixup 32bit build of internal shaders
anv: fix uninitialized use of compute initialization batch
intel/fs: fix dynamic interpolation mode selection
anv/meson: add missing dependency on the interface header
anv: fix corner case of mutable descriptor pool creation
isl: disable MCS compression on R9G9B9E5
intel/fs: rerun divergence analysis prior to convert_from_ssa
intel/nir/rt: fix reportIntersection() hitT handling
anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders
anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC
intel/perf: fix querying of configurations
Louis-Francis Ratté-Boulianne (15):
panfrost: Fix error in comment
panfrost: Add methods to determine slice and body alignment
panfrost: Add method to get size of AFBC subblocks
panfrost: Precalculate stride and nr of blocks for AFBC layouts
panfrost: Add panfrost_batch_write_bo
panfrost: Make panfrost_resource_create_with_modifier public
panfrost: Split out internal of `panfrost_launch_grid`
panfrost: Add infrastructure for internal AFBC compute shaders
panfrost: Add method to get size of AFBC superblocks valid data
panfrost: Add support for AFBC packing
panfrost: Legalize resource when attaching to a batch
panfrost: Don’t force constant modifier after converting
panfrost: Add debug flag to force packing of AFBC textures on upload
panfrost: Add some debug utility methods for resources
panfrost: Add env variable for max AFBC packing ratio
Lucas Stach (33):
ci/etnaviv: update ci expectation
etnaviv: move resource seqnos to level
etnaviv: flush destination before executing blit
etnaviv: optimize resource copies by skipping clean levels
etnaviv: add helper to mark resource level as flushed
etnaviv: add helper to mark resource level as changed
etnaviv: add helper to transfer resource level age to another
etnaviv: add helper to get TS validity
etnaviv: add helper to set TS validity
etnaviv: move TS meta into etna_resource_level
etnaviv: add tile status buffer status into TS metadata
etnaviv: optimize sampler source update
etnaviv: allow sampler TS even if the resource is flushed
etnaviv: keep blit destination tile status valid if possible
etnaviv: optimize render resource update
etnaviv: optimize transfers when whole resource level is discarded
etnaviv: split etna_copy_resource_box levels parameter in src/dst
etnaviv: don’t allocate full resource as transfer staging
etnaviv: check for valid TS as condition to create the staging resource
etnaviv: reword comment about staging resource usage
etnaviv: remove huge outdated comment
etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause
etnaviv: remove superfluous braces
etnaviv: remove always true assert in etna_transfer_unmap
etnaviv: remove bogus comment about replacing resource storage
etnaviv: initialize VIVS_GL_BUG_FIXES
etnaviv: fix read staging buffer leak
Revert “ci/etnaviv: allow failure on failing test”
mesa: enable NV_texture_barrier in GLES2+ (again)
etnaviv: use correct blit box sizes when copying resource
etnaviv: zero shared TS metadata block
Revert “etnaviv: use correct blit box sizes when copying resource”
mesa: add GL_APPLE_sync support
Luigi Santivetti (1):
pvr: do not claim support for ASTC texture compression
M Henning (31):
nv50/ir: Drop nir_jump_return handling
nv50/ir: Remove ArgumentMovesPass
nv50/ir: Remove Function.stackPtr
nv50/ir: Remove dead loop from assignSlot
nv50/ir: Remove SpillSlot
nvc0: Keep nir directly in nvc0_program
nv50: Keep nir directly in nv50_program
nouveau: Delete nv50_ir_from_tgsi.cpp
nouveau: Drop tgsi support from nv50_ir_prog_info
nouveau: Drop ConverterCommon::Subroutine
nouveau: Drop BuildUtil::DataArray
nouveau: Drop BuildUtil::Location
nouveau: Delete the nouveau_compiler tool
nv/codegen: Call nir_shader_gather_info
nv/codegen: Implement nir_op_fquantize2f16
nvk: Remove reference to genUserClip
nv/codegen: Use nir_lower_clip
nv50_ir_from_nir: Use nir’s lower_fpow
nv/codegen: Delete OP_POW
nv/codegen: Fix an uninitialized variable warning
nv/codegen: Delete OP_WRSV
nv/codegen: Delete OP_EXP, OP_LOG
nv/codegen: Remove fragCoord variable.
nv/codegen: Merge from_common into from_nir
nv/codegen: Remove unused clipVertexOutput var
nv50_ir_ra: Delete unused functions
nv/codegen: Delete unused OP_CONSTRAINT
nv/codegen: Delete periodicMask32
nv/codegen: Remove Function::buildDefSets
nv/codegen: Change copy-constructor call to assign
nv/codegen: Delete copy and assign
Maaz Mombasawala (2):
svga: Make surfaces shareable at creation.
svga: Unify gmr and mob surface pool managers
Marcin Ślusarz (16):
iris: avoid duplicating validation entries
hasvk: remove dead code & comments related to mesh shading
anv: drop support for VK_NV_mesh_shader
intel/compiler: remove NV_mesh_shader support
intel/compiler: remove redundant code
anv: drop unused function
anv: merge cases leading to the same code
intel/compiler/mesh: compactify MUE layout
intel/compiler,anv: put some vertex and primitive data in headers
intel/compiler: load debug mesh compaction options once
intel/compiler/test: fix crashes when TEST_DEBUG is set
intel/compiler: add lsc_msg_desc_wcmask
intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2
intel/compiler/mesh: fix position of output URB handle for xe2
intel/compiler/mesh: implement IO for xe2
intel/compiler: mask GS URB handles at thread payload construction
Marek Olšák (125):
Revert “ac/nir/ngg: Follow intrinsic sources when analyzing before culling.”
glthread: determine global locking once every 64 batches to fix get_time perf
mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
freedreno,lima,zink: update CI fixes and flakes
util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag
radeonsi: fix a CDNA regression breaking compute
glthread: sync for VDPAU sync functions
radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets
radeonsi: restructure the loop for non-indexed multi draws
radeonsi: cosmetic changes to radeon_opt_* macros
radeonsi: handle draw user SGPRs as tracked registers
radeonsi: update obsolete comments about compiler queues
radeonsi: remove si_compute.h, move the contents into si_pipe.h
radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp
radeonsi: move si_emit_spi_map into si_state_shaders.cpp
radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states
radeonsi: remove splitting IBs that use too much memory
radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf
radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer
radeonsi: remove the draw counter with primitive restart from the HUD
radeonsi: always inline si_prefetch_shaders
radeonsi: specialize si_draw_rectangle using a C++ template
radeonsi: add index parameter into si_atom::emit
radeonsi: split direct pm4 emission from si_pm4_emit
radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state
radeonsi: merge pm4 state and atom emit loops into one
radeonsi: add a simple version of si_pm4_emit_state for non-shader states
radeonsi: handle deferred cache flushes as a state (si_atom)
radeonsi: remove render condition logic from si_draw by reordering atoms
radeonsi: abort when failing to upload descriptors instead of skipping draws
radeonsi: rename shader_pointers state -> gfx_shader_pointers
radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers
radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom
radeonsi/ci: update gfx11 failures
radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state
radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param
radeonsi: enable shader culling by default because it helps Viewperf
radeonsi: rewrite how occlusion query precision is determined for performance
radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly
radeon_winsys: move allow_context_lost from cs_create to ctx_create
winsys/amdgpu: rework how SW reset status is generated and reported
radeon_winsys: add a ctx_set_sw_reset_status callback
radeonsi: don’t abort for descriptor failures, let the winsys handle it
radeonsi: don’t use threadID.yz/blockID.yz for copy_image if those are always 0
radeonsi: don’t use threadID.yz/blockID.yz for compute_blit if they’re always 0
nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant
nir/algebraic: collapse ALU opcodes sourcing NaN
ac/gpu_info: add the /dev/dri/ filename into radeon_info
Revert “ac: don’t call ac_query_pci_bus_info from ac_query_gpu_info”
ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY
ac: document ac_shader_args::gs_vtx_offset
ac: minor updates to packet documentation and definitions
ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts
ac: improve the IB parser
ac: update gfx11 shadowed register tables
ac: add a standalone IB parser program
ac/surface: trivial non-functional changes
ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch
radeonsi: allow setting any index in radeon_set_sh_reg_idx
radeonsi: rename uses_subgroup_info to uses_tg_size
radeonsi: improve the heuristic when to use Wave32 for compute shaders
radeonsi: simplify/merge emit_shader_ngg functions
radeonsi: don’t pass gl_Layer to PS for blit shaders
radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits
radeonsi: fix templated si_draw_rectangle callback for Navi14
nir: replace undef only used by ALU opcodes with 0 or NaN
nir: remove nir_op_unpack_64 handling from nir_opt_undef
ac/llvm: don’t convert undef to 0 because nir_opt_undef does it now
meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1
gallivm: fix build with LLVM 18
amd/llvm: fix build with LLVM 18
radeonsi: fix compute-only contexts
ac/llvm: replace removed amdgcn.ldexp for LLVM 18
ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11
ac/llvm: set !fpmath 3.0 for llvm.sqrt
ac/gpu_info: don’t align IBs to the GL2 cache line size
ac/llvm: fix flat PS input corruption
amd: rename GFX110x to NAVI31-33
ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments
ac/gpu_info: pad IBs according to ib_size_alignment
winsys/amdgpu: pad gfx and compute IBs with a single NOP packet
Revert “radeonsi: specialize si_draw_rectangle using a C++ template”
radeonsi/ci: update navi10 results
gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements
gallium/util: add more tests for compute-only contexts
radeonsi: add another aux context for uploading shaders
radeonsi: upload shaders via a staging buffer so as not to map VRAM directly
ac/surface: don’t require exact pitch for gfx6-8 tiled imports
Revert “ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG”
Revert “radv/amdgpu: fix alignment of command buffers”
Revert “radv: fix alignment of DGC command buffers”
Revert “winsys/amdgpu: pad gfx and compute IBs with a single NOP packet”
Revert “ac/gpu_info: pad IBs according to ib_size_alignment”
Revert “ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments”
nir: sort variables by location in nir_lower_io_passes to work around a bug
nir: recompute IO bases after DCE in nir_lower_io_passes
nir: add dual-slot input information into load_input intrinsics
nir: take dual slot input info into account when computing IO driver locations
nir: gather dual slot input information
nir: expose reusable linking helpers for cloning uniform loads
nir: handle nir_var_mem_ubo in nir_clone_uniform_variable
ac/gpu_info: split ib_alignment as ip[type].ib_alignment
ac/gpu_info: move ib_pad_dw_mask into ip[]
ac/gpu_info: drop the hack unifying all IB alignments
ac/gpu_info: conservatively decrease IB alignment and padding to 256B
ac/gpu_info: set gfx and compute IB padding to only 8 dwords
winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop
winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1
winsys/amdgpu: pad gfx and compute IBs with only 1 NOP
ac/gpu_info: don’t allow register shadowing with SR-IOV due to bad performance
radeonsi: disable register shadowing without SR-IOV to fix bad performance
winsys/amdgpu: don’t send CP_GFX_SHADOW chunk if shadow address is not set
radeonsi/ci: update gfx1100 results
nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags
nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc.
mesa: don’t pass Infs to the shader via gl_Fog.scale
radeonsi/ci: update the runner for new build scripts
radeonsi/ci: enable GTF tests in the runner
radeonsi/ci: enable GLES CTS in the runner
radeonsi/ci: update failures and flakes
amd/common: update DCC for gfx11.5
radeonsi: initialize perfetto in the right place
radeonsi/gfx11: don’t set OREO_MODE to fix rare corruption
nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO
Marek Vasut (1):
etnaviv: Fully replicate back stencil config
Mark Collins (10):
tu/a7xx: Adapt r3d blits for A7xx
freedreno/rnn: Remove %n usage in fprintf
freedreno: Only add drm/computerator when system_has_kms_drm
freedreno/decode: Support building replay for multiple KMDs
freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB
meson: Warn about side-effects from DRM for FD KMDs
meson: Update libarchive to v3.7.2-2
freedreno/common: Add max_sets property to A6xxGPUInfo
tu: Support higher descriptor set count for A7XX
tu,util/driconf: Add option to not reserve descriptor set
Mark Janes (1):
intel: allow reduced memory usage for INTEL_MEASURE
Martin Roukala (né Peres) (22):
radv/ci: drop the auto-reboot-on-hang for vkcts-navi10
radv/ci: use the default kernel on vkcts-navi10
zink/ci: automatically reboot when hitting a kernel BUG on vangogh
zink/ci: document more flakes seen on vangogh
radv/ci: move vkcts-navi10 testing to KWS
radv/ci: add more tests to the navi10 vkcts flake list
radv/ci: increase the parallelism of the vkcts-navi21 job
radv/ci: add more tests to the navi21 vkcts flake list
radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes
radv/ci/vkcts-navi21: document more flakes
radv/ci/vkcts-navi10: catch all the line-related flakes
radv/ci: update the vkcts gfx1100 flake/fail lists
radv/ci: add a manual job to run vkcts on navi31
radv/ci: add a manual job for vkd3d-proton on navi31
ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake
ci/vkcts-navi21: mark more of the RT handles checks as flakes
ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs
zink/ci: remove 19 tests from the zink-radv-polaris10-fails list
ci/b2c: switch containers to a back-up ahead of valve-infra renaming
zink/ci: remove 42 tests from the zink-radv-polaris10-fails list
radv/ci: tighten the vkcts-navi21 timeouts
zink/ci: tighten the zink-radv-vangogh timeouts
Martin Stransky (1):
llvmpipe: fix UAF in lp_scene_is_resource_referenced.
Mary (6):
nouveau/mme: Add initial Fermi definition
nouveau/mme: Add Fermi builder
nouveau/mme: Add Fermi simulator
nouveau/mme: Add Fermi hardware tests
agx: Move nir_lower_fragcolor out of agx_preprocess_nir
agx: Ensure to lower 1D image load/store to 2D
Mary Guillemard (4):
nir: Add NVIDIA-specific geometry shader opcodes
venus: skip bind sparse info when checking for feedback query
zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C
venus: Do not submit batch manually when no feedback is required
Matt Coster (21):
pvr: Pad rogue_regarray_cache_key union members to avoid UB
pvr: Clean up extension tables
pvr: Refactor pvr_GetPhysicalDeviceProperties2()
docs: Fixup imagination/pvr extension support
pvr: Add VK_KHR_get_display_properties2
pvr: Add VK_KHR_get_memory_requirements2
pvr: Add VK_KHR_get_surface_capabilities2
pvr: Print VkStructureType name on pvr_debug_ignored_stype()
pvr: Add VK_KHR_copy_commands2
pvr: Don’t override commands copied to new buffer when extending cs
pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable
pvr: Zero tail of cs buffers after linking when dumping cs
pvr: Cleanup comments in pvr_physical_device_get_supported_*()
pvr: Don’t rely on GNU void pointer arithmetic
pvr: Force compile error on GNU void pointer arithmetic
pvr: Switch to common pipeline cache implementation
pvr: Use vk_sampler base
pvr: Clean up & fix sampler border color support
pvr: Don’t pass pvr_physical_device when only device info is needed
pvr: Minor refactor of pvr_device.c
pvr: Use common physical device properties
Matt Turner (10):
Revert “intel/fs: only avoid SIMD32 if strictly inferior in throughput”
intel: Rearrange for next commit
intel: Consider with_intel_clc in with_any_intel
intel: Only build blorp if drivers are enabled
intel: Only build ds if drivers are enabled
intel: Only build perf if drivers or tools are enabled
intel: Allow using intel_clc from the system
intel: Limit Intel Vulkan RT to x86_64
r600: Add missing dep on git_sha1.h
util: Include stdint.h in libdrm.h
Mauro Rossi (7):
nouveau/ws: fix building error in nouveau_ws_push_dump()
vulkan/meta: fix gnu-empty-initializer build error
nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE
anv/android: remove numFds check
hasvk/android: remove numFds check
Android.mk: filter out cflags to build with Android 14 bundled clang
Android.mk: disable android-libbacktrace to build with Android 14
Mike Blumenkrantz (293):
ci: bump VVL to 1.3.257
zink: set pipeline dynamic state count after all dynamic states are set
zink: set feedback attachments on batch init
zink: be even dumber about buffer refs when replacing storage
zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
zink: fix the fix for separate shader program refcounting
kopper: handle pixmap creation failure more gracefully
glxsw: check geometry of drawables on creation
kopper: move pixmap param for drawable creation to info struct
glx/dri3: split out modifier check
glx/sw: check for modifier support in the kopper path
kopper: pass modifier availability to drawable creation
kopper: determine modifier support per-drawable
zink: don’t clobber descriptor mode on multiple screen creation
nir: fix slot calculations for compact variables with location_frac
lavapipe: use the component offset directly for xfb
nir: add a helper for calculating variable slots
radv: bump max xfb output to 128
ir3: bump max xfb output to 128
gallium: bump PIPE_MAX_SO_OUTPUTS to 128
zink: add feedback loop exts to optimal profile
glsl: only explicitly check GS components in PSIZ injection with output variables
lavapipe: statically allocate fb attachment array
lavapipe: zero fb attachment array at rp start
lavapipe: don’t check geometry for fb attachments
lavapipe: be slightly more permissive for bad apps (and cts) with dynrender
lavapipe: VK_EXT_host_image_copy
zink: better handle separate shader dsl creation when no bindings exist
zink: force image barriers after dmabuf import
ci: bump VVL to 1.3.261
zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
zink: unset line stipple ds3 state flags when stipple not available
nir/lower_io_to_scalar: fix 64bit io splitting
nir/linking_helpers: force type matching in does_varying_match
nir/print: print location names for (some) tess slots
nir/print: always group variables by type when printing
zink: add batch refs for transient images
zink: fix zs resolve attachment indexing
zink: don’t add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
zink: don’t append msrtss to dynamic render if not supported
zink: set msrtss depth resolve mode when enabled
zink: hook up VK_KHR_workgroup_memory_explicit_layout
zink: propagate have_workgroup_memory_explicit_layout to ntv
zink: use SPV_KHR_workgroup_memory_explicit_layout when available
zink: add more locking for pipeline cache
zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT
aux/trace: fix winsys handle dumping
zink: generated tcs is on the tes, not the vs
zink: apply ZINK_DEBUG=noopt to linked separate shaders
gallivm: handle A8_UNORM image stores
llvmpipe: enable A8_UNORM for shader images
llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED
lavapipe: GetRenderingAreaGranularityKHR
llvmpipe: block weird uses of subsampled formats in buffers
llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
lavapipe: fix BindVertexBuffers2 buffer size handling
lavapipe: fix resolves where src image has a layer offset
lavapipe: block yuv formats from getting blit feature flags
lavapipe: BindIndexBuffer2
lavapipe: GetDeviceImageSubresourceLayoutKHR
lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops
lavapipe: maintenance5
zink: fix xfb buffer array sizing to use buffer limit, not output
zink: move ZINK_DEBUG=nir printing to just before compile
draw: fix so debug offset printing
zink: reindex ssa defs before dumping debug shaders
lavapipe: zero-init pipe_sampler_state
zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
zink: fix big tcs output io
zink: don’t try to replace separate shader prog in noopt mode
zink: pre-convert mode in fixup_io_locations
zink: add a special separate shader i/o mode for legacy variables
nir: minor fixes for io_to_scalar
nir/lower_io: add a new doubles-only 64bit lowering option
nir: add a filter cb to lower_io_to_scalar
d3d10umd: use cso_context to set vertex buffers and elements
virgl: move virgl_vertex_elements_state to header
virgl: fix some indentation
nouveau: calloc vertex csos
gallium: move vertex stride to CSO
zink: fix null config screen creation
zink: fix crash in lower_pv_mode_gs_store
u/draw: skip zero-sized indirect draws
lavapipe: handle VkPipelineCreateFlagBits2KHR
lavapipe: handle VkBufferUsageFlags2KHR
zink: ci updates
zink: track start/stop of a couple query types
zink: require EDS1 for CWE usage
zink: unset primgen suspended flag when ending a primgen query
zink: rework rast-discard for primgen queries
zink: rip out some awkward parts of the old non-cwe path
zink: drop CWE requirement for renderpass tracking with primgen queries
nir/zink: fix gs emulation xfb_info sizing
zink: move fragcolor lowering further along the compile process
zink: add a mode param to find_var_with_location_frac
zink: use lowered io (kinda) for i/o vars
zink: stop lowering indirect derefs
ntt: handle interp intrinsics as derefs
zink: delete split_blocks pass
zink: delete lower_64bit_vertex_attribs pass
zink: fix clip/cull dist xfb inlining
zink: delete all the extra gross xfb handling
zink: stop using pipe_stream_output
zink: remove pipe_stream_output from function params
zink: ci updates
aux/trace: print bindless handles as pointers
zink: remove unused param from create_ici
zink: split create_ici to init and eval
zink: add maintenance extensions to profile
zink: use maintenance5
zink: use real A8_UNORM when possible
vk/graphics: fix CWE handling with DS3
Revert “vk/wsi/x11: handle geometry updating more asynchronously”
r600: store the mask of buffers used by a vertex state
r600: better tracking for vertex buffer emission
zink: wait on async fence during ctx program removal
zink: handle patch variable locations for separate shaders better
zink: don’t start multiple cache jobs for the same program
zink: use the “set” optimal key for prog last_variant_hash for consistency
zink: sanitize optimal keys
zink: copy some cs shader properties to the program struct
zink: handle global atomic intrinsics
zink: use Aligned with global load/store ops
zink: fix rewrite_read_as_0 filtering
rusticl: fixes for zink shader images
zink: pass KERNEL shaders through successfully
zink: add a618 flake
zink: break out ds3 state resetting
zink: be consistent with ds3 state resetting for blits
zink: fix optimal_keys warning message
zink: force-reset unordered flags for buffer barriers on non-matching batch access
zink: reset unordered flags for image barriers on non-matching batch access
zink: make image barrier init functions void return
zink: simplify some image barrier conditionals
zink: remove sync TODO
zink: add lavapipe flake
ci: disable nouveau shaderdb
egl/dri3: only set driver_name if not already set
egl: call dri3_x11_connect() for zink
egl: bind dri2_set_WL_bind_wayland_display for zink when necessary
zink: be more precise about flagging rp changes around unordered u_blitter
zink: don’t block reordering during ref updates in unordered blits
lavapipe: update vbo indices before propagating stride
lavapipe: fix pipeline stride propagation
zink: fix linear modifier dmabuf imports
zink: polaris ci updates
aux/tc: handle stride mismatch during rp-optimized subdata
zink: always add a per-prog ref for gpl libs
zink: use a pointer to simplify submit struct mechanics
zink: make zink_resource_image_barrier2_init public
zink: add a third submitinfo (unused for now)
zink: make submitinfo handling easier to manage with enum
zink: add another submitinfo for fd semaphore waits
zink: add a screen cache for fd semaphores
zink: add a util for getting cached fd semaphores
zink: hook up cached fd semaphore usage for batch signal/waits
zink: handle implicit sync for dmabufs
zink: handle multi-plane implicit sync
zink: ci updates
zink: set is_xfb=false for all i/o variables
zink: reorder bindless io lowering
zink: fix typing on bindless io lowering
zink: delete some bindless io lowering code
zink: use nir_io_semantics::num_slots for indirect var creation
zink: simplify an arrayed io check during variable creation
zink: use explicit stride from types instead of copying old_var stride
zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing
zink: use explicit sizing for builtins when creating variables
zink: create new vars without copying existing ones
zink: add a new linker pass to handle mismatched i/o components
zink: use right function to get src_type in eliminate_io_wrmasks
zink: re-rework i/o variable handling to make having variables entirely optional
ci: bump VVL to 1.3.263
zink: simplify redundant is_buffer check
zink: use VkFormatProperties3
lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT
lavapipe: don’t advertise UNDEFINED layout for HIC
zink: hook up VK_EXT_host_image_copy
zink: move mem type detection up in file
zink: disable HIC without resizable BAR
zink: add a fixup method for extra driver props
zink: fix some off-by-one indentation
zink: use some return codes for check_ici errors
zink: check/use suboptimal HIC during ici init
zink: use HIC for image subdata when possible
zink: slightly refactor psiz deletion during linking
zink: delete all psiz=1.0 stores if maintenance5 is present
nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
zink: add ZINK_DEBUG=quiet
zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip
zink: set optimal_keys for turnip jobs
aux/tc: fix staging buffer sizing for texture_subdata
aux/tc: fix address calc for segmented texture subdata
zink: ci updates
lavapipe: KHR_map_memory2
zink: slightly refactor pipeline compile selection
zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED
zink: remove an intermediate variable in pipeline compile selection
zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path
zink: pass a stage mask to pipeline create functions
glsl: check for xfb setting xfb info
zink: don’t warn about missing scalarBlockLayout on v3dv
aux/tc: fix renderpass tracking fb state clobber scenario
vk/enum2str: add more max enum vendors
aux/tc: fix rp info handling around tc_sync calls
aux/tc: don’t use pipe_buffer_create_with_data() for rp-optimized subdata
zink: flag db maps as unsynchronized
lavapipe: clamp cache uuid size
lavapipe: EXT_load_store_op_none
tu: handle unused color attachments without crashing
zink: use much bigger dummy surfaces
zink: propagate rp_tc_info_updated across unordered blits
zink: use null attachments for null attachments with dynamic render
egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque
egl/wayland: split out wl drm extension init
egl/wayland: use more registry listeners to better handle device init
egl/wayland: enable WL_bind_wayland_display for zink
zink: delete injected pointsize during shader creation
zink: require maintenance5 for shobj
zink: delete a non-maintenance5 workaround for shobj use
lavapipe: set separate_shaders for shader objects
zink: set workgroup_memory_explicit_layout for shader validation
zink: add a ZINK_DEBUG=validation alias
zink: fix semaphore signal ordering
zink: move swapchain fence to swapchain object
zink: avoid UAF on wayland async present with to-be-retired swapchain
zink: always trace_screen_unwrap in acquire
lavapipe: fix variable descriptor count support handling
lavapipe: always set independent blend
lavapipe: more vertex stride fixups
lavapipe: set default viewport and scissor count for cmdbufs
lavapipe: set default min sample shading to 1
glx: XFree visual info
radv: fix external handle type queries for dmabuf/fd
zink: fix crashing in image rebinds
zink: move push descriptor disable to driver workarounds
zink: move v3dv scalarBlockLayout workaround
zink: fix end-of-batch barrier pipeline stages
zink: guarantee egl syncobj lifetime
aux/trace: dump enum names for map usage
gallium: add PIPE_MAP_NONE
Revert “egl/wayland: Add image loader extension for swrast”
egl/wayland: don’t block in swrast when updating buffers for zink
egl/wayland: return sooner from swrast_update_buffers() if zink
zink: don’t check submit count for unflushed usage
egl: don’t set ForceSoftware for all zink loading
zink: error at handle export on missing EXT_image_drm_format_modifier
gbm: delete some zink handling
zink: apply ZINK_DEBUG=quiet to all missing feature warnings
zink: set ZINK_DEBUG=quiet for polaris jobs
lavapipe: don’t block begin/end cmdbuf pipeline barriers
ci: add a630 trace flakes
zink: shrink vectors during optimization
zink: always clamp shader stage in descriptor handling
zink: add set_global_binding
zink: eliminate samplers from no-sampler CL texops
zink: add some checks to determine whether queue is init on screen destroy
zink: don’t destroy any simple_mtx_t objects during screen destroy
zink: don’t destroy uninitialized disk cache thread
zink: reorder glsl_type_singleton_init_or_ref call
zink: use screen destructor for creation fails
zink: fix readback_present locking
zink: add automatic swapchain readback using heuristics
lavapipe: VK_EXT_nested_command_buffer
zink: ignore unacquired swapchain images during end-of-frame flush
nir/lower_fragcolor: preserve location_frac
zink: update pointer for GPL pipeline cache entry formats
zink: fix legacy depth texture rewriting for single component reads
egl: unify dri2_egl_display creation
egl: init dri3 version info during screen creation
egl/glx: don’t load non-sw zink without dri3 support
egl: add automatic zink fallback loading between hw and sw drivers
glx: add automatic zink fallback loading between hw and sw drivers
ci: don’t set GALLIUM_DRIVER for zink
egl/wayland: only add more registry listeners for hardware devices
zink: only increment image_rebind_counter on image export if binds exist
zink: check for sampler view existence during zink_rebind_all_images()
zink: use weston for anv ci
zink: blow up broken xservers more reliably
zink: delete some dead modifier handling
ci: skip implicit modifier piglits for zink
zink: don’t block large vram allocations
zink: add copy box locking
zink: emit SpvCapabilitySampleRateShading with SampleId
zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem
zink: clamp resolve extents to src/dst geometry
zink: only emit xfb execution mode for last vertex stage
aux/u_transfer_helper: set rendertarget bind for msaa staging resource
zink: unset explicit_xfb_buffer for non-xfb shaders
mesa/st/texture: match width+height for texture downloads of cube textures
zink: add more locking for compute pipelines
radv: correctly return oom from the device when failing to create a cs
zink: check for cbuf0 writes before setting A2C
Mohamed Ahmed (19):
vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c
vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c
vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c
nvk: implement vkGetDeviceImageMemoryRequirementsKHR()
nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR()
nvk: implement vkGetDeviceBufferMemoryRequirementsKHR()
nvk: advertise VK_KHR_maintenance4
nvk: advertise DemoteToHelperInvocation
nvk: Enable multiplane images and image views
nouveau/nvk: Add YCbCr sampler NIR lowering pass
nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c
nouveau/nvk: Create helper function for sampler creation
nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT
nouveau/nvk: Enable VK_KHR_sampler_ycbcr
util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats
vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly
nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only
nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only
nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM
Nanley Chery (33):
iris: Remap DRM_FORMAT_MOD_INVALID more often during import
anv: Don’t support ASTC images with modifiers
intel: Add and use isl_drm_modifier_get_plane_count
anv: Handle explicit surface layout of DG2_RC_CCS
anv: Reduce accesses of isl_mod_info->aux_usage
iris: Reduce accesses of mod_info->aux_usage
crocus: Delete modifier with aux code
hasvk: Delete modifier with aux code
iris: Swap stencil and modifier aux assignment order
intel: Describe modifier compression with booleans
intel/isl: Move the Tile4 modifier score case down
intel/isl: Add a score for DG2_RC_CCS
intel/blorp: Ambiguate after CCS resolves on gfx7-8
iris: Reorder render_aux_usage parameters
iris: Pass the render format to prepare_render
iris: Create BLORP surfaces after resource preparation
iris: Handle clear color compatibility in prepare_render
iris: Sample more texture view fast-clears on gfx11+
iris: Fix aux usage tracking in prepare_render
iris: Fix iris_copy_region calls involving FCV_CCS_E
iris: Drop get_copy_region_aux_settings
iris: Inline iris_can_sample_mcs_with_clear
anv: Initialize the clear color more often for FCV
intel: Return a bool from intel_aux_map_add_mapping
anv: Move scope of CCS binding determination
anv: Allocate space for aux-map CCS in image bindings
anv: Wrap aux surface image binding queries
anv: Refactor CCS disabling at image bind time
anv: Place images into the aux-map when safe to do so
anv: Loosen anv_bo_allows_aux_map
anv: Meet CCS alignment reqs with dedicated allocs
anv: Delete implicit CCS code
intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS
Neal Gompa (1):
asahi: Fix 32-bit x86 build with correct data type for overflow error message
Neha Bhende (1):
ntt: lower indirect tesslevels in ntt
Paul Gofman (2):
driconf: add a workaround for Captain Lycop: Invasion of the Heters
driconf: add a workaround for Rainbow Six Extraction
Paulo Zanoni (15):
anv: rename the vm_bind vfuncs
anv: add a new vm_bind vfunc
anv/xe: make vm_binds async
anv/xe: return failure in case waiting for the vm_bind syncobj fails
anv: remove misleading comment about batch_len
iris: assert bufmgr->bo_deps_lock is held
iris: avoid stack overflow in iris_bo_wait_syncobj()
iris: assert(bo->deps) after realloc()
intel/isl: add ISL_SURF_USAGE_SPARSE_BIT
intel/isl: simplify the check for maximum surface size
anv/sparse: add the initial code for Sparse Resources
anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind
anv/sparse: add INTEL_DEBUG=sparse
anv: enable sparse resources by default
vulkan: fix potential memory leak in create_rect_list_pipeline()
Pavel Ondračka (44):
r300: update RV370 failures
r300: check for index overflow when translating from TGSI
r300: source register index is always unsigned
r300: bump the RC_MAX_INDEX_BITS
r300: normal instruction can’t have presubtract op
r300: add a helper for checking number of temporary sources
r300: cycles estimate for shader-db
r300: fix cycles calculation
r300: don’t abort on flow control when using draw for vs
r300: add dEQP baseline for RV370 with forced swtcl
r300: copy ntt to r300 compiler
r300: add lower_sqrt to nir option
r300: remove unused intrinsics in ntr
r300: remove irrelevant opcodes in ntr
r300: remove unused integer support in ntr
r300: remove ntr_tgsi_usage_mask
r300: remove more unused 64-bit pieces from ntr
r300: simplify vectorization rules
r300: remove more ntr unused helpers
r300: remove the unneeded ntr_lower_vec_to_reg callback
r300: remove unneeded 64bit and atomic lowering passes
r300: remove unused ntr default settings
r300: remove ntr default options
r300: simplify ntr_emit_load_ubo
r300: simplify ntr_emit_load_input
r300: remove some virglrenderer specifics from ntr
r300: simplify ntr_setup_uniforms
r300: simplify ntr_output_decl
r300: simplify ntr_try_store_in_tgsi_output
r300: remove some unsupported texture opcodes
r300: remove unused barrier code from ntr
r300: simplify ntr_get_gl_varying_semantic
r300: remove the nrt main optimization loop
r300: reorder for easier presubtract 1-x pattern recognition
r300: exit early in presubtract is not supported
r300: implement bias presubtract
r300: convert x * 2 into x + x for presubtract
r300: move power of two multipliers down
r300: there is no limitation on presubtract source file
r300: use w channel for scalar opcodes if possible
r300: reduce number of iterations for vertex shader loops
r300: enable nir_move_vec_src_uses_to_dest
nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output
nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources
Philipp Zabel (1):
etnaviv: fix segfault after compile failure
Pierre-Eric Pelloux-Prayer (18):
radeonsi/sdma: use multiple commands if required
radv/sdma: use multiple commands if required
radv/sdma: use correct limits for gfx10.3
glx: drop the ‘libGL’ log prefix
loader: refactor DRI_PRIME handling code
loader: extend DRI_PRIME to support =N
loader: add DRI_PRIME_DEBUG env var
device_select_layer: support DRI_PRIME=n
docs: update DRI_PRIME documentation
device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE
st/mesa: check renderbuffer before using it
radeonsi: emit framebuffer state after allocating cmask
amd/common: update addrlib for gfx11.5
amd/common: add registers for gfx11.5
ac/nir: extract must_wait_attr_ring helper
amd, radeonsi: Add code to enable gfx11.5
mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode
radeonsi: check sctx->tess_rings is valid before using it
Piotr Kocia (2):
nir: Remove dead nir_const_value variables
glsl: ir_function_param_visitor::visit_enter always true condition
Qiang Yu (77):
aco,radv: replace tess_input_vertices shader info param
radeonsi: aco does not pass LS outputs to HS by arg
radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco
radeonsi: init aco shader info for merged LS/HS
radeonsi: simplify si_build_wrapper_function
radeonsi: move vertex shader vb desc input sgpr args to last
radeonsi: remove param type check in wrapper function
radeonsi: refine si_llvm_ls_build_end
radeonsi: refine si_llvm_es_build_end
radeonsi: aco compile support merged mono shader
radeonsi: calculate lds size for merged shaders
radeonsi: enable aco compile for mono merged LS/HS
radeonsi: enable aco compile for mono merged ES/GS
aco: extract aco_compile_shader_part from aco_compile_ps_epilog
aco: add p_end_with_regs pseudo instruction
aco: move jump to epilog out of ic_merged_wave_info
aco: add tcs end regs for epilog usage
aco: allow tcs with epilog to keep nir store output instruction
aco: add pending_lds_access option for insert waitcnt
aco: add tcs epilog generation for radeonsi
aco: don’t emit s_endpgm for tcs with epilog
aco: skip scratch init when no scratch arg provide
aco,radeonsi: save const addr to symbol
ac/nir/tess: move tess factor output out of control flow
aco: use semantic location as io temp index
radeonsi: add exec_size to shader binary
radeonsi: support upload multi part shader binary
radeonsi: share si_get_tcs_out_patch_stride with aco
radeonsi: fill part mode tcs aco shader info
radeonsi: extract si_llvm_build_shader_part
radeonsi: remove separate_prolog arg from prolog/epilog build
radeonsi: add si_get_tcs_epilog_args
radeonsi: change si_fill_aco_options args
radeonsi: add si_aco_build_shader_part
radeonsi: part mode standalone tcs support aco compile
radeonsi: remove unused arg of get_tcs_tes_buffer_address
aco: simplify setup_tcs_info
aco: pass sw_stage when setup_isel_context
aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog
aco: add vs prolog instruction selection for radeonsi
aco: add aco compile interface for radeonsi vs prolog
aco: do not fix_exports when program is prolog
radeonsi: fill aco_shader_info->is_monolithic
radeonsi: remove is_monolithic from vs prolog key
radeonsi: extract si_get_vs_prolog_args to be shared with aco
radeonsi: fix aco options has_ls_vgpr_init_bug setup
radeonsi: add vs prolog aco build
radeonsi: set vs has prolog aco shader info
radeonsi: enable aco compile for part mode standalone vs
aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately
ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args
aco: do not eliminate final exec write when p_end_with_regs block
aco: remove p_end_with_regs from needs_exact()
aco: add ps prolog generation for radeonsi
aco: handle ps outputs from radeonsi
aco: add create_fs_end_for_epilog for radeonsi
aco,radv: remove unused ps epilog info fields
aco,radv: rename ps epilog info inputs to colors
aco: simplify export_fs_mrt_color
aco,radv: add radeonsi spec ps epilog code
aco: compact ps expilog color export for radeonsi
aco,radv,radeonsi: pass spi ps input ena and addr
aco: do not fix_exports when program has epilog
aco: fix assertion fail when program contains empty block
aco: create exit block for p_end_with_regs to branch to
aco: wait memory ops done before go to next shader part
radeonsi: reduce sgpr count for scratch_offset when aco
radeonsi: init spi_ps_input_addr for part mode ps
radeonsi: extract si_prolog_get_internal_binding_slot
radeonsi: extract si_get_ps_prolog_args to be shared with aco
ac,radeonsi: remove unused ps prolog key fields
radeonsi: add ps prolog shader part build
radeonsi: extract si_get_ps_epilog_args to be shared with aco
radeonsi: fill aco shader info for ps part
radeonsi: add ps epilog shader part build
radeonsi: enable aco compile for part mode ps
radeonsi: disable disk cache when use aco
Rebecca Mckeever (32):
vulkan/runtime: Add helper functions for VK_EXT_host_image_copy
nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base
nouveau/codegen: Set lower_device_index_to_zero
nvk: Convert system values for gl_PointCoord and PointCoord into inputs
nvk: Add base_group to root descriptor table
nvk: Lower base_workgroup_id
nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch
nvk: Advertise KHR_device_group
nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table
nvk: Add A4B4G4R4 formats to nil_format_info table
nvk: Advertise EXT_4444_formats
nvk: Enable shadow sampling
nvk: Implement VK_EXT_non_seamless_cube_map
nouveau/nil: Add macros for ufixed
nvk: Implement VK_EXT_image_view_min_lod
nvk: Update mutable descriptor struct type
nvk: Replace asserts with conditional that sets type_list = NULL
nvk: Implement nvk_GetDescriptorSetLayoutSupport
nvk: Enable VK_KHR_maintenance3
nvk: Advertise VK_EXT_mutable_descriptor_type
nvk: Set image index to zero for NULL nvk_buffer_view
nvk: Advertise VK_EXT_image_robustness
nvk: Advertise VK_EXT_robustness2
nvk: Add view_index to root descriptor table
nvk: Lower nir_intrinsic_load_view_index
nvk: Add draw support for multiview
nvk: Add query support for multiview
nvk: Add input attachments support for multiview
nvk: Advertise VK_KHR_multiview
nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering
nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros
nvk: Move code inside view mask loops to a helper function
Rhys Perry (89):
ac/llvm: fix AC_TM_CHECK_IR
radv: fix radv_get_ballot_bit_size with CS
ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
radv: add conformant_trunc_coord to cache UUID
radv: don’t unset TRUNC_COORD if conformant_trunc_coord=true
ac/nir: always round cube array layers
nir/unsigned_upper_bound: fix phi(bcsel)
nir/tests: add test for unsigned_upper_bound with loop header phis
nir/opt_dead_cf: remove nodes after a jump earlier
nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if
aco: insert s_nop before VGPR deallocation
nir/lower_shader_calls: vectorize stack access for all shaders
radv: workaround WWZ exporting index=1 through location=1
radv: correctly skip MRT output NaN fixup for meta shaders
radv: don’t set vertex_attribute_strides on GFX8+
radv/ci: skip some mesh shader tests on GFX1100
aco: summarize register demand after handling branches
aco: don’t create sendmsg(dealloc_vgprs) if scratch is used
radv: disable 64-bit color attachments
radv: fix 128bpp comp-to-single clears
radv: support 128bpp comp-to-single with all colors
radv/gfx11: re-enable 0001/1110 clear values
nir/lower_shader_calls: fix align_offset
nir/opt_load_store_vectorize: support scratch access
radv: vectorize RT stack access
radv: vectorize scratch access
aco: fix p_bpermute_gfx6 with input at non-zero byte
aco: fix p_bpermute_gfx6’s exec save/restore with wave32
aco: clarify bpermute pseudo opcode names
aco: add adjust_bpermute_dst helper
aco/spill: skip p_branch in process_block
aco/spill: add all live-in to merge block spill candidates
nir/lower_system_values change num_workgroups to uint32_t
radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions
radv: use shortcut_1d_workgroup_id
aco: remove fast path in insert_exec_mask’s process_instructions
aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
aco: check logical_phi_info at p_logical_end when eliminating exec writes
aco: remove unused p_logical_end check when optimizing branching sequence
radv: disable mesh dispatch XYZ_DIM when possible
nir/deref: remove rematerialize_deref_in_block cache
aco: reset prefetch in the correct block after removing the exit
aco/waitcnt: replace wait_cnt::*_cnt with booleans
aco/waitcnt: add print helpers
nir/lower_int64: fix find_lsb(0)
nir/algebraic: optimize u2u32(a >> 32)
aco/optimizer_postRA: don’t combine DPP across exec on GFX8/9
aco: don’t combine DPP into v_cmpx
aco: disable zero offset optimization for strict WQM coords
nir/constant_folding: remove zero texel offset
aco: remove zero offset optimization
aco: shrink DPP8_instruction
aco: add fetch_inactive field to DPP instructions
nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd
aco: disable FI for quad/masked swizzle
aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt
aco: only mitigate VcmpxExecWARHazard when necessary
aco: fix s_setreg hazards
aco: consider exec_hi in reads_exec()
aco: resolve all possible hazards at the end of shader parts
aco/tests: test that hazards are resolved at the end of shader parts
radv: skip zero-sized memcpy
ac/nir: fix out-of-bounds access in ac_nir_export_position
radv: fix signed integer overflow
Revert “radv: pre-init surface info”
nir: improve ms_cross_invocation_output_access with local_invocation_id
aco,nir: add export_row_amd intrinsic
ac/nir: add row parameter to helpers
ac/nir: remove dead code
ac/nir: refactor mesh vertex/primitive export
ac/nir: implement mesh shader gs_fast_launch=2
ac/nir: optimize mesh shader local_invocation_index
radv: implement mesh shader gs_fast_launch=2
ac/nir: add emit_ms_outputs helper
ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms
ac/nir: implement mesh shader multi-row export
radv: implement mesh shader multi-row export
radv: enable mesh shader gs_fast_launch=2 and multi-row export
nir/serialize: fix signed integer overflow
nir/lower_shader_calls: skip zero-sized qsort
util: skip zero-sized SHA1Update
radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements
radv: skip radv_remove_varyings for mesh shaders
radv: disable gs_fast_launch=2 by default
docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default
radv: add radv_disable_trunc_coord option
radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK
ac/nir: fix partial mesh shader output writes on GFX11
Rob Clark (60):
freedreno: move virtgpu msm_proto.h to common
freedreno/drm/virtio: Remove unused header
tu/msm: staticify a couple things
tu/knl: Remove some random const’ness
drm-uapi: Update virtgpu header
freedreno: Update virtgpu proto
freedreno/drm/virtio: Use global_faults
tu: close submitqueues before device_finish()
tu/drm: Factor out shared helpers
tu/drm: Add missing error path cleanup
tu/drm: Split out helper for iova alloc
tu: Add virtgpu support
util: Decouple disk cache from EGL_ANDROID_blob_cache
docs: Followup to !24636
tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE
ir3+tu: Simplify ir3_find_sysval_regid callers
freedreno/a6xx: Drop unused screen args
freedreno/a6xx: Re-work fd6_emit_shader
freedreno/a6xx: Re-write the function-of-doom
freedreno: Implement ATI_meminfo
freedreno/a6xx: ARB_post_depth_coverage
freedreno/a6xx: ARB_sample_locations
freedreno/a6xx: ARB_texture_filter_minmax
freedreno/a6xx: EXT_demote_to_helper_invocation
freedreno/a6xx: EXT_shader_image_load_formatted
freedreno/a6xx: EXT_depth_bounds_test
freedreno/a6xx: Use pipe_blit_info::sample0_only
freedreno/a6xx: Handle PIPE_BIND_BLENDABLE
freedreno/a6xx: ARB_shader_viewport_layer_array
tu: Fix heap size
freedreno: Fix crash with debug msgs enabled
freedreno/layout: Handle 565/etc MSAA special case
freedreno/decode: Fix printing chip-id
freedreno/a6xx: Add L8_SRGB
freedreno: Add reformatting commits to .git-blame-ignore-revs
freedreno/fence: Hold a strong ref to batch
freedreno/decode: Lookup device info
freedreno/decode: Use info->chip to decode
freedreno/decode: Remove gpu_id
freedreno: Indentation fix
freedreno: Use explicit QCOM_TILED3 modifier
freedreno/a6xx: Remove dummy packet for globals
freedreno: Fix streamout offset_buf dirtiness
freedreno: Fix user const buffer dirtiness
freedreno/batch: Move query_buf allocation
freedreno: Add private-BO tracking
freedreno: Add missing indirect_draw_count tracking
freedreno: Move/add some attach_bo()
freedreno: Add attach-bo debugging
freedreno: Rework supported-modifiers handling
mesa: Introduce MESA_texture_const_bandwidth
mesa: Implement MESA_texture_const_bandwidth
freedreno: Add PIPE_CAP_HAS_CONST_BW support
panfrost: Add PIPE_CAP_HAS_CONST_BW support
iris: Add PIPE_CAP_HAS_CONST_BW support
radeonsi: Add PIPE_CAP_HAS_CONST_BW support
tu/msm: Fix timeline semaphore support
tu/virtio: Fix timeline semaphore support
freedreno/drm: Fix race in zombie import
freedreno: Always attach bo to submit
Robert Foss (9):
egl: Expose access to DeviceList
egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList()
egl: Refresh DeviceList during eglInitialize()
egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2()
egl/android: Use EGL DeviceList instead drmGetDevices2()
egl: Rename _eglAddDevice() to _eglFindDevice()
egl: Rename _eglAddDevice() to _eglFindDevice()
egl: Fix attrib_list[0] == EGL_NONE check
egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay()
Robert Mader (6):
egl/wayland: wait for compositor to release shm buffers
iris: Support parameter queries for main planes
util: Add new helpers for pipe resources
panfrost: Support parameter queries for main planes
vc4/resource: Support offset query for multi-planar planes
v3d/resource: Support offset query for multi-planar planes
Rohan Garg (33):
iris: migrate WA 14013910100 to use the WA framework
iris: migrate WA 14016118574 to use the WA framework
iris: fix iris for WA 16013000631
intel/perf: add perf query support for Intel Raptorlake
intel/genxml: set a default value for “Pixel Position Offset Enable” in genxml
anv: use the WA infrastructure where possible when generating state
anv: use the correct GFX_VERx10 macro for WA
anv,iris: program the maximum number of threads on compute queue init
anv: drop CFE state validation checks
iris: track reset signalling instead of replacing the context
iris: allow for a unsynchronized device reset query
anv: partially revert 2e8b1f6d
anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+
anv: use the pre defined _3DPRIMITIVE_DIRECT macro
anv: drop dead ifdef
iris: use the correct WA macros and lineage numbers
anv: use the lineage number for WA
crocus: add a __gen_get_batch_address declaration
crocus: fix GFX_VERx10 macro
blorp: drop undefined macro
iris: migrate preemption streamwout wa to WA infra
intel/genxml: update PIPE_CONTROL instruction for dg2
anv: define clear color localy within can_fast_clear_color_att
intel/compiler: Adjust CS payload registers for new register width on Xe2+
intel/compiler: Adjust fence message lengths for new register width on Xe2+
intel/compiler: Adjust barrier emission for Xe2+
intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec
anv: ensure that FCV_CCS_E fast clears are properly tracked
anv: enable FCV for Gen12.5
anv: fix debug string for PC flush
anv: cleanup includes
anv: turn off non zero fast clears for CCS_E
anv: selectively enable FCV optimization for DG2
Roland Scheidegger (1):
lavapipe: further limit accurate_a0 hack
Roman Stratiienko (22):
egl: android: Remove legacy name-based shared buffers support
util: Add NONNULL macro
android: Introduce the Android buffer info abstraction
android: Fix num_planes assignment in u_gralloc_fallback
v3dv/android: Use u_gralloc code
v3dv/android: Enable shared presentable image support
v3dv: Migrate to vk_device_memory
v3dv/android: Skip swapchain binding
v3dv: Rely on the internal tiled flag instead of the common vk structure
v3dv/android: Add a helper function to support explicit layouts
v3dv/android: Rework Android native buffer importing logic
v3dv: Use format stored in vk_image and vk_image_view after init
v3dv: Split v3dv_image_init to use layout setting logic separately
v3dv/android: Add AHardwareBuffer support
v3dv: Enable VK API v1.2 for Android
panvk: Add Android ICD loader entry point
u_gralloc: Remove inline modifiers from the functions
u_gralloc: Remove usage of NONNULL macro
Revert “util: Add NONNULL macro”
u_gralloc: Add a function that returns gralloc type
dri: Remove __driDriverExtensions leftovers
v3d: Don’t implicitly clear the content of the imported buffer
Ruijing Dong (2):
frontends/va: checking va version for av1enc support
radeonsi/vcn: change max_poc to fixed value for hevc encoder.
Ryan Neph (1):
vulkan/android: add missed STACK_ARRAY_FINISH()
Sagar Ghuge (34):
intel/compiler: Look at 2 register worth of data instead of 4
isl: Disable MCS compression just on ACM platform
intel: Add env variable to add break point on/before draw
anv: Add GPU breakpoint before/after specific draw call
iris: Add GPU breakpoint before/after draw call
blorp: Implement blorp hooks to emit breakpoint
docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT
intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers
anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
anv: Add helper to create companion RCS command buffer
anv: Split out End/Destroy/Reset cmd buffer code into helper
anv: Handle companion RCS in end/destory/reset code path
intel: Add helper to create/destroy i915 VM
intel: Pass virtual memory address space ID while creating context
anv: Create companion RCS engine
anv: Move compute specfic bits under compute queue init
anv: Execute RCS init batch on companion RCS context/engine
anv: Setup companion RCS command buffer submission
anv: Execute an empty batch to sync main and companion RCS batch
anv: Add secondary companion RCS cmd buffer to primary
anv: Skip layout transition on the compute queue
anv: Extract batch print code to anv_print_batch helper
iris: Enable always flush cache with DEBUG_STALL option
intel/genxml: Add STATE_COMPUTE_MODE instruction
anv: Program and emit STATE_COMPUTE_MODE
anv: Enable barrier handling on video engines
isl: Use 16-bit instead of 8-bits for surface format info fields
anv: Handle end of pipe with MI_FLUSH_DW on transfer queue
anv: Enable transfer queue only on ACM+ platforms
blorp: Use the correct miptail start LOD for surfaces
anv: Write timestamp using MI_FLUSH_DW on blitter
anv: Flush data cache while clearing depth using HIZ_CCS_WT
Saleemkhan Jamadar (1):
radeonsi/vcn: set jpeg reg version for gfx 1150
Samuel Holland (3):
Android.mk: Allow building only Vulkan drivers
Android.mk: Explicitly enable/disable LLVM support
Android.mk: Only link LLVM for radeonsi, not amd_vk
Samuel Pitoiset (299):
radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV
radv: make radv_get_pa_su_sc_mode_cntl() static
zink/ci: update list of expected failures for NAVI10
radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL
radv: remove unused param in radv_pipeline_emit_vgt_gs_out()
radv: pass a shaders array for computing ia_multi_vgt_param
radv: bind the pre-compiled PS epilog to the cmdbuf state
radv: stop using an array of binaries when compiling a compute shader
radv: add radv_compile_cs() to compile a compute shader
radv: remove the pipeline dependency for creating a GS copy shader
radv: add a helper to compute the ESGS itemsize
radv: use the number of GS linked inputs to compute the ESGS itemsize
radv: determine ES info for VS/TES with GS earlier
radv: determine as_ls earlier by using the next stage
radv: simplify getting next VS stage for VS prologs
radv: use next_stage for determining the stage to lower NGG
radv/amdgpu: fix dumping CS with the chained IBs path
radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump()
radv: pass submit info to radv_check_gpu_hangs()
radv: initialize stage/next_stage earlier
radv: set next_stage to MESA_SHADER_NONE if there is no FS
radv: rework considering force VRS without relying on graphics pipeline
radv: stop passing radv_graphics_pipeline to radv_fill_shader_info()
radv: move removing all varyings when the FS is a noop
radv: rename graphics pipeline linking helpers
radv: simplify lowering NGG GS intrinsics
radv: rework determining the NGG stage without a graphics pipeline
radv: cleanup pipeline compute emit helpers
radv: rename radv_pipeline_stage to radv_shader_stage
radv: rename NGG query state to be more generic
radv: declare the shader query user SGPR for emulating GS counters
radv: enable pipelinestat query emulation for legacy GS
radv: simplify the NGG vs legacy pipelinestat query path
radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET
radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd
radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2
radv: track whether inputs/outputs are linked per shader stage
radv: add support for VS/TES as ES without shaders IO linking
radv: use next_stage to determine if the layer should be exported
radv: use next stage to determine if primID/clip dist should be exported
radv: compute the legacy GS info earlier
radv: stop copying some NIR info fields from TES to TCS
radv: stop lowering patch vertices for TES
radv: do not always copy the number of tess patches to TES
radv: initialize tcs.tes_{patch}_inputs_read to a default value
radv: prevent linking TCS<->TES when TES is NULL
radv: use a packed user SGPR for the TES state
radv: stop checking if patch control points is dynamic everywhere
radv: copy the number of TCS vertices out to TES shader info
radv: add support for dynamic TCS vertices out for TES
radv: remove radv_shader_info::tes::num_linked_patch_inputs
amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code
radv: allow to use fixed IO locations for VS<->TCS<->TES without linking
aco: add aco_shader_info::tcs::has_epilog
aco: add infra for compiling TCS epilogs
radv,aco: move has_epilog to radv_shader_info
radv: assume a TCS needs an epilog unless it’s linked with a TES
radv: do not write tess factors in main TCS when it has an epilog
radv: track if TES reads tess factors differently
radv: declare new argument for the TCS epilog PC
radv: add radv_tcs_epilog_key
radv: add infra for creating TCS epilogs
radv: add support for a TCS epilogs cache in the device
radv: add support for emitting TCS epilogs in cmdbuf
radv: remove unnecessary check in radv_pipeline_nir_to_asm()
radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm()
radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile()
radv: add a struct for the retained shaders and GPL
radv: add radv_graphics_shaders_compile() to compile graphics shaders
radv: remove redundant check in radv_cmd_buffer_after_draw()
radv: track if patch control points is dynamic from the cmdbuf state
radv: re-emit binning state if the framebuffer is dirty
radv: track if vertex binding stride is dynamic from the cmdbuf state
vulkan: bump header register to 1.3.261
vulkan/runtime: add common implementation for GetImageSubresourceLayout()
vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR
radv: use the RT prolog scratch size directly for tracing rays
radv: add a helper to get the maximum number of scratch waves per shader
radv: update the number of scratch waves for RT prolog at bind time
radv: update cmdbuf scratch size info when shaders are bound
vulkan: add init/finish helpers for vk_buffer_view
radv: use vk_buffer_view
radv: use vk_sampler
radv: use common vkCmdBegin/EndQuery wrappers
radv: use vk_query
zink: fix setting VkShaderCreateInfoEXT::nextStage
radv/rt: fix capture/replay support
vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR()
radv: implement vkCmdBindIndexBuffer2KHR()
radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2()
radv/rmv: remove unused pipeline create flags when logging pipelines
radv: store pipeline create flags to radv_pipeline::create_flags
radv: add support for VkPipelineCreateFlags2CreateInfoKHR
radv: add support for VkBufferUsageFlags2CreateInfoKHR
radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers
radv: implement radv_Get{Device}ImageSubresourceLayout2KHR()
radv: advertise VK_KHR_maintenance5
radv: remove useless NULL for pipeline layout during shader info pass
radv: introduce radv_shader_layout for per-stage descriptor layout
radv: stop passing redundant stage to radv_shader_stage_init()
radv: re-introduce radv_pipeline_stage_init()
radv: add support for loading the LSHS vertex stride from a SGPR
radv: use the number of VS outputs for computing the tessellation info
vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits
radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE
radv: fix emitting TCS epilogs for GFX6-9
radv: remove radv_cmd_buffer::cached_vertex_formats
radv: remove unused param from radv_pipeline_init_multisample_state()
radv: simplify declaring VS specific input SGPRs
radv: stop copying if VS or TES uses the InvocationID built-in
Revert “radv/amdgpu: workaround a kernel bug when replacing sparse mappings”
Revert “radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build”
radv/amdgpu: allow to execute external IBs on the compute queue
radv/amdgpu: add support for submitting external IBs with the chained path
zink/ci: update list of expected failures for NAVI10
radv: use the maximum possible workgroup size for TCS epilogs
radv: stop declaring the scratch offset argument for TCS epilogs
radv: declare shader arguments for TCS epilogs
radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key
aco: fix jumping from main TCS to epilog on GFX9+
aco: adjust TCS epilogs for RADV
aco: allow SGPRs operands with p_jump_to_epilog
aco: implement create_tcs_jump_to_epilog()
radv: track the pipeline bind point for indirect commands layout
radv: prepare radv_get_sequence_size() for DGC compute
radv: prepare radv_prepare_dgc() for DGC compute
radv: implement NV_device_generated_commands_compute
radv: allow DGC on the compute queue
radv: advertise NV_device_generated_commands_compute
aco: rework printing shader stages
radv: fix the per-patch data offset when TES isn’t linked with TCS
radv: stop declaring unused SGPR arguments for PS epilogs
radv: add radv_shader_info::is_monolithic
radv: use info->uses_view_index directly when declaring shader arguments
radv: do not inline push constants for non-monolithic shaders
radv: force indirect descriptor sets for non-monolithic shaders
radv: always declare some arguments for non-monolithic VS/TCS shaders
radv: add a new shader argument for non-monolithic shaders PC
ac: allow to mark shader arguments as preserved
radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+
aco: disable shared VGPRs for non-monolithic shaders on GFX9+
aco: ensure to initialize exec manually for VS as LS on GFX9+
aco: add support for compiling VS+TCS separately on GFX9+
radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders
radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+
aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+
aco: add support for compiling {VS,TES}+GS separately on GFX9+
radv,aco: remove unused clip/cull distances variables
radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state()
radv: small cleanups in radv_emit_patch_control_points()
radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+
radv: remove the pipeline dependency for emitting VGT_GS_MODE
aco: fix emitting TCS epilogs end on GFX9+
radv: re-order IO slot layout for stages that aren’t linked
amd/ci: update list of failures/flakes for glcts-vangogh-valve
ci: uprev vkd3d-proton
ci: uprev Fossilize
ci: add comment explaining which image tags to update for Fossilize
radv: preserve shader argument for separate compilation of NGG shaders
aco: flag blocks with long-jump as export_end for separate compilation
aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders
aco: allow separate compilation of NGG shaders
zink/ci: add zink-radv-polaris10-valve
radv/ci: re-enable vkcts-polaris10-valve
radv: fix capturing indirect dispatches with SQTT
radv/ci: re-enable vkd3d-polaris10-valve
ci: do not fail vkd3d-proton job when the expectations match
radv/amdgpu: fix executing secondaries without IB2
radv/amdgpu: do not copy the original chain link for IBs
radv: avoid emitting SQTT markers for DGC calls
radv: add support for DGC with SQTT
zink/ci: merge GLCTS testing with GLESx for RADV
zink/ci: merge piglit testing with deqp-runner for RADV
radv: fix interactions with primitives generated queries and pipeline stats
radv: skip DGC calls when the indirect sequence count is zero with a predicate
radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches
radv: adjust next stage for VS prologs and merged shaders compiled separately
radv: adjust emitted prolog regs for merged shaders compiled separately
radv: do not use pre-compiled prologs when VS is compiled separately
radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders
radv: fix enabling DGCC
radv: fix emitting SQTT userdata when CAM is needed
radv: fix capturing RGP on RDNA3 with more than one Shader Engine
zink/ci: update list of expected failures for POLARIS10/NAVI10
radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+
radv: disable unsupported hw shader stages for RGP on GFX11+
radv: fix instruction timing on GFX11
ac/rgp: use correct API stage string for mesh/task shaders
radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
radv: emit relocation for mesh/task shaders
issue_templates/Bug Report: fix outdated URL for GFXReconstruct
ac,radv,radeonsi: rework SPM counters configuration and share it
ac/perfcounter: add new SQ_WGP block for GFX11+
ac/spm: add SPM counters configuration for GFX11
radv: enable the PKT3 CAM bit for some SPM register writes
radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values
ac/spm: remove useless SPM block setting for GFX9 and older GPUs
ac/spm: add SPM block definition for GFX10-GFX10.3
ac/gpu_info: init num_cu_per_sh from the kernel
ac/perfcounter: set the number of instances of GL1C to 4
ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C
ac/spm: fix checking if the counter instance is valid
ac/spm: rework how segment muxsel RAM are filled
ac/spm: initialize and set instance mapping for counters
radv: reserve more CS space in SQTT/SPM paths
ac/spm: use block flags to initialize instance mapping
ac/spm: select correct segment type for per-SE blocks
radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers
ac/spm: fix number of instances of GL2C
ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters
ac,radv,radeonsi: prepare support for multi-instance SPM generic counters
ac/spm: move the counter instance to ac_spm_counter_create_info
ac/spm: enable support for multi-instance counters
radv: fix checking if RGP is enabled with others tracing tools
radv: fix missing ISA with RGP and GPL
ac/perfcounter: add SG_WQP group for GFX11
ac/perfcounter: add GFX11 groups
drirc: remove Path of Exile workarounds
radv: remove drirc workarounds for Path Of Exile
radv: remove absolute_depth_bias workaround
ac/gpu_info: define AMD_MAX_WGP
ac/spm: add new segment types for GFX11
ac/spm: add support for GFX11
radv: add SPM support for GFX11
radv: enable cache counters for RGP on GFX11
ci: update to vulkan-cts-1.3.6.3
radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs
nir: rename atomic_add_gs_invocation_count_amd to make it more generic
ac/nir: add lowering for mesh shader queries
ac/nir: add lowering for task shader queries
radv: add GDS counters offset for mesh/task queries
radv: adjust lowering of intrinsic queries for mesh/task shaders
radv: enable lowering of mesh/task shader queries when enabled
radv: declare shader_query_state for mesh/task shaders
radv: stop skip emitting CB states when there is no color attachment
radv: re-enable DCC with mipmaps on GFX11
radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue
radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble
radv: fix alignment of DGC command buffers
radv/ci: update list of expected failures on PITCAIRN
radv/ci: update list of flakes for NAVI10/VEGA10
radv/amdgpu: fix alignment of command buffers
radv: enable DCC for MSAA images on GFX11
zink/ci: update list of expectations for zink-anv-tgl
zink/ci: bump zink-anv-tgl-full timeout to 1h45m
radv/ci: rename GFX1100 lists to NAVI31
radv: fix emulated geometry shader primitives/invocations queries
radv/ci: remove duplicate skipped tests for RAVEN/STONEY
radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs
radv: fix synchronization with emulated GS primitives/invocations queries
radv/ci: remove no longer existing test for VANGOGH
radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10
radv: always write the sample positions when a new descriptor BO is created
radv: fill the scratch BO in radv_fill_shader_rings()
radv: fix gang submissions with chaining
radv: fix re-emitting streamout descriptors for NGG streamout
radv: fix IB alignment
zink: use warn_missing_feature for missing modifier support
radv: fix destroying GDS/OA BOs
radv: allocate only 1 GDS OA counter for gfx10 NGG streamout
ac/nir: only consider overflow for valid feedback buffers
radv/ci: update list of expected failures on RAVEN
radv/ci: update list of flakes for VANGOGH
radv/ci: update list of flakes for STONEY
radv: disable primitive restart for non-indexed draws on GFX11
radv: enable radv_disable_aniso_single_level=true for Zink too
amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11
radv: mark GDS as needed for XFB queries with NGG streamout on GFX11
radv: skip GDS allocation for NGG streamout on GFX11
zink/ci: remove expected failures that are skipped for RADV
ci: update CTS to vulkan-cts-1.3.7.0
ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers
ci: bump DEQP_FRACTION for some jobs
radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5
radv: initialize video decoder for GFX11.5
ac/gpu_info: query the maximum number of IBs per submit from the kernel
Revert “radv: fix finding shaders by PC”
radv: fix missing predicate bit for WRITE_DATA helper
ac/gpu_info: fix querying the maximum number of IBs per ring
radv: remove outdated RADV_DEBUG=vmfaults support
amd: update amdgpu_drm.h
amd: add has_gpuvm_fault_query
radv/amdgpu: add support quering the last GPUVM fault
radv: query and report the last GPUVM fault with RADV_DEBUG=hang
radv: report the last GPUVM fault when a device lost is detected
ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues
radv: fix a synchronization issue with primitives generated query on RDNA1-2
radv: bind the non-dynamic graphics state from the pipeline unconditionally
radv: fix compute shader invocations query on compute queue on GFX6
radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE
nir: fix inserting the break instruction for partial loop unrolling
radv: fix registering queues for RGP with compute only
radv: set radv_zero_vram=true for Unreal Engine 4/5
radv: fix a descriptor leak with debug names and host base descriptor set
radv: add a missing async compute workaround for Tonga/Iceland
radv: disable TC-compatible HTILE on Tonga and Iceland
radv: set radv_invariant_geom=true for War Thunder
radv: do not set OREO_MODE to fix rare corruption on GFX11
Saroj Kumar (4):
radeonsi: Add perfetto support in radeonsi
radeonsi: Add u_trace init code in radeonsi
radeonsi: Add tracepoints in radeonsi driver
radeonsi: fixes compilaton error when perfetto is disabled
Sathishkumar S (2):
radeonsi/vcn: support variable number of bs_bufs
radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines
Semjon Kravtsenko (1):
glx: Assign unique serial number to GLXBadFBConfig error
Seppo Yli-Olli (1):
zink: Fix SyntaxWarning in zink_extensions script
Sergi Blanch Torne (7):
Introduce ci-kdl builder and launcher.
Integrate ci-kdl in the building process and launch process.
ci: disable Collabora’s LAVA lab for maintance
Revert “ci: disable Collabora’s LAVA lab for maintance”
Revert “ci: disable Collabora’s LAVA lab for maintance”
ci: disable Collabora’s LAVA lab for maintance
Revert “ci: disable Collabora’s LAVA lab for maintance”
Sid Pranjale (1):
nvk: Enable VK_EXT_load_store_op_none
Sil Vilerino (20):
util: Blake3 - Identify arm64ec as aarch64 instead of x64
d3d12: Fix Map/Unmap of YUV resources
d3d12: Fix H264 interlaced decode
d3d12: Video Decode - Remove unnecessary copy for texture array case
util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler
d3d12: Extend video screen AV1 encode tile support checking
aux/tc: Add ASSERTED to unreferenced release build variable
d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process
frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface
d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding
d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects
d3d12: Do not destroy codec when destroying video buffer
d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support
d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter
d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting
frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols
d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS
d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility
d3d12: d3d12_video_buffer_create_impl - Fix resource importing
Simon Ser (7):
wayland: enable use of wayland-protocols as a subproject
vulkan/wsi/wayland: add support for IMMEDIATE
vulkan/wsi/wayland: fix unset present_mode
radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL
egl: extract EGLDevice setup in dedicated function
egl: move dri2_setup_device() after dri2_setup_extensions()
egl: ensure a render node is passed to _eglFindDevice()
Simon Zeni (1):
nouveau/winsys: use mmap instead of mmap64 in nouveau_bo
SoroushIMG (1):
pvr: fix mipmap size calculation for bc formats
Sviatoslav Peleshko (9):
dri: Use RGB internal formats for RGBX formats
intel/isl: Don’t over-allocate CLEAR_COLOR size to use whole cache line
anv: Do fast clear color initialization more delicately
zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize
intel/fs: Check if the whole ubo load range is in the push const range
zink: Store zink_vertex_elements_hw_state::b.strides by binding id
intel/fs: Fix “packed word exception” condition for register regioning
intel/eu/validate: Validate “packed word exception” stricter
nir/loop_analyze: Fix inverted condition handling in iterations calculation
Sylvain Munaut (9):
egl/dri2: Add a couple of missing mutex release in error path
mesa: Enable ARB_texture_border_clamp in GL Core
include: Fix the PFN declarations to be pointers as they should
glx: Add missing MesaGLInteropGLXFlushObjects
glx: Export the MESA GL Interop functions through glXGetProcAddress
egl: Export the MESA GL Interop functions through eglGetProcAddress
glx: Remove MESA_depth_float_bit from enum
glx: Advertise GLX_MESA_gl_interop extension if support present
egl: Advertise EGL_MESA_gl_interop extension if support present
Tapani Pälli (34):
intel/blorp: add a new flag to communicate PSS sync need
anv: implement required PSS sync for Wa_18019816803
iris: implement required PSS sync for Wa_18019816803
vulkan/runtime: change assert to match specification needs
anv: remove assert, size is asserted in the runtime
anv: refactor batch_set_preemption to use batch_emit_pipe_control
anv: implement a dummy depth flush for Wa_14016712196
iris: implement a dummy depth flush for Wa_14016712196
mesa: fix some TexParameter and SamplerParameter cases
mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads
ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests
anv: implement Wa_14018912822
iris: implement Wa_14018912822
driconf: use lower_depth_range_rate for The Spirit and The Mouse
mesa: disable snorm readpix clamping with EXT_render_snorm
iris: modify Wa_14014414195 to use intel_needs_workaround
mesa: some cleanups for texparam extension checks
iris: avoid issues with undefined clip distance
crocus: avoid issues with undefined clip distance
anv: refactor to fix pipe control debugging
anv: fix a leak of fp64_nir shader
iris: use intel_needs_workaround for Wa_14014414195 part 2
iris: correct dst alpha blend factor in Wa_14018912822
iris/anv: move Wa_14018912822 as a drirc workaround
iris: flush data cache when flushing HDC on GFX < 12
anv: HDC flush is available only for GFX_VER 12+
iris: HDC flush is available only for GFX_VER 12+
intel/genxml: remove HDC from gen11.xml, it is not available
mesa/st: ignore StencilSampling if stencil not part of the format
intel/dev: expand existing fix for all gfx12 with small EU count
egl: fix leaking drmDevicePtr in _eglFindDevice
iris: add data cache flush for pre hiz op
anv/drirc: add option to disable FCV optimization
drirc: Set limit_trig_input_range option for Valheim
Tatsuyuki Ishi (8):
radv/amdgpu: Remove unused bo_list variable from cs_submit.
radv/winsys: Remove unused struct radv_winsys_bo_list.
radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
radv: Fix IB size for RADV_DEBUG=hang.
radv: Fix dumping vertex descriptors with RADV_DEBUG=hang.
radv/amdgpu: Use rwlock to protect access to virtual BOs.
zink: Fix missing sparse buffer bind synchronization.
zink: Fix waiting for texture commit semaphores.
Thomas H.P. Andersen (65):
tgsi: remove unused tgsi_shader_info.num_tokens
tgsi: remove unused tgsi_shader_info.array_max
tgsi: remove unused tgsi_shader_info.num_memory_instructions
tgsi: remove unused tgsi_shader_info.colors_read
tgsi: remove unused tgsi_shader_info.colors_written
tgsi: remove unused tgsi_shader_info.reads_position
tgsi: remove unused tgsi_shader_info.reads_samplemask
svga: remove unused struct field
tgsi: remove unused tgsi_shader_info.reads_tess_factors
tgsi: remove unused tgsi_shader_info fields
tgsi: remove unused tgsi_shader_info fields
tgsi: remove unused tgsi_shader_info.uses_drawid
tgsi: remove unused tgsi_shader_info fields
tgsi: remove unused tgsi_shader_info.uses_subgroup_info
tgsi: remove unused tgsi_shader_info.writes_primid
tgsi: remove unused tgsi_shader_info.uses_doubles
tgsi: remove unused tgsi_shader_info.uses_derivatives
tgsi: remove unused tgsi_shader_info.uses_bindless_samplers
tgsi: remove unused tgsi_shader_info.uses_bindless_images
tgsi: remove unused tgsi_shader_info.clipdist_writemask
tgsi: remove unused tgsi_shader_info.culldist_writemask
tgsi: remove unused tgsi_shader_info.images_load
tgsi: remove unused tgsi_shader_info.images_store
tgsi: remove unused tgsi_shader_info.images_atomic
tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load
tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store
tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic
tgsi: remove unused tgsi_shader_info.uses_bindless_image_load
tgsi: remove unused tgsi_shader_info.uses_bindless_image_store
tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic
tgsi: remove unused tgsi_shader_info.indirect_files_read
tgsi: remove unused tgsi_shader_info.indirect_files_written
tgsi: remove unused tgsi_shader_info.const_buffers_indirect
tgsi: remove unused tgsi_shader_info.max_depth
tgsi: drop two unused functions
nvk: use common physical device enumeration
nvk: fix implicit-fallthrough warnings with clang
nvk: delete commented code
nvk: fix mem leaks
nvk: use common descriptor set layout code
nvk: use common pipeline layout code
nvk: advertise KHR_shader_non_semantic_info
nvk: advertise KHR_image_format_list
nvk: advertise EXT_private_data
nvk: advertise KHR_sampler_mirror_clamp_to_edge
nvk: KHR_descriptor_update_template
nvk: CmdPushDescriptorSetWithTemplateKHR
nvk: drop dead assignment
nvk: drop dead assignment
nvk: fix initialization override
nvk: sort extensions
nvk: advertize KHR_relaxed_block_layout
nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
nvk: advertise EXT_image_2d_view_of_3d
nvk: fix maxPushDescriptors
nvk: call correct macro to clear views
nouveau/mme: use fermi enum in fermi builder
nvk: add warning on non-nouveau drm driver
nvk: Implement VK_KHR_draw_indirect_count on Turing+
nvk: set device info before use in nvk_get_device_extensions
nvk: simplify code by using new helpers
nvk: remove duplicated device features
nvk: EXT_conditional_rendering
nvk: advertise VK_EXT_tooling_info
nvk: set optimization level to 3
Thong Thai (3):
radeonsi: enable vcn encoder rgb input support
Update radeon_vcn_enc.c
frontends/va/config: report max width and height for encoding/decoding
Timothy Arceri (27):
glsl: fix validation of ES vertex attribs
nir/opt_copy_prop_vars: don’t clone copies if branch empty
nir/opt_copy_prop_vars: speedup cloning of copy tables
nir/opt_copy_prop_vars: remove var hash entry on kill alias
nir/opt_copy_prop_vars: skip cloning of copies arrays until needed
nir/opt_copy_prop_vars: drop reuse of dynamic arrays
glsl: fix spirv sso validation
glsl: mark structs containing images as bindless
util: add radeonsi workaround for Nowhere Patrol
glsl: fix out params in glsl to nir
glsl_to_nir: add more unhandled function types
nir: replace use of nir_src_copy()
nir: remove unused nir_src_copy()
nir: remove unused param from nir_alu_src_copy()
glsl: remove field from gl_shader_program
glsl: move get_varying_type() declaration earlier
glsl: add nir version of validate_first_and_last_interface_explicit_locations()
glsl: switch to nir validate_first_and_last_interface_explicit_locations()
glsl: remove unused validate_first_and_last_interface_explicit_locations()
nir: fix typo in comment
nir: copy explicit_invariant flag to nir vars
glsl: move interpolation_string() to linker_util
glsl: move is_gl_identifier() to linker_util
nir: add used field to nir variables
glsl: implement cross_validate_outputs_to_inputs() in nir linker
glsl: switch to nir linkers cross_validate_outputs_to_inputs()
glsl: remove now unused varying linker code
Timur Kristóf (39):
aco: Fix subgroup_id intrinsic on GFX10.3+.
ac/nir: Simplify arg unpacking when shift is zero.
ac/nir: Add new pass to lower intrinsics to shader args.
radv: Move radv_select_hw_stage to radv_shader_info.
radv: Use ac_nir_lower_intrinsics_to_args.
radeonsi: Move si_select_hw_stage to si_shader_info.
radeonsi: Use ac_nir_lower_intrinsics_to_args.
aco: Remove subgroup_id and num_subgroups intrinsics.
ac/llvm: Remove subgroup_id and num_subgroups intrinsics.
aco: Refactor select_program to smaller functions.
nir/opt_dead_cf: Remove if branches with undef condition.
ac/nir: Add done arg to ac_nir_export_position.
ac/nir: Slightly refactor how pos0 exports are added when missing.
ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
ac/nir/ngg: Refactor mesh shader primitive export.
ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
ac/nir/ngg: Extract nogs_export_vertex_params function.
ac/gpu_info: Add some SDMA related information.
ac: Clarify SDMA opcode defines.
ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk.
ac: Rename ac_do_parse_ib to parse_pkt3_ib.
ac: Print IP type for IBs.
ac: Add rudimentary implementation of printing SDMA IBs.
radv: Rename SDMA file to radv_sdma.c
radv: Use const device argument in radv_sdma_copy_buffer.
radv: Use const on vi_alpha_is_on_msb arguments.
radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues.
radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument.
radv: Refactor WRITE_DATA helper function.
radv: Use new WRITE_DATA helper in more places.
radv: Add queue family argument to some functions.
radv: Wait for bottom of pipe in ACE gang wait postamble.
radv: Simplify gang CS and semaphore initialization.
radv: Allow gang submit use cases other than task shaders.
radv: Slightly refactor gang semaphore functions.
radv: Add gang follower semaphore functions.
radv: Support SDMA in radv_cs_write_data_head.
radv: Support SDMA in radv_cp_wait_mem.
radv: Support SDMA in si_cs_emit_write_event_eop.
Vignesh Raman (4):
ci: add Vignesh Raman into restricted traces access list
Do explicit cast to suppress clang warnings
ci: enforce -Wimplicit-const-int-float-conversion for clang
ci: Uprev crosvm
Vinson Lee (8):
nvk: Fix assert
lavapipe: Fix struct initialization
intel/decoder: Fix memory leak on error path
nv50: Remove unused value
vk/wsi/x11: Remove dead code
freedreno/replay: Fix implicit-function-declaration error
anv: Fix transfer type assert
broadcom/qpu: Remove duplicate variable opcode
Vitaliy Triang3l Kuzmin (3):
r600/asm: Fix AR force_add_cf setting if a clause is not open
r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause
r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
Vlad Schiller (15):
pvr: Implement VK_EXT_tooling_info
pvr: Add ‘info’ PVR_DEBUG flag
pvr: Implement VK_KHR_format_feature_flags2
pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag
pvr: Add VK_KHR_driver_properties
pvr: Use correct index when writing query availability data
pvr: Enable VK_EXT_scalar_block_layout
pvr: Enable KHR_image_format_list
pvr: Enable VK_KHR_uniform_buffer_standard_layout
pvr: Implement VK_KHR_external_fence
pvr: Implement VK_KHR_external_semaphore
pvr: Enable VK_KHR_bind_memory2 extension
pvr: Implement VK_EXT_texel_buffer_alignment
pvr: Implement VK_EXT_host_query_reset
pvr: Fix VK_EXT_texel_buffer_alignment
WinLinux1028 (1):
radeonsi: prefix function with si_ to prevent name collision
Xaver Hugl (1):
vulkan wsi: add support for PresentOptionAsyncMayTear
Yiwei Zhang (46):
venus: handle query feedback creation failure
venus: ensure consistency of query overflow behavior
venus: add a missing barrier before copying query feedback
venus: refactor query feedback cmd record
venus: reduce to use 4K mem suballoc align on platforms known to fit
turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
lvp: avoid reading immutable sampler from desc write info
ci/venus: update venus-lavapipe expectations
venus: fix a cmd builder render_pass state leak across reset
venus: fix cmd state leak across implicit reset
venus: log and doc the broken query feedback in suspended render pass
venus: move transient storage from cmd to pool
venus: remove redundant fb tracking from cmd builder
venus: use tracked queue_family_index from the cmd pool
venus: cleanup vn_cmd_begin_render_pass usage
venus: add helpers to track subpass view mask
venus: avoid redundant tracking of render pass
venus: refactor more cmd states into cmd builder
venus: use in_render_pass to skip present_src counting
ci/venus: remove fixed tests that no longer run
ci/venus: reenable pipeline cts
venus: suppress a false logging
venus: add no_sparse debug option to disable sparse resource support
venus: set deviceMemoryReport feature
venus: expose at least one cached memory type
venus: expose KHR_external_fence/sempahore_fd extensions
venus: fix a device memory report leak
vulkan: remove a dup entry from vk_image_usage_to_ahb_usage
vulkan/android: improve vkQueueSignalReleaseImageANDROID
vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
vulkan/android: drop vk_buffer dependency from common AHB impl
venus: use common vk_queue object
venus: use common ANB implementation
venus: use more common vk_queue related implementations
venus: drop device, family, index, flags tracking from vn_queue
venus: fix re-export of imported classic 3d resources
venus: remove redundant bo roundtrip and add more docs
venus: track VkPhysicalDeviceMemoryProperties instead
venus: refactor vn_device_memory to prepare for async alloc
venus: make device memory alloc async
venus: enable Vulkan 1.3 for Android 13 and above
zink: sync queue access for vkQueueWaitIdle
venus: properly expose KHR_external_fence/sempahore_fd
ci/venus: mark more flaky tests after recent cts uprev
venus: fix query feedback batch leak and race upon submission
zink: apply can_do_invalid_linear_modifier to Venus
Yogesh Mohan Marimuthu (12):
gallium: remove start_slot parameter from pipe_context::set_vertex_buffers
ac/surface: add astc block size to bpe_to_format() function
util: move ASTCLutHolder from mesa/main to util
vulkan/formats,zink: move vk_format_from_pipe_format() function
vulkan/runtime: add compute astc decoder helper functions
vulkan add 3D texture support for compute astc decoder
radv: integrate meta astc compute decoder to radv
radeonsi: add more documentation for dpbb debug env variable
docs: remove document for unused variable dfsm from AMD_DEBUG
radeonsi: correct old comment in si_emit_framebuffer_state()
radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx
radeonsi: add radeonsi to GL_RENDERER string
Yonggang Luo (43):
lima: Convert to use nir_foreach_function_impl when possible
freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc
zink: Convert to use nir_foreach_function_impl when possible
lavapipe: Convert to use nir_foreach_function_impl
lavapipe: fixes indent of function lvp_inline_uniforms
microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module
microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible
radeonsi: Convert to use nir_foreach_function_impl
ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling
util: Move pipe_swizzle from p_defines.h to u_formats.h
util: Move PIPE_MASK_* from p_defines.h to u_formats.h
util: Move pipe_color_union from p_defines.h into u_formats.h
util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/
util: Remove include “pipe/*.h” in src/util/* files
util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/
util/meson: Getting mesa util core to be self contained
pvr: decouple vulkan driver and compiler from gallium
freedreno: decouple compiler and vulkan driver from gallium
glx: decouple from gallium
meson: Remove arm_neon_workaround
nouveau/drm-shim: Decouple from gallium
ac/radv: decouple radv vulkan driver and compiler from gallium
etnaviv: decouple drm from gallium
asahi: decouple layout from gallium
compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h
intel/blorp: Use float directly to avoid #include “mesa/main/format_utils.h”
intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus
intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_*
crocus: #include “program/prog_instruction.h” for SWIZZLE_*
intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium
util/treewide: Use alignas(x) instead __attribute__((aligned(x)))
v3dv: Use alignas(8) over 64 bit atomic value
svga: use alignas over struct MKSGuestStatInfoEntry
radv: Fixes mingw linkage error undefined reference to `radv_GetCalibratedTimestampsEXT’
v3d: Use DIV_ROUND_UP instead div_round_up
freedreno: Use shared DIV_ROUND_UP instead div_round_up
sfn: Use 4 instead of ATOMIC_COUNTER_SIZE
intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include “mesa/main/config.h”
d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS
compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include “mesa/main/config.h”
compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp
util: Deduplicate macros between u_math.h and macros.h
nvk: Should use alignment instead of align
Yusuf Khan (4):
nouveau/ws: remove the drm.h header
nvk: implement GetDeviceMemoryCommitment
nvk: support GetImageSparseMemoryRequirements2
nvk: expose KHR_driver_properties
Zhang Ning (1):
Revert “intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR”
antonino (14):
virgl: add ci flake
freedreno: add ci flake
zink: remove unused indices from `nir_load_push_constant` calls
zink/nir: add a zink specific intrinsic for push constants
vulkan/wsi: add `vk_wsi_force_swapchain_to_current_extent` driconf
drirc: enable `vk_wsi_force_swapchain_to_current_extent` for “The Talos Principle”
drirc: enable `vk_wsi_force_swapchain_to_current_extent` for “Serious Sam Fusion”
vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms
vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface
vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects
zink: store bindless var when creating it to avoid creating it again
nir: fix several crashes in `nir_lower_tex`
nir: don’t take the derivative of the array index in `nir_lower_tex`
vulkan: use instance allocator for `object_name` in some objects
cheyang (1):
isaspec : fix isaspec build error in aosp
georgeouzou (1):
nvk: Support VK_EXT_line_rasterization
jazzfool (1):
zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3
lorn10 (1):
docs: Update Clover’s env variable documentation
norablackcat (2):
spirv/nir_to_spirv: add expect assume op codes
rusticl: add cl_khr_expect_assume
timmac-qmc (1):
glsl: fix potential crash with DisableUniformArrayResize
twisted89 (1):
util/driconf: add workarounds for the Chronicles of Riddick
wangra (1):
tu/kgsl: Fix bitfield of DITHER_MODE_MRT6
xurui (1):
glx: There is no need to psc++