Mesa 22.2.0 Release Notes / 2022-09-21

Mesa 22.2.0 is a new development release. People who are concerned with stability and reliability should stick with a previous release or wait for Mesa 22.2.1.

Mesa 22.2.0 implements the OpenGL 4.6 API, but the version reported by glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. Some drivers don’t support all the features required in OpenGL 4.6. OpenGL 4.6 is only available if requested at context creation. Compatibility contexts may report a lower version depending on each driver.

Mesa 22.2.0 implements the Vulkan 1.3 API, but the version reported by the apiVersion property of the VkPhysicalDeviceProperties struct depends on the particular driver being used.

SHA256 checksum

b1f9c8fd08f2cae3adf83355bef4d2398e8025f44947332880f2d0066bdafa8c  mesa-22.2.0.tar.xz

New features

  • WGL_ARB_create_context_robustness

  • d3d12 ARB_robust_buffer_access_behavior

  • VK_EXT_robustness2 for lavapipe

  • VK_EXT_image_2d_view_of_3d on RADV

  • zink and d3d12 GL_EXT_memory_object_win32 and GL_EXT_semaphore_win32 support

  • vertexAttributeInstanceRateZeroDivisor support for lavapipe

  • panfrost Valhall support (conformant OpenGL ES 3.1 on Mali-G57)

  • VK_EXT_primitives_generated_query on RADV

  • VK_EXT_non_seamless_cube_map on RADV, ANV, lavapipe

  • VK_EXT_border_color_swizzle on lavapipe, ANV, turnip, RADV

  • VK_EXT_shader_module_identifier on RADV

  • variablePointers on lavapipe

  • VK_EXT_multisampled_render_to_single_sampled on lavapipe

  • VK_EXT_shader_subgroup_vote/ballot on lavapipe

  • EGL_KHR_context_flush_control for all drivers

  • GLX_ARB_context_flush_control for all drivers

  • VK_EXT_attachment_feedback_loop_layout on RADV

Bug fixes

  • turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to “movmsk” reordered with “kill”

  • [radeonsi] Graphical corruption in Valve Source-based games

  • Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO

  • nouveau: corruption on NVAC after switching to the NIR backend

  • zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip

  • Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built

  • Abort due to “Unknown intrinsic: intrinsic store_deref” (firefox, radeonsi)

  • turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635

  • regression from !17949

  • r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled

  • r600/sfn: register allocation fails for a number of piglits and deqp tests

  • DG2 corruption in gnome desktop from F36

  • radeonsi: error allocating sparse buffers.. bug?

  • [BISECTED/radeonsi] Flickering triangles with wayfire cube

  • ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion `fence->signalled’ failed for iris when disabled UTIL_FUTEX_SUPPORTED

  • turnip: gfxbench aztec ruins crashes on zink

  • glsl: textureGatherOffset param broken

  • v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c

  • radeonsi: driver crashes with kwin-effects-cube

  • [Turnip] Final Fantasy X: Incorrect hair rendering

  • turnip: gfxbench car chase crash on zink

  • [r300] - osu! - Shader optimizations cause black and glitchy rendering

  • [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available

  • anv: invalid returns from vkCreateDescriptorPool

  • [Bisected][RadeonSI] Blender crashes on startup

  • turnip: depth/stencil regressions

  • llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages

  • Poor performance/utilization with Splitgate

  • freedreno / mesa 22.1.3 crashes Firefox Nightly

  • [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3

  • dzn: Build failure when it’s the only driver built - 22.2-devel regression

  • d3d12: Unresolved external symbols from Version.lib

  • turnip: Geometry shader never invoked if there is no EmitVertex and gl_Position is not written

  • Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7

  • turnip: GPU fault on clear

  • intel_nullhw.c:179:28: error: passing ‘const void *const’ to parameter of type ‘void *’ discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers]

  • macOS undefined symbol _glapi_tls_Context build error

  • Build on macOS regressed

  • Regression: virgl: The switch to NTT breaks “Total War: Warhammer”

  • macOS’s LIST_ENTRY macro conflicts with ours

  • src/util/disk_cache_os.c:234:21: error: too many arguments provided to function-like macro invocation

  • [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1

  • Unity game segfaults often

  • Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0

  • turnip: dEQP-VK.subgroups.ballot_broadcast.framebuffer.* has failures on older deqp-vk

  • !17062 broke XTS xts5/Xlib9/XDrawImageString(16) with Xephyr using glamor on llvmpipe

  • DG2 full BAR broken by low BAR?

  • crocus: SSBOs regressed

  • Homerun Clash game triggers assert in compiler

  • radv: line rasterization mode affects multisampling

  • turnip: caselist fail

  • vtn: mediump_16bit_alu and compatibility

  • radv: dynamic vertex input failure

  • navi21\radv\amdgpu Halo Infinite screen corruption and then hang

  • ci/freedreno: a618_vk 1/7 timeouts

  • Windows OpenGL32.dll without shared-glapi currently broken

  • anv: KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage fails on TGL

  • perfetto: crash on context teardown / driver unload

  • panfrost shader classification is wrong in some cases

  • kopper: the latest awfulness exposed by piglit glx-multi-window-single-context

  • android: r600 building error after commit 79ca456b

  • Error compiling gallium-nine on i686 using musl libc

  • [anv][regression][bisected][bsw,gen9atom] dEQP-VK.memory.mapping.dedicated_alloc failing on bsw and gen9atom

  • anv: GTF-GL46.gtf32.GL3Tests.packed_pixels.packed_pixels_pbo failure

  • r600: Tesseract does not render properly with the new NIR backend

  • radv: some zink cts fails

  • anv: ICL hiz issue

  • [RADV, bisected] addrlib changes cause Navi CTS regressions

  • r300: Segfault in shaders/closed/steam/kerbal-space-program/1017.shader_test

  • Get rid of debug_assert()

  • nine: docs?

  • d3d12: Gallium d3d12 backend performance is slow

  • [RADV][bisected] after commit 124848bf9efa1cc5d8a3cdfc654e0c03d57540e3 the games which use vulkan API not starting

  • freedreno: skqp instability from GPU hangs

  • ci/a618: intermittent network failure

  • turnip: EXT_primitives_generated_query broken

  • Godot crashes with RX5500m

  • intel: Some Alan Wake shader fails validation due to bad register usage in EOT message

  • r600/sfn: Add support for fmulz

  • radv: failure in image.sample_texture.*_compressed_format*

  • radeonsi: Hang on RX 5500 XT on latest mesa git

  • blob_align assert triggers

  • radv: Enabling support for instance extensions inside vkCreateInstance does not work

  • zink: rendering artifacts in Portal 2

  • a630: AmnesiaTDD trace gpu faults

  • freedreno build issue - FAILED: src/freedreno/decode/crashdec.txt

  • segfaults under X11 since the Ubuntu update to 22

  • anv: renderdoc explosion on icl

  • turnip: EXT_depth_clip_enable broken

  • lava: test result parsing fail

  • turnip: LRZ fast clear

  • [ANV] GTA IV flickering

  • [TGL] 25-100% perf impact on SynMark2 with 2GB dynamic pool size

  • [Crocus] - Minecraft with Forge and JourneyMap - Graphical glitches

  • intel - Minecraft char bar is broken after Mesa upgrade

  • [wine] RAGE: texture problems

  • “Unsupported modifier, resource creation failed.” on stderr

  • [RADV/ACO] Star Citizen Flickeriing Lights

  • anv: query regressions

  • ir3: Infinite loop in scheduler when splitting

  • ci/lava: “Listened to connection” spam

  • ci: docs build fail

  • turnip: 3d copy/clear/… ops should not be counted by occlusion query

  • anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT

  • turnip: wrong border color for Z24S8

  • aco_tests fail with llvm-15

  • Clang 14 issues compiling src/util/u_cpu_detect.c

  • eglExportDMABUFImageQueryMESA crashes on amdgpu gallium/drivers/radeonsi

  • crocus: GL_SELECT regressed on hsw

  • Random texture flickering in Minecraft

  • panfrost: Broken interaction between CRC and tile size

  • panfrost: allow_forward_to_be_killed is too conservative

  • APU downgraded to OpenGL 3.0 since UBO size change in radeonsi

  • [anv, bisected, regression] Pipeline statistics query segfaults

  • radv/NIR: Suboptimal codegen for AccessChain on buffer device addresses

  • Regression - vah264dec + glimagesink causes green taint on entire video with mesa>=22.0.0

  • va: AMD: Wrong buffer size of P010 Luma plane

  • 625b352f14b broke glcts on Navi1x

  • Use marketing_name for the Vulkan device string in RADV

  • Gallium driver crashes due to fd double close in iris driver

  • radv: Khronos ray tracing sample issues

  • r300: copy propagate does not correctly account for control flow

  • r300: Amnesia the Dark Descent slowdown since NIR loop unrolling

  • r300: some unused uniforms are not removed with the NIR varying linker (compared to old glsl varying linker)

  • Mesa currently FTBFS on x32 architecture

  • Runescape randomly crashes. malloc_consolidate(): unaligned fastbin chunk

  • tu: viewport assert

  • d3d12: point coord regression due to NIR IO changes

  • nir_lower_phis_to_regs_block: Assertion `src->src.is_ssa’ failed.

  • Redream emulator showing artifacts

  • Build failure on sparc

  • r600 video playback broken

  • r300: hundreds of tests fail since “clear color buffers using color from a constant buffer”

  • turnip: shader viewport handling

  • Signal 6 (Aborted) when opening GTK3 applications

  • radv: Conditional rendering on compute queue

  • anv: line failure

  • 32 bit build failure

  • d3d12 build regression: d3d12video.h: error: expected ‘;’ at end of member declaration

  • panfrost midgard - major issues with MelonDS emulator - not able to trace properly too

  • Intel/anv: The cloak is flickering in Batman:Arkham Knight

  • regression: “NIR based varying linker” breaks some xwayland/glamor applications with etnaviv

  • pvr: build failure on archs with unsigned long != uint64_t

  • turnip: dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.* fails with angle

  • Nheko misrendering on RK3399 (Mali T860)

  • intel_pps_driver.cc:308:74: error: too many arguments to function ‘void intel_perf_query_result_accumulate_fields(intel_perf_query_result*, const intel_perf_query_info*, const void*, const void*, bool)

  • radv: no nir validation in radv_link_shaders

  • r300: deqp regressions from “glsl: enable the use of the nir based varying linker”

  • [crocus] glmark2 -b jellyfish rendering issues

  • radeonsi-stoney: Failing jobs during a CI bug

  • iris: skqp gles_atlastext flake

  • turnip: use-after-free in shader compilation

  • anv: EXT_primitives_generated_query fails

  • r300: vs predicate logic fails in vs-loop-complex-unroll-with-else-break.shader_test

  • [Intel/Vulkan] White blinking artifacts in Wolfenstein Youngblood

  • radv: Indirect unaligned dispatch

  • [radv] DCC causes artifacts in Senran Kagura Shinovi Versus

  • Radeon RX 5700XT crash when using prusa-slicer

  • glcts proxy_textures_invalid_samples tests assert on iris

  • turnip: gfxbench mh3.1 offscreen crash with ANGLE

  • dEQP-GLES31.functional.shaders.builtin_functions.integer.[i|u]mulextended.* fail

  • Radv leaks a descriptor set layout

  • radeonsi: KHR_debug thread safety

  • [radeon, rv370] Running piglit shaders@glsl-vs-raytrace-bug26691 test causes hard lockup & reboot

  • clover: Assertion `NumContainedTys && “Attempting to get element type of opaque pointer”’ failed

  • wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes causes memory leak in Apex Legends

  • v3d: port v3dv workaround for GFXH-1918

  • freedreno: fragile location packing can break cull/clip distance

  • turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures

  • Panfrost: wrong textures and lighting in “Thief” games in wine (not happening when using Software render)

  • [Bisected] r600: Big performance regression on some games

  • turnip: report corresponding drmFormatModifierPlaneCount for planar format

  • piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression

  • Huge amount of anon_inode:sync_file file descriptor created

  • enable warnings for GNU extensions we don’t want

  • Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro

  • NIR validation failed after nir_opt_if

  • Etnaviv: The triangle is lost during linking together triangle strips with degenerate triangles

  • intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results

  • Aperture Desk Job rendering artifacts Intel TGL

  • intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue

  • r300/rv530: “Black rendering” of Baldur’s Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi

  • radv/aco: xfb bug

  • ‘../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info *, ADDR2_COMPUTE_DCCINFO_OUTPUT *, struct gfx9_meta_equation *): assertion “dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)” failed’

  • Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT

  • Vulkan::Calling vkWaitForFences Timeout

  • turnip: bad gmem rendering with PUBG mobile on angle

  • Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0

  • gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1

  • bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed

  • radv,aco: Regression with global atomics with negative offsets

Changes

Adam Jackson (32):

  • docs: Note EGL enum allocation for EGL_EXT_present_opaque

  • docs: Update supported drivers for 22.x

  • glx/kopper: Wire up a way for SwapBuffers to fail

  • wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes

  • kopper: Grow a swap interval API

  • glx/kopper: Enable GLX_EXT_swap_control etc.

  • egl/kopper: Hook up eglSwapInterval

  • wsi/x11: Hook up KHR_incremental_present

  • dri: Inline dri2InvalidateDrawable into its one caller

  • dri: Move driCopySubBufferExtension into drisw

  • dri: move driQueryRendererIntegerCommon near its one caller

  • dri: Move driC{reate,oncat}Configs near their only caller

  • dri: Move dri{Get,Index}ConfigAttrib near their vtable

  • dri: Remove what’s left of utils.[ch]

  • glx/dri2: Require DRI2 >= 1.3 for working buffer invalidation

  • dri2: Require a loader with working buffer invalidation

  • zink: Print the VkResult if vkCreateInstance fails

  • glx/dri: Fix DRI drawable release at MakeCurrent time

  • mesa: Stub out GL_APPLE_object_purgeable

  • glx: Remove the remains of GLX_MESA_multithread_makecurrent

  • gallium/dri: Remove the memory of GLX_MESA_multithread_makecurrent

  • glx: Remove some can’t-happen checks from protocolDestroyDrawable

  • glx: Remove unused __glXSetupForCommand from __glXIsDirect

  • glx: Remove excess screen parameter from CreateContext

  • glx/dri2: Stop tracking the protocol version in the display state

  • glx: Fix return values from __glXGetDrawableAttribute

  • glx: Fix drawable refcounting for naked Windows

  • kopper: Implement {EGL,GLX}_EXT_buffer_age

  • wsi/x11: Synchronously check for error from xcb_present_pixmap

  • mesa: Remove Purgeable fields from texture/buffer/renderbuffer objects

  • llvmpipe: Indentation fix

  • gallium/dri2: Enable {GLX_ARB,EGL_KHR}_context_flush_control

Alejandro Piñeiro (22):

  • v3dv/pipeline: fix small comment typo

  • v3dv/pipeline_cache: adds check to skip searching for a entry

  • v3dv/pipeline_cache: add on disk cache hit stats

  • v3dv/pipeline: include pipeline layout on the pipeline sha1

  • v3dv/bo: unmap any bo that goes to the bo cache

  • v3dv/format: no need for GetPhysicalDeviceFormatProperties

  • v3dv/format: Add support for VK_KHR_format_feature_flags2

  • nir: get res binding using component 0, instead of asssumig an uint

  • v3dv/pipeline: call nir_lower_explicit_io after first nir optimization loop

  • v3dv/pipeline: expand nir_optimize, drop st_nir_opts

  • v3dv/device: add comment on depthClamp feature

  • broadcom: add no_merge_jobs debug option

  • broadcom: document which debug options are specific for a given driver

  • broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V

  • broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir

  • nir: call nir_metadata_preserve at nir_remove_unused_io_vars

  • v3dv/pipeline: use nir_shader_instructions_pass when lowering layout

  • broadcom/compiler: removed unused function

  • broadcom/compiler: make several passes to return a progress

  • v3d/v3dv: use NIR_PASS(_

  • vc4/v3d: stop adding NORAST when SHADERDB debug option is used

  • vc4/v3d: restore calling debug_get_option_vc4/v3d_debug

Alessandro Astone (5):

  • kopper: Only include XCB headers on supported platforms

  • Android.mk: Cleanup mesa3d-lib

  • Android.mk: Make mesa3d-lib work with absolute path meson outputs

  • Android.mk: Generate the dummy source in local-generated-sources-dir

  • Android.mk: Intermediate output paths may already be absolute

Alexander Kanavin (1):

  • swrast_kms: use swkmsDRI2Extension instead of driDRI2Extension

Alexey Bozhenko (2):

  • intel/compiler: fix singleton pointer coverity warning

  • spirv: fix OpBranchConditional when both branches are the same

Alyssa Rosenzweig (454):

  • panfrost: Add 48-bit and 96-bit tiling routines

  • panfrost: Remove bpp check for tiling

  • panfrost: Only handle 2D in should_linear_convert

  • panfrost: Implement tiled 3D resource access

  • panfrost: Only tile for 2D dimensions

  • panfrost: Allow tiling all texture targets

  • panfrost: Assert alignment in tiling routine

  • panfrost: Document arguments to tiling routines

  • panfrost: Unit test u-interleaved tiling routines

  • docs/panfrost: Document u-interleaved tiling

  • panfrost: Rework dirty tracking for Valhall

  • panfrost: Dirty track batch masks

  • panfrost: Rework PAN_GPU_ID mechanism

  • docs/panfrost: Document drm-shim usage

  • meson: Ignore unused variables in release builds

  • pan/bi: Simplify bi_propagate_pass_flags

  • panvk: Respect line width

  • panvk: Consider primitive restart in index buffer walks

  • pan/bi: Implement fquantize2f16

  • panvk: Handle empty shaders gracefully

  • panvk: Lower shared memory

  • util: Add a generic worklist implementation

  • nir: Use u_worklist to back nir_block_worklist

  • pan/bi: Implement pack_uvec[24]_to_uint

  • panfrost: Add texture features enum to v9.xml

  • pan/blit: Compile blit shaders without IDVS

  • pan/blit: Generalize texture alignment

  • pan/blit: Prepare for Valhall port

  • pan/blit: Support v9 data structures

  • panfrost: Don’t allocate storage for PSIZ on Valhall

  • panfrost: Compile libpanfrost for v9

  • panfrost: Add helper to emit UBOs

  • panfrost: Port uniform/UBO logic to Valhall

  • panfrost: Don’t fix up alpha test on Bifrost

  • panfrost: Use common state emit for compute jobs

  • panfrost: Adapt compute job emit for Valhall

  • panfrost: Specialize rasterizer state for Valhall

  • panfrost: Simplify attribute format expression

  • panfrost: Specialize vertex elements for Valhall

  • panfrost: Specialize ZSA state for Valhall

  • panfrost: Specialize shader descriptors for Valhall

  • panfrost: Use dirty flags to emit v9 descriptors

  • panfrost: Generate Valhall Malloc IDVS jobs

  • panfrost: Compile for v9

  • panfrost: Kick off v9 support in the driver

  • panfrost: Advertise all textures in drm-shim

  • panfrost: Split pan_layout.c from pan_texture.c

  • panfrost: Always use 64-bit SD with strides

  • panfrost: Add afbc_superblock_{size, width, height} helpers

  • panfrost: Use panfrost_afbc_superblock_width

  • panfrost: Don’t pretend to support multiplane AFBC

  • panfrost: Extract panfrost_afbc_is_wide helper

  • panfrost: Use pan_block_size in layout calculation

  • panfrost: Unify panfrost_block_size paths

  • panfrost: Unify paths through image_layout_init

  • panfrost: Remove unused dev argument

  • panfrost: Unit test block size queries

  • panfrost: Simplify how image_layout_init is called

  • panfrost: Add a test for pan_image_layout_init

  • asahi: Workaround broken GLSL compiler

  • agx: Implement some shader-db stats

  • agx: Note that RA proceeds in dominance-order

  • agx: Track write registers more accurately

  • agx: Match order for designated initializers

  • agx: Use correct types for some IR enums

  • agx: Wrap compiler header in extern “C”

  • agx: Add unit test infrastructure

  • agx: Use pseudo ops for mov/not/and/xor/or

  • agx: Implement simple copyprop

  • agx: Dynamically allocate agx_instr->src

  • agx: Add p_split pseudoinstruction

  • agx: Add phi pseudo instruction

  • agx: Add a hash table for vector extracts

  • agx: Add helper to emit combines

  • agx: Add helper to emit splits

  • agx: Use a transfer graph for parallel copies

  • agx: Lower p_split after RA

  • agx: Optimize p_split(kill) specially

  • agx: Emit splits for intrinsics

  • agx: Use extract helper for tex internally

  • agx: Remove else optimization

  • agx: Use a dynarray for predecessors

  • agx: Add predecessor index helper

  • agx: Mark the logical ends of blocks

  • agx: Add agx_after_block_logical helper

  • agx: Introduce worklist infrastructure

  • agx: Adapt liveness analysis for SSA

  • agx: Make DCE dumber

  • agx: Translate phi nodes

  • agx: Copy prop into phi nodes

  • agx: Don’t lower phis in NIR

  • agx: Lower phi nodes to parallel copies

  • agx: Split up RA from post-RA lowering

  • agx: Always use hash table for extracts

  • agx: Unit test parallel copy lowering

  • agx: Remove identity moves

  • agx: Add validation pass

  • agx: Update RA comment

  • agx: Remove has_liveness

  • agx: Remove nir_register support

  • agx: Lower mediump flat shading

  • panvk: Ignore point size for !points

  • panvk: Use correct point size limits

  • panfrost: Only flip point sprites for GL

  • panvk: Streamline no shader RSD case

  • panvk: Gate rasterization on !discard, not fs req

  • panvk: Emit fragment RSDs even with no shader

  • panfrost: Remove ancient unused code

  • panfrost: Call tgsi_to_nir earlier

  • panfrost: Don’t subclass pipe_compute_state

  • panfrost: Use s->info.stage instead of tgsi stage

  • panfrost: Simplify shader key architecture

  • panfrost: Refactor variant selection code

  • panfrost: Refactor variant rebind code

  • panfrost: Lower point sprites on Bifrost

  • panfrost: Lower user clip planes

  • panfrost: Flip point coords in hardware

  • pan/va: Generalize message? check for asm

  • pan/va: Add missing TABLE (SFU) instructions

  • pan/va: Mark NOT instructions as deprecated

  • pan/va: Add VAR_TEX_BUF_GRADIENT instruction

  • pan/va: Add non-IDVS VAR_TEX instructions

  • panvk: Lower global vars to local

  • panvk: Allocate stack for each thread

  • pan/bi: Handle texture offset + index

  • panvk: Lower image deref atomics

  • panvk: Only write depth when depth test enabled

  • pan/decode: Print row strides, not line strides

  • panvk: Remove unused layout structs

  • panvk: Report row_stride in GetImageSubresourceLayout

  • panfrost: Add helpers to work with legacy strides

  • panfrost: Use row stride for explicit layouts

  • panfrost: Rename away from bytes_per_pixel

  • panfrost: Use row stride to calculate slice size

  • panfrost: Adapt get_param for row strides

  • panfrost: Use row_stride even for linear resources

  • lima,panfrost: Use row stride for tiling routines

  • panfrost: Remove line_stride

  • panfrost: Unify row stride and AFBC row stride

  • panfrost: Unit test stride calculations

  • panfrost: Unit test “from legacy” helper

  • pan/bi: Rename bi_block->name to bi_block->index

  • pan/bi: Count blocks

  • pan/bi: Add u_worklist wrapper macros

  • pan/bi: Use bi_worklist in analyze_helper_requirements

  • pan/bi: Use bi_worklist for liveness

  • pan/bi: Use bi_worklist for post-RA liveness

  • pan/bi: Use worklist for scoreboard analysis

  • pan/bi: Use a dynarray for predecessors

  • nir: Don’t set writes_memory for reading XFB

  • panfrost: Relax image check

  • pan/va: Serialize memory stores

  • pan/bi: Align spilled registers on Valhall

  • pan/bi: Add option to test spilling

  • pan/bi: Assert that blend shaders may not spill

  • panfrost: Correct XML for TLS

  • panfrost: Use emit_tls

  • panfrost: Use packed TLS on Valhall

  • panfrost/ci: Smoke test spilling

  • panfrost: Don’t use VARYING_SLOT_TEX0 internally

  • panvk: Don’t use VARYING_SLOT_TEX0 internally

  • pan/bi: Rework varying linking on Valhall

  • panfrost: Increase the maximum tiler heap size

  • mesa/st: Fix building tests on macOS

  • panvk: Stub pipeline cache using the common code

  • panvk: Conform viewport code to Vulkan spec

  • panvk: Call nir_opt_trivial_continues

  • pan/va: Add some whitespace to Valhall disassembly

  • pan/va: Add whitespace after disassembled branches

  • agx: Restore Valve copyright header

  • pan/bi: Don’t analyze td on Valhall

  • pan/bi: Emit LEA_TEX on Valhall

  • pan/bi: Support image loads on Valhall

  • pan/bi: Preserve AXCHG.i32 destination

  • pan/bi: Handle shared atomic exchange on Valhall

  • pan/bi: Handle shared/scratch on Valhall

  • pan/bi: Support atomics on Valhall

  • pan/bi: Ensure the end NOP isn’t eliminated

  • asahi: Fix hangs waiting on the notification queue

  • pan/bi: Update bi_count_write_registers for Valhall

  • pan/va: Mark more source sizes

  • pan/va: Lower split 64-bit sources

  • pan/va: Use 64-bit lowering for texturing

  • pan/bi: +JUMP can’t read same-cycle temp

  • pan/bi: Lower phis to scalar

  • pan/bi: Use value-based interference with LCRA

  • pan/bi: Add source/destination counts

  • pan/bi: Add collect and split instructions

  • pan/bi: Add bi_before_block cursor

  • pan/bi: Lower split/collect before RA

  • pan/bi: Fix mov and pack_32_2x16

  • pan/bi: Expand MAX_DESTS to 4

  • pan/bi: Create COLLECT during isel

  • pan/bi: Lift split/collect cache from AGX

  • pan/bi: Simplfy BLEND emit

  • pan/bi: Emit collect and split

  • pan/va: Use split for 64-bit lowering

  • pan/va: Don’t use bi_word in FAU unit test

  • pan/bi: Scalarize bi_lower_swizzle

  • pan/bi: Scalarize bi_opt_cse

  • pan/bi: Scalarize modifier propagation

  • pan/bi: Scalarize copyprop

  • pan/bi: Remove bi_word and bi_word_node

  • pan/bi: Simplify register precolouring in the IR

  • pan/bi: Remove liveness metadata tracking

  • pan/bi: Don’t propagate discard

  • pan/bi: Optimize split of collect

  • pan/bi: See through splits for var_tex fusion

  • pan/bi: Validate preload constraints are satisfied

  • pan/bi: Validate vector widths

  • pan/va: Handle 2-src blend in lower_split_src

  • pan/va: Remove DISCARD.f32 destination

  • pan/va: Use ^ instead of ` to indicate last-use

  • asahi: Run lower_blend for all fragment shaders

  • asahi: Use z24_in_z32f lowering

  • asahi: Validate depth/stencil formats

  • asahi: Don’t store to unbound render targets

  • asahi: Handle uncompressed Z32F depth buffers

  • asahi: Pass depth/stencil clear values to firmware

  • asahi: Flush on partial clears

  • asahi: Handle 3rd deflake address

  • asahi: Account for every byte of the cmdbuf

  • asahi: Identify more depth pipeline fields

  • asahi: Dump “unknown 4a” structure

  • asahi: Add XML used for visibility tests

  • asahi: Relax Draw “command” check

  • asahi: Note unknown field

  • asahi: Identify depth bias array

  • asahi: Identify “set depth bias index” field

  • asahi: Identify depth bias enable bit

  • asahi: Add a depth bias structure

  • asahi: Implement polygon offset

  • asahi: Identify partial render pipeline

  • asahi: Sync some names with Project Zero

  • asahi: Add segment list pretty printer

  • asahi: Split unknown field in segment list header

  • asahi: Validate segment list length

  • asahi: Compute segment list length

  • asahi: Align segment list unknowns with Metal

  • asahi: Decode resource allocation responses

  • asahi: Decode resource allocation requests

  • asahi: Identify suballocation mechanism

  • asahi: Identify suballocated size

  • asahi: Drop some unknowns

  • asahi: Sync some more unknown fields

  • asahi: Improve render target dumping

  • asahi: Fix pipeline decoding

  • asahi: Handle RECT textures as 2D

  • asahi: Allocate a larger segment list

  • asahi: Guard against encoder overflows

  • asahi: Add wrap library

  • docs/asahi: Add initial driver docs

  • panfrost: Simplify the kmsro create path

  • panfrost: Don’t allocate SHARED as scanout

  • pan/bi: Recoalesce tied operands after spilling

  • pan/bi: Schedule for pressure pre-RA

  • pan/bi: Allow CSEing LEA_BUF_IMM

  • panfrost: Fix crash with GALLIUM_HUD

  • panfrost: Extract afbc_stride_blocks helper

  • panfrost: Extract pan_afbc_row_stride helper

  • panfrost: Fix is_wide return type

  • panfrost: Add pan_afbc_tile_size helper

  • panfrost: Support tiled AFBC in stride helpers

  • panfrost: Align layouts to tiles of superblocks

  • panfrost: Add AFBC stride unit tests

  • panfrost: Add 3D texture layout unit test

  • panfrost: Calculate header_size based on row_stride

  • panfrost: Add a tiled 16x16 layout unit test

  • panfrost: Add 1x1 layout unit tests

  • panfrost: Fix AFBC flags on v6

  • panfrost: Put comment in correct #ifdef

  • panfrost: Handle AFBC Tiled

  • panfrost: Add helper checking tiled AFBC support

  • panfrost: Gate tiled AFBC on GPUs that support it

  • panfrost: Advertise 16x16 tiled AFBC

  • panfrost: Use tiled AFBC textures

  • pan/bi: Fuse result types

  • pan/bi: Test CMP result_type optimization

  • panfrost: Allow FPK on Valhall

  • panfrost: Set overdraw_alpha[01] flags

  • panfrost: Set allow_rotating_primitives

  • pan/bi: Don’t allow spilling coverage mask writes

  • pan/bi: Add VAR_TEX fusing unit test

  • pan/va: Fill in missing src_flat16 enum

  • pan/bi: Make LD_VAR w=format instead of w=vecsize

  • pan/bi: Model Valhall source formats

  • pan/bi: Interpolate varyings at 16-bit

  • pan/bi: Extend bi_scoreboard_state for finer tracking

  • pan/bi: Add slot to bi_instr

  • pan/bi: Preserve flow control for non-psiz variant

  • pan/bi: Add ASSERT_SHADER_EQUAL macro

  • pan/bi: Add a trivial ctx->inputs for unit tests

  • pan/bi: Extract bit_block helper

  • pan/bi: Export bi_block_add_successor

  • pan/bi: Export helper termination analysis

  • pan/bi: Print flow control on instructions

  • pan/va: Add pass to insert flow control

  • pan/va: Do scoreboard analysis

  • pan/va: Terminate helper threads

  • pan/va: Add flow control merging pass

  • pan/va: Assign slots roundrobin

  • pan/va: Respect assigned slots

  • pan/va: Unit test flow control insertion

  • pan/va: Unit test flow control merging

  • pan/bi: Use flow control lowering on Valhall

  • pan/mdg: Scalarize with 64-bit sources

  • pan/mdg: Remove uppercase write masks

  • pan/mdg: Use “<<” instead of “lsl”

  • pan/mdg: Only print masked components of swizzle

  • pan/mdg: Only print 2 sources for ALU

  • pan/mdg: Only print 1 source for moves

  • pan/mdg: Distinguish SSA vs reg when printing IR

  • pan/mdg: Print outmods when printing IR

  • panfrost: Fix XML for AFBC header on v9

  • panfrost: Pass arch instead of dev into afbc_format

  • panfrost: Add pan_afbc_compression_mode on Valhall

  • panfrost: Support rendering to AFBC on Valhall

  • panfrost: Handle texturing from AFBC on Valhall

  • panfrost: Allow 2D AFBC on Valhall

  • panvk: Use vk_image_subresource_*_count for clears

  • pan/bi: Respect swizzles in nir_op_pack_64_2x32_split

  • pan/bi: Allow vec6 for collects

  • pan/bi: Align 64-bit register sources

  • pan/bi: Rework Valhall register alignment

  • pan/bi: Fix clper_xor on Mali-G31

  • panfrost/ci: Mark draw_buffers_indexed.* as flakes

  • panfrost: Use C11 static_assert for enums

  • nir: Add transform feedback system values

  • nir: Export nir_io_add_intrinsic_xfb_info

  • pan/bi: Add transform feedback lowering pass

  • pan/bi: Handle transform feedback intrinsics

  • panfrost: Don’t allow vertex shaders to have side effects

  • panfrost: Wire up transfrom feedback sysvals

  • panfrost: Create transform feedback shaders

  • panfrost: Launch transform feedback shaders

  • pan/va: Do not insert NOPs into empty shaders

  • panfrost: Extract panfrost_get_index_buffer helper

  • panfrost: Don’t calculate min/max indices on v9

  • pan/bi: Extract MUX to CSEL optimization

  • pan/va: Implement more lanes

  • pan/va: Lower MUX to CSEL where possible

  • pan/va: Add MUX lowering tests

  • pan/bi: Implement b2i with MUX

  • pan/va: Record which instructions are signed

  • pan/va: Try negating small constants when lowering

  • pan/va: Try widening small constants

  • pan/va: Unit test constant lowering pass

  • panfrost: Enable Mali-G57

  • docs: Document Mali-G57 conformance

  • panfrost: Fix inverted indirect dispatch condition

  • pan/bi: Implement load_push_constant

  • pan/bi: Don’t reorder push with no_ubo_to_push

  • panfrost: Use push constants for indirect dispatch

  • panfrost: Use push constants for indirect draws

  • panvk: Use push constants for clear

  • panvk: Use push constants for copy shaders

  • panfrost: Deduplicate indirect dispatch structs

  • panfrost: Remove redundant first_tag access

  • panfrost: Remove is_64b assignments

  • panfrost: Add panfrost_query_l2_slices helper

  • pan/perf: Fix performance counters on G57

  • mesa,gallium: Make point coord origin a CAP

  • asahi,agx: Don’t set PIPE_CAP_POINT_COORD_ORIGIN_UPPER_LEFT

  • panvk: Drop support for Midgard

  • panfrost: Allow pixels using discard to be killed

  • panfrost: Inline pan_fbd_has_zs_crc_ext

  • panfrost: Disable CRC at <16x16 tile sizes

  • panvk: Don’t specialize clear shaders for RT

  • panvk: Remove unused pushmaps

  • panvk: Clear Z/S attachments without a shader

  • panvk: Fix stencil clears of combined ZS images

  • panvk: Simplify depth clear preload condition

  • panvk: Use common CmdBeginRenderPass

  • panfrost: Disable QUADS on Valhall

  • panfrost: Fix batch state changes on Valhall

  • panfrost: Add tristate_is_defined helper

  • panfrost: Implement provoking vertices on Valhall

  • pan/va: Move VA_NUM_GENERAL_SLOTS to common

  • pan/va: Simplify insert flow tests

  • pan/va: Workaround quirk of barrier handling

  • pan/va: Unit test barrier handling

  • pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK

  • pan/bi: Constify bi_is_staging_src argument

  • pan/va: Add header guards to valhall_enums.h

  • pan/va: Move va_flow_is_wait_or_none to common

  • pan/va: Include BLEND for va_swap_12

  • pan/va: Use validate_register_pair for BLEND pack

  • pan/va: Mark last register reads

  • pan/va: Unit test va_mark_last

  • pan/bi: Fix LD_BUFFER.i16 definition

  • pan/bi: Align accesses with packed TLS

  • pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32

  • pan/va: Implement sample positions FAU packing

  • pan/va: Rename RSCALE to LDEXP

  • pan/bi: Remove FRSCALE from IR

  • pan/bi: Model MKVEC.v2i8

  • pan/bi: Constant fold MKVEC.v2i8

  • pan/va: Pack MKVEC.v2i8 byte lanes

  • pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8

  • pan/bi: Tune lower_vars_to_scratch

  • panfrost: Enable more FP16 caps on Valhall

  • panfrost: Enable CAP_INDIRECT_TEMP_ADDR on Valhall

  • panfrost: Bump ESSL_FEATURE_LEVEL on Valhall

  • v3d: Drop workaround for u_blitter bug

  • agx: Handle loop { if { loop { .. } } }

  • r600g: Remove streamout-based buffer copy path

  • u_blitter: Remove util_blitter_copy_buffer

  • lima,panfrost: Do not ralloc contexts against the screen

  • pan/va: Handle terminal barriers

  • ttn: Set nir->info.separate_shader

  • pan/bi: Fix unpack_32_2x16 definition

  • pan/va: Add missing <roundmode/> to V2F32_TO_V2F16

  • pan/bi: Implement f2f16{_rtz, _rtne}

  • panfrost: Separate core ID range from core count

  • panfrost: Move bifrost_lanes_per_warp to common

  • panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS

  • panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE

  • panfrost/ci: Disable 0ad trace on T860

  • panfrost: Clear with a quad to avoid flushing

  • panfrost/ci: Disable T760 jobs

  • nir/opt_shrink_vectors: Round to supported vec size

  • panfrost: Revert provoking vertex assertion

  • panfrost: Only key points to point coord origin

  • pan/va: Dump unencodable instructions

  • pan/va: Use invalid_instruction in more places

  • pan/va: Print instructions with pack assert fails

  • pan/va: Handle BIFROST_MESA_DEBUG=nosb

  • panfrost: Make pan_select_max_tile_size O(1)

  • panfrost: Use implementation-specific tile size

  • panfrost: Protect pandecode by a mutex

  • pan/bi: Require ATEST coverage mask input in R60

  • pan/va: Stall after ATEST

  • panfrost: Fix shader_modifies_coverage on Valhall

  • panfrost: Add decoupled early-ZS helpers

  • panfrost: Unit test early-ZS helpers

  • panfrost: Add zsa->zs_always_passes flag

  • panfrost: Use early-ZS helpers

  • util: Enable list_assert in debugoptimized builds

  • util: Gate simple_mtx_assert_locked on !NDEBUG

  • agx: Port minifloat tests to GTest

  • pan/bi: Assert that we use the correct CLPER

  • pan/bi: Rename CLPER_V6.i32 to CLPER_OLD.i32

  • pan/decode: Fix overrun decoding planes

  • nir/lower_idiv: Be less creative about signs

  • agx: Fix packing of samplers in texture instrs

  • panfrost: Don’t segfault on unknown models

  • pan/bi: Don’t reorder image loads across stores

  • pan/bi: Don’t allow ATEST to take a temporary

  • pan/mdg: Print 3 sources for CSEL

  • pan/bi: Fix dual texturing with uniforms

  • pan/bi: Fix out-of-bounds write in va_lower_split_64bit

  • pan/bi: Consider all dests in helper_block_update

  • agx: Fix float copyprop of neg(neg) case

  • panfrost: Respect buffer offset for OpenCL

Andres Calderon Jaramillo (2):

  • nir: Account for YUV range.

  • mesa/st: Account for YUV color space and range.

Andres Gomez (5):

  • radv/ci: update vkd3d-proton results for AMD’s Kabini

  • ci: install a more recent version of Wine in the test-vk image

  • ci: install gfxinfo-mupuf in the test-vk image

  • ci: move b2c artifacts to its own directory

  • ci: add wrapper script for Valve’s traces runner

Anuj Phogat (2):

  • include/uapi: Update drm_fourcc.h from kernel

  • isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP

Arvind Yadav (7):

  • nir: Add a lowering pass for polygon and line smoothing

  • radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing

  • nir: add load_point_coord_maybe_flipped intrinsics for point smoothing

  • nir: Add a lowering pass for point smoothing

  • ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi

  • ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes

  • radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing

Autumn on Tape (5):

  • gallivm: add subgroup shuffle support

  • lavapipe: enable subgroup shuffle operations

  • gallivm: use shufflevector for shuffles when index is constant data

  • gallivm: use VPERMPS (x86/AVX2) for 32-bit 8-element shuffles

  • lavapipe: enable subgroup quad operations

Axel Davy (6):

  • frontend/nine: Skip invalid swvp calls

  • frontend/nine: Fix buffer tracking out of bounds

  • frontend/nine: Fix ATOC handling

  • frontend/nine: Fix cso restore bug

  • frontend/nine: Fix shader multi-use crash

  • frontend/nine: Fix ff position_t fallback when w = 0

Bas Nieuwenhuizen (39):

  • vulkan/wsi/x11: Ensure we have the required number of images for acquire.

  • radv: always enable NGG on GFX11

  • radv: Mark all formats as DCC compatible with each other on gfx11.

  • radv: gfx11 register changes.

  • radv: Adjust for new SWIZZLE_ENABLE.

  • radv: Add gfx11 DCC fast clear support.

  • radv: Disable transform feedback for gfx11.

  • radv: update the initialization of SGPR0/1 registers for HS and GS on GFX11

  • radv: Deal with derefs from opaque types in function parameters.

  • radv: Add performance counter reg write.

  • radv: move ac_perfcounters to physical_device.

  • radv: Add pstate locking for perfcounters.

  • radv: Rename SPM functions.

  • radv: Make inhibiting clockgating not static.

  • radv: Add helper BO for perf counters.

  • radv: Add perf counter lock/unlock commandbuffers.

  • radv: Add a list of performance counters.

  • radv: Implement support for querying performance counters.

  • radv: Implement listing performance counters.

  • radv: Expose VK_KHR_performance_query.

  • radv: Add ability to override the build id for the cache.

  • radv: Add a 32bit memory type.

  • radv: Skip setting empty index buffers to avoid hang

  • radv: Expose function to write vertex descriptors for dgc.

  • radv: Always store stride in the vbo descriptor.

  • radv: Require 32bit memory for indirect buffers.

  • radv: Expose helper for base pa_su_sc_mode_cntl.

  • radv: Make radv_get_vgt_index_size non-static.

  • radv: Add helper to write scissors.

  • radv: Add flushing for DGC.

  • radv: Add DGC meta shader.

  • radv: Implement DGC generated command layout structure.

  • radv: Implement DGC cmdbuffer generation.

  • radv: Implement CmdExecuteGeneratedCommandsNV.

  • radv: Add stub for vkCmdBindPipelineShaderGroupNV.

  • radv: Expose VK_NV_device_generated_commands.

  • vulkan/wsi: Take max extent into consideration for modifier selection.

  • amd/common: Don’t rely on DCN support checks with modifiers.

  • amd/common: Disable DCC retile modifiers on RDNA1

Ben Skeggs (3):

  • nv50/ir: fix OUT.FINAL on ampere

  • nv50/ir: fix scope of memory ops on ampere

  • nv50/ir: fix ALD/AST where address comes from AL2P

Benjamin Cheng (5):

  • radv: fix memory leak of descriptor set layout

  • radv: refactor image binding into struct

  • radv: consolidate descriptor set buffer counts

  • radv: implement disjoint memory for multiplane images

  • radv: expose disjoint image support

Boris Brezillon (159):

  • dzn: Skip binding entries with zero descriptors

  • dzn: Make sure sampler heaps don’t contain more than 2048 samplers

  • microsoft/compiler: Add a dunmmy SV_SampleIndex when needed

  • microsoft/spirv_to_dxil: Allow forcing per-sample shading

  • dzn: Add support for sampleRateShading

  • dzn: Pass the right type to CreateCommandList() in the reset path

  • dzn: refactor error-handling

  • ci/windows: Add a variable to globally disable jobs using windows runners

  • dzn: Use core helpers to fill physical device features/properties

  • dzn: Report actual device limits

  • d3d12: Move the D3D12ResourceState code to the gallium driver

  • ci/windows: Split the windows-related tests

  • dzn: Add support for EXT_vertex_attribute_divisor

  • dzn: Rename a bunch of dzn_descriptor_set helpers

  • dzn: Pass layout objects to various dzn_descriptor_set_ptr_xxx helpers

  • dzn: Split the write desc helpers in two halves

  • dzn: Add support for KHR_descriptor_update_template

  • microsoft/spirv_to_dxil: Add a draw_id field to vertex_runtime_data

  • dzn: Fix exec_buf_stride calculation

  • dzn: Simplify the triangle_fan_rewrite_index shader

  • dzn: Update the draw_id sysval when issuing indirect draws

  • dzn: Advertise KHR_shader_draw_parameters

  • dzn: Pass an image view to dzn_cmd_buffer_clear_attachment()

  • dzn: Set MAX_RTS to D3D12_SIMULTANEOUS_RENDER_TARGET_COUNT

  • dzn: Move all render-related states to a substruct

  • dzn: Inline dzn_cmd_buffer_clear_attachments()

  • dzn: Add support for dynamic rendering

  • dzn: Get rid of the render pass logic

  • dzn: Allow multiDrawIndirect and drawIndirectFirstInstance

  • dzn: Add KHR_draw_indirect_count support

  • dzn: Add missing VKAPI_{ATTR,CALL} specifiers to BeginCommandBuffer()

  • dzn: Pass the dzn_event pointer to _mesa_hash_table_insert()

  • dzn: Fix the STATIC_ASSERT() in dzn_meta_blits_get_context()

  • vulkan/wsi: Make wsi_win32_surface_get_capabilities() return the current extent

  • dzn: Fix dzn_cmd_buffer_collect_queries()

  • dzn: Use the vk_sync_binary abstraction

  • spirv: Fix windows build

  • dzn: Use the correct ICD path on Win32

  • microsoft/spirv_to_dxil: Fix push_constants type declaration

  • microsoft/spirv_to_dxil: Fix the push_constant UBO size calculation

  • dzn: Emit missing transition barriers for rendering attachments

  • ci/windows: Bump spirv-samples commit-ID

  • microsoft/spirv2dxil: Add a debug option

  • microsoft/compiler: Make sure all SV_Position components are written

  • microsoft/spirv_to_dxil: Extract NIR passes out of spirv_to_dxil()

  • microsoft/spirv_to_dxil: Constify the descriptor binding mapping fields

  • microsoft/compiler: Make input_mappings immune to driver_location holes

  • microsoft/spirv_to_dxil: Remove dead IO vars at the end of dxil_spirv_nir_passes()

  • microsoft/spirv_to_dxil: Add a linking helper

  • microsoft/compiler: Allow vulkan users to handle varyings linking

  • microsoft/spirv_to_dxil: vectorize IOs

  • dzn: Fix maxVertexInputAttributes limit

  • dzn: Handle varying linking explicitly

  • dzn: Fix unitialized var access in dzn_meta_compile_shader()

  • dzn: Fix vertex input handling

  • microsoft/spirv_to_dxil: Let spirv_to_dxil() handle the linking

  • microsoft/compiler: Drop the auto_link parameter passed to nir_to_dxil()

  • dzn: Switch to ID3D12Device2

  • dzn: Drop an unused argument passed to dzn_graphics_pipeline_translate_vi()

  • dzn: Use CreatePipelineState()

  • dzn: Enable depth-bounds testing

  • microsoft/compiler: Lower fquantize2f16

  • nir: Fix flat new_var assignment in create_new_io_vars()

  • microsoft/compiler: Pick a type that matches interpolation mode for structs

  • microsoft/compiler: Truncate function names when needed

  • vulkan/util: Fill VkPhysicalDeviceIDProperties::deviceNodeMask

  • spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store}

  • microsoft/compiler: Images are no longer reprensented by uniform vars

  • microsoft/compiler: Set typed_uav_load_additional_formats when appropriate

  • microsoft/compiler: s/assert()/unreachable()/ in emit_intrinsic()

  • microsoft/compiler: Support load_sample_id_no_per_sample

  • dzn: Decorrelate external image info and external image properties

  • dzn: Return empty external properties

  • dzn: Add a dummy GetDeviceGroupPeerMemoryFeatures()

  • dzn: Add a dummy GetImageSparseMemoryRequirements2()

  • dzn: Add dummy {Create,Destroy}SamplerYcbcrConversion() implementations

  • dzn: Check that no export is requested in the memory allocation path

  • dzn: Lower alignment requirements when allocating buffers or single-sample images

  • dzn: Add a dzn_desc_type_has_sampler() helper

  • dzn: Implement GetDescriptorSetLayoutSupport()

  • dzn: Vulkan 1.1 wants maxPerSetDescriptors >= 1024

  • dzn: Prepare support for pipeline variants

  • dzn: Fix primitiveRestart support

  • dzn: Fix triangle-fan emulation

  • dzn: Support dynamic depth testing parameters properly

  • dzn: Support dynamic line width

  • ci/windows: Allow skipping deqp tests if we have to

  • ci/windows: Add dEQP-VK.dynamic_state.* to the test list

  • ci/windows: Skip dEQP-VK.api.command_buffers.record_many_draws*

  • d3d12: Keep access masks up-to-date when removing vars in d3d12_disable_multisampling()

  • microsoft/compiler: Move d3d12_fix_io_uint_type() to dxil_nir.c

  • microsoft/compiler: Fix Layer type

  • microsoft/compiler: Support gl_Layer

  • dzn: Disable rasterization if the last geometry stage doesn’t write the position

  • dzn: Enable support for geometry shaders

  • ci/dzn: Move to a .toml description

  • dzn: Fix the 2DMSArray case in dzn_image_get_dsv_desc()

  • dzn: Fix CmdPushConstants()

  • dzn: Clamp depthBiasConstantFactor when doing the float -> int conversion

  • dzn: Transition resource to RENDER_TARGET/DEPTH_WRITE before clears

  • microsoft/compiler: Fix emit_ubo_var()

  • ci/dzn: Copy testlog.{css,xsl} to the result dir

  • nir/serialize: Put dest last in packed_instr::tex

  • dzn: Advertise anisotropic filtering support

  • dzn: Advertise shaderImageGatherExtended support

  • dzn: Enable dynamic indexing on all kind of descriptors

  • dzn: Enable shader{Clip,Cull}Distance

  • dzn: Enable the depthClamp feature

  • nir/serialize: Support texop >= 16

  • nir/serialize: Silence integer-overflow false positive

  • vulkan: Add a vk_pipeline_hash_shader_stage() helper

  • anv: Use vk_pipeline_hash_shader_stage()

  • v3dv: Fix nir_shader leaks in v3dv_meta_{clear,copy}()

  • v3dv: Use vk_pipeline_hash_shader_stage()

  • tu: Use vk_pipeline_hash_shader_stage()

  • radv: Kill unused fs_m local var in radv_create_shaders()

  • radv: Use vk_pipeline_hash_shader_stage()

  • dzn: Hook-up device-lost detection

  • dzn: Fix indentation

  • dzn: Initialize UUIDs

  • dzn: Drop dzn_pipeline_cache.c

  • dzn: Add missing D3D12_CACHED_PIPELINE_STATE to MAX_COMPUTE_PIPELINE_STATE_STREAM_SIZE

  • dzn: Fix assertion in d3d12_pipeline_state_stream_new_desc()

  • dzn: Use d3d12_compute_pipeline_state_stream_new_desc() in the compute path

  • dzn: Pass the maximum stream size to d3d12_pipeline_state_stream_new_desc()

  • dzn: Drop useless while(link_mask != 0)

  • dzn: Drop unused allocator passed to dzn_graphics_pipeline_create()

  • dzn: Let dzn_pipeline_init() initialize the root signature in the stream

  • dzn: Fix potential nir_shader leak

  • dzn: Use vk_to_mesa_shader_stage()

  • dzn: Save a few indentation levels in graphics_pipeline_compile_shaders()

  • dzn: Don’t delegate binding translation to dxil_spirv_nir_passes()

  • dzn: Hash pipeline layout stages

  • dzn: Add a generic cached blob wrapper

  • dzn: Drop unneeded goto statement in dzn_compute_pipeline_create()

  • dzn: Move the compute shader compilation logic to a sub-function

  • dzn: Cache NIR shaders

  • dzn: Caculate a binding translation hash

  • dzn: Cache DXIL shaders

  • dzn: Cache pipeline info

  • ci/dzn: Add pipeline cache tests to the include list

  • dzn: Cache and pack transition barriers

  • dzn: Don’t call dzn_cmd_buffer_queue_transition_barriers() prematurely

  • dzn: Rename dzn_cmd_buffer_queue_image_subres_range_transition()

  • dzn: Introduce dzn_cmd_buffer_queue_image_range_state_transition()

  • dzn: Allow shaders to read from depth/stencil attachments

  • dzn: Fix copies of block-based formats

  • dzn: Fix unused var warnings

  • dzn: Fix ‘statement with no effect’ warning

  • dzn: Fix printf() format in dxgi_get_factory() error message

  • dzn: Fix {back,front}_wr_uses_ref assignment

  • dzn: Fix -Wswitch warnings

  • dzn: Use the vk_descriptor_set_layout base struct

  • dzn: Use the vk_pipeline_layout base struct

  • dzn: Mark transition barriers as executed when we execute barriers

  • dzn: Query D3D12_FEATURE_D3D12_OPTIONS3 features

  • dzn: Provide a helper to check if 2 formats are compatible

  • dzn: Support native image copies when formats are compatible

  • dzn: Take VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT into account

Boyuan Zhang (9):

  • radeonsi/vcn: add vcn 4.0 encode support

  • radeonsi/vcn: add vcn 4.0 encode fw interface version

  • radeonsi/vcn: add encode context ib for vcn 4.0

  • radeonsi/vcn: add session init ib for vcn 4.0

  • radeonsi/vcn: enable vcn 4.0 encode for gfx11 asic

  • radeonsi/vcn: use calculated max hierarchy depth for hevc enc

  • vl: add cabac init idc for h264 encode

  • frontends/va: get entropy mode info for h264 encode

  • radeonsi/vcn/enc: enable cabac entropy mode

Brian Paul (101):

  • gallivm: assorted clean-ups in lp_bld_nir_aos.c

  • gallivm: whitespace fixes in lp_bld_tgsi.h

  • gallivm: asst. clean-ups in lp_bld_sample.h

  • llvmpipe: asst. clean-ups in lp_linear_fastpath.c

  • llvmpipe: asst. clean-ups in lp_setup.c

  • llvmpipe: asst. clean-ups in lp_state_setup.c

  • gallivm: asst. clean-ups in lp_bld_type.[ch]

  • gallivm: cleanup lp_bld_arit.c

  • gallivm: clean-up in lp_bld_nir.h

  • gallivm: add const qualifier in lp_bld_nir_soa.c

  • llvmpipe: asst. clean-up in lp_state_derived.c

  • llvmpipe: asst. clean-ups in lp_rast_linear.c

  • llvmpipe: fix comment typo in lp_linear_fastpath.c

  • llvmpipe: asst. clean-ups in lp_rast.c

  • llvmpipe: asst. clean-ups in lp_linear.c

  • llvmpipe: asst. clean-ups in lp_scene.h

  • llvmpipe: asst. clean-up and add comments in lp_state_fs.h

  • gallivm: clean-up in lp_bld_sample.c

  • llvmpipe: asst. clean-up in lp_setup_context.h

  • gallivm: asst. clean-ups in lp_bld_sample_soa.c

  • gallivm: clean-ups in lp_bld_tgsi.[ch]

  • llvmpipe: asst. clean-ups in lp_state_fs.c

  • llvmpipe: minor clean-ups in lp_jit.h

  • llvmpipe: asst. clean-ups in lp_state_fs_linear_llvm.c

  • llvmpipe: asst. clean-ups in lp_state_fs_linear.c

  • llvmpipe: asst. clean-ups in lp_setup_tri.c, lp_setup_rect.c

  • llvmpipe: whitespace clean-ups in lp_rast.h

  • llvmpipe: minor whitespace/formatting cleanups in lp_state_setup.h

  • llvmpipe: minor clean-ups in lp_state_fs_analysis.c

  • llvmpipe: misc clean-up in lp_rast_tri.c

  • llvmpipe: asst. clean-ups in lp_bld_blend_aos.c

  • llvmpipe: asst. clean-ups in lp_rast_tri_tmp.h

  • llvmpipe: asst. clean-ups in lp_rast_priv.h

  • gallivm: minor clean-ups in lp_bld_sample_soa.c

  • llvmpipe: misc clean-ups in lp_bld_interp.c

  • gallivm: asst. clean-ups in lp_bld_nir.c

  • llvmpipe: fix comment typo in lp_bld_interp.h

  • llvmpipe: asst. clean-ups in lp_linear_sampler.c

  • llvmpipe: asst. clean-ups in lp_tex_sample.c

  • llvmpipe: asst. clean-ups in lp_scene.c

  • llvmpipe: minor clean-up in lp_state_blend.c

  • llvmpipe: use enum blend types instead of unsigned

  • llvmpipe: s/unsigned/enum pipe_compare_func/

  • gallivm: asst. clean-ups in lp_bld_logic.

  • llvmpipe: asst. clean-ups in lp_state_fs.c

  • draw: s/unsigned/enum pipe_prim_type/

  • draw: asst. clean-up in draw_context.h

  • draw: asst. clean-up in draw_gs.h

  • draw: asst. clean-up in draw_pipe.c

  • draw: asst. clean-up in draw_pipe_validate.c

  • draw: asst. clean-up in draw_private.h

  • draw: asst. clean-up in draw_pt.h

  • draw: asst. clean-up in draw_pt.c

  • draw: asst. clean-up in draw_pt_fetch_shader_emit.c

  • draw: asst. clean-up in draw_pt_fetch_shader_pipeline.c

  • draw: asst. clean-up in draw_pt_util.c

  • draw: asst. clean-up in draw_pt_vsplit.c

  • draw: asst. clean-up in draw_vbuf.h

  • gallivm: use enum types instead of unsigned in lp_bld_sample.[ch]

  • llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c

  • llvmpipe: remove lp_rast_cmd_arg::state field

  • gallivm: s/unsigned/enum pipe_swizzle/

  • llvmpipe: add simple assertion in generate_fragment()

  • util/bitset: add BITSET_SIZE()

  • lavapipe: rework code to compute textures_used, samplers_used

  • llvmpipe: minor code re-org in lp_state_fs_analysis.c

  • nir: add const qualifiers, move some decls in nir_to_tgsi_info.c

  • llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat()

  • gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/

  • llvmpipe: replace GET_A0() macro w/ inline function

  • llvmpipe: add missing tex_info->texture_unit assignment

  • llvmpipe: fix invalid memory used in lp_fs_linear_run

  • llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat()

  • llvmpipe: replace LP_RAST_OP_ #defines with enum type

  • gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32

  • lavapipe: fix logicop, independent blend enable/disable

  • llvmpipe: don’t allow texture/resource swizzles on linear path

  • llvmpipe: initialize a local var to fix compiler warning in release build

  • lavapipe: fix incorrect sv[] array size

  • llvmpipe: fix texcoord analysis regression

  • llvmpipe: asst. clean-up in lp_bld_depth.c

  • llvmpipe: minor white-space clean-up in lp_setup.h

  • llvmpipe: asst. clean-ups in lp_setup_rect.c

  • gallivm: asst. clean-ups in lp_bld_const.c

  • llvmpipe: asst. clean-ups in lp_linear_interp.c

  • llvmpipe: minor optimization in lp_linear_init_interp()

  • llvmpipe: asst. clean-ups in lp_state_setup.c

  • llvmpipe: remove unneeded casts

  • llvmpipe: replace ‘unsigned’ with enum types

  • gallivm: asst. clean-ups in lp_bld_swizzle.c

  • gallivm: simplify shuffling in emit_store_reg()

  • gallivm: fix incorrect memset() in SOA emit_load_const()

  • gallivm: rename ‘var’ param in emit_store_reg(), emit_store_var().

  • llvmpipe: make opaque field a 1-bit field

  • llvmpipe: minor whitespace clean-up in lp_state_fs.h

  • llvmpipe: pass frontfacing param to jit_function() call

  • llvmpipe: asst. clean-up in lp_rast_linear_fallback.c

  • llvmpipe: remove unneeded casts in lp_rast_linear_fallback.c

  • llvmpipe: consolidate code in lp_rast_linear_fallback.c

  • llvmpipe: tighten up some code in lp_rast_linear_fallback.c

  • llvmpipe: add some const qualifiers in lp_rast_linear_fallback.c

Caio Oliveira (1):

  • intel/compiler: In XeHP prefer <1;1,0> regions before compacting

Chad Versace (2):

  • venus: Refactor vn_fix_graphics_pipeline_create_info

  • venus: Don’t encode ignored pTessellationState

Charlie Turner (4):

  • ci: fix shellcheck violations in the test-vk building script

  • ci: add Linux build of apitrace

  • ci: upgrade gfxreconstruct build to v0.9.10

  • ci: upgrade to DXVK v1.10.1

Charmaine Lee (20):

  • gallium/util: Increase the debug_flush map depth to 64

  • draw: fix double free of NIR IR

  • svga: fix shader IR type passed to draw create shader function

  • svga/nir: enable PIPE_CAP_TGSI_TEXCOORD

  • svga: enable PIPE_CAP_TGSI_TEXCOORD for vgpu10 and up

  • svga/tgsi: add translation for DTRUNC

  • svga/nir: enable lower double option

  • svga: workaround for unexpected double swizzle

  • svga: fix clip distance shadow copy writemask

  • svga: add a helper function for common shader creation

  • svga: create svga_shader_info

  • svga: add svga_compile_shader helper function

  • svga: add need_texcoord_semantic to tgsi_add_point_sprite & tgsi_add_aa_point

  • svga: fix aa point

  • svga: fix reference to NULL offset for atomic counter

  • svga: fix shared memory index

  • svga: fix mksstats build

  • mesa/st: fix reference to nir->info after nir_to_tgsi

  • svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context

  • svga: fix invalid component access of domain location

Chia-I Wu (61):

  • turnip: fix drm modifier support with planar formats

  • util: Keep quiet NaNs quiet when converting to half float.

  • anv: advertise rectangularLines only for Gen10+

  • turnip: let modifier takes precedence over TU_DEBUG=noubwc

  • freedreno/fdperf: make refresh rate configurable

  • freedreno/fdperf: support dumping counters

  • turnip: fix sampledImageIntegerSampleCounts

  • turnip: emit VPC_SO_DISABLE in xfb begin/end

  • ir3: fix mem_ctx for ir3_disasm_info::nir

  • turnip: fix off-by-one in border color bitset

  • turnip: fix tu6_pack_border_color for z24

  • turnip: disable UBWC for SNORM formats

  • turnip: if-checks autotune debug macros

  • turnip: base pipeline draw costs on colorWriteMask

  • turnip: include stencil test for drawcall costs

  • turnip: estimate render pass costs

  • turnip: consider render pass costs in autotune

  • ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure

  • turnip: force sample interpolations for sample shading

  • turnip: advertise VK_EXT_border_color_swizzle

  • freedreno/fdl: use XYZW swap for PIPE_FORMAT_X24S8_UINT

  • vulkan/device_select: remove get_pdevice_proc_addr

  • freedreno/drm-shim: add a660

  • turnip: fix an assertion with drm-shim

  • ir3: update ir3_const_state comment

  • ir3: copy req_local_mem for MESA_SHADER_KERNEL

  • ir3: fix output_loc size

  • ir3: remove unused patch_vertices_in

  • ir3: increment constoff right after it is assigned

  • ir3: fix tess param allocation

  • ir3: set UL flag before ir3_lower_subgroups

  • turnip: remove shared_consts from tu_compiled_shaders

  • turnip: shared_consts and push_consts are mutually exclusive

  • turnip: add tu_common.h as the common header

  • turnip: remove includes that are already in tu_common.h

  • turnip: add tu_drm.h

  • turnip: add tu_suballoc.h

  • turnip: update tu_cs.h

  • turnip: add tu_query.h

  • turnip: add tu_image.h

  • turnip: add tu_formats.h

  • turnip: update tu_descriptor_set.h

  • turnip: add tu_shader.h

  • turnip: add tu_pipeline.h

  • turnip: add tu_clear_blit.h

  • turnip: add tu_dynamic_rendering.h

  • turnip: add tu_lrz.h

  • turnip: add tu_pass.h

  • turnip: add tu_wsi.h

  • turnip: update tu_autotune.h

  • turnip: add tu_device.h

  • turnip: add tu_cmd_buffer.h

  • turnip: add tu_android.h

  • turnip: update tu_util.h

  • turnip: move away from tu_private.h

  • turnip: remove tu_private.h

  • turnip: remove headers from libtu_files

  • turnip: use SPDX-License-Identifier

  • turnip: fix a use-after-free in autotune

  • turnip: lower the queue priority to 1

  • ir3: fix predicate splitting in scheduler

Christian Gmeiner (12):

  • isaspec: Extend split_bits(..) to accept a bitsize

  • isaspec: Handle patterns bigger then 64 bit

  • nir: Fix unused-variable compile warnings

  • freedreno: Wait for async compile jobs to finish

  • ir3: Create at least one thread for async shader compile

  • gtest: Fix maybe-uninitialized compiler warning

  • r600: Switch to unreachable(..)

  • svga: Clear query before usage

  • ci: Remove -Wno-error=maybe-uninitialized

  • etnaviv: factor out shader screen related init/deint

  • etnaviv: async shader compile

  • etnaviv: Move nir_shader_compiler_options to compiler

Chuansheng Liu (1):

  • iris,anv: correct the max thread number for DG2+

Clément Guérin (1):

  • venus: implement VK_EXT_physical_device_drm

Connor Abbott (69):

  • tu: Delete unused tu_clear_blit GS handling

  • ir3, freedreno, tu: Make ir3_shader_variant store stream_output

  • ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant

  • ir3: Use ir3_shader_variant::type more often

  • ir3: Allocate disasm_info under variant

  • ir3: Add ir3_shader_variant::compiler

  • tu: Keep original blit shaders separately

  • ir3: Remove ir3_shader_variant::shader

  • ir3: Add functions to serialize variants

  • ir3: Support disabling the pipeline cache

  • ir3: Add ir3_shader_create_variant()

  • tu: Hash pipeline layout contents

  • tu: Rewrite cache UUID based on radv

  • tu: Include turnip debug flags in pipeline cache UUID

  • tu: Implement pipeline caching with shared Vulkan cache

  • tu: Zero-initialize compute driver key

  • tu: Implement VK_EXT_pipeline_creation_feedback

  • tu: Implement VK_EXT_pipeline_creation_cache_control

  • tu: Convert to sync2 entrypoints

  • tu: Expose VK_KHR_synchronization2

  • tu: Implement VK_EXT_image_2d_view_of_3d

  • tu: Enable shaderTessellationAndGeometryPointSize

  • tu: Enable VK_KHR_swapchain_mutable_format

  • tu: Fix partial-write LRZ determination

  • tu: Implement extendedDynamicState2LogicOp

  • tu: Implement VK_EXT_color_write_enable

  • tu: Enable uniform texture buffers with NPOT formats

  • freedreno/a6xx: Support more 3-component formats

  • tu: Fix 3d GMEM store swizzle in texture descriptor

  • tu: Add missing WFI to the 3d GMEM store path

  • tu: Save/restore *_BIN_CONTROL in 3d GMEM store path

  • ir3/sched: Fix could_sched() determination

  • tu: Fix linemode for tessellation with isolines

  • ir3: Fix vectorizer condition for SSBOs

  • freedreno: Document a650-specific CP_REG_WRITE flag

  • freedreno/a6xx: Document border color/swizzle interaction

  • freedreno/a6xx: Fix border color swizzling

  • freedreno/a6xx: Use fdl format swizzle

  • freedreno/fdl: Note border color constraints

  • freedreno/a6xx: Fix border color with Z24UINT_S8UINT

  • tu: Fix stencil border color with has_z24uint_s8uint

  • mesa/st: Fix border color type for stencil sampling

  • nv50/ir/ra: Fix copying compound for moves

  • freedreno/a6xx: VPC_SO_NCOMP is actually VPC_SO_BUFFER_STRIDE

  • tu: Use incoherent CCU write for buffer accesses

  • tu: Support resolving D24S8 -> S8

  • tu: Fix resolving d32s8 into s8 on fast path

  • tu/lrz: Do not use framebuffer when inheriting LRZ

  • tu: Treat CP_WAIT_FOR_ME as a cache invalidate

  • tu: Rewrite tess factor emission

  • tu: Move TU_DONT_CARE_AS_LOAD into attachment_set_ops()

  • tu: Disable GMEM for multiview inside tu_render_pass_gmem_config

  • tu: Move tu_render_pass definition up

  • tu: Remove usage of RenderPassBeginInfo

  • tu: Initial support for dynamic rendering

  • tu: Remove has_subpass_predication

  • tu: Update more state with secondaries

  • tu: Split out some state into a separate struct

  • util/trace: Fix u_trace_*_iterator with no tracepoints

  • tu: Add support for suspending and resuming renderpasses

  • tu: Use common vk_image struct

  • tu: Use common vk_image_view base struct

  • tu: Add debug option to use emulated renderpass support

  • tu: Restore formatting of tu_clear_blit.c

  • tu: Fix sysmem depth attachment clear flushing

  • ir3/spill: Fix extracting from a vector at the end of a block

  • tu/lrz: Fix multiple subpass case with secondaries

  • tu/lrz: Fix multiple depth attachment case with secondaries

  • tu: Don’t preload variable-count descriptors

Constantine Shablya (1):

  • anv: set image_read_without_format NIR option on Vulkan 1.3

Corentin Noël (5):

  • mesa: Make sure to fallback to handling the original choose texture format

  • virgl/ci: Update virglrenderer

  • virgl/ci: Update virglrenderer

  • virgl: Add support for passing the supported number of components from virgl

  • virgl: Only progagate the uniform numbers if the numbers are actually right

Cristian Ciocaltea (4):

  • virgl/ci: Rename virgl-lava-* jobs to virgl-iris-*

  • virgl/ci: Update checksums for some virgl iris traces

  • ci: Add CPU frequency adjustment capability

  • ci: Limit Intel CPU scaling frequency for performance tests

Daniel Schürmann (36):

  • nir/lower_alu_to_scalar: don’t set the nir_builder cursor

  • aco: remove ‘max_waves’ and use ‘num_waves’ to adjust for LDS and workgroup size

  • aco: split num_waves adjustment into separate function

  • aco: adjust num_waves for LDS before scheduling

  • aco/optimizer: prevent any overflow between SGPR and const offset on MUBUF

  • aco/ra: fix live-range splits of phi definitions

  • aco: don’t re-create vectors for load_barycentric_* intrinsics

  • aco/ra: improve split_vector register assignment if the operand is not killed

  • aco/optimizer: re-combine and copy-propagate p_create_vector(p_split_vector)

  • radv: move two NIR passes out of tight optimization loop

  • radv: call nir_lower_vars_to_ssa later

  • aco: fix spilling of phis without temp operands

  • nir/opt_vectorize: add callback for max vectorization width

  • nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback

  • nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions

  • aco: correctly validate v_fma_mixhi_f16 register assignment

  • radv: don’t lower vectorized instructions to 32bit

  • radv: use callback for nir_lower_to_scalar

  • radv: vectorize nir_op_fdiv & nir_op_flrp & nir_op_ineg

  • radv: vectorize nir_op_fabs

  • radv: fix swizzles after nir_opt_algebraic_late

  • radv: reorder some NIR passes

  • nir/opt_shrink_vectors: fix re-using of components for vecN

  • radv/shader_info: fix load_frag_coord and load_sample_pos read masks

  • aco: fix packed 16bit fneg/fsat optimization

  • aco: avoid unnecessary copies in emit_wqm()

  • aco: Avoid live-range splits in Exact mode

  • aco: fix assertion in insert_exec_mask

  • spirv: handle phis decorated with RelaxedPrecision

  • radv: Only create noop-fs for graphics pipelines

  • radv: create RT traversal as separate shader

  • radv/rt: use derefs for the traversal stack

  • radv/rt: use stage ID as handle for general and closestHit shaders

  • radv/rt: Don’t load ClosestHit SBT on every hit, but only once after traversal

  • radv/rt: fix nir_builder cursor in lower_rt_instructions()

  • radv/rt: simplify lower_rt_instructions()

Daniel Stone (10):

  • dzn/ci: Don’t spam conformance warnings

  • CI: Disable Windows jobs

  • ci: Allow lima jobs to be globally disabled

  • ci: Also disable Windows container builds when down

  • ci/windows: Eliminate duplicate script definition

  • ci/zink: Add glx-tfp to skips

  • ci/lava: Sleep before, not after, API calls

  • ci: freedreno farm is down

  • CI: Disable Collabora lab

  • Revert “CI: Disable Collabora lab”

Danylo Piliaiev (53):

  • turnip: Try harder to keep LRZ valid and fix a few edge cases

  • freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC

  • turnip: Skip load/stores for tiles with no geometry

  • turnip: Add debug option to print gmem load/store skip stats

  • turnip: Fix tu_debug_flags values clashing

  • tu: Implement VK_EXT_image_view_min_lod

  • tu: Do not flush ccu in clear/blits during renderpass

  • pps: Open writable renderer node in DrmDevice::create

  • tu: Fix indices of drm_msm_gem_submit_cmd when filling them

  • freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0

  • util/u_trace: Add json output

  • docs/u_trace: document u_trace usage

  • ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS

  • ir3: Force late-z if FS has global store/atomic

  • tu: Fix D32S8 resolves into D32 or S8 in sysmem mode

  • spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark

  • turnip: Allow nested CP_COND_REG_EXEC

  • turnip: Fix out-of-bounds builder->shader_iova access

  • tu/autotune: Prevent division by zero

  • turnip: copy disasm strings for pipeline executables

  • tu: Disable sample counting for 3d blits during occlusion query

  • tu: Reset xfb_used at the end of a renderpass

  • meson: Define _GNU_SOURCE for android host system

  • meson/pps: Check if libdrm exists to compile pps

  • meson/tu: Don’t compile libdrm paths if KGSL is selected

  • tu: Do not expose storage image/buffer features for PACK16 formats

  • tu: Use hw binning or sysmem with QUERY_TYPE_PRIMITIVES_GENERATED

  • tu: Don’t count 3d blits in QUERY_TYPE_PRIMITIVES_GENERATED

  • freedreno: Document rest of GRAS_LRZ_CNTL, clarify UNK_25 event

  • freedreno: document GRAS_UNKNOWN_810A

  • tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear

  • ir3: Use NIR’s info.writes_memory to detect when when to force late-z

  • tu,freedreno: Refactored START/STOP events for pipeline stats

  • tu: Fix prim gen query and pipeline stats query interaction

  • zink: re-enable EXT_primitives_generated_query for Turnip

  • util/u_trace: Fix iteration over config_control

  • turnip: Refactor tracepoints generation to reduce duplication

  • turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints

  • freedreno: Add the rest of tracepoints with start/end to perfetto

  • freedreno: Refactor tracepoints generation to reduce duplication

  • freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints

  • docs/u_trace: Add documentation for *_GPU_TRACEPOINT envvars

  • turnip: Disable LRZ fast-clear for gen1 and gen2

  • tu: Do not dereference descriptorSetLayout in push descriptors tmpl

  • zink/turnip: Add comments to already triaged failures for tu baseline

  • turnip: Simple breadcrumbs implementation to debug hangs

  • tu: WFI after PC_TESSFACTOR_ADDR update

  • freedreno: WFI after PC_TESSFACTOR_ADDR update

  • ir3/ra: Always insert interval for precolored inputs

  • ir3: Never remove GS_HEADER_IR3 sysval input

  • tu: Disable LRZ write when alpha-to-coverage is enabled

  • freedreno: Disable LRZ write when alpha-to-coverage is enabled

  • ir3: Prevent reordering movmsk with kill

Dave Airlie (82):

  • clc: add simple llvm initialise API

  • clc: initialise one more llvm stage

  • u_blitter/stencil: take dstbox x/y into accounts for dst fb width

  • util/stencil: fix stencil fallback blit shader texture types.

  • llvmpipe: add user memory resources to the debug global list.

  • gallium/vl: wrap codec support checks in a common function.

  • gallium/omx: add video codec supported hook for decode paths.

  • meson: add a video codec support option

  • gallium/vl: respect the video codecs configure in meson

  • vulkan/wsi: keep allocate queue families in image, just don’t fill them

  • gallivm: move to new pass manager to handle coroutines change.

  • radv: precalculate hs offchip parameters.

  • radv: precalculate tess ring sizes/offsets.

  • radv/ac: introduce a new common function for hs calcs.

  • ac: port radeonsi tess factor calculations to common code.

  • radeonsi: port tess ring calcs to the common helper.

  • zink: avoid extra submits for empty wait cmdbufs.

  • zink: don’t finish barriers cmd buffer if not used.

  • radv: cache the slab bo pointer in the pipeline.

  • radv: only dirty line stipple when necessary.

  • radv: optimised command buffer reset of vertex bindings.

  • aco: move info pointer to a copy.

  • aco: move to a minimal aco shader info struct.

  • aco/info: reduce the gs ring info to what is needed.

  • aco: remove radv specific streamout info

  • aco: convert vs and so info over to aco structs.

  • aco/info: add some more fields.

  • aco/radv: convert to aco shader info at the radv level.

  • aco: remove radv vs prolog key from aco internals.

  • aco: drop unused radv include

  • ac/radv: drop info pointer from the ac and radv shader structs

  • aco/radv: drop radv_nir_compiler_options from aco.

  • llvmpipe: align scratch size to 64-bit size.

  • meson: add build-id to pipe libraries

  • vl: fix codec checks to disable properly

  • lavapipe: fix depth bias offset flag enables.

  • llvmpipe: flush resources for kms swrast path.

  • llvmpipe: handle timespec overflow on fence waits.

  • nouveau: move codegen into a standalone library.

  • nouveau: move codegen to a common higher level directory.

  • nouveau/codegen: drop all ubytes from codegen.

  • nouveau/codegen: drop gallium headers from the interface.

  • llvmpipe: disable alpha test branch if multisample is enabled.

  • egl/x11: split large put image requests to avoid server destroy

  • egl/x11: add missing put_image cookie cleanups

  • wsi/x11: add xcb_put_image support for larger transfers.

  • glx/drisw: use xcb instead of X to query connection

  • draw/cliptest: add support for guard band + full Z

  • zink: workaround depth sampler border color when z24 is z32

  • aco: refactor the radv binary builder out of the core aco fn.

  • aco/radv: provide a callback from aco shader building to build binary

  • aco/radv: provide a vs prolog callback from aco to radv.

  • aco: drop radv_shader.h include

  • radv: add a dynamic vertex format cache.

  • lavapipe: drop unreachable pNext checks.

  • lavapipe: enable variablePointers

  • kms/dri: add mutex lock around map/unmap

  • crocus: fail query begin if upload allocation fails.

  • lavapipe: state latest conformance tests passed

  • llvmpipe: fix aniso cube map arrays.

  • crocus: fix leak in query code.

  • llvmpipe: make last_fence a screen/rast object not a context one.

  • llvmpipe: keep context list and use to track resource usage.

  • gallivm: use progress from subgroup lowering.

  • gallivm: fix casting around read invocation.

  • llvmpipe: export ARB_shader_ballot

  • lavapipe: expose some subgroup extensions

  • zink: fixup subgroup vote/ballot enables.

  • zink/llvmpipe/ci: skip broken CTS ballot test.

  • util: add reallocarray wrapper for win32

  • llvmpipe: Allocate scene tiles dynamically

  • gallium: add a rasterizer state bit for unrestricted depth values.

  • mesa/st: disable unrestricted fragment depth values for GL/GLES

  • util/pack_color: clamp depth values outside range for unorm formats.

  • llvmpipe/fs: handle unrestricted depth values.

  • lavapipe: enable VK_EXT_depth_range_unrestricted.

  • util/reallocarray: add errno.h include.

  • lavapipe: enable VK_FORMAT_R5G5B5A1_UNORM_PACK16

  • gallivm: fix printf hook for cached shaders.

  • draw: don’t touch info values that aren’t valid.

  • nir_to_tgsi_info: drop const_buffers_declared

  • llvmpipe: finish rendering before flushing frontbuffer resources.

David Heidelberg (58):

  • ci: skqp: patch skqp to report also GL results

  • ci: skqp: upgrade to skqp based on Android CTS 11.0 (r7)

  • ci: skqp: update URL

  • ci: intel: Merge anv and iris into src/intel/ci

  • ci/iris: Enable SKQP on Tiger Lake boards

  • ci: traces: switch to Valve trimmed traces

  • ci: traces: re-enable disabled traces which are now trimmed traces

  • ci: traces: virgl: reenable previously crashing Godot trace

  • ci: uprev piglit 2022-05-10

  • ci/iris: skqp: add default GLES rendertests for TGL

  • ci/iris: skqp: remove flaking atlastext for TGL

  • ci/virgl: fix checksum for valve traces which run on iris

  • ci/freedreno: enable ROR and Nheko traces

  • ci/i915: add entries for RoR and Nheko traces

  • ci/intel: add RoR and Nheko traces and reenable most of Valve traces

  • ci/crocus: add RoR and Nheko traces

  • ci/llvmpipe: add RoR and Nheko traces

  • ci/radeonsi: add RoR and Nheko traces

  • ci/virgl: traces: add RoR and Nheko traces

  • ci/panfrost: add RoR and Nheko traces

  • ci/iris: skqp: remove flaking atlastext for TGL (gl version)

  • ci: traces: temporarily disable nheko trace

  • ci/virgl: traces: temporarily disable nheko trace

  • ci: uprev piglit 2022-06-09

  • ci/etnaviv: sort fails.txt

  • ci/etnaviv: add recent fails and remove 3 unexpected passes

  • ci/etnaviv: disable tests which takes too long

  • ci/etnaviv: add recent flakes for gc2000

  • ci/etnaviv: gc2000: drop flakes from fails

  • ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/freedreno: fix A530 glmark2@ideas:speed=10000 trace

  • ci/llvmpipe: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/virgl: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/virgl: iris: disable Counter Strike 1.6 trace flaking

  • ci/crocus: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces

  • ci/iris: disable Blender Diorama trace (incompatible with perf. traces)

  • ci/freedreno: update restricted traces to the brotli compressed version

  • ci/freedreno: smuggle myself into restricted traces access list

  • ci/freedreno: disable non-existing trace

  • ci: Uprev apitrace to 790380e05854

  • ci: traces: switch to brotli compressed traces

  • ci/freedreno: add more restricted traces

  • ci/freedreno: temporary disable AmnesiaTDD

  • ci/freedreno: disable Stellaris trace

  • ci/freedreno: disable SpecOps trace, each run flaky

  • Revert “ci/freedreno: temporary disable AmnesiaTDD”

  • ci/traces: piglit, be more verbose

  • docs: briefly document Gallium Nine

  • ci: add script for manage Mesa CI

  • ci: Allow disabling the whole of the Igalia farm

  • ci: move apitrace instalation outside of x86_test-vk

  • ci: wine-apitrace bump to 11.1

  • ci/freedreno: 3 pixel change in Raven restricted trace

  • nine: char can be unsigned on non-x86 platforms

  • mailmap: fix order of emails for Axel Davy

  • ci: set reasonable amount of wget retries for each download

Dawn Han (6):

  • Add iub features and properties passthrough

  • Refactor the descriptor enums to be extensible

  • Add iub type to vn_descriptor_type Verified the pNext VkWriteDescriptorSet is not NULL on iub type

  • Add the iub binding count tracking

  • Refactor and add template support for iub

  • venus: enabled iub in physical device

Denis Pauk (1):

  • panvk: Return VK_ERROR_INCOMPATIBLE_DRIVER for Midgard

Dmitry Osipenko (1):

  • virgl: Fix unmapping of blob resources

Dylan Baker (74):

  • VERSION: bump to 22.2-devel for next cycle

  • docs: truncate new_features.txt

  • docs: add release notes for 22.0.2

  • docs: Add sha256sums for 22.0.2

  • docs: update calendar and link releases notes for 22.0.2

  • docs: update calendar for 22.1.0-rc1

  • docs: update calendar for 22.1.0-rc2

  • winsys/radeon: remove unused simple_list.h

  • r300: remove unused simple_list.h

  • gallium/rbug: replace simple_list.h with list.h

  • gallium/driver_trace: remove simple_list.h

  • util/list.h: add a function to move an item in a list

  • gallium/util/u_cache.c: remove trailing whitespace

  • gallium/util/u_cache: replace simple_list.h with list.h

  • gallium/draw: remove trailing whitespace

  • gallium/draw: use list.h instead of simple_list.h

  • llvmpipe: cleanup trailing whitespace

  • llvmpipe: replace uses of simple_list.h with list.h

  • util: remove simple_list

  • docs: Extend calendar entries for 22.0 by 1 releases.

  • docs: update calendar and link releases notes for 22.0.3

  • docs: Extend calendar entries for 22.1 by 1 releases.

  • docs: update calendar for 22.1.0-rc3

  • docs: update calendar for 22.1.0-rc4

  • docs: add release notes for 22.0.3

  • docs: Add sh256sum for mesa 20.0.3

  • docs: update calendar and link releases notes for 22.1.0

  • docs: add release notes for 22.1.0

  • relnotes: Add sha256sum and fix minor formatting issues

  • docs: Add calendar entries for 22.1 release.

  • docs: add release notes for 22.0.4

  • docs: add sha256sum to 22.0.4 notes

  • docs: Extend calendar entries for 22.0 by 1 releases.

  • docs: update calendar and link releases notes for 22.0.4

  • docs: add release notes for 22.1.1

  • docs: Add sh256sums for 22.1.1

  • docs: update calendar and link releases notes for 22.1.1

  • docs: add release notes for 22.0.5

  • docs: update sha256 for 22.0.5

  • docs: update calendar and link releases notes for 22.0.5

  • docs: add release notes for 21.3.9

  • docs Add sha256 sums for 21.3.9

  • docs: link releases notes for 21.3.9

  • docs: add release notes for 22.1.2

  • docs: add sha256sum to 22.1.2 notes

  • docs: update calendar and link releases notes for 22.1.2

  • docs: drop 22.1.4 calendar entry.

  • docs: Add calendar entries for 22.2 release candidates.

  • docs: Extend calendar entries for 22.1 by 3 releases.

  • docs: add release notes for 22.1.3

  • docs: add sah256sum for mesa 22.1.3

  • docs: update calendar and link releases notes for 22.1.3

  • docs: Bump 22.2 branchpoint by two weeks

  • anv: add gfx version 12.5 flushes to CCS path

  • iris|anv: gfx version 12.5 data cache flush is not a workaround

  • VERSION: bump 22.3.0-devel

  • VERSION: bump for 22.2.0-rc1

  • .pick_status.json: Update to 8e6bdb2ed31f27c28d258859ec48e8e67f51ff41

  • .pick_status.json: Update to 0a0205f04552ab21971c431a5f8bb7486d664422

  • .pick_status.json: Update to 1a3b086b06c6c1ff63a4abe3752ca0a7a854dc77

  • .pick_status.json: Update to c67e60ae8fefaeadbe06bf0cbd8ad2ee267a6c0e

  • .pick_status.json: Update to 70891edd9728e7dd73115171fdb03622ce6450ba

  • VERSION: update to 22.2.0

  • Revert “VERSION: update to 22.2.0”

  • VERSION: bump to 22.2.0-rc2

  • .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36

  • .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9

  • .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e

  • .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated

  • VERSION: bump to 22.2.0-rc3

  • .pick_status.json: Update to 0c6fbfca0c91ef012e8ab767a317c07f1f6dc5e6

  • .pick_status.json: Update to 8eac45b27446cd9b9eaeb147af97fff1e09832cb

  • .pick_status.json: Update to baf24dea943202b3a92cad0c9f9648597040955a

  • .pick_status.json: Update to 3d4c36a3bcc51ed441b2667d92291bea30ef7449

Eli Schwartz (1):

  • meson: add various generated header dependencies as order-only deps

Ella Stanforth (1):

  • v3dv: Implement VK_KHR_performance_query

Emil Velikov (1):

  • c11: reinstate the original license and authorship

Emma Anholt (232):

  • nouveau/nir: Don’t try to emit OP_FMA pre-nvc0.

  • Revert “ci: Disable Google’s lab”

  • ci/virgl: Add piglit and GL4.3 testing to the virpipe path.

  • nir: Add lowering for fround_even on r300.

  • nir: Skip fround_even on already-integral values.

  • nir: Avoid generating extra ftruncs for array handling.

  • turnip: Add TU_DEBUG=layout for dumping image layouts.

  • freedreno/ir3: Make sched nodes before adding deps.

  • turnip: Add nir_opt_conditional_discard.

  • freedreno/ir3: Call nir_opt_find_array_copies().

  • turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags.

  • turnip: Don’t disable LRZ in subpasses after the first in the easy case.

  • nir_to_tgsi: Add a workaround for virglrenderer TG4.

  • nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs.

  • nir_to_tgsi: Don’t forget to split 64-bit store_per_vertex_output.

  • nir_to_tgsi: Avoid swizzling from undefined channels in load_output.

  • nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs.

  • tgsi: Emit ureg HW_ATOMIC decls in range order.

  • tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW.

  • nine: Disable optional use of TTN when MUL_ZERO_WINS is available.

  • r600: Fix up some mis-indentation of blocks.

  • ci/r600: Manual run updates.

  • r600: Fix reading back from a temp array immediately after writing on RV770.

  • r600/sb: Avoid causing an exception when getting the reciprocal of 0u.

  • r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled.

  • nouveau/nv50: Print the number of loops in shader-db output.

  • ci/nouveau: Add expectations files for GM206.

  • ci/nouveau: Add nv92 xfails.

  • nouveau/nir: Disable bitfield ops pre-nvc0.

  • nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR.

  • nouveau/nv50: Set the primid sysval flag if it’s in the sysval list, too.

  • nouveau/nir: Fix the inverted sense of usesSampleMaskIn.

  • nouveau/nir: Fix edgeflag input detection.

  • nouveau/nir: Set the input for vertex/instance ID like TGSI does.

  • ci/dzn: Cut the runtime by running 1/3 of the enabled tests.

  • nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit.

  • nir_to_tgsi: Lower FS input array indexing since we don’t declare input arrays.

  • tgsi: Extend array lengths when merging usage_masks.

  • virgl: Switch to nir-to-tgsi by default.

  • nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional.

  • nir_to_tgsi: Drop the txl(lod=0)->tex nir_lower_tex() fixup.

  • nouveau/nir: Move FS output stores to the end of the last block.

  • ci/nouveau: Add MESA_GLES_VERSION_OVERRIDE=3.1 baseline state.

  • nouveau/nir: Add support for pre-GF100 images and ssbos.

  • nouveau/nir: Put the UBO offset indirect into the address reg.

  • nouveau: Enable the NIR backend by default.

  • util: Add some unit tests of the half-float conversions.

  • nir: Eliminate out-of-bounds read/writes in local lowering.

  • nir: Don’t assert on tg4 offset range.

  • svga: Add support for requesting NIR and translating to TGSI.

  • ci/iris: Demote APL deqp to manual-only for now.

  • ci/iris: Add a bunch of APL and KBL flakes recently.

  • mesa/st: Always generate NIR from GLSL, and use nir_to_tgsi for TGSI drivers.

  • mesa/st: Remove st_glsl_to_tgsi.

  • mesa/st: Remove TGSI shader program variant handling.

  • mesa/st: Drop the attrib remapping in st_prepare_vertex_program().

  • mesa/st: Remove TGSI-only shader lowering code.

  • tgsi_scan: Remove unused input_array_last/output_array_last.

  • glsl: Remove UBO reference lowering.

  • mesa: Remove unused Mesa IR PROGRAM_* register files.

  • gallium/tgsi: Remove tgsi_emulate.

  • mesa/st: Remove now unused TGSI paths from disk cache.

  • glsl: Retire the non-NIR GLSL linking paths.

  • gallium: Remove now-unused shader caps.

  • glsl: Remove the unused lower_if_to_cond_assign.

  • glsl: Drop the dead MOD_TO_FLOOR path.

  • mesa/st: Clean up a bit of st_prepare_vertex_program().

  • compiler/glsl: Remove the dead parts of build_program_resource_list().

  • glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code.

  • glsl: Remove unused lower_variable_index_to_cond_assign.

  • svga: Set lower_bitops for vpu9 screen.

  • glsl: Stop lowering ir_quadop_vector.

  • nouveau: disable fences when running under drm-shim.

  • Revert “ci: remove nouveau from shader-db runs”

  • ci/iris: Cut the glk-deqp test coverage in half.

  • glsl: Make all drivers take the GLSLOptimizeConservatively path.

  • turnip: Add a TU_DEBUG=perf debug option.

  • ci/crocus: Manual CI updates after CI was down for a bit.

  • ci/crocus: Merge the piglit runs with the deqp runs.

  • i915g/ci: Add depth-clear-precision-check xfails like everyone else.

  • ci/crosvm: Simplify the CID setup.

  • ci/crosvm: Rename VSOCK_TEMP_DIR -> VM_TEMP_DIR.

  • ci/crosvm: Terminate the previous crosvm after a deqp-runner timeout.

  • freedreno/ir3: Fix 16-bit bit_count.

  • spirv: Use nir_vec_scalars() to simplify matrix transpose.

  • spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump.

  • turnip: Make RelaxedPrecision-decorated ALU ops 16-bit.

  • freedreno/ir3: Add support for 16-bit nir_texop_lod.

  • freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit.

  • mesa/st: Only scalarize for doubles lowering if we’re lowering doubles.

  • glsl,nir: Move i/umulExtended lowering to NIR.

  • panfrost: always print the bad ALU op if we’re failing to translate.

  • ci/turnip: Add missing xfails for a618 full VK run

  • nir_lower_mediump: Drop assertion about not containing movs.

  • freedreno/ir3: Enable load/store vectorization for SSBO access, too.

  • freedreno/ir3: Make the types of tex coord padding match.

  • freedreno/ir3: Fix validation of half-precision image store values.

  • freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts.

  • i915g: Ignore the new SEPARABLE_PROGRAM property.

  • ci/i915: Merge the piglit and deqp runs.

  • ci/i915: Update manual piglit job expectations.

  • turnip: Don’t bother creating tile_load/store_cs for sysmem rendering.

  • turnip: Refactor a bit of tu6_emit_tile_select().

  • turnip: Move binning decisions from FB usage time to FB creation time.

  • turnip: Use fb->binning_possible to decide on conditional tile load/stores.

  • ci/turnip: Add a little forced touch-testing of XFB with no binning requested.

  • turnip: Refactor a bit of repeated code for subpass setup.

  • turnip: Include 3d-based CmdClearAttachments() in binning visibility.

  • turnip: Refactor a bit of subpass attachment processing.

  • turnip: Allow load/store skipping in vkCmdClearAttachments().

  • turnip: Move tile loads back into the draw CS.

  • nouveau/nv30: Make sure fsat is lowered in the VS.

  • gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE

  • glsl: Remove stale lower_instructions comments.

  • glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR.

  • tgsi_exec: Fix inf/nan handling for divide by zero.

  • glsl: Drop the div-to-mul-rcp lowering for floats.

  • nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set.

  • glsl: Drop INT_DIV_TO_MUL_RCP lowering.

  • ci/crocus: Drop g41’s xfail for rgba_half_float_oes.

  • ci/iris: Disable skqp until it can be stabilized.

  • turnip: Reverse the order of walking pipes or tiles on odd rows.

  • mesa/st: Abort the linking on driver link failure.

  • .gitignore: Qualify the path for the ignored build directory.

  • nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning.

  • gallium: Rename MUL_ZERO_WINS to LEGACY_MATH_RULES.

  • nir_to_tgsi: Set LEGACY_MATH_RULES cap for use_legacy_math_rules shaders.

  • r600/sfn: Implement 0*x=0 behavior for use_legacy_math_rules.

  • nouveau/nir: Implement mul_zero_wins behavior for use_legacy_math_rules.

  • tgsi_to_nir: Set use_legacy_math_rules for TGSI_PROPERTY_LEGACY_MATH_RULES.

  • gallium: Rename PIPE_CAP_TGSI_LEGACY_MATH_RULES to drop “TGSI”

  • Revert “nine: Disable optional use of TTN when MUL_ZERO_WINS is available.”

  • iris: Enable PIPE_CAP_LEGACY_MATH_RULES.

  • crocus: Enable PIPE_CAP_LEGACY_MATH_RULES.

  • ci/dzn: Skip a test that intermittently times out at a minute.

  • ci/iris: Disable blender-demo-cube_diorama on APL.

  • ci: Terminate capture-devcoredump before tarring up artifacts.

  • turnip: Document some fields about resolves.

  • ci/bare-metal: Apply autopep8 to our python scripts.

  • ci/bare-metal: Add per-boot-stage timeouts for fastboot and poe.

  • ci/bare-metal: Get rid of servo’s serial feed threads.

  • ci/bare-metal: Add handling of netboot firmwares for servo boards.

  • ci/bare-metal: Add test phase timeouts to all boards.

  • ci: Remove the unused .test-manual

  • ci/vc4: Merge quick_shader in with deqp-gles

  • ci/vc4: Turn on deqp-egl testing by default.

  • vc4: Propagate txf_ms’s dest_type to the lowered txf.

  • freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx.

  • freedreno/a5xx: Set the buffer bit appropriately in XS_CTRL_REG0.

  • ci/freedreno: Turn a530 back on by default and update expectations.

  • ci/bare-metal: Remove “stage: test” from .baremetal-test.

  • ci/bare-metal: Consolidate needs declarations in .baremetal-test-*.

  • ci/bare-metal: Collapse artifacts wget by default.

  • ci/traces: GC unused code for DXVK trace replay.

  • ci/traces: Drop ZINK_USE_LAVAPIPE ICD override.

  • ci/freedreno: Add vulkan+gl integration testing in piglit.

  • ci/turnip: Test traces on turnip using zink.

  • ci/turnip: Trim the a630 VK run a bit.

  • ci: Use “!references” to manage scheduled pipeline rules.

  • ci: Pull out farm disables to a reused rule.

  • ci: Pull common zink frontend rules to a shared rule list.

  • ci/freedreno: Use !references to clean up restricted traces rules.

  • ci: Simplify vulkan rules using !references.

  • ci: Split core GL from core VK-or-GL rules.

  • ci/freedreno: Filter when we run test jobs for VK or GL changes.

  • ci/zink: Reuse anv-rules.

  • ci/zink: Simplify lavapipe rules setup, and clarify what the rules are for.

  • ci: Remove the trailing “when: never”s from rules.

  • ir3: Retire the cp postsched pass now that we do RA in SSA.

  • ci: Uprev deqp to 1.3.3.0.

  • ci/llvmpipe: Skip ssbo.layout.random.all_shared_buffer.36

  • ci: Make the retry policy default for all jobs.

  • ci: Rename .ci-run-policy rules to .build-rules.

  • ci: Drop .build-rules from container jobs.

  • ci: Remove .build-rules from core test job definitions.

  • ci: Fix non-freedreno performance jobs running during Marge merges.

  • ci: Remove GIT_STRATEGY from .test-manual-mr.

  • mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0.

  • vc4: Disable OES_texture_3D being exposed.

  • vc4: Move previous existing 3D xfails up to the group of 3d xfails.

  • vc4: Add notes on the remaining dEQP failures.

  • turnip: Account for additional_cs_reserve_size for both bin and render.

  • turnip: Fix up per-stage additional size accounting.

  • turnip: Fix the reservation for vertex inputs.

  • turnip: Refactor vertex input setup a little.

  • ir3: Make sure to pass the interp_mode through in our load_bary lowering.

  • nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3.

  • freedreno: Rename the “SIZE” regs for interpolateAtOffset to “CENTERRHW”

  • ir3: Clarify what’s happening in the interpolateAtOffset() math.

  • ir3: Use non-persp interpolation when appropriate for interpolateAtOffset.

  • zink: Do the timestamp-to-ns math in a double to have better precision.

  • mesa: Fix the error check for VertexAttrib*.

  • vc4: Work around a HW bug with 2-vert line loops.

  • vc4: Add links to test bug reports.

  • kopper: Fix the return value of kopperSetSwapInterval().

  • kopper: Use the swap interval that was set at swapchain creation time.

  • glx: Unify dri2/dri3 vblank_mode logic.

  • kopper: Respect the vblank_mode env var.

  • egl: Just include driconf.h instead of redefining its values.

  • loader: Just include driconf.h instead of redefining its values.

  • ci/freedreno: Mark an occasional flake pass that happens on a530.

  • ir3: Fix the no-emitted-vertex condition emission in geom lowering.

  • mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir.

  • mesa/arbprog: Use nir_lower_io_to_temporaries.

  • mesa/arbprog: Stop doing optimization in the ARB program IR.

  • ci/crocus: Update portal 2 trace shas for the recent fix.

  • ci/crocus: Disable the blender trace.

  • ci/crocus: Drop xfails for the recent image external fix.

  • freedreno/ir3: Enable core NIR’s 16-bit ALU optimizations.

  • intel/fs: Simplify brw_barycentric_mode() args.

  • ci/bare-metal: Add timeouts to the shell commands called in fastboot.

  • ci/freedreno: Switch a630 to manual/disabled for lab maintenance.

  • glsl: Use the same NIR path for shared mem lowering as SPIRV does.

  • glsl: Remove optimize_swizzles.

  • ci/turnip: Add a bit of spilling-vs-ballot testing on a618.

  • ci/freedreno: Add some more known flakes for a630 from our IRC logs.

  • ci/turnip: Bump up the a630 full run timeout.

  • ci/turnip: Add a couple of missing a630 fails.

  • Revert “ci/freedreno: Switch a630 to manual/disabled for lab maintenance.”

  • nir: Split usub_sat lowering flag from uadd_sat.

  • ci/crocus: Fix the commented YML for blender-demo-cube_diorama.

  • gallivm: Refactor a bit of UBO/SSBO range checking.

  • gallivm: Optimize single-invocation SSBO stores.

  • ci/virgl: Make the VM-based virgl tests report to #virgl-ci, too.

  • nir: Add an opt pass for phis after if choosing between true/false.

  • glsl: Remove opt_conditional_discard().

  • nir: Use nir_foreach_phi_src consistently.

  • ci/freedreno: Update a630 s8 resolve xfails.

  • turnip: Split the tiling config into separate layouts based on CCU usage.

  • turnip: Use the GMEM CCU space for attachments when the stores won’t.

  • zink: Make sure that we keep the existing ici pNext chain on inserts.

  • spirv: Mark phis as mediump instead of directly lowering them to 16 bit.

Enrico Galli (6):

  • microsoft/spirv_to_dxil: Add pass to lower dynamic accesses on ubo[1]

  • microsoft/compiler: Fix for arrays of 1 element

  • microsoft/compiler: Add support for unnamed ubos

  • microsoft/compiler: Fix when using a shadow sampler more than once

  • microsoft/spirv_to_dxil: Fix discard semantics

  • microsoft/spirv_to_dxil: Fix discard to demote + return pass

Eric Engestrom (36):

  • meson: replace deprecated dep.get_pkgconfig_variable(…) with dep.get_variable(pkgconfig : …)

  • docs: show and link to the new macOS CI

  • broadcom/rpi4-skips: drop duplicated lines

  • mailmap: update my email to Igalia

  • vulkan: add script to detect & replace identifiers that have been renamed

  • vulkan/runtime: use updated tokens from vk.xml

  • vulkan: use updated tokens from vk.xml

  • nir: use updated tokens from vk.xml

  • radv: use updated tokens from vk.xml

  • v3dv: use updated tokens from vk.xml

  • turnip: use updated tokens from vk.xml

  • zink: use updated tokens from vk.xml

  • lavapipe: use updated tokens from vk.xml

  • pvr: use updated tokens from vk.xml

  • anv: use updated tokens from vk.xml

  • dzn: use updated tokens from vk.xml

  • panvk: use updated tokens from vk.xml

  • venus: use updated tokens from vk.xml

  • zink: fix portability_subset usage after rename from EXTX to KHR

  • zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR

  • zink: add missing guards around `have_{ext}`

  • docs/isl: fix typos & formatting

  • vk/util: handful of pythonic cleanups

  • vk/util: simplify extensions gen code

  • util/list: rename LIST_ENTRY() to list_entry()

  • nouveau: use existing list_first_entry() macro instead of re-writing it

  • docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip

  • docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv

  • docs/ci: fix Emma’s name

  • turnip: expose support for VK_EXT_acquire_drm_display

  • ci: add `–print-errorlogs` to `meson test`

  • bin/gen_release_notes.py: bump advertised vulkan version to 1.3

  • vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties

  • wsi/x11: fix memleak in wsi_x11_connection_create()

  • meson: replace manual compiler flags with meson arguments

  • broadcom: fix dependencies in static_library() calls

Erico Nunes (11):

  • ci: put lima farm back online

  • docs/ci: fix literalinclude of caching setup files

  • lima/ppir: optimize branch comparison when possible

  • lima/ppir: merge branch nodes in simple cases

  • egl: Fix DETECT_OS macro usage

  • ci: update linux kernel to v5.17

  • ci: update docs for linux kernel uprevs

  • lima/ci: update piglit ci expectations

  • lima/ci: reenable lima-mali450-piglit-gpu:arm64

  • lima/ci: Don’t skip piglit opengl 1 tests

  • ci: Revert “CI: Lima farm is offline”

Erik Faye-Lund (157):

  • gallium: rename continue shader-cap

  • gallium: rename dround shader-cap

  • gallium: rename dfracexp/dldexp shader-cap

  • gallium: rename ldexp shader-cap

  • vulkan: explicitly cast object-type enum

  • meson: turn on -Wno-unused-function project-wide

  • nir: introduce and use nir_component_mask

  • dzn: fixup indent

  • dzn: remove all usage of ComPtr<T>

  • dzn: always use ID3D12Device1

  • dzn: always use ID3D12GraphicsCommandList1

  • dzn: use ID3D10Blob instead of ID3DBlob

  • dzn: pass IDXGIAdapter1 to d3d12_create_device

  • dzn: port to d3d12 c-api

  • dzn: add a bunch of missing struct-keywords

  • dzn: drop auto usage

  • dzn: drop using references

  • dzn: no more reinterpret_cast

  • dzn: use define instead of constexpr

  • dzn: use c-style for-statement

  • dzn: use c-style initialization

  • dzn: c-style casts

  • dzn: do not set unused default member initializer

  • dzn: port code to plain c

  • dzn: merge util sources

  • dzn: remove needless defines

  • dzn: remove some needless casts

  • dzn: clean up pointer syntax

  • dzn: add space before += operator

  • dzn: drop needless enum-casts

  • dzn: fixup indent

  • dzn: remove stale cpp_args

  • nir/lower_int64: do not try to clamp floats to int-range

  • vulkan: do not depend on alignof(void)

  • lavapipe: quiet non-conformant warning on ci

  • panvk: quiet non-conformant warning on ci

  • vulkan: use c_msvc_compat_args for shared code

  • vulkan: drop empty vulkan_wsi_args

  • microsoft/clc: fixup indentation

  • microsoft/compiler: fixup indentation

  • meson: remove unused defines

  • meson: deprecate specifying osmesa-bits

  • gallium/xlib: fix stale comment

  • mesa: add missing error-path

  • util: fix test on msvc

  • lavapipe: tolerate NULL UBO and SSBO descriptors

  • lavapipe: tolerate NULL image view descriptors

  • lavapipe: tolerate NULL image-buffer view descriptors

  • lavapipe: tolerate NULL sampler-buffer view descriptors

  • lavapipe: add support for null-descriptors from EXT_robustness2

  • zink: pass screen to extension getter macro

  • zink: initialize dispatch-table before queue-init

  • zink: call vk-functions through dispatch-table

  • zink: pass screen to zink_create_instance

  • zink: dynamically load a few functions

  • zink: use run-time linking to loader

  • dzn: reset correct list

  • vulkan/wsi: unbreak win32-support

  • d3d12: add missing cmath includes

  • tests/graw: rename shaders from .sh to .txt

  • dzn: add a debug-flag to wait for the debugger

  • dzn: add debug option to redirect stdout/stderr

  • editorconfig: remove scons-ism

  • editorconfig: remove html-rule

  • editorconfig: remove m4-rule

  • editorconfig: remove pl-rule

  • radeonsi: port amdgcn_glslc build to meson

  • freedreno: remove stale makefile

  • intel: remove stale makefile

  • meson: modernize win_flex stdint.h logic

  • c99_compat.h: remove inline and __func__ shims

  • util: add missing c99_compat.h includes

  • util: remove needless c99_compat.h includes

  • egl: remove needless c99_compat.h includes

  • gallium/util: remove needless c99_compat.h includes

  • mapi: remove needless c99_compat.h includes

  • gallium: remove needless c99_compat.h includes

  • include: remove needless c99_compat.h includes

  • intel/isl: remove needless c99_compat.h includes

  • vulkan/device_select: remove needless c99_compat.h includes

  • include: drop c99_math.h

  • include: drop c11_compat.h

  • pvr: do not use c_msvc_compat_args

  • turnip: do not do STATIC_ASSERT on a variable

  • freedreno: do not use variable in STATIC_ASSERT

  • freedreno: do not do STATIC_ASSERT on variables

  • iris: do not do STATIC_ASSERT on variables

  • freedreno: degrade STATIC_ASSERT to assert

  • v3dv: do not do STATIC_ASSERT on variables

  • util: add IS_POT macro

  • intel/compiler: use macro for power-of-two check

  • broadcom/compiler: use macro for power-of-two check

  • util: use static_assert directly

  • util: implement STATIC_ASSERT using c++11 / c11 primitives

  • meson: add back -Werror=vla to msvc compat flags

  • ci/windows: add back build-error detection

  • d3d12: move cubemap-lowering to common-code

  • microsoft/compiler: mark image-functions as such

  • microsoft/compiler: make sampler-lowering optional

  • microsoft/spirv_to_dxil: lower cube-images to 2d arrays

  • dzn: enable binding_model tests

  • dzn: correct assert-condition

  • dzn: handle stencil-attachment-optimal layout

  • dzn: use a null-rtv to handle no-attachment

  • dzn: enable two more exts

  • util: sort includes

  • llvmpipe: assume c99

  • c99_compat.h: assume c99 support

  • util: use c11 alignof instead of our own

  • glcpp: remove outdated msvc hack

  • vulkan/util: assume c11

  • util: use c11 alignas instead of rolling our own

  • gallium: remove stale comment

  • gallium: reorganize includes

  • llvmpipe: use c11 alignas instead of PIPE_ALIGN_VAR

  • gallium: use c11 alignas instead of PIPE_ALIGN_VAR

  • gallium: use c++11 alignas instead of PIPE_ALIGN_VAR

  • gallium: remove repeated check

  • gallium: remove unused macros

  • gallium: clean up PIPE_ALIGN_STACK-definition

  • docs/zink: correct requirement for tbos

  • dzn: correct order of src_subres and dst_subres args

  • docs: set language to english

  • dzn: fill in driver name and info

  • dzn: fill in float-control details

  • dzn: fill in non-uniform-indexing props

  • dzn: fill in bindless props

  • dzn: fill in depth/stencil resolve props

  • dzn: fill in minmax props

  • dzn: fill misc props

  • dzn: expose VK_KHR_driver_properties

  • zink: remove pointless test

  • zink: remove pointless assert

  • zink: drop pointless comment

  • llvmpipe: merge all rasterizer-bind functions

  • llvmpipe: consolidate legacy_points and multisample state

  • llvmpipe: clean up bresenham implementation

  • docs/zink: correct name of extension

  • docs/zink: document required VK_KHR_swapchain_mutable_format

  • docs/zink: document required VK_EXT_border_color_swizzle

  • zink: warn about missing image2DViewOf3D feature

  • zink: fix EXT_color_write_enable check

  • gallium/hud: do not use texture-rect for font

  • llvmpipe: remove unused function

  • zink: clean up extension emitting

  • zink: set dynamic indexing caps

  • zink: more accurately set {Sampled,Image}1D caps

  • zink: more accurately set {Sampled,Image}Buffer caps

  • zink: more accurately set ImageQuery cap

  • docs: fixup link to virgl docs

  • zink: add have_D32_SFLOAT_S8_UINT boolean

  • zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking

  • zink: type_main -> type_void_func

  • zink: add spirv_builder_function_call

  • zink: wrap discard in a function

  • zink: clamp miplodbias when creating sampler

  • docs/zink: document rgtc requirement

Fabrice Fontaine (1):

  • src/util/futex.h: fix build on 32-bit architectures using 64-bit time_t

Feng Jiang (1):

  • virgl: do not share virgl_screen between different drm_files

Filip Gawin (5):

  • r300: Print warning when stubbing derivatives

  • r300: keep negation if w is an inline constant

  • r300: don’t check for unitialized reads when rewriting register

  • r300: prefer old not native swizzle in constant folding

  • r300: don’t read from output transform_r300_vertex_SEQ/SNE

Francisco Jerez (2):

  • intel/dev: Compute pixel pipe information based on geometry topology DRM query.

  • intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC.

Frank Binns (3):

  • pvr: replace p_compiler.h boolean with stdbool

  • pvr: shorten error to err in label names

  • pvr: Add TI AM62 as a supported device.

Georg Lehmann (74):

  • aco: Remove some old comments in aco_opcodes.py.

  • aco: Reuse previous -1 result in find_msb to avoid using VOP3.

  • nir/lower_mediump: Add a new pass to fold 16bit image load/store.

  • aco: Implement image_store d16.

  • aco: Implement image_load d16.

  • radv: Use nir_fold_16bit_image_load_store_conversions.

  • radv: Run copy_prop and dce after folding 16bit sampling/load/store.

  • aco: Remove D16 zero components from image stores.

  • meson: Tell glslang to be quiet.

  • radeonsi: Use nir_fold_16bit_image_load_store_conversions.

  • wsi/x11: Don’t leak xcb_get_geometry_reply_t.

  • nir/opt_algebraic: Fix mask in shift by constant combining.

  • nir/opt_algebraic: Simple xor/ishr optimizations.

  • radv: Don’t retile read-only images.

  • nir/opt_algebraic: Optimize Doom Eternal’s word extract by LSB.

  • aco: Implement uclz.

  • aco: Implement usub_sat.

  • aco: Implement isub_sat.

  • ac/llvm: Implement usub_sat and isub_sat.

  • ac/llvm: Implement uclz.

  • nir: Add optional lowering for mul_32x16.

  • radv: Lower mul_32x16.

  • radv: Lower hadd/hradd.

  • radv: Lower 8bit isub_sat/usub_sat.

  • radv: Enable VK_INTEL_shader_integer_functions2.

  • radv, aco: Packed usub_sat/isub_sat.

  • zink: Use VK_USE_64_BIT_PTR_DEFINES to check for 64bit platforms.

  • radv, aco: Lower txf offset in NIR.

  • nir/lower_tex: Add lower_array_layer_round_even option.

  • radv, aco: Round texture array layer in NIR.

  • vulkan: Spec update to 1.3.217.

  • radv: Implement VK_EXT_non_seamless_cube_map.

  • aco: Print r128/a16 MIMG bits separately.

  • aco: Remove r128_a16 MIMG builder option.

  • anv: Implement VK_EXT_non_seamless_cube_map.

  • radv: Remove redundant memset in radv_descriptor_set_create.

  • radv: Check descriptor pool entry count before allocating a new set.

  • radv: Don’t check if we need to copy immutable samplers for non push templates.

  • aco/ra: Static assert that changing instruction type to VOP2 is valid.

  • aco/ra: Move mac encoding optimization to its own function.

  • aco/ir: Pad SOP2 and SOPC to the same size as SOPK.

  • aco/ra: Optimize some SOP2 instructions with literal to SOPK.

  • aco/optimizer: Convert s_add_u32 with literals to s_add_i32 if carry is not used.

  • aco/optimizer: Optimize SOPC with literal to SOPK.

  • r600/sfn: Add missing std::array include.

  • aco: Fix swapping sources in SOPC -> SOPK optimization.

  • aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9.

  • nir_to_tgsi: Lower uadd_sat/usub_sat.

  • zink: Lower uadd_sat/usub_sat.

  • r600: Lower uadd_sat/usub_sat.

  • ir3: Implement [iu]sub_sat.

  • nir/lower_io: Fix array length of buffers larger than INT32_MAX.

  • nir/opt_algebraic: Optimize [ui](add|sub)_sat with 0.

  • radv,aco: Don’t lower and vectorize 16bit iabs.

  • radv: Remove ineg from lower_bit_size_callback callback.

  • radv, aco: Don’t lower 16bit isign.

  • ir3: Only run 16bit tex NIR passes on a5xx+.

  • ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress.

  • nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass.

  • nir/lower_mediump: Add an option to only fold if all tex sources can be folded.

  • ir3: Stop using nir_legalize_16bit_sampler_srcs.

  • aco: Add G16 opcodes.

  • aco: Support 16bit sources for texture ops.

  • radv: Fold 16bit tex sources.

  • aco: Add a note about G16 without A16 to README-ISA.md.

  • aco/ir: Generalize (un)ordered_swapped.

  • aco/ir: Add integer get_cmp_info.

  • aco/ir: Fix swapped nle.

  • aco/ir: Add vcmpx opcode to get_cmp_info.

  • aco/ir: Add v_cmp_class to get_cmp_info.

  • aco/ir: Add swapped opcode for v_cmp_u/v_cmp_o.

  • radeonsi: Stop using nir_legalize_16bit_sampler_srcs.

  • nir/fold_16bit_tex_image: Default to only_fold_all.

  • aco: Force tex operand to have the correct sub dword size before packing.

Gert Wollny (85):

  • virgl: Extend integer write out output fix to all non-move integers ops

  • r600/sfn: Fix store_shared_r600 write masks

  • ntt: Add option to not optimized register allocation

  • nir: Add pass to split 64 bit vec3 and vec4 variable access and phis

  • ntt: Split 64 bit vec3 and vec4 local variables and phis

  • virgl: Clear all indirect flags when writing to a temporary

  • nir: Don’t optimize to 64 bit fsub if the driver doesn’t support it

  • ntt: remove dead input variables before lowering FS IO

  • r600/sb: Don’t optimize float GT and GE

  • r600: tune nir options

  • r600: Add support for TGSI_OPCODE_ATOMIMIN and IMAX

  • virgl: don’t move input gl_SampleMaskIn to a temp

  • r600: Allow eight bit, three channel formats for vertex buffers

  • nir: honor lower_double options for ffloor and ffract

  • r600/sb: Don’t create three source ops with all kcache values

  • r600/sb: Fall back to un-optimized shader if scheduling fails

  • virgl: Don’t let ntt optimize the register allocation

  • tgsi: Don’t bother setting the TGSI_PROPERTY_SEPARABLE_PROGRAM

  • virgl: Submit the TGSI_PROPERTY_SEPARABLE_PROGRAM when the host understands it

  • r600: Add number of ALU groups to statistics

  • r600: Make sure that LDS instructions only use bank swizzle 012

  • r600: Update nir options

  • r600/sfn: rewrite NIR backend

  • r600: enable sb by default also for NIR

  • r600: Cleanup nir options

  • r600/sfn: support nir_op_mulz and legazy math rules

  • r600/sfn: Add missing copyright information

  • r600/sfn: Don’t use assert to test success of operation

  • r600: limit loops when trying to merge alu groups

  • r600/sfn: Never consider an op with register dest as dead

  • r600/sfn: Add support for fdph

  • r600/sfn: lower CLIPVERTEX to clip planes

  • r600/sfn: emulate pmr::monotonic_buffer_resource if needed

  • r600/sfn: clean up some commented out code

  • r600/sfn: fix storing shader output array locations

  • r600/sfn: lower undef to zero

  • r600/sfn: count LDS queue pop reads separately in assembler

  • r600/sfn: Fix the kcache failure handling

  • r600/sfn: Fix emmision of LDS instructions

  • r600/sfn: Fix used of local shared write mask

  • r600/sfn: Fix streamout with non-zero component write

  • r600: don’t dump shader info to files on debug

  • r600/sfn: make sure the memory pool is released after shader translation

  • r600/sfn: lower the right shader variant

  • r600: Delete possible old NIR variant when translating from TGSI

  • r600: don’t clone the nir shader, just use it

  • r600ß/sfn: Don’t fall through with VS outputs

  • r600/dri-tweak: Disable ARB_shader_image_load_store for “Tomb Raider 2013”

  • r600/sb: fall back to unoptimized shader if RA fails

  • r600: Don’t set TES_EVAL UCP dirty when it is not supported

  • virgL: lower fneg and fabs

  • virgl: honor host caps in shader cash sha evaluation

  • r600/sfn: Factor out fragement shader class for EG

  • r600/sfn: Support pre-EG style FS input

  • r600/sfn: Prepare StreamOut instruction for pre EG opcodes

  • r600/sfn: Handle slot differences between pre-EG and EG

  • r600/sfn: Handle chipclass with kcache allocation

  • r600/sfn: Handle buffer size query on pre-EG

  • r600/sfn: Handle pre-EG GS input fetch

  • r600/sfn: Handle primitive input handling for pre-EG

  • r600/sfn: Handle non-compressed MS texel fetch

  • r600/sfn: Set trans flag for flt_to_int on pre-EG

  • r600/sfn: Handle pre-EG buffer fetch

  • r600/sfn: Disable FP64 for pre-CEDAR

  • r600/sfn: Advertise support for fmulz

  • r600/sfn: Enable NIR for pre RG hardware

  • r600/sfn: remove leftover debug output

  • r600/sfn: Handle R600 class sin/cos

  • r600: increase possible stack size in binary code

  • r600/sfn: Schedule shift instruction on R600 in t-slot

  • r600/sfn: Add GS thread fix just like the TGSI code path

  • r600/sfn: Sort FS inputs to make interpolated values come first

  • r600/sfn: Fix color outputs when color0 writes all

  • r600: Force NOPs when loading AR on R600 class hardware

  • r600/sfn: Handle R600 scratch read

  • r600: Don’t use SB with R600 style scratch reads

  • r600/sfn: Handle color0 writes all on R700 like on EG

  • r600/sfn: Don’t tag mem-ring and stream instructions as exports

  • r600/sfn: Don’t schedule GDS instructions early

  • r600/sfn: Don’t scan the whole block for ready instructions

  • r600/sfn: Use a heuristic to keep SSBO setup and store close

  • r600: Fix reporting TGSI IR support

  • r600/sfn: Use a low number for unused target register

  • virgl: when reading back wait first, then do the transfer

  • r600/sfn: override register ID when it doesn’t matter

Glenn Kennard (1):

  • nv30: Fix non-scissored clears after a scissor has been set

Greg Depoire–Ferrer (5):

  • util/queue: add missing space to comment in util_queue_destroy

  • zink: check for error when initializing util_queue

  • lavapipe: check for error when initializing lvp_queue

  • util/queue: handle thread cration failure in util_queue_adjust_num_threads

  • zink: fix crash on dlclose when zink_internal_create_screen failed

Guilherme Gallo (48):

  • ci/lava: Encapsulate job data in a class

  • ci/lava: Filter log lines from LAVA return

  • ci/lava: Return test-suite result as exit code

  • ci/lava: Fix LAVA job validation

  • ci/lava: Set lava-signal to kmsg

  • ci/lava: Always validate the lava job

  • ci/lava: Use lava-test-case to run custom scripts in LAVA

  • ci/lava: Trap init-stage2.sh background processes

  • ci/lava: Fix shebang in deqp-runner.sh

  • ci/lava: Parse all test cases from 0_mesa suite

  • ci/lava: Let LAVA job submitter run without JWT file

  • ci/lava: Reduce LAVA boot phase timeout to 3 minutes

  • ci/lava: Cancel the job if the script is interrupted

  • ci/lava: Improve exception handling

  • ci/lava: Retry when data fetching log RPC call is corrupted

  • ci: Fix init-stage2 exit code

  • ci: Update trace after CI fix

  • ci: Fix tests expectations

  • ci/lava: Make job submitter parse the job result

  • ci/lava: Fix colored LAVA outputs

  • ci/lava: Fix Gitlab Section markers

  • ci/lava: Add support for more complex color codes

  • ci/lava: Filter out undesired messages

  • ci/lava: Improve result parsing regex

  • ci/lava: Create Gitlab log sections handler

  • ci/lava: Create LogFollower and move logging methods

  • ci/lava: Follow job execution via LogFollower

  • ci/lava: heartbeat: don’t consider kernel message logs

  • ci/lava: Wrap job info into a collapsed section

  • ci/lava: Wrap job definition dump into a collapsed section

  • ci/lava: Add an integration test for LAVA jobs

  • ci/lava: Don’t print LAVA debug messages

  • ci/lava: Highlight kernel messages in bold

  • ci/lava: Highlight job retrying message

  • ci/lava: Stop printing after the result line

  • ci/lava: Flexibilize section marker regexes

  • ci/lava: Rename console color names

  • ci/lava: Update license header

  • ci/lava: Split lava_log into modules

  • ci/lava: Detect R8152 issues preemptively and retry

  • ci/lava: Make hung job status yellow

  • ci/lava: Color red for fatal and yellow for warning

  • ci/lava: Add `slow` pytest marker

  • ci/lava: Add canceled job status

  • ci/lava: Add delay before calling lava-test-case

  • ci/lava: Increase boot timeout

  • ci/lava: Only parse result within testcase section

  • ci/lava: Remove MINIO_RESULTS_UPLOAD if requested

Hans-Kristian Arntzen (4):

  • radv: Fix RTPSO hashing of pGroups.

  • vulkan: Update Vulkan XML and headers to 1.3.219.

  • vulkan: Add common code for VK_EXT_shader_module_identifier.

  • radv: Implement VK_EXT_shader_module_identifier.

Hoe Hao Cheng (4):

  • zink/codegen: remember the fields in feats/props structs of extensions

  • zink/codegen: do not include compilation structs with extension structs

  • zink/codegen: do not automatically consider extensions promoted

  • zink/codegen: use the updated identifiers

Hyunjun Ko (15):

  • radv: fix assertion for the count of xfb output

  • turnip: add an assertion for max descriptor set count.

  • ir3: handle intrinsic_load_draw_id when scanning driver constants

  • turnip: clamp to 1 for negative bottom-right of viewport.

  • turnip: fix wrong size for lrz cs when lrz_track_quirk is enabled.

  • freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE

  • turnip: keep the depth_clip_disable state at the pipeline builder.

  • zink: remove the workaround for depth_clip_enable on turnip

  • turnip: Remove an unnecessary assert.

  • turnip: expose VK_EXT_shader_module_identifier

  • ir3: change maximum size of const files.

  • ir3: handle shared consts.

  • turnip: clean up unused parameters for user consts.

  • turnip: Change to use shared consts for PushConstants

  • tu: increase maxPushConstantsSize to 256.

Iago Toral Quiroga (79):

  • v3dv: ignore no-op barriers

  • v3dv: drop obsolete comment

  • v3dv: don’t emit final noop job if we don’t have anything to signal

  • v3dv: document Vulkan requirements for signaling operations

  • v3dv: ignore barriers for image layout transitions from undefined layouts

  • v3dv: drop default sampler states if not used

  • v3dv: flag BCL barriers for all-graphics and all-commands stages

  • v3dv: skip binning sync if binning shaders don’t access external resources

  • v3dv: use wait stage for wait semaphores to decide about binning syncs

  • v3dv: check input attachment usage as sampled usage

  • v3dv: implement VK_EXT_separate_stencil_usage

  • v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts

  • docs/features: flag VK_KHR_timeline_semaphore as implemented

  • v3dv: expose separateDepthStencilLayouts

  • v3dv: implement vkGetPipelineExecutableInternalRepresentationsKHR

  • v3dv: implement vkGetPipelineExecutablePropertiesKHR

  • v3d/compiler: add more stats to prog_data

  • v3dv: implement vkGetPipelineExecutableStatisticsKHR

  • v3dv: expose VK_KHR_pipeline_executable_properties

  • v3dv: don’t leak variant QPU when pipeline compile fails

  • v3dv: drop unsused struct

  • v3dv: expose VK_KHR_shader_float_controls

  • v3dv: expose VK_KHR_spirv_1_4

  • v3dv: expose VK_KHR_vulkan_memory_model

  • v3dv: fix format usage checks when extended usage flag is set

  • broadcom/compiler: don’t promote early fragment tests when writing sample mask

  • v3dv: consume barriers at the right stages

  • v3dv: only clear BCL barrier state if we don’t have pending graphics barriers

  • v3dv: use an explicit struct type to track barrier state

  • v3dv: track sources of barriers

  • v3dv: limit sync for barriers to hw queues selected by source mask

  • v3dv: don’t be too aggressive disabling early Z

  • v3dv: use the global RCL EZ disable if we don’t have any EZ draws in the job

  • broadcom/compiler: disable flags optimization for loop conditions

  • v3dv: merge pending secondary barrier state into primary command buffers

  • v3dv: handle barriers at the end of a command buffer

  • v3dv: fix leak

  • broadcom/compiler: fix postponed TMU spills with multiple writes

  • broadcom/compiler: don’t predicate postponed spills

  • v3dv: fix pool descriptor count for inline uniform buffers

  • v3dv: fix merge jobs

  • nir: fix documentation for uadd_carry and usub_borry opcodes

  • v3dv: don’t lower uadd_carry and usub_borrow

  • v3dv: remove obsolete comment

  • nir/serialize: fix missing divergence info after deserialization

  • broadcom/compiler: don’t try to hide TMU latency at QPU scheduling

  • broadcom/compiler: don’t over-estimate latency of TMU instructions

  • v3dv: fix comment in texel buffer shader copy path

  • v3dv: remove unused lowering for nir_intrinsic_load_layer_id

  • v3dv: fix comment for point_sprite_mask filed in shader key

  • v3dv: clean up get_internal_type_bpp_for_image_aspects

  • v3d,v3dv: stop hardcoding various image limits

  • v3d: stop using a smaller texture limit in OpenGL

  • v3d,v3dv: stop copying and pasting the translate_swizzle helper

  • v3dv: only save/restore push constant data for meta operations if needed

  • v3dv: move push constant data to the command buffer state

  • v3dv: limit upload of indirect push constant data

  • v3dv: don’t allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO

  • v3dv: stop tracking push constant buffer references

  • v3dv: allocate more push constant buffers if needed

  • v3dv: add a dirty state for pending push constants UBO updates

  • v3dv: explain why we clear certain state after a draw call

  • v3dv: remove duplicate condition

  • nir: add nir_address_format_2x32bit_global

  • broadcom/compiler: implement 2x32 global intrinsics

  • broadcom/compiler: track if a shader uses global intrinsics

  • v3dv: implement VK_KHR_buffer_device_address

  • v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT

  • v3dv: remove assert that poolSizeCount must be larger than 0

  • v3dv: fix offset reported by vkGetImageSubresourceLayout

  • v3dv: remove unnecesary FIXME comment

  • v3d,v3dv: add a common v3d_hw_prim_type helper

  • v3dv: only check binning sync for semaphores for the first CL job

  • v3dv: explicitly set sampler min/max filter properties to false

  • v3dv: fill in Vulkan 1.2 missing device properties

  • v3dv: enable missing Vulkan 1.2 features for supported extensions

  • v3dv: enable missing mandatory Vulkan 1.2 features

  • broadcom/ci: update list of slow tests

  • v3dv: expose Vulkan 1.2

Ian Romanick (26):

  • glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp

  • glsl: Add flag to disable part of do_vec_index_to_cond_assign

  • nir: Use nir_vector_extract to generate code for ir_binop_vector_extract

  • glsl: Delete lower_extracts code

  • anv: Remove FS executables when applying the null FS optimization

  • intel/fs: Add missing synchronization for WaW dependency

  • nir: i32csel opcodes should compare with integer zero

  • nir/algebraic: Fix NaN-unsafe fcsel patterns

  • nir: Add and use algebraic property “is selection”

  • nir/range_analysis: Teach range analysis about fdot opcodes

  • intel/compiler: Rename vec4 state URB opcodes to have VEC4_ prefix

  • intel/eu: Validate some aspects of URB messages

  • intel/compiler: Move logical-send lowering to a separate file

  • intel/fs: Add _LOGICAL versions of URB messages

  • intel/fs: Lower URB messages to SEND

  • intel/fs: Remove non-_LOGICAL URB messages

  • intel/vec4: Set lower_usub_sat

  • spirv: Fix array length of buffers larger than INT32_MAX.

  • intel/fs: Make logical URB write instructions more like other logical instructions

  • intel/fs: Make logical URB read instructions more like other logical instructions

  • intel/fs: Don’t pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send

  • intel/fs: Eliminate “masked” and “per slot offset” URB messages

  • intel/fs: Use canonical form for “work around” tags

  • nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident

  • radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering

  • glsl: Remove lower_offset_arrays pass

Icecream95 (45):

  • clc: Use stringstream for printing spirv errors

  • panfrost: Stop overallocating compressed textures

  • panfrost: Enable NIR lowering of half float packing

  • panfrost: Fix pack_32_2x16 implementation

  • pan/bi: Use texture index instead of sampler for message preloading

  • pan/va: Improvements to LEA_TEX instructions

  • pan/va: Add more transcendental operations

  • pan/va: Add SEG_ADD/SEG_SUB operation

  • pan/va: Add absneg modifier to V2F32_TO_V2F16

  • pan/va: Improve texture instructions

  • nir/lower_tex: Copy more fields in lower_tex_to_txd and friends

  • pan/mdg: Keep min_bound at 16 when alignment requires it

  • pan/mdg: Use MAX2 to set min_alignment

  • pan/mdg: Fix mask usage when filling before a spill

  • pan/mdg: Fix disassembly of store instructions

  • pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled

  • pan/mdg: Fix multiple spilt writes in the same bundle

  • panfrost: Copy blend constant into variant even when reusing it

  • panfrost: Increase the limit for blend shader variants

  • panfrost: Move patched_s out of the pan_blitter_views struct

  • nir: Add store_combined_output_pan BASE back

  • pan/bi: Read base for combined stores

  • pan/mdg: Read base for combined stores

  • panfrost: Only write depth / stencil once if MRT is used

  • pan/bi: Reverse linear constraint bits

  • pan/bi: Add nodearray datastructure

  • pan/bi: Use nodearrays for linear constraints

  • panfrost: Constant stencil value tracking

  • panfrost: Stop using sparse_array for batch BOs

  • panfrost: Use a macro for checking for a shared bind type

  • panfrost: Remove sync arguments from panfrost_batch_submit

  • panfrost: Enable QUAD_STRIP and POLYGON on v6

  • pan/va: Use the _safe iterator when adding blend shader calls

  • pan/decode: fflush buffers after dumping and before aborts

  • pan/decode: Use tag bits for resource entry count

  • pan/decode: Change indent when decoding resources

  • panfrost: Only emit images when they are present

  • panfrost: Emit the correct number of attributes

  • panfrost: Clear the GENERAL varying buffer field if unused

  • panfrost: Correctly calculate prefetch suppresion varying index

  • panfrost: Add a debug option for checking overflows on pool uploads

  • panfrost: Don’t unbind recently bound streamout targets

  • panfrost: Allow NULL streamout targets

  • nine: Make vdecl_index_map always signed

  • nine: Only enable thread_submit by default on x86

Igor Torrente (2):

  • venus: Add support to VK_KHR_maintenance4 extension

  • venus: Use maintenance4 to get max_size_buffer

Ikshwaku Chauhan (1):

  • Revert “radeon: add EFC support to only VCN2.0 devices”

Illia Abernikhin (1):

  • vulkan/runtime: repair thread safety in method vk_queue_start_submit_thread()

Indrajit Das (5):

  • radeonsi/gfx11: VRS changes

  • radeonsi: save the fs constant buffer to the util blitter context

  • gallium/u_blitter: clear color buffers using color from a constant buffer

  • mesa/st: add nir shader to clear color buffers using constant value

  • mesa/st: clear color buffers using color from a constant buffer

Italo Nicola (3):

  • mesa/st: call pipe->link_shader even if ir loaded from cache when linking

  • tgsi: Add SEPARABLE_PROGRAM property

  • virgl: overpropagate precise flags

Iván Briano (6):

  • vulkan/wsi/wayland: Fix double free on error condition

  • anv: re-alloc push constants after secondary command buffers

  • anv: do not get rid of empty/useless fragment shaders

  • anv: assert inheritance_info is not NULL

  • anv: emit scissors when the pipeline changes

  • anv: pipelineStageCreationFeedbackCount is allowed to be 0

James Zhu (5):

  • amd: update headers to support decode software ring

  • radeonsi/gfx11: update codec support for gfx11

  • radeonsi/vcn: add decode software ring support for gfx11

  • radeonsi/vcn: add jpeg decode support for gfx11

  • amd/common: some ASICs with gfx9 use compute rings for render

Jan Beich (1):

  • vulkan/wsi: treat EBADF as missing DMA_BUF_IOCTL_{EXPORT,IMPORT}_SYNC_FILE

Jan Palus (1):

  • pvr: use UINT64_C for 64-bit constant

Faith Ekstrand (312):

  • clc: Only initialize LLVM once

  • clc: Rework logging a bit

  • clc: Declare LLVMContexts on the stack

  • intel/compiler: Set lower_fisnormal

  • nir,microsoft: Move scale_fdiv into a common NIR pass

  • nir/opcodes: fisfinite32 should return bool32

  • util: Remove util_cpu_detect

  • util/blob: Add align helpers

  • util/set: Respect found in search_or_add_pre_hashed

  • vulkan: Add a common VkPipelineCache implementation

  • vulkan/pipeline_cache: Add helpers for storing NIR in the cache

  • anv: Rename a fail label in CreateDevice

  • anv: Switch to the new common pipeline cache

  • anv: Clean up pipeline cache helpers a bit

  • nir: Lower all bit sizes of usub_borrow

  • vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping

  • vulkan: Use ALL_COMMANDS_BIT for waits/signals instead of ~0

  • spirv: Handle Op*MulExtended for non-32-bit types

  • vulkan/log: Allow but warn if called with a NULL object

  • vulkan/log: Allow but warn for client-invisible objects

  • nir/constant_folding: Break TXB folding into a helper function

  • nir: Constant fold sampler/texture offsets

  • panvk: Take buffer offsets into account in BindVertexBuffers

  • panvk: Eliminate unused vertex attributes

  • nir/deref: Add an alu-of-cast optimization

  • panvk: Add address/range helpers for panvk_buffer

  • vulkan/wsi: Close file descriptors in wsi_destroy_image

  • vulkan/wsi: Only use a single fd per wsi_image

  • vulkan/wsi: Hang on to file descriptors

  • vulkan/wsi: Reset the image fence right before vkQueueSubmit

  • vulkan/wsi: Set the right stage flags for semaphore waits

  • vulkan/wsi: Add signal_fence/semaphore_for_image helpers

  • nir/cf: Return a cursor from nir_cf_extract as well

  • nir: Fix constant folding for non-32-bit ifind_msb and clz

  • nir: Set image_buffers and msaa_images in lower_samplers_as_deref

  • nir: Stop assuming shader_info::textures_used is 32-bit

  • shader_info: Make images_used a bitset

  • nir: Gather samplers_used separately from textures

  • util/bitset: Support larger ranges in BITSET_TEST/CLEAR_RANGE

  • shader_info: Bump the number of images and textures supported

  • gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers

  • mesa/st: Clamp MaxImageUniforms to MAX_IMAGE_UNIFORMS

  • gallium: Bump PIPE_MAX_SHADER_IMAGES to 64

  • panvk: Call nir_lower_io_arrays_to_elements_no_indirects

  • nir/lower_blend: Be more explicit about deref assumptions

  • nir/lower_blend: Stop passing the whole options object around

  • nir/lower_blend: Support SNORM and integer formats for logic ops

  • nir/builder: Add a nir_trim_vector helper

  • nir/builder: Add a nir_resize_vector helper

  • midgard: Handle FB fetch from non-vec4 output variables.

  • nir/lower_blend: Expand or shrink output variables as needed

  • panvk: Advertise support for logicOp

  • iris: Set BindingTableEntryCount for compute shaders

  • nir: Mark negative re-distribution on fadd as imprecise

  • clover: Set images/samplers_used when lowering images

  • lavapipe: Set shader_info::samplers_used

  • gallium: Set shader_info::samplers_used in pstipple_fs

  • ttn: Set shader_info::samplers_used

  • mesa/st: Set samplers_used in lower_tex_src_plane

  • gallium/draw: Properly handle nr_samplers != nr_sampler_views in keys

  • llvmpipe: Fill out samplers even if nr_samplers = 0

  • lavapipe: Set images_used in lvp_lower_pipeline_layout

  • ntt: Don’t gather samplers_declared twice

  • panvk: Move CreateDescriptorSetLayout to per-arch

  • panvk: Break descriptor lowering into its own file

  • panvk: Add a buffer to each descriptor set

  • panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned

  • panfrost: Add some sanity checking for sysvals

  • bifrost,midgard: Allow providing a fixed sysval layout

  • panvk: Get rid of the per-pipeline sysvals BO

  • panvk: Use a flat sysvals struct

  • panvk: Stop calling lower_uniforms_to_ubo

  • panvk: Put the sysval and push const UBOs at fixed indices

  • panvk: Interleave UBOs with multiple descriptor sets

  • panvk: Put SSBO addresses in the descriptor buffer

  • vulkan: Start collecting enabled features in vk_device

  • panvk: Enable robustBufferAccess

  • panvk: Advertise VK_KHR_variable_pointers

  • panvk: Add an elems field to panvk_buffer_view

  • pavnk: Pass bind layouts to texture and image descriptor helpers

  • panvk: Implement texture/image queries

  • FIXUP: Use 16-bit things for texture sizes

  • panvk: Re-arrange descriptor set functions

  • panvk: Rewrite the write portion of vkUpdateDescriptorSets

  • panvk: Set immutable samplers properly up-front

  • panvk: Implement descriptor copies properly

  • panvk: Drop panvk_descriptor

  • panvk: Round FillBuffer sizes down to a multiple of 4

  • nir: Preserve metadata if remove_dead_derefs makes no progress

  • nir: Add a var set version of lower_indirect_derefs

  • mesa/st: Use nir_shader_instructions_pass for st_nir_lower_builtin

  • mesa/st: Check deref modes in lower_builtin_instr

  • mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin

  • nir: Put the builder first in lower_input_attachments helpers

  • nir: Use nir_shader_instructions_pass in nir_lower_input_attachments

  • bifrost: Run nir_lower_global_vars_to_local before nir_lower_vars_to_scratch

  • panvk: So more nir_lower_tex before descriptor lowering

  • panvk: Lower blending after lower_var_copies

  • v3dv: Add a create_image_view helper for internal views

  • vulkan: Only be clever about vk_image_view::view_format for normal views

  • v3dv: Don’t use color aspects for depth/stencil images

  • v3dv: Drop the region temporary from blit_shader

  • vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create

  • radv: Only use PLANE_0 in meta when actually needed

  • radv: Use vk_image as the base for radv_image

  • radv: Use vk_image_view as the base for radv_image_view

  • glsl: Drop this != NULL assertions

  • vulkan/pipeline_cache: Implement deserialize for raw objects

  • v3dv: Loosen an assert in copy_buffer_to_image_shader

  • vulkan: Fall back to raw data objects when deserializing if ops == NULL

  • lavapipe: Use the correct ICD path on Win32

  • vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper

  • radv: Add a sqtt entrypoint for CmdBindVertexBuffers2

  • radv: Use the common CmdBindVertexBuffers wrapper

  • lavapipe: Use the common BindVertexBuffers wrapper

  • nir/algebraic: Add two more pack/unpack rules

  • intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround()

  • intel/fs: Copy color_outputs_valid into wm_prog_data

  • anv: Drop alpha_to_coverage from the NULL FS optimization

  • anv: Handle the null FS optimization after compiling shaders

  • anv: Don’t disable the fragment shader if XFB is enabled

  • intel: Only set VectorMaskEnable when needed

  • vulkan: Add a base struct for buffers

  • anv: Use the base vk_buffer struct

  • panvk: Use the vk_buffer base struct

  • vulkan/runtime: Add min_lod to vk_image_view

  • intel/isl: Add isl_view::min_lod_clamp for IVB+

  • anv: Implement VK_EXT_image_view_min_lod

  • isl: Add some asserts about multisampled surfaces

  • nir: Handle register sources in lower_phis_to_regs_block

  • nir: Add a correctness note for nir_lower_phis_to_regs_block

  • nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info

  • nir: Add a nir_xfb_info to nir_shader

  • anv: Use nir_shader_gather_xfb_info

  • lavapipe: Use nir_shader_gather_xfb_info

  • turnip: Use nir_gather_xfb_info

  • radv: Use nir_gather_xfb_info

  • nir: Drop nir_shader_get_xfb_info

  • glsl/nir: Populate nir_shader::xfb_info after linking varyings

  • glsl/nir: Stop leaking varyings_info

  • glsl/nir: Stash the xfb_info in the nir_shader when linking XFB

  • st,nir: Use nir_shader::xfb_info in nir_lower_io_passes

  • nir/glsl: Use rzalloc for nir_xfb_info

  • nir: xfb_buffer_info::stride is in bytes

  • ttn: Populate the images/textures/samplers_used fields in shader_info

  • vulkan,anv: Move the image offset/extent sanitize helpers to common code

  • radv: Use the common image offset/extent sanitize helpers

  • vulkan,radv: Move vk_format_get_plane_format to common code

  • vulkan/format: Add a vk_format_get_aspect_format helper

  • vulkan: Add a helper for image<->buffer copies

  • anv: Use the common image<->buffer copy helper

  • vulkan,radv: Steal some image offset/extent helpers from radv

  • radv: Use vk_image_buffer_copy_layout

  • panvk: Use vk_image_buffer_copy_layout

  • lavapipe: Use vk_image_buffer_copy_layout

  • nir/deref: Break out a helper for compare_deref_paths

  • nir/deref: Use an index in compare_deref_paths

  • nir/deref: Make compare_deref_paths take a stop callback

  • nir/deref: Re-arrange variable checks in compare_deref_paths

  • nir: Increase nir_variable_data::mode to 16 bits

  • nir/vars_tests: Use nir_var_mem_global instead of ssbo

  • nir/deref: Handle SSBO array bindings specially

  • nir/deref: Handle RESTRICT for SSBO deref bindings

  • compiler/types: Don’t place members in the previous substruct’s end padding

  • anv/wsi: Stop resetting semaphores

  • vulkan/wsi: Fix a signal_semaphore_with_memory check

  • wsi: Always signal semaphores and fences in wsi_common_acquire_next_image

  • panvk: Use the common AcquireNextImage implementation

  • v3dv: Use the common AcquireNextImage implementation

  • radv: Drop create_sync_for_memory

  • lavapipe: Use the common AcquireNextImage implementation

  • pvr: Use the common AcquireNextImage implementation

  • turnip: Use the common AcquireNextImage implementation

  • turnip: Use the common QueuePresent implementation

  • turnip: Set supported_sync_types before wsi_init

  • vulkan/wsi: Use a STACK_ARRAY for stage_flags

  • vulkan/wsi: Reset fences earlier in wsi_common_queue_present

  • vulkan/wsi: Re-arrange QueueSubmit code in wsi_common_queue_present

  • vulkan/wsi: Set memory ownership after signaling fences and semaphores

  • vulkan/wsi: Signal semaphores and fences from the dma-buf

  • intel/compiler: Use NIR_PASS(_, …)

  • anv: Use NIR_PASS(_, …)

  • anv: Properly clamp attachment layer counts

  • vulkan/wsi: Use HAVE_LIBDRM to detect DRM instead of !_WIN32

  • vulkan: Add some border color helpers

  • panvk: Use the new border color helpers

  • lavapipe: Use the new border color helper

  • turnip: Use the new border color helpers

  • vulkan/nir: Make spirv_data const in vk_spirv_to_nir

  • vulkan: Add a vk_pipeline_shader_stage_to_nir helper

  • gallium: Add a u_default_clear_buffer helper

  • iris: Use u_default_clear_buffer

  • panfrost: Use u_default_clear_buffer

  • nir/opt_memcpy: lower copies to/from tightly packed types

  • nir: Add an options parameter to deref_instr_has_complex_use

  • nir/opt_memcpy: Add another case for function_temp

  • vulkan/render_pass: Add a better helper for render pass inheritance

  • anv: Use CmdBeginRendering for resumes in BeginCommandBuffer when possible

  • vulkan/render_pass: Use a special layout for self-dependencies

  • vulkan/render_pass: Pass sample locations to barriers

  • vulkan/render_pass: Allow for mixed sample counts

  • vulkan/render_pass: Support VkAttachmentSampleCountInfoAMD

  • vulkan: Fix a comment

  • vulkan: Add a base struct for descriptor set layouts

  • vulkan: Add a common implementation of pipeline layouts

  • lavapipe: Use the vk_descriptor_set_layout base struct

  • lavapipe: Use the vk_pipeline_layout base struct

  • panvk: Use the vk_descriptor_set_layout base struct

  • panvk: Use the vk_pipeline_layout base struct

  • vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue

  • anv: Replace an assert() with unreachable()

  • util: Re-indent util_sign_extend, comment, and add asserts

  • util: Use shifts in util_sign_extend

  • util: Use util_sign_extend in fast_idiv_by_const

  • mesa/bptc: Use util_sign_extend

  • util,nir: Move mask_sign_extend from opt_load_store_vectorize to util

  • isl: Use util_sign_extend

  • intel/decoder: Use util_mask_sign_extend()

  • panfrost,asahi: Use util_sign_extend for unpacking

  • nir: Use util_mask_sign_extend when serializing constants

  • vulkan/wsi: Pass the size to MapMemory in the SW path

  • vulkan/wsi/wayland: Use a single memcpy in the SW path

  • vulkan/wsi: Move select_memory_type to common and rework it a bit

  • vulkan/wsi: Add a helper for creating CPU images

  • vulkan/wsi/x11: Use wsi_configure_cpu_image

  • vulkan/wsi/wayland: Use wsi_configure_cpu_image

  • vulkan/wsi: Delete SW support from configure_native_image

  • vulkan/wsi: Persistently map CPU images

  • vulkan/wsi: Compute stride and size in configure_buffer_image

  • vulkan/wsi: Align buffer image strides to optimalBufferCopyRowPitchAlignment

  • vulkan/wsi: Support tiled CPU images

  • vulkan/wsi/x11: Don’t leak shm_reply if we don’t have dri3 or present

  • vulkan/wsi/x11: Only use MIT_SHM if the device supports EXT_external_memory_host

  • vulkan/wsi/wayland: Use host pointer import when available

  • dozen: Increase optimalBufferCopy*Alignment

  • dozen: Stop asking for semaphore/fence signaling

  • vulkan/wsi/wayland: Only memcpy if the swapchain is actually software

  • vulkan/wsi: Fix structure chaining in wsi_create_buffer_image_mem

  • shader_info: Move subgroup_size out of cs and make it an enum

  • vulkan: Re-order pipeline hashing

  • vulkan: Constify vk_spirv_version

  • spirv,vulkan: Set shader_info::subgroup_size

  • mesa,glsl,ttn: Set subgroup_size to UNIFORM

  • anv: Drop unnecessary parameters to anv_pipeline_compile_cs

  • anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage

  • anv: Use vk_pipeline_shader_stage_to_nir

  • vulkan: Add a vk_pipeline_shader_stage_is_null() helper

  • anv: Use vk_pipeline_shader_stage_is_null()

  • intel,anv,iris,crocus: Drop subgroup size from the shader key

  • anv: Implement VK_EXT_shader_module_identifier

  • anv: Init supported extensions before WSI

  • vulkan/wsi/wayland: Add a BUFFER_TYPE flag

  • vulkan/wsi: Add debug variables to force the SW and PRIME buffer blit paths

  • intel/blorp: Set uses_sample_shading for MSAA blit shaders

  • radv: Set uses_sample_shading for copy shaders

  • intel/fs: Use shader_info::fs::uses_sample_shading

  • nir: Add a pass for lowering shaders to single-sampled

  • intel/fs: Use nir_lower_single_sampled

  • glsl/nir: Set uses_sample_shading from glsl_to_nir

  • spirv/nir: Set uses_sample_shading from spirv_to_nir

  • nir/gather_info: Stop gathering uses_sample_shading

  • glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue

  • intel/fs: Simplify persample_dispatch

  • intel/fs,anv: Rework handling of coarse and sample shading

  • panfrost: Simplify sample_shading

  • anv: Stop compacting surface state tables

  • vulkan: Detect pNext chain loops in vk_foreach_struct()

  • anv: Don’t use the wrong ARRAY_SIZE

  • Revert “vulkan: Detect pNext chain loops in vk_foreach_struct()”

  • vulkan: Add a vk_limits.h file for runtime limits

  • vulkan: Add an enum for all dynamic graphics states

  • vulkan: Add standard sample locations

  • vulkan: Add data structures to store all graphics state

  • util: Constify __bitset_test_range

  • vulkan: Add a common data structure for dynamic states

  • vulkan: Add a common implementation of vkCmdSet*

  • vulkan: Copy the depth/stencil state optimization code from ANV

  • Use vk_foreach_struct_const where needed

  • vulkan: Detect pNext chain loops in vk_foreach_struct() (v2)

  • anv: Add an instance multiplier to anv_pipeline

  • anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2

  • anv: Stop recording sample locations per-sample-count

  • anv: Convert to using vk_graphics_pipeline_state

  • anv: Rework setting primitive topology

  • anv: Stop merging DEPTH_STENCIL state

  • anv: Switch to using common dynamic state tracking

  • vulkan/graphics_state: Improve the depth/stencil optimization code

  • anv: Do depth/stencil optimization for dynamic depth/stencil

  • vulkan: Call lower_clip_cull_distance_arrays in vk_spirv_to_nir

  • vulkan/nir: Don’t remove dead XFB outputs

  • vulkan: Call gather_xfb_info in vk_spirv_to_nir

  • lavapipe: Use more Vulkan NIR heleprs

  • vulkan: Allow up to 12 pointers in multialloc

  • vulkan: Record shader stages in vk_graphics_pipeline_state

  • vulkan: Multisample state isn’t always included in fragment shader state

  • vulkan: Depth/stencil isn’t fragment output state but multisample is

  • vulkan: Add a fully_dynamic_state_groups() helper

  • vulkan: Fix pipeline libraries with dynamic-only VI or FSR state

  • vulkan: Input assembly and depth/stencil can also be fully dynamic

  • vulkan: Don’t assert VkGraphicsPipelineLibraryCreateInfoEXT::Flags == 0

  • zink: Mark depth/stencil visual tests as flakes

  • lavapipe: Use shader_info::uses_sample_shading

  • lavapipe: Use the vk_graphics_pipeline_state

  • vulkan: Append subpass structures to VkRenderingInfo last

  • vulkan: Include self-dep info in rendering continues

  • intel/eu: Don’t throw validation errors on float MOV_INDIRECT

  • lima: Lower undefs to zero and run DCE after from_ssa

  • nir/from_ssa: Ignore undef sources

  • vulkan: Always populate vk_render_pass_state::render_pass

  • vulkan: Improve the docs for vk_subpass_info

  • radv: Use both aspects for depth/stencil blit destinations

  • vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors

Jason Volk (4):

  • r600: Fix userspace pointer support for evergreen compute.

  • r600: Improve compute memory pool performance; reduce fragmentation.

  • r600: Elide downloads for discarded and immutable compute memories.

  • radeon: Support shared memory user pointers.

Jeffrey Knockel (1):

  • egl: dereference XCB drawable pointers

Jesse Natalie (158):

  • d3d12: Don’t block DISPLAY based on format for non-Windows

  • wgl: Implement WGL_ARB_create_context_robustness

  • d3d12: Hook up robustness extensions

  • d3d12: Plug context leaks

  • d3d12: Plug screen leaks

  • d3d12: Attempt screen reset during context create

  • d3d12: Add a unit test for context reset recovery

  • microsoft/compiler: Unload DXIL validator library *after* calling Release()

  • d3d12: Correctly key off of polygon stipple enable cap

  • mesa: Ensure ARB programs end in a newline

  • bin: Add a script for filtering/sorting refcount logs

  • u_debug_refcnt: Don’t loop for initial refcounts if the initial value is huge

  • u_debug_symbol: Use correct sizeof for DbgHelp

  • u_debug_symbol: Allow re-init of DbgHelp to work around already-initialized issues

  • u_debug_stack: Lock around stack dumps to prevent interleaving

  • u_primconvert: Refactor to remove recursion from util_primconvert_draw_vbo

  • u_primconvert: Handle take_index_buffer_ownership

  • d3d12: Use a pipe_reference in d3d12_bo

  • d3d12: Fix leaks in map with do-not-wait

  • d3d12: Fix a couple over-releases from incorrect take-ownership flag

  • d3d12: Destroy empty residency bo set during early-return

  • d3d12: Include windows.h on Windows before dxgicommon.h

  • d3d12: Don’t use VLAs

  • d3d12: MSVC warning around operator precedence causing uint32_t==bool

  • dzn: Add new parameter to vk_image_view_init call

  • d3d12: Fix forward decl for sw_winsys

  • microsoft/compiler: Fixup sampler derefs in tex instrs that don’t *need* samplers

  • nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*)

  • d3d12: Re-add missed bitmasks to shader key comparison

  • CI/d3d12: Add skips and update baselines

  • CI/windows: Install Vulkan SDK from LunarG directly isntead of Chocolatey

  • CI/windows: Delete comment for installing vulkan-runtime

  • CI/windows: Disable chocolatey progress printing to shrink log size

  • CI/windows: Disable LLVM CMake developer warnings to shrink log size

  • CI/windows: Move –fraction for deqp-runner to run command line

  • dzn: Update fails/flakes

  • CI: Re-enable Windows builds

  • nir: Consider PNTC to be a varying

  • wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains

  • meson: Use a C99 STDC_VERSION for flex

  • d3d12: Report PIPE_CAP_ACCELERATED as 0 on WARP

  • d3d12: Properly set HS input control point count

  • dzn: Init sync types before wsi

  • mesa: Use C11 alignas for cross-platform SSE support

  • CI: Lima farm is offline

  • dzn: Skip another test that times out occasionally

  • util/disk_cache: Implement disk_cache_get_function_identifier for Windows

  • microsoft/clc: Enable tests that pass on server 2022

  • microsoft/clc: Fix test double free in the case of compilation failure

  • microsoft/clc: Remove dead image vars

  • microsoft/clc: Add a unit test for unused image kernel args

  • vulkan_entrypoints_gen: For Windows ARM64EC, prefix symbols with ‘#’

  • dzn: Support Vulkan loader interface v5

  • gallium/util: Move u_dl and u_pointer to src/util

  • dzn: Annotate sometimes-unused variables.

  • dzn: Define COBJMACROS before any includes

  • dzn: Use os_time_sleep instead of Win32 Sleep

  • dzn: Use u_dl to load D3D12

  • dzn: Add an eventfd sync implementation

  • dzn: Fix maybe-uninitialized warning

  • dzn: Skip dxil validator and some Windows-only debug flags for Linux

  • dzn: Use IUnknown instead of IDXGIAdapter1 as the stored adapter

  • dzn: Don’t hash adapter LUID as part of device UUID

  • dzn: Use a custom adapter desc instead of DXGI adapter desc

  • dzn: Move DXGI code to a separate file and only build it on Windows

  • dzn: Add a DXCore enumeration path

  • dzn: Add ABI helpers for D3D12 functions returning structs

  • vulkan/wsi/win32: Use the new helpers and persistent map

  • dzn: Fix winsys reporting

  • dzn: Only support high/normal queue priorities

  • dzn: Add for condition to break nested loop

  • dzn: Re-order includes in dzn_nir to prevent TRUE redefinition

  • dzn: Only add Werror flags that the compiler supports

  • dzn: Disable Wunused-value for C++ files

  • dzn: Fix comparison order for format enums

  • dzn: Re-order dzn_physical_device_desc so = { 0 } doesn’t initialize a struct

  • dzn: Use fabs on floating point input

  • CI: Add dzn to debian vulkan/clang/release builds

  • mesa: Support D3D11/D3D12 memory imports

  • mapi: Add more EXT_external_objects_win32 functions/enums

  • gallium: Add a ‘name’ field to winsys_handle

  • mesa: Implement ImportMemoryWin32NameEXT

  • gallium: Add ‘name’ field to Win32 semaphore import

  • driver_noop: Remove infinite recursion from create_fence_win32

  • gallium: Add a new fence type with a pipe cap to indicate it can be imported

  • mesa: Support importing D3D12 fences as timeline semaphores

  • mesa: Implement ImportSemaphoreWin32NameEXT

  • gallium, mesa: Support setting timeline semaphore values

  • d3d12: Store the rest of the device IDs in the screen

  • d3d12: Compute UUIDs required by external objects extension

  • d3d12: Add pipe getters for Win32 and base external objects device matching

  • d3d12: Support B4G4R4A4 format

  • d3d12: Get adapter LUID after device creation

  • d3d12: PIPE_BIND_SHARED doesn’t mean linear and is always on opened resources

  • d3d12: Support creating memory objects

  • d3d12: Implement resource_from_memobj

  • d3d12: Support opening resources and memobj by name

  • d3d12: Support importing fences / timeline semaphores

  • d3d12: Implement server signal/wait

  • d3d12: Implement fence opening and value setting

  • d3d12: Fix up resource import validation

  • docs: Update features.txt and new_features.txt

  • u_atomic: Implement p_atomic_xchg for Windows

  • u_atomic: Fix MSVC p_atomic_add_return

  • u_atomic: Add p_atomic_fetch_add which returns the old value

  • zink: Use p_atomic_fetch_add

  • simple_mtx: Replace GCC sync intrinsics with u_atomic ops

  • Fix static glapi on Windows

  • dzn: Missed ABI fixes for GetCustomHeapProperties

  • dzn: Remove the cast when the SDK version is high enough

  • d3d12: Add helpers to build with correct ABI for MinGW

  • microsoft/clc: Add helpers to build with correct ABI for MinGW

  • subprojects: Point DirectX-Headers at a specific tag

  • CI: Update DirectX-Headers dependency for MinGW/Debian

  • util: Add a Win32 futex impl

  • d3d12: Support clip halfz

  • d3d12: Add a blank d3d12_resource_state.h/cpp

  • d3d12: Move desired resource state to new files

  • d3d12: Move current resource state to new files

  • d3d12: Give bos a unique identifier to be used for state tracking

  • d3d12: Add a list of contexts alive for the current screen

  • d3d12: Hold lock when removing resources from residency list

  • d3d12: Notify contexts about deletion of bos

  • d3d12: Add a context state tracking structure

  • d3d12: Treat depth/stencil as planar for plane count helper

  • d3d12: Create/free context state entries

  • d3d12: Track a global resource state for non-simultaneous-access resources

  • d3d12: Record a state fixup command list when necessary

  • d3d12: Swap the remainder of state tracking to new method

  • d3d12: Rename bind invalidate options to transition flags

  • d3d12: Extract core barrier logic

  • d3d12: Optimize transition_subresource_states that covers a whole resource

  • d3d12: Add a transition flag indicating that state accumulation is needed

  • d3d12: Remove state tracking implementation details from header

  • nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O

  • microsoft/compiler: Add a max shader model option

  • microsoft/compiler: Add a max validator version

  • microsoft/compiler: Emit metadata based on long-lived signature data

  • microsoft/compiler: Split signature processing into two parts

  • microsoft/compiler: Ensure 4-component position writes via NIR

  • microsoft/compiler: Lower I/O to scalar

  • microsoft/compiler: Support DXIL validator version 1.5

  • microsoft/compiler: Compute correct usage masks

  • microsoft/compiler: Correctly compute dynamic indexing I/O masks

  • microsoft/compiler: DXIL validator 1.6 counts UAVs differently

  • microsoft/compiler: Set the barycentrics flag for attribute_at_vertex

  • microsoft/compiler: DXIL validator 1.6 uses a new PSV struct version

  • microsoft/compiler: DXIL validator 1.6 uses a new PSV resource struct

  • microsoft/compiler: Support DXIL validator 1.6

  • microsoft/compiler: Support DXIL validator 1.7

  • microsoft/compiler: Add a getter for a validator version from DXIL.dll

  • microsoft/compiler: Blacklist DXIL validator 1.6 from 20348 SDK

  • d3d12: Retrieve validator version

  • dzn: Retrieve validator version

  • egl/wgl: Delete unused variables/code

  • egl/wgl: Fix some awkward sizeof formatting

  • microsoft/compiler: Discard shouldn’t be marked readnone

  • microsoft/compiler: Fix PSV struct when numthreads is 0

Jiang Feng (1):

  • virgl: Set res->maybe_busy to true when creating resources

Jonathan Gray (2):

  • intel/dev: sync ADL-S pci ids with linux

  • intel/dev: add RPL-S pci ids from drm-intel-next

Jonathan Marek (2):

  • freedreno/registers: add a7xx registers for drm/msm kernel driver

  • freedreno/registers: add missing varset=”chip” for new enum values

Jonathan Weinstein (1):

  • lavapipe: support instance rate zero for VK_EXT_vertex_attribute_divisor

Jordan Justen (48):

  • intel/dev: Add device info for RPL-P

  • vulkan/wsi/x11: Use atomic_int with sent_image_count

  • drm-uapi/i915_drm.h: Update from drm-next (2022-04-28)

  • intel/dev: Read hwconfig from i915

  • intel_dev_info: Add –hwconfig command line parameter

  • iris: Fix assertion meant to only target the clear-color stride

  • nir/divergence: handle more *_intel intrinsics

  • intel/dev: Add INTEL_PLATFORM_DG2_G12

  • intel/dev: Add DG2 G12 PCI IDs

  • intel/dev: Enable first set of DG2 PCI IDs

  • intel: Fix build of mi_builder_tests by including c99_compat.h

  • intel: Build mi_builder_test whenever build-tests is set

  • drm-uapi/i915_drm.h: Update from drm-next (2022-05-06)

  • intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE

  • anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state()

  • anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state()

  • anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE

  • anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1

  • intel/dev: Add devinfo::mem to store i915 regions information

  • anv: Use memory info from devinfo

  • iris/bufmgr: Use memory info from devinfo

  • intel/tools: Print memory info in intel_dev_info

  • vulkan/wsi: Disable dma-buf sync file if ENOSYS is returned

  • intel/compiler: Don’t create vec4 reg-set for gen8+

  • intel/dev: Add vram.unmappable.size region info

  • intel/tools: Print unmappable region info in intel_dev_info

  • intel/dev: Add intel_vram_all_mappable()

  • iris/resource: Assert that DG2 CCS buffers don’t also try to set BO_ALLOC_SMEM

  • iris/resource: Avoid mapping when not needed in iris_resource_init_aux_buf()

  • iris/bufmgr: Add all_vram_mappable which is currently always true

  • iris/bufmgr: Set mmap_mode to IRIS_MMAP_NONE for lmem in small-BAR mode

  • iris: Set clear_color_unknown if the bo is not mappable

  • anv/allocator: Add assert and TODO comment for future small BAR uapi

  • iris/bufmgr: Add assert and TODO comment for future small BAR uapi

  • drm-uapi/i915_drm.h: Update from drm-next (2022-07-22)

  • iris/bufmgr: Add I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS for vram mappable buffers

  • intel/dev: Use i915 region probed_cpu_visible_size when non-zero

  • intel/dev: Determine the amount of free vram using small BAR uapi

  • intel/dev: Enable remaining DG2 and ATS-M device IDs

  • anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR

  • intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE

  • iris: Drop extra file-descriptor dup in iris_drm_screen_create()

  • intel/pci_ids: Update ADL-S strings

  • intel/pci_ids: Add 0x468b ADL-S PCI-id

  • intel/pci_ids: Update ATS-M device names

  • intel/pci_ids: Update DG2 device names

  • intel/pci_ids: Add dg2 0x5698 pci-id

  • intel/pci_ids: Drop non-upstream dg2 pci-ids

Jose Maria Casanova Crespo (1):

  • v3dv/ci: increase fraction to 10 on v3dv ci jobs.

Josh Billingsley (1):

  • driconf: add SD Gundam G Generation Cross Rays

Joshua Ashton (1):

  • aco: Use movk for AddressHi bits in vertex prolog

José Fonseca (6):

  • trace: Fix framebuffer state serialization.

  • lavapipe: Prevent mapping buffers beyond their size.

  • trace: Dump NIR.

  • trace: Parse character data more efficiently.

  • trace: Bring state dump up to speed.

  • trace: Allow to control nir dumping via an environment variable.

Juan A. Suarez Romero (37):

  • ci/v3dv: remove fixed test

  • v3d: report the correct unsupported blit format

  • v3d: do not tile 1D textures

  • v3d: use surface format defined on pipe_blit

  • gallium: add hook on getting canonical format

  • v3d: define our own canonical supported formats

  • v3d: enable GL_ARB_copy_image extension

  • gallium: move get_canonical_format hook to pipe_screen

  • v3dv: duplicate key on hashtable insert

  • v3dv: store device_id on device init

  • v3d/simulator: add support for AMD cards

  • mesa: unref syncobj after wait_sync

  • v3d: enable early Z/S clears

  • v3d: disable early-Z on odd frame dimensions

  • v3d: allow TFU blitting for single layer textures

  • v3d: add hook to get on-disk shader cache

  • v3d: add builtin support for white/black clamp-to-border

  • v3dv: check BO mapping result

  • docs: update extensions for V3D and VC4

  • ci: bump VK-GL-CTS to 1.3.2.0

  • v3d: fix blending for mixed RT formats

  • v3d: expose GL_ARB_draw_buffers_blend

  • v3d/ci: Add traces

  • v3d: use function to initialize refcount

  • v3d: save only required states in blitter

  • v3d: release all color buffers on context destroy

  • v3d: add ARB_polygon_offset_clamp extension support

  • v3dv/ci: Update expected results

  • v3d/ci: Add flake test

  • v3d/v3dv/ci: update expected results

  • v3d/ci: (partially) revert expected list

  • v3d/ci: update expected list

  • v3dv/ci: Update expected list

  • v3d/vc4/ci: Fix typo in skipped list

  • v3d/ci: enable ARB_texture_cube_map piglit tests

  • vc4: properly restore vc4 debug option

  • vc4: store tex sampler in proper register

Kai Wasserbäch (1):

  • fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods.

Karmjit Mahil (26):

  • pvr: Fix clang-format errors caused by vk outarrays.

  • pvr: Add PVR_WORKGROUP_DIMENSIONS define.

  • pvr: Add initial implementation of vkCmdDispatch().

  • pvr: Add compute update shared kernel in vkCmdDispatch().

  • pvr: Add compute update kernel in vkCmdDispatch().

  • pvr: Update pvrsrvkm to fw 1.17 .

  • pvr: Add csb enum helper funciton header.

  • pvr: Write immutable descriptor words in vkAllocateDescriptorSets().

  • pvr: Fix incorrect samples to cr_isp_aa_mode conversion.

  • pvr: Change compute hard coding infrastructure.

  • pvr: Remove unused output_selects from vertex shader state.

  • pvr: Add graphics pipeline hard coding infrastructure.

  • pvr: Enable 33.15.11.3 (AXE-1-16M).

  • pvr: Extend hard coding infra to allow per stage per device coding.

  • pvr: Fix physical device limits.

  • pvr: Handle vdm degen_cull_enable.

  • pvr: Fix off by 1 error in buffer_id for ubo pds program.

  • pvr: Rename loop iterator variable.

  • pvr: Update pvrsrv build version for fixed size fw.

  • pvr: Rename uniform pds program to descriptor pds program.

  • pvr: Add static consts support in descriptor pds program.

  • pvr: Add static consts to hard coding infra.

  • pvr: Add initial vkCmdPipelineBarrier skeleton.

  • pvr: Add IDF/WDF program for compute pipeline barrier.

  • pvr: Add compute pipeline barrier handling.

  • pvr: Fix cdm shared reg usage reported to fw.

Karol Herbst (20):

  • nir: add 16 and 64 bit fisnormal lowering

  • vtn: clamp SpvOpImageQuerySize dest to 32 bit

  • nv50/nir: align tlsspace to 0x10

  • llvmpipe: PIPE_COMPUTE_CAP_GRID_DIMENSION is uint64_t

  • llvmpipe/fence: make the fence id counter atomic

  • iris/cs: take buffer offsets into account for CL

  • nir: add a nir_remove_non_entrypoints helper

  • ntt: make use of new samplers_used field

  • clc: parse localSize and localSizeHint

  • clc: drop parsingComplete check

  • clc: speed up compilation by not relying on opencl-c.h

  • clc: fix compiler features_macro CTS Test

  • clc: disable opaque pointers until they are supported

  • nv50/ir: recoginse Ampere

  • nir/gather_info: allow to run it before inlining

  • nir/gce: pin call instructions

  • nouveau: use the contexts pushbuf and client where possible

  • nouveau/mm: make code thread safe

  • nv50/ir: fix OP_UNION resolving when used for vector values

  • nv50: properly flush the TSC cache on 3D

Ken Johnson (2):

  • d3d12: Reduce cost of shader key comparisons

  • mesa/main: Correct alloca() use without #include “c99_alloca.h”

Kenneth Graunke (41):

  • anv: Fix INTEL_DEBUG=bat on XeHP

  • intel/decoder: Fix binding table pointer decoding with large offsets

  • iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS

  • Revert “st/mesa: Transcode ASTC to BC7 (BPTC) where possible”

  • intel: Drop Wa_1409226450 (stall before instruction cache invalidation)

  • intel: Fix analysis invalidation in eliminate_find_live_channel

  • intel/compiler: Move spill/fill tracking to the register allocator

  • mesa: Split tx_compress_dxtn into per-format functions

  • mesa: Avoid temp images in _mesa_texstore_rgb_dxt1 for GL_RGBA source

  • mesa: Require temp images in _mesa_texstore_*_dxt* with SkipImages

  • mesa: Avoid temp images in _mesa_texstore_*_dxt* for stride = 0

  • isl: Add an isl_format_supports_typed_atomics() helper.

  • intel/compiler: Fix uncompaction of signed word immediates on Tigerlake

  • intel: Set a more useful fake devinfo->gtt_size in no-hw mode

  • intel/compiler: Fix A64 header construction with a uniform address

  • iris: Update comment about 2GB dynamic state range

  • iris: Use PIPE_* defines rather than ones from main/config.h

  • crocus: Use PIPE_* defines rather than ones from main/config.h

  • intel/compiler: Stop including src/mesa/main/config.h

  • intel/compiler: Fix brw_gfx_ver_enum.h to be a proper header file

  • intel/compiler: Split 3DPRIM_* defines out to a separate header.

  • intel/tools: Stop malloc’ing device info in i965_disasm

  • intel/compiler: Move opcode_desc handling to a separate header

  • intel/compiler: Introduce a new brw_isa_info structure

  • intel/compiler: Remove use of thread_local for opcode tables

  • intel/compiler: Convert brw_eu.cpp back to brw_eu.c

  • intel/compiler: Handle split-sends in EOT high-register pinning case

  • intel/fs: Opportunistically split SEND message payloads

  • intel/compiler: Avoid copy propagating large registers into EOT messages

  • intel/compiler: Don’t set SBID on EOT send messages

  • mesa: Mark render-to-texture as unsafe if there’s no pipe_resource

  • intel/eu: Handle compaction when inserting validation errors

  • intel/eu: Fix XeHP register region validation for hstride == 0

  • intel/eu: Clarify spec citations for XeHP region restrictions

  • intel/eu: Mark header present in URB memory fences on XeHP

  • intel/compiler: Lower FIND_[LAST_]LIVE_CHANNEL in IR on Gfx8+

  • intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel()

  • iris: Fix PIPE_CAP_UMA

  • iris: Use linear for exported resources if we can’t convey tiling

  • iris: Delete unused iris_screen::aperture_bytes field

  • crocus: Fix memory leaks on iris_resource_create failure paths

Konrad Dybcio (1):

  • freedreno: Enable A619

Konstantin Seurer (106):

  • radv: Fix ray queries with !15854

  • radv: Do not discard hits with t=tmax

  • radv: Enable rt primitive culling for spirv2nir

  • ci: Always install glslangValidator for build jobs

  • radv: Use VAs in radv_meta_buffer.c

  • radv: Make fill_buffer_shader non-static

  • radv: Format radv_acceleration_structure.c

  • radv: Add the fuchsia radix sort

  • radv: Build accaleration structures using LBVH

  • radv: Clean up the accel-struct build shaders

  • radv: Fix lowering ignore_ray_intersection

  • vulkan: Add a shader module clone helper

  • radv: Copy shader modules to avoid use after free

  • radv/radix_sort: Add missing entry points

  • radv/radix_sort: Make variable names consistent

  • radv: Use the entire morton code as sort key

  • ac/nir: Do not set cursor in try_extract_additions

  • radv: Move RADV_RT_STAGE_BITS to radv_private.h

  • radv: Treat rt stages like compute stages

  • radv: Use RADV_RT_STAGE_BITS more often

  • nir: Add a ray launch size addr intrinsic

  • radv: Allow indirect dispatches without bo

  • radv: Indirect unaligned dispatches

  • radv,aco: Use ray_launch_size_addr

  • radv: Implement vkCmdTraceRaysIndirectKHR

  • radv: Advertise indirect ray tracing

  • radv: Allow radv_fill_buffer to work with VAs only

  • radv: Use radv_fill_buffer for accel struct builds

  • Revert “radv: Make fill_buffer_shader non-static”

  • radv: Add and use radv_bvh_node_type

  • radv: Replace nir_vector_extract with nir_channel

  • radv/ci: Skip slow/crashing ray tracing tests

  • radv: Enable ray queries by default

  • radv: Add fail/flake lists for navi22

  • radv: Fix handling of primitiveOffset

  • radv: Ignore transformOffset if transformData is 0

  • vulkan: Use the correct enum in vk_buffer

  • radv: Use vk_buffer

  • radv: Add and use radv_cp_dma_wait_for_stages

  • radv: Some acceleration structure cleanups

  • radv: Require an alignment of 64 for accel structs

  • radv: Fix the accel struct size calculation

  • vulkan: Spec update to 1.3.216

  • spirv: Add plumbing for ray_cull_mask

  • radv: ray cull mask

  • radv: Add accel struct queries for maintenance1

  • radv: ACCESS_2_SHADER_BINDING_TABLE_READ_BIT_KHR

  • radv: PIPELINE_STAGE_2_ACCELERATION_STRUCTURE_COPY

  • nir: Handle ray_launch_size_addr in opt_preamble

  • radv: vkCmdTraceRaysIndirect2KHR

  • radv: Advertise VK_KHR_ray_tracing_maintenance1

  • nir,spirv: Preserve inbounds access information

  • nir/nir_lower_io: Optimize 32-bit inbounds access

  • radv: Move some rt intrinsics to the top

  • radv: Introduce an enum for the any hit status

  • radv: Fix the t comparison for triangles

  • radv: Refactor lower_rt_derefs

  • nir: Add a nir_test_mask helper

  • radv: Use nir_test_mask instead of i2b(iand)

  • nir: Use nir_test_mask instead of i2b(iand)

  • d3d12: Use nir_test_mask instead of i2b(iand)

  • intel: Use nir_test_mask instead of i2b(iand)

  • microsoft: Use nir_test_mask instead of i2b(iand)

  • dozen: Use nir_test_mask instead of i2b(iand)

  • radv: Use two bools for ahit_status

  • radv: Skip pipeline_no_null_shaders_flag tests

  • radv: Fix acceleration structure size queries

  • radv: Use a global address for ray_launch_size

  • radv: Use a global address for sbt_base

  • radv: Only set rt stack size for dynamic stacks

  • vulkan: Common vk_format_get_component_bits

  • nir: Add a common gen_rect_vertices implementation

  • radv: Use nir_gen_rect_vertices

  • v3dv: Use nir_gen_rect_vertices

  • radeonsi: Set vertex_id_zero_based

  • aco: Remove dead nir_intrinsic_load_vertex_id case

  • ac/llvm: Remove load_vertex_id handling

  • util/format: Use an explicit length for the descs

  • util/format: Assert that formats are valid

  • radv: Remove format desc null checks

  • radv: Use desc->format

  • v3dv: Remove format desc null asserts

  • turnip: Remove format desc null assert

  • gallivm: Remove format desc null checks

  • tgsi: Remove format desc null check

  • util: Remove format desc null check

  • agx: Remove format desc null check

  • etnaviv: Remove format desc null check

  • llvmpipe: Remove format desc null checks

  • panfrost: Remove format desc null check

  • r300: Remove format desc null check

  • r600: Remove format desc null checks

  • radeonsi: Remove format desc null checks

  • softpipe: Remove format desc null check

  • virgl: Remove format desc null checks

  • pvr: Remove format desc null check

  • util/format: Remove format desc null checks

  • gallium/tests: Remove format desc null checks

  • radv: Add meta saving/restoring for predicating

  • radv: Use RADV_META_SUSPEND_PREDICATING

  • radv: Simplify the meta init fail path

  • radv: Fix out of bounds access in RTPSO creation

  • vulkan: Fix descriptor set layout allocation scope

  • vulkan: Handle descriptor set layout alloc fails

  • radv: Use vk_descriptor_set_layout

  • radv: Fix stack size calculation with stage ids

Kristian Høgsberg (1):

  • util/u_debug: Use ‘initialized’ instead of ‘first’

Kyle Russell (1):

  • etnaviv: don’t halve max_instructions for GC400

Leandro Ribeiro (1):

  • Revert “egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()”

Lionel Landwerlin (120):

  • anv: use shadow surface for stencil input attachment on gfx7

  • intel: fix URB programming for GT1s

  • intel/fs: require memory fence commit bit on Gfx9

  • anv: improve INTEL_DEBUG for submit

  • anv/intel: add a new debug flag for stalling after every draw/dispatch

  • anv: implement DEBUG_SYNC

  • anv: allow getting the address of the beginning of the batch

  • intel/fs: tidy up lower of ray queries

  • ci/iris: mark another test as flaky

  • anv: implement VK_EXT_primitives_generated_query

  • nir/divergence: handle load_global_block_intel

  • anv: fix INTEL_DEBUG=sync

  • intel: fixup number of threads per EU on XeHP

  • anv: remove unused enum

  • anv: fix acceleration structure descriptor template writes

  • anv: skip acceleration structure in binding table emission

  • anv: reemit 3DSTATE_STREAMOUT after memcpy

  • anv: fix clflush usage on utrace copy batch

  • anv: limit clflush usage

  • vulkan/runtime: 0-out pipeline cache object

  • iris: use new kernel uAPI to compute video memory

  • anv: fix invalid utrace memcpy l3 config on gfx < 11

  • anv: reset all dynamic state after secondary execution

  • anv: add missing logic op set in pipeline dyn state

  • Revert “anv: fix dynamic state emission”

  • anv: rework sample location

  • anv: never emit 3DSTATE_CPS in the pipeline batch

  • anv: don’t emit 3DSTATE_STREAMOUT in pipeline batch

  • anv: don’t emit 3DSTATE_WM in pipeline batch

  • anv: don’t emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch

  • anv: don’t emit 3DSTATE_DEPTH_BOUNDS in pipeline batch

  • anv: don’t emit 3DSTATE_VF_TOPOLOGY in pipeline batch

  • anv: remove static_state_mask

  • docs: explain state emission in Anv

  • intel: fix stall debug option

  • nir/cf: return cursor after insertion of cf_list

  • nir/lower_shader_calls: don’t use nop instructions as cursors

  • nir/lower_shader_calls: don’t insert code after break/continue

  • nir/lower_shader_calls: put inserted instructions into a dummy block

  • anv: fix primitives generated queries values

  • intel/perf: add support for new opcodes in code generation

  • intel/perf: store a copy of devinfo

  • intel/perf: add support for dualsubslice count variable

  • intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables

  • intel/perf: stop overriding oa_format

  • intel/perf: add OA A counter type

  • intel/perf: add support new variable counting the number of EUs in slice0-3

  • intel/perf: add max vfuncs

  • intel/perf: add new layout for Gfx12.5 products

  • intel/perf: disable sseu setting on Gfx12.5+

  • intel/perf: deal with OA reports timestamp values on DG2

  • intel/ds: fix compilation

  • u_trace/anv/iris: drop cs argument for recording traces

  • intel/disasm: add missing handling of <1;1,0>

  • util/debug: add an enable parsing helper

  • util/perf: add options to enable/disable tracepoints

  • anv: move internal RT shaders to the internal cache

  • nir/divergence: handle load_scratch_base_ptr

  • nir/divergence: handle load_ray_num_dss_rt_stacks_intel

  • anv: don’t expose EXT_border_color_swizzle on gfx7

  • intel/validator: validate dst/src types against devinfo support

  • anv: fix index buffer emission

  • vulkan/wsi: fix crash with debug names on swapchain

  • intel/dev: add a helper to update memory info

  • intel/dev: add a fallback when memory regions are not available

  • anv: limit RT writes to number of color outputs

  • anv: disable perf queries on non RCS engines

  • anv: make apply_pipeline_layout/compute_push_layout visible to NIR debug

  • anv: silence border color swizzle debug message

  • clc: add new feature options for intel_clc

  • intel/clc: enable fp16 & subgroups for GRL

  • anv: disable non uniform indexing of UBOs

  • anv: remove device features handling now in runtime

  • anv: use the right helper to invalidate memory

  • intel/fs: make sure memory writes have landed for thread dispatch

  • intel/nir: temporarly disable opt_uniform_atomics for RT/CL

  • anv/utrace: use a bo pool for utrace buffers

  • intel/fs: ray query fix for global address

  • nir/serialize: restore ray query variables

  • isl: add new helper for format component compatibility

  • anv: track if images can be fast clear with non-zero color

  • anv: move gfx12+ CCS_E clear color restriction

  • anv: deal with isl format swizzles for buffer views

  • anv: validate image view lowered storage formats for storage

  • anv: add a comment about handling buffer view swizzles on gfx7

  • intel: protect against empty invalidate ranges

  • anv: skip flush/invalidate faster

  • spirv: switch to uint64 for rayquery internal type

  • anv: ensure tile flush before streamout writes

  • intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it.

  • anv: remove local computation of dynamic states

  • anv: simplify dynamic buffer count in pipeline layout

  • anv: break up anv_pipeline_compile_graphics()

  • anv: split graphics nir loading

  • anv: rename internal function for consistency

  • anv: move CreateGraphicsPipelines to common code

  • anv: move CreateComputePipelines to common code

  • anv: move CreateRayTracingPipelines to common code

  • anv: reorder & document fields of anv_graphics_pipeline

  • anv: name non dynamic state fields correctly

  • anv: fix primitive topology dynamic state emission on gfx7

  • anv: move restart index to gfx state

  • anv: remove pipeline setting dynamic state

  • intel/dev: deal with i915 unallocated_size on smem

  • anv: allocate RT scratch in local memory

  • anv: make use of the new smallbar uAPI

  • anv: decode init batch with INTEL_DEBUG=bat

  • intel/dev: move verification function to a header

  • intel/tools/drm-shim: fixup eu_stride for topology

  • intel/tools/stub: fixup parsing of –platform=

  • intel/dev: add a test verifying that device override works

  • intel/dev: remove INTEL_DEVID_OVERRIDE

  • anv: fixup PIPE_CONTROL restriction on gfx8

  • anv: don’t return incorrect error code for vkCreateDescriptorPool

  • anv: fix GetPipelineExecutableStatistics for ray tracing pipelines

  • anv: fixup assertions on lowered storage formats

  • anv: fix assert in memory budget code when extension is not supported

  • intel/fs: fix load_scratch intrinsic

  • intel/fs: fixup scratch load/store handling on Gfx12.5+

  • intel/fs: fixup SEND validation check on overlapping src0/src1

Liviu Prodea (7):

  • dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32

  • ci/MSVC: LLD is not really used anywhere

  • ci/MinGW: LLD is not really used anywhere

  • d3d12/dzn/spirv2dxil: Require version library

  • Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can’t load it

  • meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover

  • meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don’t depend on it directly

Louis-Francis Ratté-Boulianne (4):

  • d3d12: Move d3d12_nir_lower_vs_vertex_conversion() to a common place

  • microsoft/compiler: Add support for more scaled formats

  • dzn: Allow some non-native formats to be used as vertex inputs

  • dzn: Enable dEQP-VK.pipeline.vertex_input.* test cases

Lucas Stach (37):

  • etnaviv: update headers from rnndb

  • etnaviv: set VIVS_GL_VERTEX_ELEMENT_CONFIG depending on prim type

  • etnaviv: don’t supertile textures if supertiling is disabled via debug option

  • etnaviv: fill all minor GPU features from the kernel

  • etnaviv: update headers from rnndb

  • etnaviv: correct bits per tile and clear value for HALTI5

  • etnaviv: use new PE pipe address states on >= HALTI0

  • etnaviv: use feature flag to determine which RS states to use

  • etnaviv: use feature bit for one const src per instuction limitation

  • etnaviv: clean up tiling setup in etna_compile_rs_state

  • etnaviv: add support for big tile RS states

  • etnaviv: properly set additional DEC400 compression states

  • etnaviv: use feature bit to check for big tile support

  • etnaviv: add tile size helper

  • etnaviv: reinstate accidentially commented assert

  • etnaviv: improve single buffer setup

  • etnaviv: add linear PE support

  • etnaviv: switch depth mode when linear PE is used

  • etnaviv: expose real map buffer alignment

  • etnaviv: allow mapped buffers during execution

  • etnaviv: drm: always use hash to track BO index

  • etnaviv: drm: don’t cache mmap offset

  • etnaviv: drm: make etna_bo_map thread safe

  • etnaviv: drm: rename etna_drm_table_lock

  • etnaviv: add function to get resource status

  • etnaviv: rework resource status tracking (again)

  • etnaviv: drop scanout assumption for resources allocated via modifier interface

  • kmsro: add ‘imx-lcdif’ driver support

  • etnaviv: move dummy BOs to screen

  • etnaviv: add texture descriptor suballocator

  • etnaviv: tex_desc: make error handling more consistent

  • etnaviv: tex_desc: remove descriptor patch TODO comment

  • etnaviv: fix use after free in async shader compile

  • etnaviv: remove etna_align_up() function

  • etnaviv: move checking for MC2.0 for TS into screen init

  • etnaviv: use linear PE rendering only on properly aligned surfaces

  • etnaviv: add debug option to disable linear PE feature

M Henning (7):

  • nouveau/nir: Allow up to 6 nested joins

  • nouveau: Emit cctl to flush L1 cache for atomics

  • nouveau: Skip cctl for atomic counters in tgsi

  • nouveau/nir: Split fewer 64-bit loads

  • ttn: Make ttn_optimize_nir match gl_nir_opts

  • nv50: Fix tlsSpace alignment

  • nouveau: Drop C++03 compat code

Manas Chaudhary (2):

  • panvk: Add PANVK_DEBUG_DUMP dump mappings

  • panvk: Allow PAN_BO_INVISIBLE to be mapped with dump flag

Manuel Stoeckl (4):

  • dri_interface: add XBGR16161616 format

  • gbm: add GBM_FORMAT_[AX]BGR16161616

  • dri: fix incorrect fourcc for R16G16B16A16_UNORM format

  • dri: add [ax]bgr16161616 to format lookup tables

Marcin Ślusarz (40):

  • anv: invalidate all metadata in anv_nir_lower_ubo_loads

  • intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader

  • intel/compiler: invalidate metadata in brw_nir_initialize_mue

  • compiler: add VARYING_SLOT_CULL_PRIMITIVE

  • intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh

  • intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup

  • anv: update task/mesh distribution with the recommended values

  • intel/compiler: print name of the unhandled intrinsic

  • anv: disable streamout before emitting mesh shading state

  • intel/compiler: implement primitive shading rate for mesh

  • anv: handle primitive shading rate for mesh

  • intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu

  • intel/tools: clean up mmio regs definitions

  • intel/tools: drop wrappers around mmio regs macros

  • intel/tools/aubinator: add support for 2 “new” subopcodes

  • intel/tools/aubinator: list all platforms in help message

  • intel/tools: add macros for gfx12+ variant of VCSUNIT0

  • nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis

  • nir: add load_task_payload intrinsic to nir_divergence_analysis

  • intel/tools: fix 32-bit build

  • anv: remove redundant code calculating dynamic states mask

  • anv: remove invalid copy/pasted comment

  • anv: mask out not applicable state flags when setting up mesh pipeline

  • intel/dev: fixup max_cs_workgroup_threads after hwconfig is read

  • nir/opt_load_store_vectorize: handle task payloads

  • intel/compiler: vectorize task payload loads/stores

  • nir/lower_task_shader: insert barrier before/after shared memory read/write

  • intel/compiler: assert that base is 0 for [load|store]_shared intrins

  • intel: fix typos found by codespell

  • intel/common: allocate space for at least one task urb

  • intel/compiler: adjust task payload offsets as late as possible

  • anv: disable injection of primitive shading rate for mesh

  • intel/compiler: print shaders after nir_remove_unused_varyings

  • nir: convert unused mesh outputs to shared memory

  • intel/compiler: emit URB fences for TASK/MESH

  • intel/compiler: there are 4 types of fences on gfx >= 12.5

  • intel/compiler: insert URB fence before task/mesh termination

  • intel/compiler: use NIR_PASS more

  • intel/compiler: fix mesh urb write regression

  • anv: disable task redistribution

Marek Olšák (311):

  • gallium: move drivers/radeon/* into drivers/radeonsi/

  • gallium: move radeon_winsys.h into gallium/include/winsys/

  • gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h

  • gallium/ddebug: implement pipe_vertex_state callbacks

  • nir: fix an uninitialized variable valgrind warning in nir_group_loads

  • ac/surface: fix an addrlib race condition on gfx9

  • radeonsi: set more state bits for disabled binning

  • radeonsi: don’t compile monolithic PS for dual source blending

  • radeonsi: add an SQTT workaround for chips with disabled RBs

  • radeonsi: set PA_SC_EDGERULE according to bottom_edge_rule state

  • radeonsi: initialize line stipple config registers

  • radeonsi: fix ALPHA_IS_ON_MSB determination

  • radeonsi: handle an out-of-bounds sampler/image index better

  • radeonsi: don’t treat skipped GS copy shader compilation as successful

  • radeonsi: dump shaders after si_shader_binary_upload to fix printed LDS stat

  • radeonsi: ignore AMD_DEBUG=nodcc if the texture is imported

  • radeonsi: run nir_io_add_const_offset_to_base for TES/TCS as late optimizations

  • radeonsi: scalarize IO instructions

  • radeonsi: add pipe_stream_output_info into si_shader_context

  • radeonsi: use nir_shader_get_entrypoint in si_nir_scan_shader

  • radeonsi: apply key.ge.opt.kill_{outputs,pointsize,clipdistance} in NIR

  • radeonsi: gather pipe_stream_output_info from NIR intrinsics

  • radeonsi: move most “info” fields from si_shader_selector into si_shader_info

  • radeonsi: move si_shader_info::stage into si_shader_selector

  • ac: add ac_nir_optimize_outputs, a NIR version of ac_optimize_vs_outputs

  • radeonsi: assign param export indices before compilation

  • ac/llvm: remove LLVM pass ac_optimize_vs_outputs

  • amd: document chips

  • ac/surface/tests: generalize and extend gfx10 tests

  • ac: add more non-shadowed registers to the lists

  • ac/llvm: add AC_WAIT_EXP for ac_build_waitcnt

  • ac/llvm: remove immoffset parameter from ac_build_tbuffer_store

  • ac/llvm: remove immoffset parameter from ac_build_tbuffer_load

  • ac/llvm: remove inst_offset parameter from ac_build_buffer_store_dword

  • ac/llvm: use soffset correctly in ac_build_buffer_load

  • ac/llvm: remove inst_offset parameter from ac_build_buffer_load

  • ac/llvm: remove immoffset parameter from ac_build_tbuffer_load_byte/short

  • ac/llvm: remove unused code from ac_llvm_build.c/h

  • ac/llvm: don’t use tbuffer_store as a fallback for swizzled stores

  • ac/llvm: rename tbuffer -> buffer where tbuffer instructions are not generated

  • ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c

  • ac/gpu_info: remove old and unused fields from radeon_info

  • ac/llvm: remove unused LLVM helpers

  • ac/llvm: don’t create the target machine if the LLVM processor is unsupported

  • radeonsi: remove an obsolete comment about LLVM 14

  • radeonsi: fail to create pipe_screen if LLVM doesn’t support the GPU

  • winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen

  • radeonsi/ci: add polaris11 baseline

  • util: add pipe formats R10G10B10X2_SINT, B10G10R10X2_SINT, G16R16_SINT

  • util: change util_format_snorm8_to_sint8 -> snorm_to_sint for radeonsi

  • ac/gpu_info: rework how num_se is derived

  • ac: remove really_needs_scratch, parameter from ac_parse_shader_binary_config

  • ac/llvm: set the correct cache policy for sparse buffer loads

  • ac/llvm: don’t set ac_dlc in visit_intrinsic

  • ac: support GR channel order in ac_choose_spi_color_formats

  • ac,radeonsi: report SCALED formats as unsupported by samplers and color buffers

  • radeonsi: always print the lower-case GPU name for radeonsi-run-tests.py

  • radeonsi: cleanup local includes

  • radeonsi: align the tessellation ring address to 2MB

  • radeonsi: remove unused si_cs_shader_state::uses_scratch

  • radeonsi: simplify setting COMPUTE_PGM_RSRC2.SCRATCH_EN

  • radeonsi: code style fix

  • radeonsi: add si_can_use_compute_blit

  • radeonsi: don’t report R64_*INT as a sampler format because it doesn’t work

  • radeonsi: don’t allow broken compute image copies into A8R8_UNORM

  • radeonsi: add color buffer support for 2_10_10_10 pipe formats

  • radeonsi: add color buffer support for R9G9B9E5_FLOAT on gfx10.3

  • radeonsi: implement compute_copy_image between 1D_ARRAY and other texture types

  • radeonsi: cosmetic changes in si_compute_copy_image

  • radeonsi: assert format support in si_compute_copy_image

  • radeonsi: don’t expose formats that CB supports, but not shaders

  • radeonsi: rename si_test_blit -> si_test_image_copy_region

  • radeonsi/test_image_copy: remove whole image copies

  • radeonsi/test_image_copy: remove linear and power-of-two size testing

  • radeonsi/test_image_copy: remove special SDMA codepaths

  • radeonsi/test_image_copy: simplify texture size generation

  • radeonsi/test_image_copy: separate generating a random pipe_resource

  • radeonsi/test_image_copy: separate printing pipe_resource into a function

  • radeonsi/test_image_copy: test all interesting texture targets

  • radeonsi/test_image_copy: test copying a block-compressed format

  • radeonsi/test_image_copy: test mipmaps

  • radeonsi/test_image_copy: test all formats

  • radeonsi/ci: update sienna_cichlid results

  • radeonsi: handle compressed formats in si_compute_copy_image

  • radeonsi: improve the subsampled copy_image calculation

  • radeonsi: remove compressed and subsampled gfx copy from resource_copy_region

  • radeonsi: remove si_create_sampler_view_custom and related code

  • radeonsi: remove si_create_surface_custom

  • mesa: consider the sample count when choosing a texture format

  • frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers

  • amd/registers: hardcode GC base offsets in the json generator

  • amd/registers: add gfx11 to the json generator

  • amd/registers: add gfx11.json

  • amd/registers: add gfx11-rsrc.json

  • amd: enable gfx11 in header generator, fix drivers with renamed gfx6-10 defs

  • amd: add gfx11 to packet definitions

  • amd: import gfx11 addrlib

  • amd: add Mesa-only addrlib changes

  • ac: don’t align VGPRs to 8 or 16 for gfx11

  • ac: implement ac_get_tbuffer_format for gfx11

  • ac: scratch buffer register changes for gfx11

  • ac: implement register shadowing for gfx11

  • ac/llvm: export mrt0 instead of null on gfx11

  • ac/llvm: don’t set DLC on gfx11 because it means something else there

  • ac/llvm: update pknorm and waitcnt for gfx11

  • ac/llvm: don’t set GLC for stores on gfx11

  • ac/gpu_info: set cu_mask correctly for gfx11

  • ac/surface: gfx11 changes

  • ac/surface: define gfx11 modifiers

  • ac/surface: add gfx11 support to modifiers tests

  • radeonsi/gfx11: increase the hw screen offset alignment

  • radeonsi/gfx11: always allow DCC stores

  • radeonsi/gfx11: expect packed threadID VGPRs

  • radeonsi/gfx11: enable NGG-only draw paths

  • radeonsi/gfx11: enable arbitrary DCC format reinterpretation

  • radeonsi/gfx11: use the new TCS WaveID SGPR to compute vs_rel_patch_id

  • radeonsi/gfx11: program inst_pref_size for graphics

  • radeonsi/gfx11: buffer descriptor changes

  • radeonsi/gfx11: add CB deltas

  • ac,radeonsi/gfx11: remove FMASK loads

  • ac,radeonsi/gfx11: set SWIZZLE_ENABLE correctly

  • radeonsi/gfx11: don’t set non-existent SPI_SHADER_USER_DATA_VS_x

  • radeonsi/gfx11: don’t set COMPR for exports, use 0x3 channel mask instead

  • radeonsi/gfx11: implement attributes through memory

  • radeonsi/gfx11: scattered register deltas

  • radeonsi/gfx11: don’t set non-existent CP_COHER_START_DELAY

  • radeonsi/gfx11: emit SQ_NON_EVENT for tessellation at the end of IBs

  • radeonsi/gfx11: don’t use FLUSH_AND_INV_DB_META

  • radeonsi/gfx11: TF_RING_SIZE changed to a per-SE size

  • radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order

  • radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS

  • radeonsi/gfx11: change LDS allocation granularity for PS

  • radeonsi/gfx11: don’t count the non-existent scratch_byte_offset SGPR

  • radeonsi/gfx11: resolve MSAA using u_blitter

  • radeonsi/gfx11: mark streamout as unimplemented for now

  • radeonsi/gfx11: limit CP DMA to max 32KB sizes

  • radeonsi: inline si_cp_dma_prefetch in si_draw_vbo for lower overhead

  • radeonsi/gfx11: add a workaround for CB perf counters

  • amd: add chip identification for gfx1100-1103

  • radeonsi: remove tautologies when setting CU_EN

  • amd: rename ring_type –> amd_ip_type and match the kernel enum values

  • amd: rename RING_* enums to AMD_IP_*

  • amd: replace num_rings[type] with ip[type].num_queues

  • ac/gpu_info: simplify HW IP querying

  • ac/gpu_info: remove dead GDS query code

  • ac/gpu_info: print all IP versions reported by the kernel

  • ac/gpu_info: fix incorrect IP versions reported by the kernel

  • radeonsi: report 10_10_10_2 scaled formats as unsupported on gfx6-9 as well

  • radeonsi: report more missing/broken texture formats as unsupported on gfx6-9

  • ac/llvm: implement memory_barrier_tcs_patch properly to fix LLVM 15

  • radeonsi: wait before s_barrier in TCS epilog to fix LLVM 15

  • ac/llvm: inline emit_membar

  • ac/llvm: replace LLVMBuildFence with ac_build_waitcnt(VSTORE)

  • ac/llvm: add gl_shader_stage parameter into ac_build_s_barrier

  • ac/llvm: move the gfx6 optimization for TCS barriers into ac_build_s_barrier

  • ac/llvm: fix the remaining s_barriers for LLVM 15

  • radeonsi: add si_get_tcs_epilog_key

  • ac/llvm: skip s_barrier if tess patches don’t cross a wave boundary

  • radeonsi: inline si_nir_build_llvm

  • ac/llvm: remove ac_shader_abi::emit_outputs

  • radeonsi: cleanups getting rid of the ambigous “epilogue” word

  • radeonsi: get rid of the ambiguous “prologue” word

  • radeonsi: try to group stage-specific code in si_llvm_translate_nir

  • ac: replace 5 ac_shader_abi::load_* callbacks with 1 intrinsic_load callback

  • amd: change chip_class naming to “enum amd_gfx_level gfx_level”

  • radeonsi: rename RADEON_FLAG_UNCACHED -> RADEON_FLAG_GL2_BYPASS

  • winsys/amdgpu-radeon: rework how we describe heaps

  • winsys/amdgpu: handle RADEON_FLAG_ENCRYPTED as a heap modifier

  • radeonsi: use PIPE_RESOURCE_FLAG_UNMAPPABLE and DRIVER_INTERNAL more

  • winsys/amdgpu-radeon: clean up code related to heaps

  • include: update amdgpu_drm.h

  • radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE

  • radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache)

  • radeonsi: fix a crash when failing to create a context

  • radeonsi: print an error when failing to create a context

  • radeonsi/gfx11: set BIG_PAGE for the attribute ring

  • ac/gpu_info: silence a valgrind warning due to amdgpu_query_hw_ip_info

  • radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition

  • ac/gfx11: fix the scratch buffer

  • radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export

  • radeonsi/gfx11: don’t insert shader code for GS_PIPELINE_STATS_EMU

  • radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs

  • Revert “radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order”

  • radeonsi/gfx11: fix VM faults due to the attribute ring

  • radeonsi/gfx11: optimize attribute stores

  • radeonsi: merge all preamble states into one

  • radeonsi: move CS preamble emission into the winsys

  • winsys/amdgpu: initialize IB_PREAMBLE in advance

  • radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney)

  • Revert “frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers”

  • Revert “mesa: consider the sample count when choosing a texture format”

  • amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt

  • r300,r600,svga: save the FS constant buffer for u_blitter to fix a regression

  • st/mesa: remove unused GENERIC input and output from the clear VS

  • ac/gpu_info: always retile DCC on gfx10 and newer chips

  • mesa: clamp UINT values greater than INT_MAX for glGetInteger

  • mesa: change GL_MAX_COMPUTE_WORK_GROUP_COUNT to UINT

  • mesa: change GL_MAX_UNIFORM_BLOCK_SIZE to UINT

  • mesa: change GL_MAX_TEXTURE_BUFFER_SIZE to UINT

  • mesa: change GL_MAX_SHADER_STORAGE_BLOCK_SIZE to UINT

  • st/mesa: limit MaxComputeWorkGroupCount to INT_MAX - 1 due to failing tests

  • st/mesa: round down MaxUniformBlockSize to a multiple of 4 for piglit

  • gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to *_BUFFER0_*

  • gallium: add PIPE_CAP_MAX_CONSTANT_BUFFER_SIZE_UINT for UBOs

  • gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT

  • gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> *_UINT

  • radeonsi: increase the max compute LDS size to 64KB for gfx7+

  • radeonsi: report correct maximum compute grid sizes

  • radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly

  • radeonsi: change max TBO/SSBO sizes again and rework max alloc size

  • radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly

  • radeonsi: set the max UBO size same as the max SSBO size

  • winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+

  • radeonsi: print LDS size in bytes

  • radeonsi: don’t use info.gs.invocations if it’s not GS

  • radeonsi: fix uninitialized wait_mem_scratch_tmz

  • radeonsi: remove unused dword from wait_mem_scratch

  • radeonsi: fix polygon stippling without color and Z outputs (v2)

  • radeonsi: restructure PS no-export fixups

  • radeonsi: determine DB_SHADER_CONTROL in si_shader_ps

  • radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug

  • radeonsi: add BREAK_BATCH at the beginning of IBs

  • radeonsi: simplify how pipeline statistic offsets are computed

  • radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS

  • radeonsi: rename and regroup VS_STATE definitions

  • radeonsi: move GS_STATE bits to the end to make space at the beginning

  • radeonsi: rework how vs_state_bits is set and unpacked

  • radeonsi: fix an NGG streamout hang with monolithic shaders

  • radeonsi: fix a crash in gfx10_sh_query_get_result_resource

  • radeonsi: unconditionally enable the streamout overflow query with NGG

  • radeonsi: inline gfx10_emit_streamout_begin/end

  • radeonsi: fix NGG streamout hang by allocating GDS in the right place

  • radeonsi: remove streamout code from shaders if no streamout buffers are bound

  • winsys/amdgpu: add a kernel GDS management workaround retrying on -ENOMEM

  • radeonsi: allocate GDS only once per process

  • radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout

  • ac,radeonsi: don’t export null from PS if it has no effect on gfx10+

  • ac/gpu_info: clamp gart_size_kb and vram_size_kb to fix buggy kernel driver

  • gallium/u_threaded: fix buffer busyness tracking

  • ac/llvm: fix tcs_wave_id unpacking on gfx11

  • amd/gfx11: add PixelWaitSync packet fields

  • radeonsi/gfx11: compile monolithic PS if it writes memory

  • radeonsi/gfx11: allocate more space for pipeline statistics

  • radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_*

  • radeonsi/gfx11: don’t use memory for waiting for cache flushes

  • radeonsi/gfx11: don’t allocate unused wait_mem_scratch

  • radeonsi/gfx11: drop the ES vertex count requirement

  • radeonsi/gfx11: add missing register shadowing code

  • radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout

  • amd: update addrlib - trivial changes

  • ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11

  • radeonsi: fix random PS wave size

  • util: remove our redefinition of assert

  • Change all debug_assert calls to assert

  • util: remove now unused debug_assert

  • ttn: set dest_type for TXQ

  • gallium: add pipe_blit_info::dst_sample, update u_blitter to write that sample

  • ac/gpu_info: rename info fields to num_cu, memory_bus_width, memory_freq_mhz

  • ac/gpu_info: remove tabs

  • ac/gpu_info: rework and extend device info to be more informative

  • ac/gpu_info: use hw_ip::ip_discovery_version to set IP versions

  • ac/gpu_info: use the kernel-reported GFX IP version to set gfx_level

  • ac/gpu_info: remove vram_size and gtt_size in favor of *_kb variants

  • ac/gpu_info: remove redundant uvd_decode

  • ac/gpu_info: remove redundant jpeg_decode

  • ac/gpu_info: remove redundant uvd_encode

  • ac/gpu_info: remove redundant vce_encode

  • ac/gpu_info: remove redundant vcn_encode

  • amd: require amdgpu DRM 3.2.0 from April 2016

  • gallium/radeon: require radeon DRM 2.45.0 from April 2016

  • radeonsi: remove workarounds for radeon DRM < 2.45.0

  • ac/gpu_info: use drm_amdgpu_device_info instead of amdgpu_gpu_info

  • ac/gpu_info: remove amdgpu_gpu_info parameter from ac_query_gpu_info

  • ac/gpu_info: remove unused has_unaligned_shader_loads

  • gallium/radeon: require radeon DRM 2.50.0 (kernel 4.12) from July 2017

  • ac/gpu_info: require amdgpu DRM 3.15.0 (kernel 4.12) from July 2017

  • util/u_threaded: fix a CPU storage crash due to an invalid codepath taken

  • glthread: don’t ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE

  • glthread: don’t sync on IsEnabled(GL_DEPTH_TEST) by tracking it in glthread

  • util/format: add util_format_rgbx_to_rgba helper

  • amd/addrlib: fix 3D texture allocation failures on gfx11

  • ac/surface: don’t forbid 256KB swizzle modes on smaller gfx11 chips

  • radeonsi: unify VGT_TESS_DISTRIBUTION programming

  • radeonsi: move SI_MAX_VRAM_MAP_SIZE to si_debug_options.h

  • radeonsi: call pipe->blit instead of util_blitter_blit after MSAA resolving

  • radeonsi: make si_launch_grid_internal static

  • radeonsi: add common helper si_launch_grid_internal_images that is more robust

  • radeonsi: remove compute-based DCC decompression because it’s broken

  • radeonsi: don’t do image stores with RGBX, L, LA, I, and SRGB formats

  • radeonsi: follow shader_info.float_controls_execution_mode (mostly)

  • radeonsi: add need_fmask_expand parameter into si_decompress_subresource

  • radeonsi: check for 16-bit hw support instead of relying on options.fp16

  • radeonsi: move compute-related code from si_blit.c to si_compute_blit.c

  • radeonsi: fold async_copy into the preceding conditional in si_blit

  • radeonsi: cosmetic changes around do_hardware_msaa_resolve

  • radeonsi/gfx11: use correct VGT_TESS_DISTRIBUTION settings

  • radeonsi/gfx11: set SAMPLE_MASK_TRACKER_WATERMARK = 15 and clean up

  • radeonsi/gfx11: use better PRIM_GRP_SIZE_GFX11 setting

  • radeonsi/gfx11: skip code in si_update_shaders that has no effect

  • radeonsi/gfx11: rename si_calc_inst_pref_size -> si_get_shader_prefetch_size

  • radeonsi/gfx11: enable shader prefetch except for initial chip revisions

  • radeonsi/gfx11: use a better workaround for the export conflict bug

  • radeonsi: make various blit functions non-static

  • radeonsi: allow texture_map to upload only 1 sample for MSAA instead of all

  • radeonsi: add a randomized blit test

  • radeonsi: move the no-AA small prim precision cull constant into an SGPR

  • radeonsi: move small prim precision computation out of si_emit_cull_state

  • radeonsi: don’t assume that TC_ACTION_ENA invalidates L1 cache on gfx9

  • glthread: unbind framebuffers in glDeleteFramebuffers

  • glthread: call _mesa_glthread_DeleteBuffers unconditionally

  • ac/surface: disallow 256KB swizzle modes on gfx11 APUs

Marek Vasut (3):

  • dri: Check buffer height and avoid overflow

  • etnaviv: Rename etna_emit_tex() args

  • etnaviv: Implement TXD

Mark Collins (1):

  • tu: Clamp priority in DRM submitqueue creation

Mark Janes (11):

  • intel: parse intel_measure environment without side effects

  • intel: relax assertion in INTEL_MEASURE

  • anv: fix performance bug in INTEL_MEASURE

  • intel: provide a callback to clean up after intel_measure_gather

  • iris: provide a callback to INTEL_MEASURE to clean up snapshots

  • util: define helpful macros for compiler diagnostic features

  • iris: pad all structures used in a shader key

  • iris: reorder to minimize padding

  • intel/compiler: pad all data structures used by shader cache keys

  • intel/compiler: reorder shader cache keys to minimize padding

  • intel/dev: drop warning for unhandled hwconfig keys

Martin Roukala (né Peres) (27):

  • ci/b2c: fix the generation of the IMAGE_UNDER_TEST variable

  • ci/b2c: bump the kernel to 5.17.1

  • ci/b2c: update b2c to enable monitoring support

  • radv/ci: add a Van Gogh dEQP runner

  • ci: add ACO_DEBUG to the list of variables to pass down for testing

  • ci/radv: remove amdgpu.gpu_recovery=0 from the kernel cmdline

  • radv/ci: reboot the DUTs when hitting a ring timeout

  • radv/ci: reduce the execution timeout

  • radv/ci: re-enable fossils testing

  • radv/ci: enable the experimental support for mesh shaders

  • radv/ci: update the list of flakes

  • radv/ci: allow polaris10 to retry more times

  • radv/ci: bump the timeout of vangogh/renoir

  • radv/ci: update the vkcts expectations for the CI machines

  • radv/ci: remove an expected fail from all families

  • radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22

  • ci/valve: do not extract the install tarball on the runner

  • ci/valve: uprev boot2container

  • Revert “ci/valve: uprev boot2container”

  • radv/ci: test vkd3d on kabini

  • ci/radv: move a test from the fail list to the flake list

  • ci/radv: add a missing flake

  • ci/radv: enable vkcts testing on kabini

  • ci/x86: update to llvm 13

  • radv/ci: test the llvm backend on navi21

  • radv/ci: allow specifying a warning pattern

  • radv/ci: do not reboot on soft-recovered hangs, just warn

Matt Coster (20):

  • pvr: csbgen: Formatting pass (PEP-8 plus other minor changes)

  • pvr: csbgen: Add a missing @staticmethod decorator

  • pvr: csbgen: Make some loops more pythonic

  • pvr: csbgen: Cleanup imports

  • pvr: csbgen: Remove “dim” functionality from Group

  • pvr: csbgen: Remove unused function parameters

  • pvr: csbgen: Add typing information

  • pvr: csbgen: Isolate “main” function with __name__ check

  • pvr: csbgen: Replace map() and filter() with list comprehension

  • pvr: csbgen: Fix “local variable may be undefined” lints

  • pvr: csbgen: Add *_unpack() functions for all generated struct types

  • pvr: debug: Implement PVR_DEBUG variable

  • pvr: csbgen: Make all generated enums unambiguous

  • pvr: Add a const qualifier to inline pvr_csb_*() helpers

  • pvr: Remove pvr_cmd_struct() macro

  • pvr: Make pvr_cmd_pack() macro clearly internal

  • pvr: Split out unioned structs from struct pvr_sub_cmd

  • pvr: Implicitly assert that the correct sub-command type is present

  • pvr: Split pvr_dev_addr_t into a separate header

  • pvr: Add helper macros for creating pvr_dev_addr_t instances

Matt Turner (3):

  • mesa: Drop dead #include “sparc/sparc.h”

  • freedreno/ir3: Move the texture array coord fixup to nir

  • freedreno/ir3: Pass 16-bit sampler coordinates when possible.

Matti Hamalainen (7):

  • gallium/tools: fixes to option handling

  • gallium/tools: add option for ignoring junk calls in trace dumper

  • gallium/tools: reimplement tracediff completely in Python

  • pytracediff: add per-line difference highlighting for blocks

  • pytracediff: make -M (“method only”) option print arguments for differing calls

  • pytracediff: change how ‘junk’ calls are handled

  • pytracediff: implement pager (‘less’) invocation internally

Max Kellermann (2):

  • gallium/u_threaded: add missing reference counts for draw_multi slots

  • gallium/u_threaded: fix offset calculation for draw_multi slots

Michael Olbrich (1):

  • crocus: export GEM handle with RDWR access rights

Michael Skorokhodov (2):

  • glsl: Fix ir_quadop_vector validation

  • anv: Update line range

Michel Zou (7):

  • gallium: fix unused symbols warnings

  • meson: call run_command with check=true

  • vulkan/wsi: fix missing unistd include

  • zink: fix pointer size conversion warning

  • lavapipe: allow null handleTypes

  • meson: drop dladdr check on win32

  • util/xmlconfig: fix -Wmaybe-uninitialized warning

Mihai Preda (28):

  • gallium: add opaque pointers shim for LLVM < 8.0

  • gallium: use LLVM opaque pointers in draw_llvm.c

  • gallium/llvmpipe: use LLVM opaque pointers in lp_bld_interp.c

  • gallium/llvmpipe: use LLVM opaque pointers in lp_bld_depth.c

  • gallivm: use LLVM opaque pointers in lp_bld_arit.c

  • gallivm: use LLVM opaque pointers in lp_bld_assert.c

  • gallivm: use LLVM opaque pointers in lp_bld_format_aos.c

  • gallivm: use LLVM opaque pointers in lp_bld_conv.c

  • gallivm: use LLVM opaque pointers in lp_bld_coro.c

  • gallivm: use LLVM opaque pointers in lp_bld_printf.c

  • gallivm: use LLVM opaque pointers in lp_bld_flow.c

  • gallivm: use LLVM opaque pointers in lp_bld_intr.c

  • gallivm: use LLVM opaque pointers in lp_bld_format_s3tc.c

  • gallivm: use LLVM opaque pointers in lp_bld_gather.c

  • gallivm: use LLVM opaque pointers in lp_bld_nir_soa.c

  • gallivm: use LLVM opaque pointers in lp_bld_sample.c

  • gallivm: use LLVM opaque pointers in lp_bld_sample_soa.c

  • gallivm: use LLVM opaque pointers in lp_bld_struct.c

  • gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c

  • gallivm: use LLVM opaque pointers in lp_bld_tgsi_aos.c

  • gallivm: LLVM opaque pointers small changes

  • gallium: refactor a channel loop in draw_llvm.c

  • Revert “gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c”

  • gallium: LLVM-15 contexts use non-opaque pointers

  • gallivm: fix a few llvm non-opaque pointers

  • gallivm: refactor a bit the cache access in view of LLVM opaque pointers

  • gallivm: add lp_build_struct_get() variants that take the LLVM type

  • gallivm: push LLVM version guard into assert

Mike Blumenkrantz (634):

  • nir/lower_tex: fix rect queries with lower_rect set

  • mesa/st: set normalized coords for RECT samplers if rects are unsupported

  • zink: unset PIPE_CAP_TEXRECT

  • zink: radv baseline fix

  • zink: nv ci update

  • zink: clamp max shader images to 32

  • zink: remove tcs patch slot map

  • zink: disable EXT_extended_dynamic_state2 on AMDPRO

  • zink: add baseline for amdpro

  • zink: reject resource creation if format features don’t match attachment

  • zink: use mixed zs renderpass for depth read/write

  • zink: unconditionally set line width on rasterizer state change

  • zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available

  • zink: fix extended restart prim types without dynamic state2

  • zink: do not create fences at all if timeline semaphores are supported

  • zink: require timeline semaphores

  • zink: remove batch lock

  • zink: rename zink_query::batch_id

  • zink: remove driver-based max_fences throttling

  • zink: hook up VK_KHR_create_renderpass2

  • zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available

  • zink: require renderpass2

  • zink: require KHR_imageless_framebuffer

  • zink: delete all non-imageless framebuffer code

  • zink: remove framebuffer indirection

  • zink: rename imageless framebuffer functions

  • zink: remove the worst part of basic framebuffer support

  • zink: remove ZINK_NO_TIMELINES

  • zink: make a kopper debug print into an error

  • zink: rework zink_kopper_update() assert

  • zink: always bind gfx pipeline at the top of draw

  • zink: store min required stride values on the vertex state

  • zink: selectively disable dynamic vertex stride

  • nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims

  • lavapipe: run nir_fold_16bit_sampler_conversions

  • zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo

  • zink: NV_linear_color_attachment

  • zink: nv ci update

  • zink: fix synchronization when drawing from streamout

  • zink: fix xfb counter buffer barriers

  • zink: remove xfb_barrier flag

  • zink: handle device-local unsynchronized maps

  • util/draw: fix map size of indirect buffer in util_draw_indirect_read

  • util/draw: handle draw_count=0 when reading indirect parameters

  • util/draw: fix indirect draw count readback

  • gallium: add PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE

  • zink: unset PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE

  • zink: move the kopper present fence to the displaytarget object

  • wgl: support GL 4.6

  • zink: make unsupported blit formats an error

  • zink: make update_framebuffer_state() public

  • zink: rework clear_depth_stencil hook

  • zink: add a stencil blit fallback path

  • zink: fix tcs control barriers for use without vk memory model

  • zink: fix semantics mask for compute control barriers

  • zink: hook up VK_KHR_vulkan_memory_model

  • zink: use vulkan memory model shader semantics for tcs barriers

  • zink: always use vk mem model for shaders if available

  • zink: fix memory_barrier intrinsic

  • zink: fix stride=0 for dynamic state vertex stride checking

  • zink: add synchronization for buffer clears

  • mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries

  • aux/trace: make get_sparse_texture_virtual_page_size useful

  • glsl/nir: set new_style_shadow for sparse tex ops as necessary

  • zink: fix group memory barrier emission

  • vulkan: bump layer api versions to current vk header version

  • vk/cache: unbreak vk_pipeline_cache_create with flags

  • kopper: add DISPLAY_TARGET bind for depth buffer

  • zink: fix/improve swapchain surface info updating

  • zink: fix up swapchain depth buffer geometry during fb update

  • drisw: remove dead code

  • kopper: always fetch and store drawable info

  • kopper: move drawable geometry updating up in function

  • kopper: store whether screen has dmabuf support

  • kopper: copy a bunch of code for texture_from_pixmap

  • zink: add supported present modes to kopper displaytarget

  • zink: add a util function for finding swapchain resource

  • zink: flag swapchains when updating fails

  • zink: handle dead swapchains in acquire

  • zink: unset image layout when killing swapchain

  • zink: rework swapchain object check for acquires

  • zink: fix return for zink_kopper_acquire

  • zink: handle swapchain acquire failures more directly

  • zink: flag swapchain resources as swapchains

  • zink/kopper: add a mechanism for checking swapchain status

  • zink: radv ci updates

  • ac: remove gallivm header include

  • kopper: add some error logging for pixmap->texture failure

  • kopper: check for modifiers to use modifier functionality

  • zink/kopper: add a function for determining if running on software

  • kopper: check whether zink is using sw

  • kopper: rename a confusing variable

  • kopper: implement texture-from-pixmap for software (non-dmabuf)

  • zink: fix dmabuf plane returns

  • zink: export fd info for all 2d images

  • zink: ignore KMS handle types

  • zink: remove drm_fd

  • llvmpipe: add a ci flake

  • llvmpipe: disable statistic queries during draw if queries are disabled

  • llvmpipe: remove misleading query comment

  • llvmpipe: disable compute statistics queries if queries are disabled

  • wgl: always set alpha on kopper windows

  • zink: hook up VK_EXT_primitives_generated_query

  • zink: pass query object to get_num_queries()

  • zink: pass query object to get_num_query_pools()

  • zink: pass query object to get_num_results()

  • zink: pass screen param to convert_query_type()

  • zink: add and use a function to detected emulated primgen queries

  • zink: add a flag to zink_query to trigger rasterizer discard workaround

  • zink: use VK_EXT_primitives_generated_query when available

  • wgl: don’t auto-load zink before software drivers

  • lavapipe: lower quad_broadcast intrinsics

  • zink: add a ci flake

  • kopper: invalidate drawables when resizing textures in place

  • kopper: pass the current context to dri_flush

  • util/blitter: fix sampler restore with 0 saved samplers

  • radv: emit fewer framebuffer registers

  • zink: restore conditional ordering for query begin/end

  • nir/gather_info: flag fbfetch on subpass image loads

  • llvmpipe: never infer early zs tests when fbfetch is active

  • zink: split renderpasses for TextureBarrierNV() usage

  • zink: scale depth bias by factor of 2

  • zink: pass sparse backing page offset to binding function

  • zink: pass sparse bind bo offset through for texture binds

  • zink: set all usage flags when querying sparse features

  • zink: fix multisample conditional in sparse image query

  • zink: bump number of image binds that can be batched to 50

  • zink: semi-handle 1D sparse texture rewrites for drivers that don’t support them

  • zink: decouple descriptor templates from layouts

  • zink: flatten push descriptor template into normal template array

  • zink: implement indirect buffer indexing

  • zink: delete nir_lower_dynamic_bo_access

  • lavapipe: enqueue pipeline destruction

  • zink: fix null buffer/surface formats

  • zink: add more image usage for null surfaces

  • zink: use descriptor surfaces for notemplates ref updating

  • lavapipe: handle null samplerviews

  • gallivm: fix ssbo oob reads

  • lavapipe: fix CmdPushDescriptorSetWithTemplateKHR with refcounting

  • st/draw_feedback: set constant buffer stride

  • gallivm/draw: fix oob ubo reads

  • llvmpipe: always set ssbo data pointers for draw

  • gallivm: fix oob txf swizzling

  • lavapipe: set robustBufferAccess2 and robustImageAccess2

  • lavapipe: ci updates

  • docs: EXT_robustness2 for lavapipe

  • zink: ci updates

  • kopper: remove unused zink include

  • zink: remove unused headers/struct members/linkage

  • zink: outdent some code

  • zink: add a flag to indicate whether a resource is exportable

  • zink: break out resource bind-adding into separate function for reuse

  • zink: introduce copy contexts

  • zink: create a copy context for the screen on init

  • Revert “zink: export fd info for all 2d images”

  • zink: always check for fb rebinds when starting renderpass

  • zink: unset pipe_resource::next pointers when creating resource copies

  • zink: add a bind flag to indicate a resource is exported as a dmabuf

  • zink: fix conditional for modifier usage

  • zink: force memory exports if dmabuf bind is specified

  • zink: add a LINEAR drm modifier if rebinding to add dmabuf export

  • zink: rebind resources for export as needed

  • zink: create images with modifiers any time there is an import handle

  • zink: fix up sparse texture sampling for shadow samplers

  • zink: clamp out min_lod operands for explicit lod ops

  • zink: fix sparse binding for arrayed textures

  • zink: set sparse flag in cubemap lowering

  • zink: remove misleading sparse comment

  • zink: fix sparse texture depth calcs for arrayed textures

  • zink: remove implicit query resets

  • zink: remove refs from shader images

  • zink: reuse local res pointer in set_shader_images

  • zink: simplify dumb update flagging in set_shader_images

  • zink: read shader image r/w usage from incoming data struct

  • zink: copy incoming shader image struct after doing updates

  • zink: stop leaking shader image surfaces

  • zink: fix 3d shader image miplevel calc for depth

  • zink: directly reuse surface ivci when rebinding

  • gallium/noop: implement pipe_screen::create_fence_win32

  • aux/trace: screen::create_fence_win32

  • zink: add win32 guards for device extensions with win32 in name

  • lavapipe: ignore depth/stencil states if attachment is not present

  • lavapipe: fix pipeline handling for dynamic render zs state with pipeline library

  • spirv: fix barrier scope assert

  • zink: manually validate shaders in debug builds

  • zink: fix init with MESA_SHADER_CACHE_DISABLE enabled

  • zink: use copy context to eliminate dependency on EXT_calibrated_timestamps

  • zink: move get_timestamp to screen

  • zink: add tu baseline for a630

  • zink: add an anv-tgl ci job

  • zink: never create a sw context unless sw is requested

  • zink: only reject non-blittable resource creation if it has no other binds

  • zink: add separate error message for push descriptor set alloc fail

  • zink: add extra validation for resource creation

  • zink: remove input attachment usage when pruning surface usage

  • zink: add extended usage and retry if resource creation fails

  • zink: fix surface/bufferview cache comparisons

  • zink: force render target usage for sampler-only resources

  • zink: clamp 1D_ARRAY sparse textures to 2D as needed

  • zink: handle PIPE_BUFFER sparse texture queries

  • zink: fix non-dynamic vertex stride update flagging

  • zink: add c++ guards for zink_screen.h

  • zink: add ZINK_DEBUG=sync

  • zink: update radv baseline

  • zink: rename a fb surface variables

  • zink: always use zink_batch_rp to init renderpass

  • zink: use zink_batch_no_rp to end renderpasses

  • zink: move renderpass handling to zink_render_pass.c

  • zink: use global image rebind counter for dmabuf export

  • zink: remove fb rebind check during renderpass begin

  • zink: use local variable for getting rt barrier info

  • zink: break out sample location updating to separate function

  • zink: break up swapchain renderpass update to util function

  • zink: remove renderpass state flag for swapchain

  • zink: pass rt attrib to zink_render_pass_attachment_get_barrier_info()

  • zink: fix indentation

  • zink: break out color rt attrib analysis into util function

  • zink: break out zs rt attrib analysis into util function

  • zink: track whether there’s any transient attachments bound

  • zink: lift some renderpass mechanics up a level in the api

  • zink: split out scissor flagging from framebuffer updating

  • zink: use dynamic rendering (most of the time)

  • turnip: fix assert for max xfb outputs

  • zink: only warn once for SRGB fb without KHR_swapchain_mutable_format

  • zink: clamp dynamic render layerCount to 1

  • zink: create all descriptor ref templates at once

  • zink: add screen-based indexing to descriptor sets

  • zink: use descriptor indices in compiler

  • zink: use descriptor set index indirection in program init

  • zink: use descriptor set index indirection in cache state updates

  • zink: flag descriptor sets as changed before updating hashes

  • zink: add a “compact” descriptor mode

  • zink: use XOR for descriptor hash accumulation

  • zink: radv ci updates

  • zink: use a zink_render_pass_pipeline_state bit for fbfetch

  • zink: add renderpass bits for color/depth r/w

  • zink: update radv piglit fails

  • vtn: assert that vector shuffle indices are in-bounds

  • vtn: assert that composite members have the same bit size as the result

  • zink: require draw params ext/cap in ntv if drawid is used

  • zink: emit spirv cap for subgroup vote ops

  • zink: use correct scope for subgroup vote ops

  • zink: fix subgroup vote detection

  • zink: lower subgroup width to 1 for unsupported subgroup vote stages

  • zink: remove first_frame stalling

  • zink: drop wideLines requirement

  • zink: drop largePoints requirement

  • st/pbo_compute: do pbo readback directly to the buffer object if it exists

  • st/pbo_compute: use different calc for non-3d compute buffer sizing

  • st/pbo_compute: make compute download conditional in shader slightly more readable

  • st/pbo_compute: fix z coords for compute pbos

  • zink: only infer modifiers if winsys handle has a stride

  • zink: drop requirement for 6 descriptorsets to use bindless

  • zink: store the real (non-compact) binding usage for programs

  • zink: fix change flagging for compact descriptor cache

  • zink: flag all cached descriptors as needing update on program change

  • zink: add a #define for descriptor compaction

  • zink: make swapchain kill message more descriptive

  • st/texture: use base teximage for compute blits

  • st/texture: allow compute blits for (some) non-finalized textures

  • lavapipe: fix dynamic patchControlPoints

  • mesa: throw a log warning any time a fallback texture is used

  • zink: fix radv piglit fails

  • zink: force queries to start/end out of renderpass

  • zink: tu ci updates

  • zink: reorder pipeline dynamic state1 substruct

  • zink: improve packing for pipeline dynamic state1 info

  • zink: break out dynamic state1 pipeline info to struct

  • zink: use memcmp for comparing dynamic state1 info

  • zink: use dynamic cull mode

  • zink: break out pipeline dynamic state2 into struct

  • zink: use dynamic patch vertices if available

  • zink: expand patch_vertices pipeline key bitsize

  • zink: fix framebuffer attachment usage asserts for dummy attachments

  • aux/trace: fix sampler view dumping

  • aux/trace: implement pipe_screen::is_compute_copy_faster

  • zink: disable compute pbos on turnip

  • egl/wayland: manually swap backbuffer when using zink

  • egl/wayland: skip buffer creation on zink

  • kopper: use get_drawable_info path for non-x11 drawables

  • ci: disable unit tests

  • zink: fix dmabuf plane layout struct scoping

  • zink: represent plane offsets using offset from plane 0 vs size of plane

  • zink: rename a variable

  • zink: handle aux plane imports

  • Revert “zink: remove drm_fd”

  • zink: add back kms handling

  • mesa: improve relocation problem message

  • mesa/st: bump param reservation to 20

  • format_utils: properly parenthesize macro params

  • d3d12: skip time-elapsed piglit tests in ci

  • virgl: add some ci flakes

  • mesa/st: bump param reservation to 28

  • mesa: add statevar for atomic counter offsets

  • mesa: conditionally set constants dirty for atomic counter binds

  • st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible

  • mesa: handle atomic counter lowering for drivers with big ssbo offset aligns

  • zink: remove ANV depth clip control workaround

  • zink: disable EXT_primitives_generated_query on turnip

  • zink: track invalidation for image resources

  • zink: flag renderpass for change if image resource changes valid state

  • zink: rename renderpass attrib value

  • zink: split out a dynamic render ternary

  • zink: hook up surface invalidation to LOAD_OP_DONT_CARE

  • zink: invalidate blit dsts if fully covered

  • zink: remove buffer valid range tracking from blit

  • mesa/streaming-memcpy: add memcpy fallback

  • streaming-memcpy: move to src/util/ and compile unconditionally

  • mesa/st: use util_streaming_load_memcpy for compute pbo copying

  • zink: break out of zs mixed layout update loop when work is done

  • zink: only update layout when doing mixed zs attachment renderpass check

  • aux/trace: add pipe_context::link_shader

  • aux/tc: handle link_shader

  • zink: simplify some compiler bo code

  • zink: fix buffer descriptor hashing

  • zink: always create descriptor templates

  • zink: flag optimization pass after inlining uniforms

  • zink: break out nir alu typing in ntv

  • zink: run nir_lower_alu_to_scalar in optimizer loop for 64bit lowering

  • zink: always scalarize pack/unpack alu ops

  • zink: don’t lower pack/unpack intrinsics

  • zink: run more int64 passes during optimization loop if int64 isn’t available

  • zink: fix ntv partial stores

  • zink: flag all shader i/o variables as !is_xfb

  • zink: fix variable locations in manual xfb emission

  • zink: if a variable is flagged as being xfb, treat it as manual xfb output

  • zink: improve/expand manual xfb emission

  • zink: implement nir_op_unpack_64_2x32 in ntv

  • zink: rewrite 64bit shader i/o as 32bit

  • zink: scalarize when rewriting explicit 64bit io

  • zink: avoid adding more 64bit alus during bo rewriting

  • zink: fix 32bit bo rewriting

  • zink: update tu a630 baseline

  • zink: add new flakes

  • zink: fix feature requirement

  • zink: add feature req for border colors

  • zink: more requirement docs

  • zink: rewrite atomic ssbo intrinsics as atomic derefs

  • zink: run copy_prop_vars during optimization

  • zink: warn on missing customBorderColorWithoutFormat

  • zink: more lavapipe glcts fails

  • zink: cap driver inlining using ssa allocation limit

  • zink: use separate pointer for swapchain presents

  • zink: keep a count of async presents occuring for a given swapchain

  • zink: defer old swapchain destruction

  • zink: add implicit sync workaround for non-mesa drivers

  • zink: unset res->swapchain upon killing a swapchain

  • zink: fix up KILL to a more sensible log message

  • lavapipe: VK_EXT_border_color_swizzle

  • anv: VK_EXT_border_color_swizzle

  • zink: fix dynamic stride conditional in pipeline creation

  • zink: fix generated tcs update path

  • zink: use the patch_vertices value from the shader key, not the context

  • zink: unset generated tcs pointer from tes on deletion

  • zink: fix generated tcs deletion

  • zink: only add necessary binds during dmabuf export

  • zink: avoid uninit values in renderpass state

  • zink: don’t print VK_EXT_shader_atomic_float warning for nir_intrinsic_image_deref_atomic_exchange

  • zink: hook up VK_EXT_non_seamless_cube_map

  • zink: use VK_EXT_non_seamless_cube_map when available

  • zink: rename nonseamless struct members to emulate_nonseamless

  • zink: skip some emulated cube code if the extension is present

  • zink: fix cubemap lowering for array samplers

  • zink: fix cubemap lowering bit size

  • zink: clamp dynamic renderArea to framebuffer size

  • zink: check for pending clears to determine write status of zs attachments

  • zink: add a function for getting the minimum framebuffer layers

  • zink: clamp dynamic fb layers better

  • zink: clamp renderpass layers better

  • zink: anv ci updates

  • zink: use uint64_t for batch_id

  • zink: remove thread_queue

  • zink: use a dynarray for semaphore wait flags

  • zink: add batch api for adding a wait semaphore

  • zink: add semaphore handling for sparse binds

  • zink: detect sparse-only binding queue

  • zink: always bind on the sparse queue

  • zink: only enable vtn debugging if validation is active

  • zink: tu ci baseline update

  • zink: radv baseline updates

  • drisw: probe kms if fd is passed

  • kmsro: add error message on drm ioctl failure

  • zink: rename a variable

  • zink: unify actual map calls in buffer mapping

  • zink: use only the extents for buffer unmap flushes

  • zink: fix buffer transfer flushes with offsets

  • zink: store and reuse memory heap flags for buffer placement

  • zink: move BAR allocation demotion up the stack

  • zink: be a little selective about BAR fallback memory type

  • zink: hook up VK_EXT_border_color_swizzle

  • zink: use VK_EXT_border_color_swizzle

  • zink: print runtime warning on missing EXT_border_color_swizzle

  • mesa/st: add PIPE_CAP_DITHERING

  • zink: unset PIPE_CAP_DITHERING

  • mesa/st: increment num_uniforms for hw accel select shaders

  • zink: move draw-time barrier generation down a little

  • zink: track vertex buffer bind counts on resources

  • zink: rework buffer barrier generation

  • zink: track image binds

  • zink: add a #define for vk shader bits

  • zink: collect gfx stages for all bindings during barrier generation

  • zink: don’t short-circuit gfx stage finding during barrier generation

  • zink: relax zink_resource_buffer_needs_barrier checks

  • zink: ci updates

  • glx/drisw: store the flush extension to the screen

  • glx/drisw: invalidate drawables upon binding context if flush extension exists

  • zink: fix dual_src_blend driconf workaround

  • nir/lower_point_size: apply point size clamping

  • mesa: break out PointSizeIsOne setting to util function

  • mesa: rename PointSizeIsOne -> PointSizeIsSet

  • mesa: skip pointsize exports if pointsize attenuation is enabled

  • mesa: enforce pointsize exports if pointsize is being clamped

  • mesa: explicitly disallow multiple pointsize exports from generating

  • zink: ci updates

  • zink: add flag to indicate if a resource is a dmabuf

  • zink: flag dmabufs for foreign queue transition on flush_resource call

  • lavapipe: zero out blend info if blend isn’t enabled

  • lavapipe: copy more pNexts for pipeline creation

  • lavapipe: fix renderpass info handling during pipeline creation

  • lavapipe: add a pipeline library assert

  • lavapipe: skip post-copy pNext checking during pipeline creation for composites

  • zink: use the bigger of the variable type and interface type for bo sizing

  • zink: track ssbo bind counts

  • zink: track overall samplerview bind counts

  • zink: fix image bind counting

  • zink: track gfx/compute descriptor barrier info

  • zink: use tracked barrier info for generated barriers

  • zink: remove another zink/tu fail

  • zink: fix-ish depth clipping without VK_EXT_depth_clip_enable

  • zink: add a turnip driver workaround for EXT_depth_clip_enable

  • zink: remove swizzle from fbfetch lowering

  • lavapipe: always set point_tri_clip

  • zink: fix kopper_acquire return value

  • zink: track whether current swapchain has data

  • zink: move to AoS for storing swapchain struct data

  • zink: move ‘acquired’ flag onto swapchain struct

  • zink: use better determination for handling swapchain acquire submits

  • zink: remove broken kopper assert

  • zink: remove unnecessary code in kopper readback acquire

  • zink: improve no-op acquire detection

  • zink: flag batch as having work during frontbuffer flush

  • zink: ensure pending present flushes are handled during frontbuffer flush

  • zink: delete zink_resource_object::acquire

  • zink: no-op pixmap frontbuffer flush calls

  • zink: acquire swapchain images on image map

  • zink: use kopper_displaytarget type directly

  • zink: delete zink_resource_object::dt_has_data

  • zink: set PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY for device-local resources

  • lavapipe: break out part of shader optimization into function

  • lavapipe: streamline xfb shader setup

  • nir/types: fix glsl_matrix_type_is_row_major() assert

  • zink: store context flags

  • zink: don’t increment screen->num_contexts for copy context

  • zink: disable turnip traces temporarily

  • lavapipe: don’t remove xfb outputs

  • zink: enforce viewport depth clamping

  • zink: size uniform_0 ubo accurately

  • zink: further improve bo sizing

  • zink: force unsized array in bo creation using the last block member type

  • zink: move/refactor bo_vars struct creation in compiler

  • zink: handle bitsizes in get_bo_vars() analysis

  • zink: don’t run nir_lower_uniforms_to_ubo repeatedly

  • zink: pass zink_shader to optimize_nir during final optimize pass

  • zink: add an optimizer pass to enforce bo bounds

  • zink: use std430 packing by default

  • zink: use a std430 interface for bo types

  • mesa: remove incomplete texture warning

  • mesa/st: make get_sampler_view_format() public

  • mesa/st: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO

  • zink: init driver workarounds earlier in screen creation

  • zink: disable customBorderColorWithoutFormat on turnip

  • zink: don’t warn for missing customBorderColorWithoutFormat on turnip

  • zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO

  • zink: flush pending clears for fb texture barriers

  • mesa: fix SignalSemaphoreEXT behavior

  • lavapipe: don’t overwrite entire VkFormatProperties3 struct

  • lavapipe: don’t crash on null xfb buffer pointer

  • zink: reject swizzled format blits

  • lavapipe: support inlined shader spirv for compute

  • zink: allow multiple tex components for depth tg4

  • zink: always use 32bit sample ops

  • zink: add a compiler pass to match up tex op dest types

  • zink: use sampler_bind_count to simplify some code

  • zink: break out samplerview layout reset code

  • zink: always update sampler descriptor layouts on fb surface unbind

  • zink: always use 32bit floats for so output types

  • zink: handle bare matrix types in xfb emission

  • zink: split xfb block emission from array/matrix handling

  • zink: fix xfb array inlining

  • zink: use right glsl length getter for ntv partial stores

  • zink: bitcast extracted streamout components to uint before creating uvec

  • zink: add a compiler pass to split xfb block outputs

  • zink: fix xfb emit check in compiler

  • mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate

  • mesa: track which sampler wrap params use GL_CLAMP

  • mesa: update GL_CLAMP emulation when binding/unbinding textures

  • zink: remove bogus range tracking from texture clear

  • zink: fix transient attachment rp assert

  • zink: remove format check from clear texture

  • zink: fix coverage check for texture clears

  • zink: stop using u_blitter for texture clears

  • zink: improve zink_clear_depth_stencil check for current attachment

  • zink: massively simplify zink_clear_depth_stencil

  • zink: remove non-renderpass clear path from zink_clear_texture

  • zink: remove u_blitter usage from zink_clear_render_target

  • zink: always use storeOp=STORE for depth renderpass

  • zink: track a bitmask of fb attachments with mismatched layer counts

  • zink: add explicit (awful) handling for fb layer mismatch clears

  • zink: clamp color clear values based on format

  • zink: lift clearing on fb state change up a level

  • zink: remove out-of-renderpass clears

  • zink: delete zink_fb_clear_util_unpack_clear_color

  • zink: delete srgb tracking for clears

  • zink: simplify zink_framebuffer_clear_data union

  • zink: inject a 0,0,0,1 clear for RGBX formats

  • zink: break out tc/trace context unwrapping

  • zink: nv baseline update

  • zink: tu a630 baseline update

  • mesa/st: add implicit zeroing of clipdistance array

  • zink: move descriptor_mode to be a global

  • zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK

  • zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED

  • zink: default to lazy/templated descriptors

  • zink: handle max_vertices=0 in geometry shader

  • zink: handle null samplerview in get_imageview_for_binding()

  • zink: fix crashing on bo allocation failure

  • zink: improve oom error message

  • zink: call bind_last_vertex_stage() when binding vs

  • zink: fix viewport count tracking

  • zink: use gfx_barrier for synchronization when binding buffer descriptors

  • vulkan/wsi: fix multiple acquires for sw without mit-shm

  • llvmpipe: ignore multisample state for alpha_to_one

  • zink: don’t try to start a renderpass before CmdClearAttachments

  • zink: don’t flag renderpass change on depth write change as often

  • zink: flag renderpass change if renderpass clears change

  • zink: do dynamic render implicit clear updates on rp_changed

  • zink: unflag ctx->rp_changed after dynamic render update

  • zink: remove redundant renderpass update check for clears

  • zink: add an assert to verify that deferred clears aren’t added in renderpass

  • zink: split out rp_changed to be more granular for dynamic render

  • zink: break out renderpass attachment load op calc

  • zink: break out zs renderpass layout calc

  • zink: break out color rt layout calc

  • zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info()

  • zink: add a bits union for zink_render_pass_state comparisons

  • zink: improve granularity of renderpass switching

  • zink: allow no-op renderpass updates in zink_batch_rp()

  • zink: move layout-setting for dynamic render attachments after no-op case

  • zink: handle null samplerview/image rebinds more gracefully

  • zink: scale number of shader cache-get threads

  • zink: invoke descriptor_program_deinit for programs on context destroy

  • vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize

  • zink: add env var to abort on device-lost if no reset callback is set

  • util/vbuf: handle multidraws

  • anv: disable dev.i915.perf_stream_paranoid=0 warning

  • lavapipe: propagate xfb info for pipeline library rasterization stages

  • zink: remove deqp fails for lavapipe

  • zink: only flag fbfetch as rp update if fbfetch wasn’t already configured

  • zink: assert !rp_changed after renderpass has started

  • zink: avoid crashing when bindless texture updates aren’t used

  • lavapipe: EXT_non_seamless_cube_map

  • lavapipe: accurately set image/ssbo access based on shader usage

  • lavapipe: implement extreme uniform inlining

  • lavapipe: remove some skips

  • zink: add a ci flake

  • lavapipe: move vertex input dynamic states into vertex input block

  • lavapipe: make binding/layout helper functions static inline

  • lavapipe: use correct offset calculation for flagging ssbo/image write access

  • lavapipe: use 64bit masks for shader access

  • lavapipe: set image access on pipeline bind, not descriptor bind

  • llvmpipe: remove weird clamping for format support with sample counts

  • llvmpipe: allow resource_copy_region to perform replicate operations

  • lavapipe: fix stencil clears

  • llvmpipe: add a shortcut for blitter resolves

  • vulkan: hook up VK_EXT_multisampled_render_to_single_sampled

  • lavapipe: VK_EXT_multisampled_render_to_single_sampled

  • lavapipe: advertise VK_EXT_multisampled_render_to_single_sampled

  • util/blitter: respect PIPE_TEXTURE_RECT

  • lavapipe: fix shader access stage conversion from mesa -> pipe

  • zink: stop ORing unordered barrier access

  • zink: convert ZINK_DEBUG to enum

  • zink: add doc for ZINK_DEBUG=compact

  • zink: add ZINK_DEBUG=noreorder

  • zink: zink_resource::unordered_barrier -> unordered_exec

  • zink: explicitly unset unordered_exec when resource can’t be unordered

  • zink: rework unordered_exec handling

  • zink: add a second resource param to get_cmdbuf() for unordered promotion

  • zink: promote image/buffer copy operations to unordered

  • zink: move buffer/image functions down in file

  • zink: expand unordered_exec

  • zink: make get_cmdbuf() public

  • zink: enable unordered blit/resolve ops

  • virgl: more ci flakes

  • d3dadapter: fix sw pipe loading

  • nine: set view_mask=0

  • zink: fix gfx program cache pruning with generated tcs

  • zink: don’t flag lazy push constant set dirty on batch change

  • zink: combine loops for lazy descriptor program deinit

  • zink: handle mutable swapchain images with dmabuf

  • zink: store VkFormatFeatureFlags on creation

  • zink: use modifier feature flags during surface creation when necessary

  • zink: init cache_put program fence on program creation

  • zink: force flush clears on fb change if fb geometry changes

  • zink: force a new framebuffer for clear_depth_stencil if the clear region is big

  • zink: drop mode_changed check from linewidth/depthbias draw updates

  • zink: handle unscaled depth bias from nine

  • zink: handle !half_pixel_center

  • zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE

  • nine: check return on resource_get_handle

  • zink: demote dmabuf tiling to linear if modifiers aren’t supported

  • zink: use modifier_aspect to check for modifier plane in zink_resource_get_param

  • zink: don’t add modifiers if EXT_image_drm_format_modifier isn’t present

  • zink: add all format modifiers when adding for dmabuf export

  • zink: don’t fixup sparse texops

  • vulkan: Update the XML and headers to 1.3.224

  • radv: fix return type for meta resolve shaders

  • nir/validate: clamp unsized tex dests to 32bit

  • mesa: fix blending when using luminance/intensity emulation

  • mesa: require render target bind for A/L/I in format selection

  • vk/render_pass: don’t deref null resolve attachments

  • zink: handle nir_intrinsic_sparse_residency_code_and mechanics

  • zink: ignore nir_texop_lod for tex dest matching

  • tu: fix invalid free on alloc failure

  • zink: don’t call util_queue_fence_init in zink_screen_get_pipeline_cache()

  • zink: don’t emit entrypoints for function temp variables

  • zink: check the variable mode before taking samplemask path in ntv

  • zink: explicitly use unsigned types for bit shifts

  • zink: only add srgb mutable for images with modifiers

  • zink: don’t emit illegal interpolation

  • llvmpipe: don’t assume pipe_context is always available in flush_frontbuffer

  • zink: flag all assigned output slots as mapped

  • zink: handle split acquire/present

  • st_pbo/compute: fix 1D_ARRAY offsets

  • st_pbo/compute: fix 1D coord dimension by pre-trimming vectors

Mike Lothian (1):

  • util: Fix invalid usage of alignas in u_cpu_detect.c

Mykhailo Skorokhodov (3):

  • anv: Use sampleLocationsEnable for sample locations

  • iris: Move Wa_1806527549 and enable by default

  • anv: Move Wa_1806527549 and enable by default

Nagappa Koppad, Basanagouda (1):

  • iris:Duplicate DRM fd internally instead of reuse.

Nanley Chery (11):

  • iris: Return a 64B stride for clear color plane

  • intel/isl: Add a score for I915_FORMAT_MOD_4_TILED

  • isl,iris: Add DG2 CCS modifier support for XeHP

  • mesa: Simplify UNORM8 conversions for sRGB ASTC textures

  • mesa: Follow ASTC decode mode extension for RGBA8 output

  • iris: Don’t leak surface states for compressed resources

  • iris: Don’t leak compressed resources in iris_create_surface

  • iris: Use fill_surface_states for compressed resources

  • anv: Make the D16 reg mode single-sampled

  • iris: Make the D16 reg mode single-sampled

  • iris: Dedent enum iris_depth_reg_mode

Nicolas Caramelli (1):

  • egl: Fix EGL_EXT_platform_xcb name string to match the registry

Panagiotis Apostolou (1):

  • util: Don’t block SIGSEGV for new threads

Patrick Lerda (1):

  • panfrost: Fix unwanted valgrind message related to restart_index

Paulo Zanoni (4):

  • iris: fix race condition during busy tracking

  • intel/compiler: compute int64_options based on devinfo->has_64bit_int

  • intel/compiler: split handling of 64 bit floats and ints

  • intel/compiler: call ordered_unit() only once at update_inst_scoreboard()

Pavel Asyutchenko (7):

  • gallium: add PIPE_CAP_FBFETCH_ZS and expose extension

  • glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil

  • nir_to_tgsi: Don’t count ZS fbfetch vars as outputs

  • llvmpipe: enable per-sample shading when FB fetch is used

  • llvmpipe: simplify early/late zs tests selection

  • llvmpipe: implement FB fetch for depth/stencil

  • llvmpipe: enable PIPE_CAP_FBFETCH_ZS

Pavel Ondračka (13):

  • r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader

  • r300: skip draws instead of using a dummy vertex shader

  • r300: fix vertex shader control flow in loops

  • r300: deduplicate common NIR options

  • r300: be less agresive with copy propagate in loops

  • r300: don’t try to use inline constants instead of constant swizzles

  • r300: Update list of RV515 dEQP failures and add some flakes

  • r300: merge simple movs with constant swizzles together

  • r300: expose PIPE_CAP_LEGACY_MATH_RULES

  • r300: only run merge_movs pass on R500

  • r300: Keep rc_rename_regs() from overflowing RC_REGISTER_MAX_INDEX

  • r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF

  • r300: fix variables detection for paired ALU and TEX instructions in different branches

Pierre-Eric Pelloux-Prayer (81):

  • radeonsi: make si_pc_emit_shaders non static

  • radeonsi: add helpers to deal with spm counters

  • radeonsi: add spm counters setup code

  • radeonsi/sqtt: setup spm bo

  • radeonsi/sqtt: enable spm counters on gfx10+

  • radeonsi: make sure profile_peak is used before using sqtt

  • ac/spm: setup write broadcasting correctly

  • ac/surface: adjust gfx9.pitch[*] based on surf->blk_w

  • radeonsi: remove commutative_blend_add option

  • radeonsi: drop assume_no_z_fights option

  • drirc: remove i965 entry

  • gallium: add PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER

  • radeonsi: enable PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER

  • radeonsi: fix VS kill_outputs handling

  • radeonsi: don’t always strip NIR debug infos

  • radeonsi: generate unique shader name in si_get_nir_shader

  • amd: update gfx10_format_table.py for gfx11

  • radeonsi/gfx11: add assert in legacy vs path

  • radeonsi/gfx11: register changes

  • radeonsi/gfx11: image descriptor changes

  • radeonsi/tests: allow empty line and comments in csv files

  • radeonsi/tests: add a heuristic to pick the baseline

  • radeonsi/tests: reformat with black

  • u_threaded: clear non-async debug callback correctly

  • radeonsi: flush VGT streamout like PAL

  • radeonsi: implement pipeline stats workaround

  • gallium: add a union to access queries counters

  • gallium: always return the correct pipeline stats index

  • gallium: deduplicate target -> stats index code

  • radeonsi: deduplicate query offsets

  • radeonsi: store the pipeline stats index

  • radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders

  • radeonsi/test: update sienna_cichlid results

  • radeonsi/query: use the qbo correct size

  • radeonsi/ngg: reuse the pipeline stats buffer when using atomics

  • radeonsi: fix gs_invocation query with NGG

  • radeonsi: don’t use wave32 for GE on gfx10 if culling is used

  • amd: fix ac_build_mbcnt_add in wave32 mode

  • util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster

  • radeonsi: remove unused params from si_shader_select_with_key

  • radeonsi: scale the number of shader compiler threads

  • radeonsi: store shader variants in an array

  • glx: set max values for pbuffer width / height

  • radeonsi: fix glTexBuffer max size handling

  • radeonsi: wait for PS idle in si_set_framebuffer_state

  • radeonsi/tests: add a –slow option

  • radeonsi/tests: use a smaller tests-per-group value

  • radeonsi: don’t use sel->nir in si_check_blend_dst_sampler_noop

  • ac: export LLVM c++ ResetCommandLineParser method

  • ac: reset LLVM command line parser

  • ac/llvm: add use_waterfall_for_divergent_tex_samplers option

  • radeonsi: enable use_waterfall_for_divergent_tex_samplers

  • st/mesa: use mutex in st_get_texture_sampler_view_from_stobj

  • gallium: update winsys_handle documentation

  • gallium: add size attribute to winsys_handle

  • va/surface: set the correct size in vaExportSurfaceHandle

  • radeonsi: set size in si_texture_get_handle

  • tradeonsi: fix preamble state producing incorrect packets

  • radeonsi: prevent recursion in si_decompress_dcc

  • radeonsi: add helper to use si_screen::aux_context

  • radeonsi: use helpers to access si_screen::aux_context

  • Revert “winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+”

  • dri: store internal_format when known

  • mesa/st: don’t guess the internal format if it’s known

  • util/blob: use memcpy in read functions

  • mesa: skip draws with invalid indices offset

  • vulkan/wsi: define pWaitDstStageMask in the blit submission

  • ac: use LLVMContextSetOpaquePointers if available

  • radeonsi: use LLVMBuildLoad2 where possible

  • ac: add per output is_16bit flag to ac_shader_abi

  • ac/llvm: handle opaque pointers in visit_store_output

  • ac/llvm: use LLVMBuildLoad2 in visit_load

  • radeonsi: use LLVMBuildLoad2 in llvm PS

  • radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads

  • radeonsi: check last_dirty_buf_counter and dirty_tex_counter

  • amdgpu/bo: update uses_secure_bos when importing buffers

  • nir: add a nir_opt_if_options enum

  • radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot

  • radeonsi: use nir_opt_large_constants earlier

  • mesa: avoid reading back textures from VRAM

  • radeonsi: invalidate L2 when using dcc stores

Qiang Yu (63):

  • radeonsi: lower nir_intrinsic_sparse_residency_code_and

  • mesa/program: fix nir output reg overflow

  • mesa/vbo: remove unused vbo_context->binding

  • nir/builder: add load/store array variable helper functions

  • mesa: add hardware accelerated select constant

  • mesa: add _mesa_bufferobj_get_subdata

  • mesa: refine name stack code to prepare for hw select

  • mesa: add hw select name stack code path

  • mesa/vbo: enclose none-vertex functions with HW_SELECT_MODE

  • mapi: add api setup header for hw select mode

  • mesa: set CurrentServerDispatch too when glBegin/End

  • mesa: add HWSelectModeBeginEnd dispatch table

  • mesa: pass select result buffer offset as attribute/varying

  • mesa/st: implement hardware accelerated GL_SELECT

  • zink: reset PIPE_CAP_ACCELERATED when cpu soft rendering

  • virgl: return -1 for PIPE_CAP_ACCELERATED

  • mesa: enable HardwareAcceleratedSelect

  • ac/llvm: get back nir_intrinsic_load_tess_rel_patch_id_amd

  • nir: add nir_intrinsic_load_lshs_vertex_stride_amd

  • ac/nir: use nir_intrinsic_load_lshs_vertex_stride_amd

  • radeonsi: implement load_lshs_vertex_stride abi

  • radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader

  • ac/nir: skip gl_Layer/gl_ViewportIndex write for LS

  • radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones

  • gallium: disable hardware select for crocus

  • ac/llvm: conditionally check wave id in gs sendmsg

  • radv: no need to do gs_alloc_req for newer chips in ngg vs/tes

  • radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop

  • radeonsi: use si_shader as parameter in si_get_nir_shader

  • radeonsi: support multi stage shader state creation in nir shaderlib

  • radeonsi: add si_create_passthrough_tcs

  • radeonsi: replace llvm based fixed tcs with nir

  • radeonsi: implement nir_intrinsic_load_tcs_num_patches_amd

  • ac/nir: remove unused parameter in tes input lower

  • ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd

  • ac/nir: use nir_intrinsic_load_hs_out_patch_data_offset_amd in tess lower

  • ac/nir: add pass_tessfactors_by_reg param to hs output lower

  • ac/nir: add wave_size parameter to ac_nir_lower_hs_outputs_to_mem

  • radeonsi: preload tess offchip ring for tcs

  • radeonsi: implement nir_intrinsic_load_ring_tess_offchip(_offset)_amd

  • radeonsi: implement nir_intrinsic_load_tess_rel_patch_id_amd for both tcs and tes

  • ac/llvm: handle write mask for nir_intrinsic_store_buffer_amd

  • ac/nir: add no_input_lds_space param to hs output lower

  • radeonsi: replace llvm tcs output with nir lower pass

  • radeonsi: enable PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS

  • radeonsi: set uses_vmem_load_other for nir_intrinsic_load_buffer_amd

  • radeonsi: replace llvm tes input load with nir lowering

  • ac/llvm: fix code format alignment in visit_load_local_invocation_index

  • radeonsi: implement two esgs ring nir intrinsic

  • ac/nir: skip gl_ViewportIndex and gl_Layer write in ES

  • radeonsi: set lds for gs/es to handle nir shared memory load/store

  • ac/nir: remove unused param num_reserved_es_outputs from gs input lower

  • ac/nir: change es output lower param to esgs_itemsize

  • radeonsi: replace llvm es output with nir lowering

  • ac/nir: add triangle_strip_adjacency_fix to gs input lower

  • ac/llvm: get back nir_intrinsic_load_gs_vertex_offset_amd

  • radeonsi: replace llvm gs input handle with nir lowering

  • ac/nir/ngg: fix nogs culling scratch size

  • ac/nir/ngg: add a barrier before prim id export

  • ac/nir/ngg: Decouple primitive ID store and primitive export.

  • nir/lower_gs_intrinsics: fix primitive count for points

  • radeonsi: fix tcs_out_lds_offsets arg alignment

  • winsys/amdgpu: fix non-page-aligned sparse buffer creation

Rajnesh Kanwal (29):

  • pvr: Remove double error reporting in error path.

  • pvr: Change phase_rate_change type to bool from uint32_t.

  • pvr: Setup common nop shader.

  • pvr: Add services winsys transfer context support.

  • pvr: Add support to create transfer context and setup required shaders.

  • pvr: Add vk_sync support and remove service winsys syncobjs interface.

  • pvr: Fix argument comments in render job submission.

  • pvr: Pass device info struct in pds generator.

  • pvr: Add support for R32G32B32A32_UINT, R32_UINT and R8_UINT formats.

  • pvr: Allow signal_sync pointer to be NULL in job submission.

  • pvr: Remove vk_format_is_pure_integer and use common helper.

  • pvr: Add transfer 3d parameter heap support.

  • pvr: Fix ISP_RENDER_MODE_TYPE enum in rogue_cr.xml

  • pvr: Restrict argument to dev_info where possible.

  • pvr: Fix UTIL_ARCH_BIG_ENDIAN check in vk_format.h.

  • pvr: Change ALIGN macro to ALIGN_ATTR.

  • pvr: Add services winsys transfer cmd submit interface.

  • pvr: Move transfer logic to pvr_job_transfer.[ch]

  • pvr: Remove clang-format off comment from vk_format.h

  • pvr: Implement vkCmdDraw API.

  • pvr: Init and store default sampler in pvr_device.

  • pvr: Use common vk_buffer implementation.

  • pvr: Implement vkCreateBufferView and vkDestroyBufferView.

  • pvr: Move binding related checks in common code.

  • pvr: Add support for sampler descriptor.

  • pvr: Add support for combined image and sampler descriptor.

  • pvr: Add support for image descriptor.

  • pvr: Add support for storage and uniform texel descriptor.

  • pvr: Add support for input attachment descriptor.

Renato Pereyra (4):

  • venus: add support for vk_xwayland_wait_ready

  • venus: Add driconf option to enable implicit fencing

  • Revert “wsi/x11: Don’t leak xcb_get_geometry_reply_t.”

  • Revert “wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes”

Rhys Perry (94):

  • aco: fix disassembly of SMEM with both SGPR and constant offset

  • ac/nir: properly handle large global access constant offsets

  • nir/gather_info: fix system_value_read for rt/mesh system values

  • nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper

  • radv: remove unnecessary align_mul/offset initialization

  • radv: initialize meta shader options earlier

  • radv: use radv_meta_build_nir_vs_generate_vertices() more

  • radv: use nir_op_imm helpers

  • radv: use inot(a) instead of ine(a, true)/ieq(a, false)

  • ac/nir: fix 64-bit NGG GS output stores

  • radv: always use nir_lower_io_lower_64bit_to_32

  • nir/copy_prop_vars: fix non-vector shader call payloads

  • radv: fix clearing of TRUNC_COORD with tg4 and immutable samplers

  • radv: consider radix sort shaders to be internal/meta

  • aco/ra: fix vgpr_limit

  • aco: ensure that definitions fixed to operands have matching regclasses

  • radv: add radv_has_shader_buffer_float_minmax

  • radv: allow LBVH on GFX6+

  • radv: remove subtractions in address calculations

  • radv: use SMEM for a few load_global

  • aco: split load_sbt_amd result

  • aco: skip s_barrier if TCS patches are within subgroup

  • ac/nir: skip s_barrier if TCS patches are within subgroup

  • aco: fix cmpswap global atomic definition on GFX6

  • nir: allow NIR_PASS(_, )

  • nir: print file when validation fails

  • nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base

  • nir: call nir_metadata_preserve in nir_vectorize_tess_levels

  • nir: call nir_metadata_preserve in nir_lower_memory_model

  • radv: call nir_metadata_preserve in various lowering passes

  • radv: add missing NIR_PASS() and switch from NIR_PASS_V()

  • radv: validate shaders after linking passes

  • aco: fix p_constaddr with a non-zero offset

  • aco/tests: add test for p_constaddr with a non-zero offset

  • aco: only add/subtract low bits of program addresses

  • aco/ra: fix usage of invalidated iterator

  • aco: use p_parallelcopy for uniform reduction with zero source

  • aco: clarify a portion of do_pack_2x16

  • aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11

  • aco/tests: fix gfx11 variants printed as gfx12

  • aco: disable sdwa on gfx11

  • aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00

  • aco/tests: update for GFX11’s removal of SDWA

  • radv,aco: disable shader cache when ACO debug options are used

  • aco: fix SMEM load_global_amd with non-zero offset

  • aco: fix SMEM load_global with VGPR address and non-zero offset

  • aco: consider fma with multiplication by power-of-two unfused

  • ac: use ResetAllOptionOccurrences instead of ResetCommandLineParser

  • gitlab: ask for useful titles

  • gitlab: ask that reporters don’t include long logs in descriptions

  • nir/deref: stop assuming coherent accesses of different SSBOs may alias

  • aco: update SMEM offset workaround for LLVM 15

  • aco: don’t skip VS->TCS barrier if TCS output vertices doesn’t match input

  • aco: cleanup force-waitcnt output

  • aco/ra: update register file when updating phi definition

  • radv/ci: re-enable vega10 fossils testing

  • aco: add and use constantValue16()

  • aco: don’t accept med3 opcodes in get_minmax_info()

  • aco: include _e64 variants of 16-bit min/max in minmax optimizations

  • aco: ignore precise flag when optimizing integer clamps

  • aco: fix single-alignbyte do_pack_2x16() path with fp inline constants

  • aco: fix propagate_constants_vop3p with integer vop3p and 16-bit constants

  • aco: don’t use opsel to fold constants into dot accumulation sources

  • aco: fix redirect combine in propagate_constants_vop3p() with negatives

  • aco: try sign-extending or shifting constants in propagate_constants_vop3p

  • aco: don’t use 32-bit fp inline constants for fp16 vop3p literals

  • aco/tests: add vop3p constant combine tests

  • nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd

  • nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b))

  • nir: allow 16-bit fsin_amd/fcos_amd

  • radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd

  • aco: make flat access latency match mtbuf/mubuf/mimg

  • aco: include flat-like in vmem clause statistics

  • aco: make FLAT_instruction::offset signed

  • aco: improve support for scratch_* instructions

  • aco: combine additions and constants into scratch load/store

  • aco: handle subtractions in parse_base_offset

  • aco: refactor VGPR spill/reload lowering

  • aco: avoid WAW hazard with BVH MIMG and other VMEM

  • aco: include scratch/global in VMEM WAW optimization

  • aco: treat flat-like as vmem in some scheduling heuristics

  • aco: initialize scratch base registers on GFX9-GFX10.3

  • aco: use scratch_* for scratch load/store on GFX9+

  • aco: use scratch_* for VGPR spill/reload on GFX9+

  • radv: fix vbo_bound_mask indexing

  • radv: don’t update misaligned_mask in CmdBindVertexBuffers2

  • aco: remove unnecessary exp_cnt increments

  • aco: fix LdsBranchVmemWARHazard with 2+ branch chains

  • aco: fix hash statistic

  • radv: remove claimed support for sRGB vertex buffer formats

  • radv: fix 16-bit support in radv_lower_vs_input

  • aco: fix 16-bit VS inputs

  • aco: don’t expand vec3 VS input load to vec4 on GFX6

  • aco: add SCC clobber in build_cube_select

Ricardo Garcia (1):

  • vulkan: allow null descriptor set layouts in pipeline layouts

Riteo (1):

  • vulkan/device_select_wayland: fix a memory leak with DRM device handling

Rob Clark (64):

  • freedreno: Misc indent fixes

  • freedreno/drm/virtio: Appease valgrind

  • freedreno/drm: Drop FD_PP_PGTABLE

  • freedreno/drm: Close bo handle after bo->destroy()

  • freedreno/drm: Move bo common init

  • freedreno/drm/virtio: Support ring_idx

  • freedreno/drm/virtio: Use userspace IOVA allocation

  • freedreno/drm/virtio: Drop blocking in host

  • freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE

  • freedreno/drm/virtio: Protocol updates

  • freedreno/drm/virtio: Pass guest handles to execbuf

  • freedreno/drm/virtio: Async ccmd batching

  • freedreno/drm/virtio: Split up large uploads

  • freedreno/a6xx: A couple more FD_BO_NOMAP

  • freedreno: Split out helper for staging blit

  • freedreno: Use staging transfer if mmap fails

  • freedreno/drm: Fix bos_on_stack calculation

  • drm-shim: Add GET_UNIQUE support

  • freedreno/drm-shim: Update to latest uapi version

  • freedreno/drm-shim: Robustify error handling

  • drm-shim: Better mmap offsets

  • drm-shim: Add error handling for bo_init()

  • freedreno/drm-shim: Better iova handling

  • drm-shim: Cleanup on device file close

  • freedreno/drm: Use DEBUG_GET_ONCE_OPTION()

  • freedreno/drm/virtio: Don’t try to mmap imported bo’s

  • freedreno/devices: Add another SKU

  • Revert “ci: freedreno farm is down”

  • freedreno: Add pkt4 assert

  • freedreno/registers: Small cleanup

  • freedreno/a6xx: Split VFD_FETCH[] if needed

  • freedreno/a6xx: Initialize VFD_FETCH[n].SIZE to zero

  • freedreno/a6xx: assert valid vertex_flags reg

  • turnip: assert valid vertex_flag reg

  • freedreno/ir3: Don’t lower_gs multiple times

  • freedreno/ir3: Add copy_vars() helper

  • freedreno/ir3: Copy vars if needed on EndPrimitive()

  • freedreno/ir3: Add more tess varying slots

  • freedreno/a6xx: Fix indentation

  • freedreno/a6xx: Fix VS const packet size

  • freedreno/ir3: Fix GS clip-plane lowering

  • freedreno: rename ir3_emit_driver_params()

  • freedreno/a6xx: Handle driver-params in GS/DS

  • freedreno/a6xx: Do clip-plane lowering in backend

  • freedreno/gmem: Reverse order of alternative tile rows

  • util: Fix c++ enum casting pickiness

  • libsync: Fix compiler warning

  • freedreno/a6xx: Remove fd6_format.[ch]

  • freedreno/a6xx: Fix enum tag

  • freedreno: Extract common helper macros

  • freedreno/ir3: Remove unneeded forward declaration

  • freedreno/autotune: Make ‘offset’ macro “private”

  • freedreno/drm: Combine upper and lower 32b of OR val

  • freedreno/a6xx: De-open-code CACHE_FLUSH enum

  • freedreno: Use enum for primtypes table

  • freedreno/ci: Update unit test reference decodes

  • freedreno/registers: Whitespace fix for gen_header.py

  • freedreno/registers: Allow varset to be specified on enum

  • freedreno/registers: Move varset to <enum>

  • freedreno/registers/a6xx: Some reg64 conversion

  • freedreno/drm: Fix potential bo cache vs export crash

  • freedreno/gmem: Fix col0 calc

  • llvmpipe: Add some missing locking

  • freedreno: We really don’t need aligned vbo’s

Roman Stratiienko (3):

  • Android.mk: Fix c11-related build failures

  • v3dv: Enable sync_fd importing/exporting on Android

  • v3dv: Limit API version to v1.0 for Android

Ruijing Dong (6):

  • radeonsi/vcn: Add support of array_mode for gfx11

  • radeonsi/vcn: update av1 decoding to support vcn4

  • radeonsi/vcn: prepare for unified queue in vcn4

  • radeonsi/vcn: support unified queue in vcn4

  • frontends/va: do texture_map when needed

  • frontends/va: WA for ffmpeg 10bit encoding crash

Ryan Neph (5):

  • venus: update venus-protocol with VK_EXT_image_view_min_lod

  • venus: enable VK_EXT_image_view_min_lod

  • venus: support VK_KHR_copy_commands2

  • venus: refactor VkCommandBufferBeginInfo fixups to function

  • venus: add support for VK_KHR_dynamic_rendering

Sagar Ghuge (1):

  • anv: Disable storage image compression for possible atomic ops

Sami Kyöstilä (2):

  • subprojects: Roll Perfetto to v27.1

  • util: Shut down Perfetto before driver unload

Samuel Pitoiset (213):

  • radv: exclude PRIMITIVE_{COUNT,INDICES} from the per-vertex output mask

  • radv: use shader_info::writes_memory

  • radv: use shader_info::gs::active_stream_mask

  • radv: use shader_info::inputs_read/outputs_written for FS IO

  • radv: use shader_info::outputs_written/per_primitive_outputs for VS outputs

  • radv: use shader_info::system_values_read

  • radv: fix enabling adjust_frag_coord_z and apply per-pipeline

  • radv: add support for independent descriptor set layouts

  • radv: add support for VkShaderModuleCreateInfo as pNext

  • radv: fix initializing pipeline_key::topology for GFX9 and older

  • radv: mark all active stages earlier in the pipeline creation path

  • radv: use the hardware primitive topology everywhere

  • radv: initialize the vertex input interface state in only one place

  • radv: do not support UNIFORM_TEXEL_BUFFER with SRGB

  • radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs

  • radv/ci: update the flakes list for GFX9 chips

  • aco: fix load_barycentric_at_{sample,offset} on GFX6-7

  • nir: fix marking XFB varyings as always active IO

  • nir: mark XFB varyings as unmoveable to prevent them to be remapped

  • radv: clarify why STAGE_2_CLEAR_BIT needs to wait for CP DMA to be idle

  • radv: remove redundant VK_PIPELINE_STAGE_2_TRANSFER_BIT for CP DMA idle

  • radv: add an SQTT workaround for chips with disabled RBs

  • radv: fix handling divisor == 0 with dynamic vertex input state

  • radv: allow to disable sinking of load inputs for FS via drirc

  • radv: enable radv_disable_sinking_load_input_fs for Grid Autosport

  • radv: use correct push constants range for internal operations

  • radv/ci: update list of expected failures for Bonaire (GFX7)

  • radv/ci: remove empty flakes lists for GFX9

  • radv: fix the number of generated primitive queries with NGG GS vs legacy

  • radv/ci: stop skipping dEQP-VK.synchronization.* on Bonaire

  • radv: use 3D views for 3D internal operations on GFX6-8

  • radv: re-emit dynamic line stipple state if the primitive topology changed

  • radv/radix: handle intentional allocation failures properly

  • ac,radeonsi: add has_sqtt_auto_flush_mode_bug

  • radv/sqtt: fix configuring AUTO_FLUSH_MODE on GFX10.3

  • radv: only init acceleration structure if RT is enabled

  • radv: disable instance packing to fix pipeline query stats

  • radv: remove useless check against nir_texop_samples_identical

  • radv/winsys: allow to use RADV_FORCE_FAMILY=gfx1100

  • radv: disable DCC for Senra Kagura Shinovi Versus

  • aco: recognize GFX11 in few places

  • aco: do not align VGPRS to 8 or 16 on GFX11

  • radv,aco: add support for packed threadID VGPRs on GFX11

  • radv,aco: use the new TCS WaveID SGPR to compute vs_rel_patch_id on GFX11

  • aco: export MRT0 instead of NULL on GFX11

  • aco: do not set DLC for loads on GFX11

  • aco: do not set GLC stores on GFX11

  • aco: update LDS allocation granularity for PS on GFX11

  • aco: update waitcnt on GFX11

  • aco: do not set RESOURCE_LEVEL for buffer descriptors on GFX11

  • aco: do not set COMPR for exports but use 0x3 channel mask on GFX11

  • radv,aco: export alpha-to-coverage via MRTZ on GFX11

  • aco: only retrieve the scratch offset when it’s declared

  • radv,aco: do not implicitly export the primitive ID for mesh shaders

  • aco: remove unreachable code about viewport index/layer and mesh shaders

  • radv,aco: add a workaround for binding 2D views of a 3D image on GFX9

  • radv: implement VK_EXT_image_2d_view_of_3d

  • radv: advertise VK_EXT_image_2d_view_of_3d

  • radv: do not declare the scratch offset on GFX11

  • aco: do not emit the primitive ID twice for NGG VS or TES with GS

  • radv: do not align VGPRS to 8 or 16 on GFX11

  • radv: use the new format table on GFX11

  • radv: update scratch buffer registers on GFX11

  • radv: update sampler registers on GFX11

  • radv: update buffer descriptor registers on GFX11

  • radv: update image descriptor registers on GFX11

  • radv: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE on GFX11

  • radv: configure DB_RENDER_CONTROL on GFX11

  • radv: do not emit non-existent CP_COHER_START_DELAY on GFX11

  • radv: do not emit FLUSH_AND_INV_DB_META on GFX11

  • radv: update TF_RING_size to a per-SE size on GFX11

  • radv: update VRS registers on GFX11

  • radv: apply a workaround for CB perf counters on GFX11

  • radv: limit CP DMA to max 32KB sizes on GFX11

  • radv: more register changes on GFX11

  • radv: do not lower loading TESS/ESGS rings using the ABI for LLVM

  • aco: use ac_is_llvm_processor_supported() for checking LLVM asm support

  • radv: rename shader compile functions to spirv_to_nir/nir_to_asm

  • radv: do not try to dump the NIR of the trap handler shader

  • radv: add new pipeline helpers for NIR->ASM compilation

  • radv: init states from pTessellationState at only one place

  • radv: init states from pViewportState at only one place

  • radv: init states from pRasterizationState at only one place

  • aco: remove unnecessary intrinsics that are lowered at the ABI level

  • radv: determine if shaders use NGG before linking

  • radv: export implicit primitive ID in NIR for legacy VS or TES

  • radv: constify radv_pipeline in more radv_pipeline_generate_XXX() helpers

  • radv: rename radv_pipeline_generate_XXX() to radv_pipeline_emit_XXX()

  • radv: remove couple of useless pCreateInfo pointers in the emit path

  • radv: remove unused radv_get_conservative_raster_mode() helper

  • radv: move gs_table_depth to the physical_device

  • radv: cleanup using device/physical_device during pipeline creation

  • radv: split radv_pipeline into radv_{graphics,compute,library}pipeline

  • radv: introduce radv_graphics_pipeline_info and split existing info structs

  • radv/ci: fix fails list for NAVI21

  • radv: move HS info and task_num_entries to the physical device

  • radv: rename radv_cmd_state::pipeline to graphics_pipeline

  • radv: unify radv_pipeline_has_XXX() helpers

  • radv: move streamout_shader to radv_graphics_pipeline

  • radv: move active_stages to radv_graphics_pipeline

  • radv: rename VERT_GRP_SIZE to VERTS_PER_SUBGRP on GFX11

  • radv: do not emit more non-existent registers on GFX11

  • radv: update framebuffer registers on GFX11

  • radv: update VRS rates on GFX11

  • radv: use the fragment resolve path by default on GFX11

  • radv: report adjusted LDS size for fragment shaders on GFX11

  • radv: fix configuring COLOR_INVALID on GFX11

  • radv: fix VK_BLEND_FACTOR_CONSTANT_COLOR translation on GFX11

  • radv: configure DB_Z_INFO.NUM_SAMPLES correctly on GFX11

  • radv: disable VK_AMD_shader_fragment_mask on GFX11

  • radv: disable attachementFragmentShadingRate on GFX11

  • radv: use pipeline->slab_bo in more places

  • radv: fix writing buffer markers with non-zero memory offset

  • radv: copy viewport/scissor when initializing radv_viewport_info

  • radv: init states from pMultisampleState at only one place

  • radv: init states from pDepthStencilState at only one place

  • radv: init states from VkPipelineRenderingCreateInfo at only one place

  • radv: use AMD values for computing blend related state

  • radv: init states from pColorBlendState at only one place

  • radv: init states from VkAttachmentSampleCountInfo at only one place

  • radv: init states from VkPipelineDiscardRectangleState at only one place

  • radv: init states from VkPipelineFragmentShadingRateState at only one place

  • radv: ignore DYNAMIC_STENCIL_OP if stencil test isn’t enabled

  • radv: reduce radv_tessellation_info::patch_control_points to 8-bit

  • radv: reduce radv_input_assembly_info::primitive_topology to 8-bit

  • radv: reduce radv_rasterization_info::polygon_mode to 8-bit

  • radv: reduce radv_discard_rectangle_info::count to 8-bit

  • radv: fix lowering GS intrinsics if NGG is disabled per pipeline

  • radv/ci: update list of failures for Pitcairn

  • radv: enable radv_zero_vram for Hammerting

  • radv/winsys: add get_chip_name() to the null winsys

  • radv: report the marketing name as part of the device name

  • radv: rename ngg_gs_state to ngg_query_state

  • radv: track if primitives generated query features are enabled

  • radv: add primitives_generated_query to the graphics pipeline key

  • radv: declare the NGG query argument for primitives generated query

  • ac/nir/ngg: count the number of generated primitives for VS and TES

  • radv: flush the NGG query state when the argument is declared

  • radv: add few helpers related to streamout

  • radv: implement VK_EXT_primitives_generated_query

  • radv: advertise VK_EXT_primitives_generated_query

  • radv/ci: rename deqp-XXX jobs to vkcts-XXX

  • zink/ci: skip two KHR-GL46 tests that timeout most of the time with RADV

  • zink/ci: update list of expected failures with RADV

  • radv: remove redundant check when importing vertex input info

  • radv: update radv_is_vrs_enabled() to use radv_graphics_pipeline_info

  • radv: update the check to determine if dynamic discard rectangle is needed

  • radv: update the check to determine if dynamic sample location is needed

  • radv: update the check to determine if rasterization is enabled

  • radv: remove remaining unused pCreateInfo pointers

  • ci: bump the hang-detection tool in the test image for Vulkan

  • ci: fix passing down MESA_LOADER_DRIVER_OVERRIDE for zink-anv-tgl

  • zink/ci: change the surface type used for dEQP

  • ci/valve: split .b2c-test into .b2c-test-vk and .b2c-test-gl

  • radv/ci: add GLES/GLCTS testing with Zink on NAVI10

  • zink/ci: update list of failures again

  • radv/ci: refactor test machines and dEQP rules to reduce copy&paste config

  • ci: update vkd3d-proton builder/runner

  • ci: print a message when vk3d-proton returns successfully

  • ci: add VKD3D_PROTON_RESULTS to the list of variables to pass down for testing

  • radv/ci: add vkd3d-proton testing

  • radv: advertise VK_EXT_border_color_swizzle on GFX10+

  • ci: uprev vkd3d-proton to 39d07dea2cef34bfb3ed39741f026bc637e3eec4

  • radv/amdgpu: make sure to reset the number of BO when there is no ranges

  • radv/amdgpu: do not add “global” BO to the virtual BO list

  • aco: fix validation of SOP1 instructions without definitions

  • radv,aco: rename radv_prolog_binary/radv_shader_prolog to shader_part

  • radv/llvm: always emit a null export even if the FS doesn’t discard

  • radv: disable small primitive culling for user sample locations

  • radv: use RADEON_FLAG_VA_UNCACHED for the trace BO

  • radv: fix command line for dumping waves with UMR

  • radv: dump UMR waves before UMR rings

  • radv: do not emit SQTT user data packets on the SDMA queue

  • radv/ci: add CI lists for LLVM on NAVI21

  • radv: use LOAD_CONTEXT_REG to load the opaque buffer size on GFX10+

  • radv: do not abort if SPM isn’t supported for the current GPU

  • radv: remove old workaround for HTILE layers with F1 2021

  • zink/ci: update list of failures for piglit with RADV

  • radv: disable DCC for Melty Blood Actress Again Current Code

  • radv: always enable VK_EXT_debug_utils

  • zink/ci: update list of failures

  • radv/ci: update list of failures against CTS 1.3.3.0

  • aco: fix load_barycentric_at_sample without MSAA

  • radv: fix dumping VS prologs assembly

  • radv/ci: enable fossils testing for GFX1100

  • radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2

  • radv: fix wide points/lines by configuring the guardband correctly

  • zink/ci: update list of CTS flakes for RADV

  • ci: uprev vkd3d-proton to 5b73139f182d86cd58a757e4b5f0d4cfad96d319

  • radv,aco: track if a fragment shader needs an epilog

  • radv,aco: introduce {radv,aco}_ps_epilog_key

  • radv: declare a new user SGPR arg in FS for the epilog PC

  • radv: add a function that declares PS epilog shader arguments

  • aco: add new pseudo instruction p_jump_to_epilog

  • aco: emit p_jump_to_epilog if the main fragment shader has an epilog

  • aco: do not abort if the FS doesn’t export anything but has an epilog

  • aco: prevent adding DONE/VM to the last export if the FS has an epilog

  • aco: add aco_postprocess_shader() helper

  • radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part

  • aco: refactor export_fs_mrt_color() for PS epilogs preparation

  • aco: add support for compiling PS epilogs

  • radv: implement PS epilogs

  • radv: disable viewport depth clamping only when necessary

  • aco: requires Exact for p_jump_to_epilog

  • radv: only force 1x sample for Bresenham lines when pipeline draws lines

  • vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT

  • radv: implement VK_EXT_attachment_feedback_loop_layout

  • radv: ignore out-of-order rasterization if stencil write mask is dynamic

  • radv: fix gathering XFB info if there is dead outputs

  • radv: fix cleaning the meta query state if an error occured

  • radv: re-emit viewports if negative one to one or depth clamp mode changed

  • radv: fix bogus assertion with RADV_FORCE_VRS

  • radv: fix pipelineStageCreationFeedbackCount when it’s 0

Sarah Walker (2):

  • pvr: Update for firmware 1.17@6256262

  • pvr: Move BRN 44079, 48492 and 66011 code into pvrsrvkm specific directory

Sathishkumar S (2):

  • radeon/vcn: skip create and destroy message for jpeg

  • radeon/vcn: engage all available jpeg engines

Sebastian Keller (1):

  • egl/wayland: Don’t try to access modifiers u_vector as dynarray

Sergi Blanch Torne (1):

  • ci/crosvm: clean crosvm socket

Sidney Just (19):

  • wgl: add a flag to determine if running on zink

  • wgl: add zink to the list of auto-loaded drivers

  • zink: support VK_KHR_win32_surface

  • kopper: add win32 loader interface

  • zink: support win32 wsi

  • wgl: support kopper

  • gallium: support for implementing EXT_external_objects_win32

  • mapi: added EXT_external_objects_win32 definitions

  • gallium/noop: implement win32 device LUID and node mask getters

  • gallium/trace: implement win32 device LUID and node mask getters

  • mesa: implement GL_DEVICE_LUID_EXT and GL_DEVICE_NODE_MASK_EXT getters

  • mesa: implement import win32 memory and semaphore

  • mesa: pipe cap for EXT_memory_object_win32 and EXT_semaphore_win32

  • zink: codegen for Win32 external object extensions

  • zink: implement win32 fence import

  • zink: implement win32 memory handle import

  • zink: implement device LUID and node mask

  • zink: enable pipe cap for win32 external memory and fences

  • docs: updated zink features and release notes

Sil Vilerino (39):

  • gallium/vl: Add software winsys and offscreen winsys

  • util/u_format: Drop assert that has valid/well-defined behavior

  • util/vl_vlc: Support compiling in C++

  • st_vdpau: Pass format when opening resource from handle in st_vdpau_resource_from_description

  • d3d12: Add .clang_format file

  • gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords

  • gallium va: VaDeriveImage support stride/offset being different for NV12 planes

  • gallium: Add values to pipe_video_cap for multi-slice and multi-reference encode

  • gallium va: Handle new VA attributes with new pipe video caps

  • gallium: Add multiple slice support to pipe_h264_enc_picture_desc

  • gallium va: Add support for multiple slices encoding

  • gallium radeon/r600/omx/va: Adds support for multiple reference encoding

  • gallium: Add MinLumaBiPredSize8x8 to pipe_h264_sps, pic_init_qs_minus26 to pipe_h264_pps

  • gallium omx: Fill out MinLumaBiPredSize8x8 and pic_init_qs_minus26

  • gallium vdpau: Fill out level_idc and MinLumaBiPredSize8x8

  • gallium va: VaHandlePictureParameterBufferH264 fill out new pipe params MinLumaBiPredSize8x8, pic_init_qs_minus26, chroma_format_idc, bit_depth_chroma, bit_depth_chroma_minus8

  • d3d12: Add d3d12_promote_to_permanent_residency

  • ci: Update x86 debian build to pick up tag v1.602.0-r1 from DirectX-Headers

  • d3d12: Add video decode implementation of pipe_video_codec

  • d3d12: Add video encode implementation of pipe_video_codec

  • d3d12: Add create_video_codec and create_video_buffer entrypoints

  • d3d12: Add util video functions to d3d12_format

  • d3d12: Improve planar resource support to handle video requirements

  • d3d12: Add support for d3d12 video in d3d12_screen

  • d3d12: Add GLSL singleton refcounting to screen for offscreen non-GL rendering

  • meson: Support d3d12 as a video-supporting driver

  • winsys: Do not use Display type when X11 is not present in build

  • meson: Update DirectX-Headers dependency to 1.602.0 version with fallback to the wrap

  • meson: Add build option for gallium-d3d12-video feature

  • d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment

  • d3d12: Fix debug_printf format strings for cross platform builds

  • d3d12: Video - Remove unused spCopyQueues from enc/dec objects

  • pipe/video: Add PIPE_VIDEO_ENTRYPOINT_PROCESSING

  • vl: Replace usage of entrypoint UNKNOWN with PROCESSING for VP

  • va: Replace usage of entrypoint UNKNOWN with PROCESSING for VP

  • gallium/video: Add video post processing interface

  • va: Add support for VPP rotation, flip, alpha blend, crop, scaling

  • d3d12: Add pipe_video_codec::process_frame implementation

  • d3d12: Enable VPP rotation, flip, alpha blend, crop, scaling via pipe_video_codec::process_frame

SoroushIMG (4):

  • zink: Fix BO size when it’s not aligned to 16 bytes

  • zink: Fix spirv stream 0 vertex emit for multistream shaders

  • Zink: Fix clear being missed when using emulated draws in zink_blit

  • zink: Fix incorrect emission of SPIR-V shift ops

SureshGuttula (4):

  • radeonsi: Set display_remote for non-refernced frames

  • radeonsi/vcn : update enc->dpb ref_use for index 0

  • Revert “radeonsi: Set display_remote for non-refernced frames”

  • Revert “radeon: hardcode uvd/vce encoder not_referenced value to false”

Sviatoslav Peleshko (7):

  • mesa: flush bitmap caches when changing scissors or window rects state

  • anv: workaround apps that assume full subgroups without specifying it

  • intel/blorp/gen6: Set BLEND_STATEChange only if emitting the blend state

  • anv: Dirty all dynamic state bits when creating command buffer state

  • intel/blorp: Dirty depth bounds dynamic state bits after blorp

  • intel/nullhw: Use correct macro to fix build regression

  • iris: Always initialize shader compilation queue ready fence

TSnake41 (1):

  • zink: print result code string on vulkan failure

Tapani Pälli (5):

  • isl: disable mcs (and mcs+ccs) for color msaa on DG2

  • anv: use anv_cmd_dirty_mask_t type for dynamic state

  • intel/fs: setup SEND message descriptor from nir scope

  • iris: implement Wa_14015264727 for DG2

  • anv: implement Wa_14015264727 for DG2

Tatsuyuki Ishi (9):

  • radv: Fix redundant subpass barriers due to erroneous comparison

  • radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit

  • radv/ci: Move transient_attachment_bit from fail to skip list

  • radv/ci: skip image.sample_texture.*_compressed_format*

  • radv/ci: Move sample_texture.*_compressed_format_* to faillist for gfx<=9

  • radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard.

  • amd: Revert gfx10 addrlib changes

  • radv: Only set pstate for the first hw_ctx.

  • radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs.

Thomas Debesse (2):

  • gallium/clover: LLVM setLangDefaults moved from clangFrontend to clangBasic

  • gallium/clover: pass -no-opaque-pointers to Clang

Tiago Koji Castro Shibata (2):

  • d3d12: fallback to integrated adapter instead of arbitrary one

  • d3d12: add more formats to supported conversions

Timothy Arceri (71):

  • glsl/st: vectorise interfaces of SSO shader programs

  • nir: fix setting varying from uniform as flat

  • nir: fix sorting before assigning varying driver locations

  • nir: always set the exact_trip_count_unknown loop terminator property

  • nir: remove unreachable loop terminators

  • svga: remove duplicate nir compile options

  • nir/glsl: add glsl_record_compare() wrapper

  • nir/glsl: wrapper contains_{double,interger}()

  • nir/glsl: wrapper field_index()

  • nir/glsl: wrap component_slots_aligned()

  • glsl/mesa: move parse_program_resource_name() to common linker_util code

  • glsl: add new build program resource helpers

  • nir: add variable data fields required for NIR glsl varying linking

  • nir: abort io info gathering if location is not set or is a temp value

  • nir: skip lowering io to scalar for must_be_shader_input

  • glsl: implement lower_packed_varyings() as a NIR pass

  • glsl: add a NIR based varying linker

  • glsl: implement opt_dead_builtin_varyings() as a NIR pass

  • glsl: implement lower_xfb_varying() as a NIR pass

  • glsl: enable the use of the nir based varying linker

  • glsl: remove now unused GLSL IR varying linker code

  • glsl: simplify finding cursor in varying packing code

  • nir: add support for forced sampler indirect loop unrolling

  • gallium/drivers: set force_indirect_unrolling_sampler for all required drivers

  • glsl: move validation of sampler indirects to the nir linker

  • i915g: use a valid setting for force_indirect_unrolling

  • softpipe: switch to NIR loop unrolling

  • asahi: switch to NIR loop unrolling

  • d3d12: switch to NIR loop unrolling

  • etnaviv: switch to NIR loop unrolling

  • i915g: switch to NIR loop unrolling

  • r300: disable GLSL IR loop unrolling

  • r600: switch to NIR loop unrolling

  • vc4: disable GLSL IR loop unrolling

  • nir/i915g/r300/nv30: skip marking varyings as flat in some drivers

  • etnaviv: assert if etna_shader_io_file reg overrun

  • etnaviv: vectorise io

  • nir/gcm: fix pushing instructions into if blocks

  • ci: uprev piglit 2022-05-31

  • nir: add nir based version of the lower_const_arrays_to_uniforms pass

  • glsl/nir: skip adding hidden uniforms to the remap tables

  • glsl/nir: allow the nir linker to remove dead uniforms we created

  • glsl: move common link time optimisation calls to linker code

  • glsl: move gl_nir_link_opts() call out of the st code

  • glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms()

  • glsl: remove now unused lower_const_arrays_to_uniforms()

  • gallivm: disable GLSL IR loop unrolling in LLVMPIPE

  • freedreno/ir3: tidy up duplication of common nir options

  • freedreno: switch to NIR loop unrolling

  • lima: lower all undefs to zero in vs

  • lima: fixup nir indirect unroll options to match gallium CAP

  • lima: switch to NIR loop unrolling

  • nouveau/nv30: disable GLSL IR loop unrolling

  • nouveau/nv50: disable GLSL IR loop unrolling

  • nouveau/nvc0: disable GLSL IR loop unrolling

  • svga: disable GLSL IR loop unrolling

  • gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP

  • glsl: remove the now unused GLSL IR loop unrolling code

  • glsl: drop extra optimise swizzles call

  • glsl: tidy up link_varyings_and_uniforms()

  • glsl: remove never true do_dead_code() parameter

  • glsl: inline do_common_optimization() call

  • util: add dri config option to disable GL_MAP_UNSYNCHRONIZED_BIT

  • util: use force_gl_map_buffer_synchronized workaround with RAGE

  • glsl: merge lower_buffer_access with lower_shared_reference

  • glsl: drop non-nir path for atan in builtin functions

  • st/glsl: fix broken vertex attrib mapping

  • nir/loop_unroll: clean up after complex_unroll_single_terminator()

  • glsl: correctly track cross slot component packing

  • Revert “nir: Preserve offsets in lower_io_to_scalar_early”

  • glsl: dont lower precision for textureGatherOffsets

Timur Kristóf (119):

  • radv: Minor formatting fix in radv_device.c file.

  • radv: Move queue submit sparse bindings to a separate function.

  • radv: Move empty queue submit code path to a separate function.

  • radv: Move normal (non-empty) queue submit to a separate function.

  • radv: Move up early exit for transfer queues in radv_get_preamble_cs.

  • radv: Simplify some boolean code in radv_get_preamble_cs.

  • radv: Initialize BO pointers when creating preambles.

  • radv: Refactor cache flush code for the initial preambles.

  • radv: Don’t create continue preamble when it’s not needed.

  • radv: Rename fill_geom_tess_rings to radv_fill_shader_rings.

  • radv: Don’t use pointers to pointers when updating the preambles.

  • radv: Only emit what is appropriate to the queue family for preambles.

  • ac/nir: Add ac_nir_load_arg helper for shader arguments.

  • radv: Move radv_nggc_settings enum out of radv_cmd_buffer.

  • nir: Add upper bound for AMD shader arg intrinsics.

  • radv: Lower ABI in NIR for tess/ESGS/NGG shader arguments.

  • ac/nir: Remove now-superfluous ac_nir_lower_tess_to_const.

  • aco: Remove now-superfluous intrinsics.

  • ac/llvm: Remove now-superfluous intrinsics and ABI callbacks.

  • radv/amdgpu: Group queue submit info into a structure.

  • radv/amdgpu: Initialize semaphore info with a designated initializer.

  • radv/amdgpu: Pass new queue submit info structure to internal function.

  • radv/amdgpu: Pass correct struct type instead of repeating the cast.

  • radv/amdgpu: Use scheduled dependency for submitting to multiple queues.

  • ac/nir: Add I/O lowering for task and mesh shaders.

  • ac: Add task ring entry shader argument.

  • radv: Add radv_pipeline_has_task helper.

  • radv: Set user data register for task shaders.

  • radv: Set wave size for task shaders.

  • radv: Fill task shader info.

  • radv: Add task shader arguments.

  • radv: Add task ring entry argument for mesh shaders.

  • radv: Implement task shader intrinsics in the ABI.

  • radv: Enable nir_opt_offsets for task shaders.

  • radv: Use I/O lowering for task and mesh shaders.

  • radv: Postprocess task shader configuration.

  • radv: Allow linking task shaders.

  • radv: Lower shared and task_payload variables in task/mesh shaders.

  • radv: Print task shader stage name before disasm.

  • radv: Fix task shader info.

  • radv: Fix loading task shader ring buffer addresses.

  • aco: Fix scratch with task shaders.

  • radv: Introduce ring info structure for queues, refactor preamble generation.

  • radv: Use a ws variable during preamble creation.

  • radv: Pass radv_device pointer to some functions.

  • radv: Refactor queue state to separate structure.

  • radv: Clarify emitting graphics shader pointers.

  • nir: Add explicit task payload atomic intrinsics.

  • radv: Implement conditional rendering for async compute queue.

  • ac: Remove trailing white space.

  • aco: Remove trailing whitespace.

  • radv: Remove trailing whitespace.

  • radv: Disable predication for supass clear and image clears.

  • nir: Add new launch_mesh_workgroups intrinsic.

  • nir: Add common task shader lowering to make the backend’s job easier.

  • radv: Add mesh and task stage names to pipeline executable properties.

  • nir: Keep track of cross-invocation mesh shader output access.

  • radv, ac/nir: Fix multiview layer export for mesh shaders.

  • ac/nir/taskmesh: Use task shader lowering from common NIR code.

  • ac/nir/taskmesh: Use 3 dimensional workgroup ID.

  • ac/nir: Add remappability to tess and ESGS I/O lowering passes.

  • ac: Add RDNA2 task+mesh shader draw packet opcodes.

  • ac: Add task shader ring information.

  • radv: Implement task shader draw and payload rings.

  • nir: Introduce new intrinsics for AMD specific mesh shader task ring.

  • ac/nir/ngg: Clean up mesh shader output LDS layout.

  • ac/nir/ngg: Use mesh shader scratch ring when outputs don’t fit LDS.

  • ac/nir/ngg: Use variables for outputs without cross-invocation access.

  • radv: Implement mesh shader scratch ring.

  • radv: Enable NGG wave ID for mesh shader scratch ring.

  • ac/nir/ngg: Add mesh shader workgroup index.

  • radv: Lower mesh shader 3D workgroup ID to 1D index.

  • ac/nir/taskmesh: Preserve workgroup ID Y and Z when applying firstTask.

  • radv: Always use 3D block ID and grid size in task shaders.

  • gallium/u_blitter: Fix depth.

  • zink: Enable the VK_EXT_depth_clip_enable extension.

  • zink: Always enable depth clamping, make depth clipping independent.

  • nir/lower_task_shader: don’t use base index for shared memory intrinsics

  • ac/nir/ngg: Refactor LDS instructions in NGG GS vertex emit and export.

  • ac/nir/ngg: Ignore driver location for mesh shader outputs.

  • radv: Don’t assign driver locations to mesh shader outputs.

  • radv: Add CULL_PRIMITIVE to special output mask.

  • radv: Use NIR optimization to move discards to the top.

  • radv: Only initialize DGC state when DGC is enabled.

  • radv: Remove trailing whitespace introduced by DGC commits.

  • radv: Refactor predication for compute queues.

  • radv: Refactor some CP DMA functions to work with radeon_cmdbuf.

  • radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf.

  • radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf.

  • radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf.

  • radv: Refactor view index emit to use a per-stage function.

  • radv: Move inline push constants to a new function.

  • radv: Minor cleanup of radv_queue_submit_normal.

  • radv: Allow reusing pipeline compute state emit functions.

  • radv: Add dispatch_initiator_task field to radv_device.

  • radv: Create internal cmdbuf when a graphics pipeline needs compute.

  • radv: Flush descriptors and push constants for task shaders.

  • radv: Introduce radv_before_taskmesh_draw.

  • radv: Implement mesh shading draw calls with task shaders.

  • radv: Synchronization for task shaders.

  • radv: Support task shaders in secondary cmd buffers.

  • radv: Workaround MEC taskmesh dispatch hang when count buffer has zero.

  • radv: Submit internal compute cmdbuf.

  • radv: Copy BO list to ACE internal CS.

  • radv: Enable task shader feature for NV_mesh_shader.

  • ac/nir/ngg: Move primitive ID workgroup barrier to proper place.

  • aco: Remove hack for primitive ID export.

  • ac/nir/ngg: Create output variable for primitive ID export.

  • ac/llvm: Implement load_num_subgroups for NGG shaders.

  • ac/llvm: Add LLVM bug workaround to ac_build_mbcnt_add.

  • ac/llvm: Use gs_prim_id for NGG VS.

  • ac/nir/ngg: Copy comment about LDS layout for NGG GS.

  • ac/nir/ngg: Remember proper bit sizes of GS output variables.

  • ac/llvm: Implement GDS atomic add NIR intrinsic.

  • radv/llvm: Remove incorrect hardcoded workgroup size from NGG GS.

  • radv: Properly set LDS size for LLVM NGG shaders.

  • radv: Use NIR-based NGG lowering with LLVM.

  • aco: Fix p_init_scratch for task shaders.

  • nir/gather_info: Clear cross-invocation output mask.

Tomeu Vizoso (4):

  • lima/ci: Disable lima-mali450-piglit-gpu:arm64

  • panvk/ci: Disable CI for a while

  • ci: Disable jobs to the Collabora lab

  • Revert “ci: Disable jobs to the Collabora lab”

Vadym Shovkoplias (4):

  • anv: Fix geometry flickering issue when compute and 3D passes are combined

  • driconf: Add a limit_trig_input_range option

  • drirc: Set limit_trig_input_range option for the Wolfenstein Youngblood

  • drirc: Set limit_trig_input_range option for glmark2

Vasily Khoruzhick (7):

  • lima: Implement texture_barrier

  • lima: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT

  • lima: Lower nir_op_idiv with other scalar math operations.

  • u_transfer_helper: flush temporary transfer first for MSAA

  • lima: implement lima-specific blitter

  • lima: wire up MSAA 4x support

  • lima: fix buffer overallocation for index, vertex and constant buffers

Victor Hermann Chiletto (1):

  • radv: always check entry count in descriptor pool when allocating

Viktoriia Palianytsia (1):

  • anv: Add workaround for sample mask with multisampling

Vinson Lee (14):

  • panfrost: Fix memory leaks on unit test failures.

  • zink: Fix memory leak on error path.

  • radeonsi: Move NULL check before dereference.

  • dri2: Fix memory leak.

  • d3d12: Initialize d3d12_video_encoder_bitstream member m_uiOffset.

  • amd: Initialize Gfx11Lib members in constructor.

  • d3d12: Remove unnecessary NULL check.

  • clc: Fix build with llvm-15.

  • microsoft/spirv_to_dxil: Fix missing-prototypes errors.

  • radv: Remove unnecessary null check.

  • microsoft/compiler: Fix assert.

  • tu: Check dereferenced value of rop_reads_dst.

  • aco: Initialize spill_ctx members in constructor.

  • r600/sfn: Initialize TestShaderFromString member m_instr_factory.

Vlad Zahorodnii (2):

  • winsys/amdgpu-radeon: Allow specifying context priority

  • radeonsi: Add support for EGL_IMG_context_priority

Yevhenii Kolesnikov (1):

  • nir: Remove single-source phis before opt_if_loop_last_continue

Yiwei Zhang (51):

  • venus: override aspectMask for internal tiling modifier

  • venus: use linear modifier for legacy common wsi path

  • venus: fix view format for ahb image

  • venus: refactor to add struct vn_env

  • venus: add env perf options and introduce no_async_set_alloc

  • venus: add VN_PERF option no_async_buffer_create

  • venus: add VN_PERF option no_async_queue_submit

  • venus: refactor vn_cmd_submit

  • venus: flush when batched draw calls reach a threshold

  • venus: disable VK_EXT_extended_dynamic_state2

  • venus: refactor android gralloc pieces

  • venus: cache front_rendering_usage bit at gralloc init

  • venus: update vn_GetSwapchainGrallocUsage2ANDROID for shared present

  • venus: enable ANB shared presentable image prop

  • venus: add VN_RELAX_BASE_SLEEP_US env var for tuning cpu utils

  • venus: log performance envs upon instance creation with debug init

  • venus: extend buffer cache to cover layering usage

  • vulkan: remove the VkPhysicalDeviceMemoryProperties workaround

  • radv: remove VkPhysicalDeviceMemoryProperties2 workaround

  • venus: use common entry point for VkPhysicalDeviceMemoryProperties

  • venus: resolve AHARDWAREBUFFER_FORMAT_Y8Cb8Cr8_420 to external format

  • venus: fix a bug on buffer cache init failure path

  • venus: fix the queue init failure path

  • venus: fix a missing mtx_destroy in vn_device_init

  • venus: adjust device resources init order

  • venus: let device track queue families used for device creation

  • venus: add vn_feedback_pool backed by coherent buffer (part 1)

  • venus: add vn_feedback_pool backed by coherent buffer (part 2)

  • venus: add feedback check/set/reset helpers

  • venus: add event feedback

  • venus: add NO_FENCE_FEEDBACK perf option and disable sparse resource

  • venus: create per queue family command pool for feedback cmd

  • venus: renderer to store allow_vk_wait_syncs capset

  • venus: add fence feedback

  • venus: resolve YV12 AHB to VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM

  • venus: use narrow range to match up with mesa EGL

  • venus: swizzle the chroma channels for YVU420 to match the VkFormat

  • venus: GetPhysicalDeviceSparseImageFormatProperties to match sparse support

  • venus: suballocate more for layering

  • venus: add more tracepoints for perf analysis

  • venus: filter out VK_EXT_physical_device_drm on the driver side

  • venus: check dynamic state for raster enablement

  • venus: reenable VK_EXT_extended_dynamic_state2

  • Revert “venus: suballocate more for layering”

  • anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support

  • venus: fix external memory ext filtering

  • venus: avoid feedback for external fence

  • venus: allow no external memory handle when renderers lacks support

  • zink: fix zink_create_fence_fd to properly import

  • zink: fix in-fence lifecycle

  • venus: ignore pInheritanceInfo if not secondary command buffer

Yogesh Mohan Marimuthu (12):

  • radeonsi/gfx11: make flat_scratch changes for graphics

  • radeonsi/gfx11: make flat_scratch changes for compute

  • radeonsi/gfx11: interp changes for 32bit

  • radeonsi/gfx11: interp changes for 16bit

  • radeonsi/gfx11: instruction cache line size is 128 bytes

  • radeonsi/gfx11: program inst_pref_size for compute

  • radeonsi/gfx11: program db render control register

  • radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there

  • ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending

  • vulkan/wsi: fix extra free if buffer_blit_queue

  • radeonsi: remove tabs from code

  • radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state()

Yogesh Mohanmarimuthu (1):

  • radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE

Yonggang Luo (119):

  • zink: Remove redundant framebuffer_mtx from zink_screen.h

  • win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py

  • win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols.

  • misc: Replace `#ifdef\t__cplusplus` with `#ifdef\s\s__cplusplus`

  • misc: replace `#ifdef\s\s__cplusplus` with `#ifdef\s__cplusplus`

  • zink: Trim spaces in zink.rst

  • zink: Improve zink.rst with links

  • st: Improve _mesa_error about GL_OUT_OF_MEMORY in st_cb_texture.c

  • llvmpipe: Do not use _Atomic keyword that doesn’t support by MSVC

  • radv: fixes msvc compiling error

  • radv: Fixes compiling error with msvc

  • radv: Add losing member hw_ip_version_major and hw_ip_version_minor for win32

  • radv: Add macros for win32 that accessed

  • nir: Fixes MSVC compiling error about unused variable `_`

  • egl/wgl: On win32, there is no support for EGL_EXT_device and EGL_EXT_platform_device

  • util: Fixes test_util_get_process_exec_path on windows host with msys2/mingw

  • util: Fixes typo in test_util_get_process_exec_path

  • ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson

  • radv: Remove usage of `cnd_monotonic.h`

  • d3d12: Fixes d3d12 compiling errors in `D3D12ResourceState.h` for mingw

  • d3d12: Move shared code that include d3d12 headers into d3d12_common.h

  • d3d12: Fixes compiling error by include `<unknwn.h>`

  • d3d12: `#include <windows.h>` instead `#include <Windows.h>` for building under linux with mingw

  • d3d12/wgl/test: Fixes wgl_tests.cpp for d3d12 with mingw

  • dzn: Fixes compiling error when build with msys2/mingw

  • dzn: Fixes compiling error by include `<unknwn.h>`

  • microsoft/clc: Fixes compiling of microsoft clc with mingw

  • docs: Fixes dispatch.rst with __THREAD_INITIAL_EXEC

  • meson: Fixes timespec_get detect code

  • c11/threads: Remove the detecting code for `HAVE_TIMESPEC_GET` in threads_win32.h

  • c11: Implement c11/time.h with c11/impl/time.c

  • c11: Implement timespec_get on win32 properly when not available

  • util: Use timespec_get directly, it’s always present

  • util: Remove usage of designated initializers in timespec_test.cpp

  • util: always enable timespec_test.cpp

  • util: Remove unused function timespec_passed

  • util: Add unittest for timespec_get

  • ci/vs2019: There is no meaning to set $env:ErrorActionPreference

  • ci/vs2019: switch to powershell 7

  • ci/vs2019: Update editorconfig for ps1 files

  • ci/vs2019: Give each option its own line in mesa_build.ps1

  • ci/vs2019: Rename to vsInstallPath

  • ci/vs2019: Do not install subprojects

  • ci/vs2019: Ignore cert install output, it’s too long

  • ci/vs2019: Ignore error when removing directory recursively

  • ci/vs2019: Disable progress when install python

  • ci/vs2019: Disable progess bar for pip3 install

  • ci/vs2019: Use shared VULKAN_SDK_VERSION for install and build vulkan related packages

  • ci/vs2019: Add a dedicated script for entering VS dev shell

  • ci/vs2019: Split call to C:\vs_buildtools.exe into multiple lines

  • ci/vs2019: Show the total vs2019 install time

  • ci/vs2019: Reduce the vs2019 build tools components to be installed

  • ci/vs2019: ignore _build and _install directory by write * into .gitignore

  • ci/vs2019: Install all build artifacts into C:\mesa-deps

  • ci/vs2019: Split choco install output

  • ci/vs2019: Split install of vs2019 and choco into separate Docker image

  • ci/vs2019: Remove comment for windows_test_vs2019 that not true

  • ci/vs2019: Convert mesa_build.ps1 to using PowerShell 7 && operator

  • microsoft/compiler: Include stddef.h before using size_t

  • microsoft/compiler: Fix error from double extern

  • dxil: Fixes compiling dxcapi.h with mingw.

  • dxil: Sync dxil_spirv_shader_stage with gl_shader_stage

  • dxil: Fixes warning about comparing different enum value and enum literal

  • microsoft/spirv_to_dxil: Delete unused local variable

  • microsoft/clc: Delete unused local variables

  • microsoft/clc: Add missing void to no-parameter function signature

  • microsoft/spirv_to_dxil: Fixes maybe-uninitialized compiling error in dxil_spirv_nir.c

  • ci: Building spirv-to-dxil on both mingw and debian/x64

  • intel: using C++11 keyword thread_local

  • glx: using C++11 keyword thread_local

  • util: Define WIN32_LEAN_AND_MEAN before include of `windows.h` in u_thread.h

  • c11: Move the implementation of threads.h into c source code

  • c11: Fixes memory leak of pack in thrd_create on win32

  • c11: Implement thread_local in c11/threads.h

  • util/c11: Update function u_thread_create to be c11 conformance

  • meson: Using get_argument_syntax as the `–compiler_id` option for gen_vs_module_defs.py

  • meson: Use cc.get_argument_syntax instead cc.get_id when possible.

  • meson/rtti: Using get_argument_syntax to check msvc compatible compiler

  • meson/clover: Getting clover can be built with no-rtti llvm in mingw

  • ci/vs2019: Upgrade Windows 10 SDK version to 20348

  • docs: Update windows build requirement with Visual Studio

  • d3d12: Use static_cast instead of dynamic_cast in d3d12_video_enc_h264.cpp

  • d3d12: Convert #include <Windows.h> to #include <windows.h> for mingw on linux

  • d3d12: Fixes compiling error in d3d12/wgl/d3d12_wgl_framebuffer.cpp with gcc

  • microsoft/clc: Fixes narrowing error in clc_compiler_test.cpp with mingw/gcc

  • microsoft/clc: Disable clc_compiler_test on non-windows platform

  • dzn: Fixes incompatible pointer type error

  • ci/x86_build: Getting pushd popd be paired, avoid using cd

  • ci: Prepare the container for building all mesa components with mingw under linux

  • ci: Trigger the new mingw/linux dockers to be build

  • d3d12: Turn d3d12_format.h to include d3d12_common.h

  • ci: Building all mesa functional with mingw on debian

  • meson: Enable wgl tests on mingw

  • CODEOWNERS: Update c11 code owners

  • CODEOWNERS: evelikov renamed to xexaxo

  • gallium: Move -DHAVE_PIPE_LOADER_DRI and -DHAVE_PIPE_LOADER_KMS to be pre_args

  • gallium: Do not call to pipe_loader_sw_probe_kms when not present

  • gallium: Rename macros HAVE_PIPE_LOADER*

  • gallium: Disable dri2 interface on OSX

  • glx: __glX_tls_Context should use __THREAD_INITIAL_EXEC

  • util: thread_local doesn’t support to be exported on OSX

  • glx: Convert tab to space in applegl_glx.c

  • dri: Trim trailing space in dri/dri_util.*

  • ci: Building osmesa on OSX

  • util: Disable tests/sparse_array_test.cpp on MacOS as it’s too slow

  • mapi: Trim trailing spaces in stub.c and u_current.c

  • meson: Remove usage of use_elf_tls

  • egl: Remove usage of USE_ELF_TLS macro

  • mapi: Remove usage of USE_ELF_TLS

  • loader: Remove usage of USE_ELF_TLS in loader.c

  • glx: Remove usage of USE_ELF_TLS

  • util: Remove usage of USE_ELF_TLS in u_thread.h

  • meson: Remove pre_args += ‘-DUSE_ELF_TLS’

  • glx: Remove usage of pipe_tsd_set and pipe_tsd_get

  • auxiliary: Remove pipe_tsd

  • d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library

  • microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp

  • util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func

  • c11: #include <threads.h> when the os/platform provide it

Yurii Kolesnykov (2):

  • Remove Travis CI config

  • Init macOS GitHub Action

Zack Rusin (4):

  • svga: Don’t try to build x86/x64 assembly on different arch’s

  • svga: finish readbacks before mapping resources

  • svga: Use direct maps when GB objects are present

  • svga: Add support for SVGAv3

Zhang, Jianxun (1):

  • iris: Wa_14016820455 for GFX_VERx10 == 12.5

illiliti (1):

  • Use proper types for meson objects

jheaff1 (1):

  • build(glx): Fix build by adding missing deps

naveen (1):

  • github/ci: Set permissions for GitHub actions

newbluemoon (1):

  • nine: replace ulimit with sysconf call

prashanth (1):

  • vulkan/device_select: force default device if environment variable set

sjfricke (2):

  • isl: fix bug where sb.MOCS is not being set

  • anv: fix assert to build with shader cache disabled

xperia64 (1):

  • wgl: Add driver_zink as a dependency of the wgl frontend

İlhan Atahan (1):

  • Add Adreno 616 and 620 to use turnip on these GPU’s .