moltenvk

Author	SHA1	Message	Date
Bill Hollings	6327b767e0	Reinstate memory barriers on non-Apple GPUs. Ensure non-Apple GPU's enable memory barriers. A previous commit inadvertently disabled GPU memory barriers. Change tests for memory barriers to runtime test for Apple GPU, instead of build-time test for Apple Silicon, to accommodate running on Rosetta2, and refactor tests for Apple Silicon and OS version on some macOS GPU feature settings.	2022-03-08 17:01:50 -05:00
Bill Hollings	e42b33e593	Don't attempt to store the depth component of a stencil-only renderpass attachment. For a combined depth-stencil format in a MVKImageView attachment with VK_IMAGE_ASPECT_STENCIL_BIT, the attachment format may have been swizzled to a stencil-only format. In this case, we want to guard against an attempt to store the non-existent depth component. Pass MVKImageView attachment to MVKRenderPassAttachment::encodeStoreAction() and MVKRenderPassAttachment::populateMTLRenderPassAttachmentDescriptor() to check attachment depth format component. Consolidate calls to MVKImageView::populateMTLRenderPassAttachmentDescriptor() by calling it from within MVKRenderPassAttachment::populateMTLRenderPassAttachmentDescriptor().	2022-02-24 11:42:12 -05:00
Bill Hollings	e28a16d76b	Update MoltenVK version number to 1.1.9. Adjust Whats_New.md to accommodate earlier trivial 1.1.8 patch release for SDK 1.3.204.1.	2022-02-22 14:18:55 -05:00
Bill Hollings	afd997ab31	Align flattened shader inputs to previous stage output structs. When flattening shader inputs for stage_in, which are to be read from a buffer that was populated as nested structs during an earlier stage, the structs will be aligned according to C++ rules, which can affect the alignment of the first member of the flattened input struct. Add SPIRVShaderOutput::firstStructMemberAlignment to track the alignment requirements of the first member of a nested structure, and recursively determine the alignment of the first member of each nested output structure. Move sizeOfOutput() from MVKPipeline.mm to SPIRVReflection.h, rename to getShaderOutputSize(), and add getShaderOutputAlignment() to extract member alignment.	2022-02-22 12:17:15 -05:00
Bill Hollings	f7ca132844	Update glslang version, to use python3 in glslang scripts, to replace missing python on macOS 12.3.	2022-02-16 14:02:01 -05:00
Bill Hollings	16408fd6ae	Remove logged warning if MoltenVK does not support VkApplicationInfo::apiVersion value. Update MoltenVK version to 1.1.8. Minor spelling fixes in comments.	2022-02-09 13:36:08 -05:00
Bill Hollings	24ff2106d9	Update dependency libraries to match Vulkan SDK 1.3.204. Update What's New document.	2022-02-06 19:55:44 -05:00
Nikita Fediuchin	4efb90b3c1	Update license year	2022-02-04 13:33:27 +02:00
Bill Hollings	9986e92f35	Merge pull request #1497 from billhollings/apple-silicon-deviceID On Apple Silicon, set VkPhysicalDeviceProperties::deviceID from GPU capabilities.	2021-12-28 22:08:44 -05:00
Bill Hollings	9633f4843d	Improve accuracy of VkPhysicalDeviceLimits::timestampPeriod. If using GPU counters, on all Apple GPUs lock timestampPeriod to 1.0, since Apple GPUs use nanoseconds, and on non-Apple GPUs, dynamically adapt value of timestampPeriod by correlating GPU ticks with GPU ticks. If using CPU sync, set timestampPeriod to OS CPU timestamp tick period.	2021-12-28 17:19:11 -05:00
Bill Hollings	df043487e4	On Apple Silicon, set VkPhysicalDeviceProperties::deviceID from GPU capabilities. Previously, on Apple Silicon (iOS, tvOS & macOS M1), we tried to guess deviceID from GPU parameters, but this is becoming harder as the types of Apple Silicon is growing, and the actual device SoC itself is less relevant that the GPU capabilities. So we now set deviceID from the combination of OS version and GPU type. Rename MVKDevice::getHighestMTLFeatureSet() to getHighestGPUCapability().	2021-12-27 16:45:12 -05:00
Bill Hollings	5810772644	Fix merge conflicts and syntax build error in iOS build.	2021-12-26 18:30:31 -05:00
Bill Hollings	92712e240a	Fix memory leak of dummy MTLTexture in render subpasses that use no attachments. Older Metal does not support rendering without subpass attachments. In this case, a dummy attachment with a dummy MTLTexture is created whenever the subpass begins, but was not being correctly used. Move the creation, retaining, and releasing of the dummy MTLTexture to MVKFramebuffer, where the extent and layer count is known and can be reused. Pass framebuffer to MVKCommandEncoder::beginRenderpass() and remember current framebuffer in MVKCommandEncoder. Add getFramebufferExtent() and getFramebufferLayerCount() to MVKCommandEncoder. Pass framebuffer to MVKRenderSubpass:populateMTLRenderPassDescriptor() and retrieve dummy MTLTexture from framebuffer.	2021-12-25 16:18:18 -05:00
Bill Hollings	355bfed457	Fix Metal object retain-release errors in assignment operators.	2021-12-25 12:58:55 -05:00
Bill Hollings	18642002ce	Updates to better support Rosetta2 runtimes, and MSL 2.4 and 2.3 versions. Do not use MTLEvent for VkSemaphore under Rosetta2. Remove compile test for MVK_MACOS_APPLE_SILICON and MVK_APPLE_SILICON when testing for Apple GPU families, to allow x86 builds to test for Apple GPU under Rosetta2. Simplify identifying M1 GPU. All M1 SoCs currently support the A14 (Apple7) GPU. Support compiling MSL 2.4 in runtime pipelines and MoltenVKShaderConverterTool. Fix issue where MSL 2.3 only available on Apple Silicon, even on macOS. Update to latest SPIRV-Cross (unrelated to Rosetta2).	2021-12-01 18:14:07 -05:00
Bill Hollings	5de7f5551c	Support building MoltenVK with static Vulkan linkage symbols hidden. Add build environment variable MVK_HIDE_VULKAN_SYMBOLS. to allow MoltenVK to be built with static Vulkan API symbols hidden, to avoid library linking conflicts when bound to a Vulkan Loader that also exports identical symbols. The default value of MVK_HIDE_VULKAN_SYMBOLS is 0, meaning Vulkan static symbols are exposed by default. Add MVK_PUBLIC_VULKAN_SYMBOL directive to mark each Vulkan call symbols for exporting or hiding. Update the MoltenVK Xcode project to add the MVK_HIDE_VULKAN_SYMBOLS build setting, and set the ENABLE_TESTABILITY build setting to NO, because it conflicts with stripping symbols. Update MoltenVK version to 1.1.7.	2021-11-17 18:22:33 -05:00
Bill Hollings	a0ed3345b6	Update library dependencies to match Vulkan SDK 1.2.198. Update What's New.md document.	2021-11-13 19:57:41 -05:00
Bill Hollings	6e054ad5db	Restore support for BC1_RGB compressed format. For Vulkan BC1_RGB formats, swizzle alpha of substituted Metal BC1_RGBA to 1.0, to return value expected by Vulkan.	2021-11-09 12:13:51 -05:00
Bill Hollings	bab17a52b7	Remove advertised support for BC1_RGB texel formats. Metal does not provide direct support for BC1_RGB formats (VK_FORMAT_BC1_RGB_UNORM_BLOCK & VK_FORMAT_BC1_RGB_SRGB_BLOCK). We have been faking it by mapping these Vulkan formats to Metal formats containing alpha (MTLPixelFormatBC7_RGBAUnorm & MTLPixelFormatBC7_RGBAUnorm_sRGB, respectively), and advertising support for BC1_RGB formats. However, this triggers CTS failures, because the BC1_RGBA formats can return an alpha value of 0.0 when constructed that way, whereas the BC1_RGB formats always expect 1.0 (opaque) to be returned. This change moves to indirect support for BC1_RGB formats. They will still be covered by MTLPixelFormatBC7_RGBAUnorm & MTLPixelFormatBC7_RGBAUnorm_sRGB, and will effectively work (except transparency), but are no longer advertised through physical device format and image format queries.	2021-10-31 19:00:25 -04:00
Bill Hollings	cc10af04c9	Ensure dynamic pipeline state always respects pipeline dynamic flags. Dynamic pipeline state set before the pipeline is set was reading dynamic flags from previous pipeline. This is fixed here by accepting the dynamic state, but deferring the decision to use either dynamic or static state until the pipeline is encoded.	2021-09-30 21:09:07 -04:00
Bill Hollings	36fae88ee2	Move multilayer-rendering validation from MVKImage to MVKImageView. An MVKImageView that renders to only one layer of a multilayer MVKImage is not performing multilayer-rendering. Only validate multilayer-rendering when it is definitely requested in an MVKImageView, instead of presuming when a multilayer MVKImage is marked for rendering.	2021-09-30 15:30:23 -04:00
Bill Hollings	ba9623cad5	Update to latest version of SPIRV-Cross. Add additional SPIRV-Cross header files to ExternalDependencies.xcodeproj to allow browsing these files in Xcode.	2021-09-30 13:45:54 -04:00
Bill Hollings	4ad5930263	Improved checks for timestamp GPU counter support on older devices. Check that appropriate GPU counter set is supported on the device before creating GPU counter sample buffer. Don't attempt to timestamp using GPU counters unless a GPU sample counter buffer has been created.	2021-09-13 17:20:00 -04:00
Bill Hollings	333084f739	Several updates for CTS test fixes. Support maximum point primitive size of 511. Update to latest SPIRV-Cross version to add support for OpSpecConstantOp ops OpQuantizeToF16 and OpSRem. Update MoltenVK version to 1.1.6.	2021-09-08 12:05:08 -04:00
Bill Hollings	a5ee3ead35	Update library dependencies to match Vulkan SDK 1.2.189. Update What's New document.	2021-08-30 13:54:37 -04:00
Bill Hollings	f5dd310c55	Merge pull request #1426 from billhollings/partial-clear-stencil Several renderpass clearing and input fixes from CTS	2021-08-23 19:42:13 -04:00
Bill Hollings	43a133ff10	Update to latest version of SPIRV-Cross. MSL: Support row-major transpose when storing matrix from constant RHS matrix. MSL: Fix casting in constant expressions with different sizes. MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused.	2021-08-23 18:31:45 -04:00
Bill Hollings	8a7e20d348	Fix GPU race condition when clearing a renderpass input attachment on Apple GPUs. Apple GPUs do not support rendering/writing to an attachment and then reading from that attachment within a single Metal renderpass. On Apple Silicon, restart the Metal renderpass if an input attachment is cleared inside renderpass. Don't clear render area when restarting Metal renderpass, as it should not occur, and itself causes a recursive loop restarting the renderpass as a result. Add MVKCommandUse::kMVKCommandUseRestartSubpass. MVKCommandEncoder::beginMetalRenderPass() pass MVKCommandUse to help determine Metal renderpass attachment load and clearing options.	2021-08-23 14:28:20 -04:00
Bill Hollings	ce583f4f3d	Support stencil-only partial attachment clearing. Rename MVKRenderPassAttachment::shouldUseClearAttachment() to shouldClearAttachment(), pass whether testing for stencil clearing, and check stencil clearing distinct from depth clearing. Refactor MVKRenderSubpass::populateClearAttachments() to work with this. Remove MVKRenderPassAttachment::getAttachmentStencilLoadOp() as obsolete.	2021-08-19 17:22:29 -04:00
Bill Hollings	8c7db31cd7	Prefer MTLEvent for VkSemaphore, except on NVIDIA, prefer emulation. Disable MVK_ALLOW_METAL_FENCES by default. Disable use of MTLEvent on NVIDIA. By default, use MTLEvent for VkSemaphore everywhere except NVIDIA. By default, use CPU synchronization on NVIDIA. These changes fix a large number of CTS synchronization test failures.	2021-08-10 11:32:21 -04:00
Bill Hollings	8e6731fd8e	Revert to prefer MTLEvent over MTLFence for VkSemaphore, except on NVIDIA. Prefer MTLEvent over MTLFence for VkSemaphore, because MTLEvent handles sync across MTLCommandBuffers and MTLCommandQueues, except on NVIDIA GPUs, which have demonstrated trouble with MTLEvents, prefer MTLFence. Add MVKDevice::VkSemaphoreStyle enum.	2021-08-10 10:29:09 -04:00
Bill Hollings	12f0089d0c	Support resolving attachments with formats that Metal does not natively resolve. Metal does not support resolving all formats that support MSAA, whereas Vulkan assumes any MSAA format can be resolved. We fix that by running an optional post-renderpass compute shader that resolves such textures by simply taking the first sample as the resolved sample. This works to fix all failing CTS tests, because such formats are all integer formats, and Vulkan allows an arbitrary single sample value to be selected. If we need to resolve, but the Metal format doesn't support it, cause the Metal renderpass to store the MSAA attachment results. MVKRenderSubpass don't establish Metal resolve attachment textures if format is not natively resolvable, and encode Metal renderpass store actions accordingly. MVKCommandEncodingPool add MTLComputePipelineStates to run simple resolve compute shaders on attachments that cannot be resolved in Metal renderpass. Add MVKRenderSubpass::resolveUnresolvableAttachments() and call from MVKCommandEncoder::endMetalRenderEncoding(), before subpass index is updated. Rename MVKCommandEncodingPool::getClearStateIndex() to getRenderpassLoadStoreStateIndex() and remove MVK_MACOS restriction on clearing shaders to allow compatibility with resolve shader handling. MVKRenderPassAttachment remove validation of whether a format can be resolved. MVKPixelFormats::getMTLTextureUsage() add read and write usage as appropriate to allow compute shader to run to resolve formats not natively resolvable. MVKPixelFormats remove obsolete unit test code. MVKImageView clean up access functions and obsolete constructor use of MVKConfiguration.	2021-08-03 18:47:13 -04:00
Bill Hollings	086c680436	Fix pipeline barriers not working inside self-dependent subpasses on Apple GPUs. On Apple GPUs, MVKCmdPipelineBarrier restarts Metal renderpass when encountered inside self-dependent Vulkan subpass where the same attachment acts as both input attachment and render attachment.	2021-07-30 15:25:59 -04:00
Bill Hollings	665dbfd632	Fix issue with vkCmdBlitImage() from compressed textures.	2021-07-27 18:08:50 -04:00
Bill Hollings	b40dba904d	Fix issue where swapchain images were acquired out of order under heavy load. Move update of MVKSwapchainImageAvailability::acquisitionID to the acquisition time instead of the become available again time, so other images will be preferred if either all images are available or no images are available.	2021-07-27 15:42:43 -04:00
Bill Hollings	2adb24bde2	Merge master branch into timestamp-using-metal-gpu-counters branch.	2021-07-23 12:42:27 -04:00
Bill Hollings	386bde9c78	Update to latest SPIRV-Cross version. MSL: Adjust gl_SampleMaskIn for sample-shading and/or fixed sample mask. MSL: Fix setting SPIRVCrossDecorationInterpolantComponentExpr decoration. MSL: Simplify spvSubgroupBallot().	2021-07-22 11:03:52 -04:00
Bill Hollings	6ae1745a9c	Vulkan timestamp query pools use Metal GPU counters when available. Add MVKPhysicalDeviceMetalFeatures::counterSamplingPoints to track platform availability of GPU counters. MVKPhysicalDevice creates and manages MTLCounterSets and checks for and enables flags within MVKPhysicalDeviceMetalFeatures::counterSamplingPoints. Add abstract MVKGPUCounterQueryPool class as parent of MVKTimestampQueryPool and MVKPipelineStatisticsQueryPool concrete classes and refactor access to host and command copy tracking data to allow extraction from MTLCounterSampleBuffer. MVKTimestampQueryPool uses MTLCounterSampleBuffer if supported, otherwise reverts to using host data for timestamps. MVKCommandEncoder encodes Vulkan timestamp commands either as Metal staged or command timestamps, depending on whether the GPU is tile-based or immediate-mode. For Metal stage counters, we use a light-weight dummy BLIT encoder to mark timestamp commands executed in the previous Metal encoding pass. Add MVKDevice::getDummyBlitMTLBuffer() to supply a dummy single-byte buffer that can be used by a stand-alone MTLBlitCommandEncoder as dummy work to mark timestamps.	2021-07-20 22:13:04 -04:00
Bill Hollings	b1a8b59f66	Support alpha-to-coverage without a color attachment. If alpha-to-coverage is enabled, we must enable the fragment shader first color output, even without a color attachment present or in use, so that coverage can be calculated.	2021-07-10 15:38:02 -04:00
Bill Hollings	b3cf74401f	Disable VK_FORMAT_FEATURE_COLOR_ATTACHMENT_BLEND_BIT for VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 on macOS Apple Silicon. On Apple Silicon (iOS/tvOs/macOS M1), format VK_FORMAT_E5B9G9R9_UFLOAT_PACK32 is fully supported as a color attachment except that format components cannot be individually write-enabled. All components must either be write-enabled or write-disabled together. This is causing several hundred Vulkan CTS blending tests to fail on M1. The least intrusive behavioural change to allow the CTS tests to report Not Supported instead, is to disable blending for this format.	2021-07-08 15:52:34 -04:00
Bill Hollings	53a2223abd	Fix swizzle of depth and stencil values into RGBA (float4) variable in shaders. MVKImageViewPlane track Vulkan component swizzle instead of just packed swizzle, and modify swizzle when using depth or stencil sampling or reading.	2021-07-06 17:57:47 -04:00
Bill Hollings	2d30c0ae13	Fix incorrect translation of clear color values on Apple Silicon. The same set of CTS tests either fails or passes on different GPUs based on whether or not we adjust float clear colors by one ULP. Add MVKFloatRounding enum. Add MVKPhysicalDeviceMetalFeatures::clearColorFloatRounding. Disable ULP adjustment for clear colors on Apple Silicon. For consistency and to simplify bookkeepping, calculate clear color ULP adjustment from bit width of format component. Update MoltenVK version to 1.1.5. Update VK_MVK_MOLTENVK_SPEC_VERSION to 32.	2021-07-04 11:41:39 -04:00
Bill Hollings	4f7f0dc209	Merge pull request #1392 from billhollings/sdk-1.2.182 Update dependency libraries to match Vulkan SDK 1.2.182.	2021-06-28 08:11:38 -04:00
Bill Hollings	e3cf071ace	Merge pull request #1387 from billhollings/occlusion-query-fixes-for-M1 Occlusion query fixes for M1	2021-06-28 08:10:17 -04:00
Bill Hollings	55c5cee233	Update dependency libraries to match Vulkan SDK 1.2.182.	2021-06-25 15:32:34 -04:00
Bill Hollings	84e8e9ed73	Merge pull request #1389 from billhollings/add-cts-script Add Scripts/runcts script as a convenience for running Vulkan CTS tests.	2021-06-24 13:14:16 -04:00
Bill Hollings	25d1349579	Add Scripts/runcts script as a convenience for running Vulkan CTS tests. Update .gitignore to ignore CTS artifacts in Scripts directory.	2021-06-22 19:45:24 -04:00
Bill Hollings	664296abd0	Fix small memory leak during swapchain creation. Add ability to profile Cube demo on macOS.	2021-06-22 11:50:58 -04:00
Bill Hollings	e72cd614e1	Fix issue where M1 GPU does not support reusing Metal visibility buffer offsets across separate render encoders within a single Metal command buffer (Vulkan submit). Add MVKCommandEncodingContext to track information across multiple MVKCommandEncoders, and use it to track temporary visibility buffer and offset. Add support for more than one temporary visibility buffer per MTLCommandBuffer when current temporary visibility buffer is exhausted.	2021-06-21 11:17:40 -04:00
Bill Hollings	6b502bcb60	Upgrade projects to Xcode 13 SDK APIs. Add MVK_XCODE_13 code macro. Support MTLLanguageVersion2_4 enum value. MoltenVKPackaging project add DISABLE_MANUAL_TARGET_ORDER_BUILD_WARNING build setting to suppress build warnings about using manual build orders.	2021-06-14 16:24:32 -04:00

... 2 3 4 5 6 ...

546 Commits