moltenvk

Author	SHA1	Message	Date
Bill Hollings	ccd9592d61	Merge pull request #1076 from cdavis5e/mux-switch-swapchain MVKDevice: Only force a mux switch when a swapchain is created.	2020-09-28 12:59:39 -04:00
Chip Davis	9d21231fc2	MVKSwapchainImage: Retain image and swapchain until presentation is done. Fixes use-after-free bugs in `VK_GOOGLE_display_timing` support.	2020-09-28 11:28:55 -05:00
Bill Hollings	1831eb9f38	Merge pull request #1079 from cdavis5e/indexed-non-indexed-tess-draw MVKPipeline: Don't rely on the index buffer to tell apart indexed draws.	2020-09-28 12:02:52 -04:00
Bill Hollings	442f586007	Merge pull request #1078 from cdavis5e/fill-buffer-round-down MVKCmdFillBuffer: Round size down, not up.	2020-09-28 12:01:18 -04:00
Bill Hollings	3dad0c6a04	Merge pull request #1077 from cdavis5e/end-query-outside-rp MVKQueryPool: Fix ending a query outside a render pass.	2020-09-28 11:56:32 -04:00
Bill Hollings	cff0b6bc7a	Merge pull request #1075 from cdavis5e/shader-subgroup-extended-types Advertise the VK_KHR_shader_subgroup_extended_types extension.	2020-09-28 11:54:45 -04:00
Bill Hollings	299a937887	Merge pull request #1073 from cdavis5e/aliasable-dedicated-alloc Allow multiple images for dedicated allocs of aliasabled images.	2020-09-28 11:50:04 -04:00
Chip Davis	481fe3553d	MVKPipeline: Don't rely on the index buffer to tell apart indexed draws. This was a good heuristic, because the index buffer must be bound for indexed draws. However, it may also be bound for non-indexed draws--for example, if an indexed draw were immediately followed by a non-indexed draw, as happens in some `dEQP-VK.synchronization.*` tests. Therefore, we can't tell from the presence or absence of the index buffer what kind of draw we're in. We'll have to keep track of this state in the command encoder.	2020-09-28 00:43:53 -05:00
Chip Davis	5794503876	MVKCmdFillBuffer: Round size down, not up. According to the Vulkan spec: > If `VK_WHOLE_SIZE` is used and the remaining size of the buffer is not > a multiple of 4, then the nearest smaller multiple is used. > [emphasis added] Therefore, we should round down when calculating the number of words to write.	2020-09-26 01:20:57 -05:00
Chip Davis	bbe45e4a8a	MVKQueryPool: Fix ending a query outside a render pass. I missed this when I fixed occlusion queries begun outside a render pass.	2020-09-25 15:24:50 -05:00
Chip Davis	ee59948ed8	MVKDevice: Only force a mux switch when a swapchain is created. We only want the window server to use the high-performance GPU if we will use it to present to the display. If we won't use it to present, we can save some battery life by not using the display. I had hoped this would help window server stability in case something goes horribly wrong while using the GPU, but my experience has sadly not borne this out. My testing shows that the device returned by `MTLCreateSystemDefaultDevice()` is exactly equal (i.e. has the same pointer value) to one of the devices returned by `MTLCopyAllDevices()`, so we should see no problems from doing this at swapchain create time instead of device create time.	2020-09-25 15:16:01 -05:00
Chip Davis	0a7d2af989	Advertise the VK_KHR_shader_subgroup_extended_types extension. I anticipated this, and I tried to design the support in SPIRV-Cross so that this would just work. And they do... well, they work as well as 32-bit types currently do, which is to say, there's plenty of room for improvement.	2020-09-25 15:13:12 -05:00
Chip Davis	9321b5553f	Allow multiple images for dedicated allocs of aliasable images. Believe it or not, this is valid usage. If an image is aliasable and it has a dedicated alloc, it is valid for multiple images to bind to the dedicated memory. Some tests actually try this--for example, the `dEQP-VK.device_group.afr_dedicated` test.	2020-09-25 15:11:36 -05:00
Chip Davis	f1cbac1629	MVKPipeline: Use arrayed textures for layered subpass input. Normally, we would have to check the framebuffer, but we don't know its contents until draw time. To avoid yet another situation where we must compile multiple pipelines, I've used a simple heuristic: if the vertex pipeline writes to `BuiltInLayer`, this is likely for a layered framebuffer, and we should use `texture_2darray` for subpass input. Hopefully this is good enough for all intents and purposes. If not, then we really will have to wait until draw time. And God help us if someone try to do this with a 3D texture!	2020-09-25 10:30:42 -05:00
Bill Hollings	655b15a609	Merge pull request #1072 from cdavis5e/subgroup-size-vendor MVKPhysicalDevice: Get the vendor ID before setting the subgroup size.	2020-09-25 10:52:21 -04:00
Bill Hollings	4e6930d9d9	Merge pull request #1069 from cdavis5e/2d-attachment-view MVKImageView: Use a 2D non-arrayed view for 2D non-arrayed attachments.	2020-09-25 10:19:50 -04:00
Chip Davis	ce0c107317	MVKPhysicalDevice: Get the vendor ID before setting the subgroup size. At this point in device initialization, the device properties have not yet been initialized. Unfortunately, this includes the vendor ID, on which the maximum SIMD-group size depends. Initialize that property so we can use it to set the subgroup size correctly.	2020-09-24 21:58:40 -05:00
Chip Davis	832213b0e3	MVKImageView: Use a 2D non-arrayed view for 2D non-arrayed attachments. If a shader uses an input attachment and doesn't do layered rendering, but the image view is of type `MTLTextureType2DArray`, Metal's validation layer will complain about the texture type mismatching what the shader expects. This change makes the texture types line up.	2020-09-24 19:56:01 -05:00
Bill Hollings	6588ac8fd5	Merge pull request #1071 from cdavis5e/render-area-size-min Ensure there is at least one pixel in the render area.	2020-09-24 17:50:09 -04:00
Bill Hollings	9f0a9ebcda	Merge pull request #1070 from billhollings/master Fix memory overrun when enabling VkPhysicalDevicePortabilitySubsetFeaturesKHR features.	2020-09-24 17:49:00 -04:00
Chip Davis	86d5310736	Ensure there is at least one pixel in the render area. If there are no attachments and `renderTargetWidth` and `renderTargetHeight` are zero, the Metal validation layer complains. To prevent this, ensure both are at least 1.	2020-09-24 15:45:22 -05:00
Bill Hollings	fa963bfd91	Fix memory overrun when enabling VkPhysicalDevicePortabilitySubsetFeaturesKHR features. Update MoltenVK_icd.json to tell Vulkan Loader and Layers to use Vulkan 1.1.	2020-09-24 16:28:32 -04:00
Chip Davis	c377de9748	Offset vertex buffers for attribute bindings with zero divisors. One aspect of the `VK_EXT_vertex_attribute_divisor` spec that I apparently missed is that, for a vertex buffer binding with a divisor of 0, the base instance determines where the attributes are read from. This cannot be expressed in Metal, but we can emulate it by offseting the buffer by `firstInstance * stride`. Unfortunately, we can't do this for indirect draws. If a program tries this, we're hosed.	2020-09-24 10:21:05 -05:00
Bill Hollings	92336c597a	Merge pull request #1067 from billhollings/setmtltex vkSetMTLTextureMVK() Fix crash if incoming MTLTexture does not have an IOSurface.	2020-09-23 15:34:01 -04:00
Bill Hollings	f58da7675c	Merge pull request #1054 from billhollings/master Merge xcode12 branch into master.	2020-09-23 14:40:18 -04:00
Bill Hollings	6ebb20ebf5	Merge pull request #1064 from cdavis5e/layered-copies MVKCmdCopyImage: Use the method to copy multiple slices when possible.	2020-09-23 13:54:43 -04:00
Chip Davis	ef6848165a	Really fix tvOS build.	2020-09-23 12:15:25 -05:00
Bill Hollings	4097089f1c	vkSetMTLTextureMVK() Fix crash if incoming MTLTexture does not have an IOSurface.	2020-09-23 13:06:25 -04:00
Chip Davis	9334fe8f15	MVKCmdCopyImage: Use the method to copy multiple slices when possible. When the extent covers both the source and destination images completely, we can use the copy method on `MTLBlitCommandEncoder` which can copy multiple slices at once. This should hopefully reduce CPU overhead and command buffer memory usage.	2020-09-23 12:02:35 -05:00
Bill Hollings	37fb85f1d7	Merge branch 'master' of https://github.com/KhronosGroup/MoltenVK	2020-09-23 12:40:39 -04:00
Bill Hollings	cf3379fea9	Merge pull request #1062 from cdavis5e/layered-resolve MVKCmdResolveImage: Use layered rendering where possible.	2020-09-23 12:36:56 -04:00
Chip Davis	0fc93afafd	Fix build errors on tvOS.	2020-09-23 10:51:25 -05:00
Bill Hollings	3f58a8f805	Merge branch 'master' of https://github.com/billhollings/MoltenVK into xcode12	2020-09-23 11:11:34 -04:00
Bill Hollings	a23f99c3e5	Refactor MoltenVKShaderConverter frameworks. Combine MoltenVKSPIRVToMSLConverter and MoltenVKGLSLToSPIRVConverter frameworks into a single MoltenVKShaderConverter framework. Update corresponding directory structures, symlinks, scripts, and build paths. Update MoltenVK code to use new framework name for headers. Add symlinks in API-Samples demo to support legacy MoltenVKGLSLToSPIRVConverter header paths. In addition to simplifying shader converter code and build management, the use of only one shader converter framework fixes a race condition within Xcode, prior to Xcode 12, when multiple targets use the same dependency XCFramework.	2020-09-23 11:09:46 -04:00
Bill Hollings	f175750042	Merge branch 'master' of https://github.com/KhronosGroup/MoltenVK	2020-09-23 10:50:30 -04:00
Chip Davis	0ae3dc2041	MVKCmdResolveImage: Use layered rendering where possible.	2020-09-23 09:32:35 -05:00
Chip Davis	35fbda1cae	MVKCmdClearAttachments: Account for the renderArea. When calculating the vertices, we need to use the render area's extent--but only if the implementation supports constraining the render area using `renderTargetWidth` and `renderTargetHeight`. Otherwise, the quad will be stretched and/or squashed because of the render area constraint.	2020-09-23 02:45:11 -05:00
Chip Davis	656935b559	MVKCmdClearImage: Use layered rendering when possible.	2020-09-22 16:04:06 -05:00
Chip Davis	61de11b370	MVKImage: Correct offset for texel buffers and heaps. For planes other than 0, we must add the offset of the first subresource, so the plane is at the correct place in memory.	2020-09-22 13:15:40 -05:00
Bill Hollings	f3b6b9c582	Merge branch 'master' of https://github.com/billhollings/MoltenVK into xcode12	2020-09-22 13:57:47 -04:00
Bill Hollings	fde690aabc	Merge branch 'master' of https://github.com/KhronosGroup/MoltenVK	2020-09-22 13:54:29 -04:00
Bill Hollings	c312fb37a7	Merge pull request #1057 from cdavis5e/fix-1d-props MVKDevice: Fix reported image format properties for 1D images as 2D.	2020-09-22 13:51:58 -04:00
Bill Hollings	38e0793c90	Merge pull request #1056 from cdavis5e/read-stencil-view MVKImage: Only create a stencil view for reading.	2020-09-22 13:49:51 -04:00
Bill Hollings	f4f6c06d33	Conditionally compile MTLLanguageVersion2_3 based on available SDK.	2020-09-22 13:40:02 -04:00
Chip Davis	f4065f8a72	MVKDevice: Fix reported image format properties for 1D images as 2D. Fall through to the 2D case, so all the special handling for 2D is used for 1D as well. Also, make sure 1D doesn't report multisampling or support for 420 subsampled formats. There is no `MTLTextureType1DMultisample` anyway. Also, clear the `VkImageFormatProperties` struct if the format is not supported with the given parameters. Some tests seem to expect this.	2020-09-22 10:47:42 -05:00
Chip Davis	41b787dc8b	MVKImage: Only create a stencil view for reading. We don't want to do this for stencil attachment views, because we use the original packed depth/stencil format in render pipelines, and Metal's validation layer for some reason doesn't consider packed formats and their corresponding stencil view formats to match. So only do this if the image view usage includes `SAMPLED` or `INPUT_ATTACHMENT`.	2020-09-22 10:46:19 -05:00
Chip Davis	4b4e5e912b	MVKImage: Always use a texel buffer for atomic linear storage images. If the image has a format that supports atomic access, or can be cast to a format which supports atomic access, then use a texel buffer, regardless of the memory type. If we can't use the `MTLBuffer` from the device memory, then create our own. For #1027.	2020-09-22 10:44:16 -05:00
Bill Hollings	e5cf11fcbf	Merge pull request #1052 from cdavis5e/image-mutable-format MVKImage: Handle VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT.	2020-09-22 11:25:55 -04:00
Bill Hollings	b04720d8a6	Merge pull request #1053 from cdavis5e/max-inline-uniform-blocks MVKDevice: Limit the maximum number of inline uniform blocks on Mac.	2020-09-22 10:51:58 -04:00
Chip Davis	e4a5910f3b	MVKImageView: Get the plane from the aspect mask for single-plane formats. This is the case where we might be getting a view for a single plane from a multiplanar image.	2020-09-21 16:20:57 -05:00

... 3 4 5 6 7 ...

1373 Commits