dynarmic

Author	SHA1	Message	Date
Lioncash	2047683777	emit_x64_vector: Correct static asserts for < 64-bit type checks in saturated accumulate fallbacks I had initially meant to use BitSize() here, not sizeof()	2018-09-11 07:08:32 +01:00
MerryMage	55e9e401aa	emit_x64_vector: EmitVectorSignedSaturatedAccumulateUnsigned64: SSE implementation	2018-09-10 22:39:30 +01:00
MerryMage	1076651426	emit_x64_vector: Simplify fpsr_qc related code Move the bool conversion into A64JitState::GetFpsr so we don't have to continuously pay the cost of conversion for every saturation instruction.	2018-09-10 21:24:07 +01:00
Lioncash	4039030234	A64: Implement CLZ's vector variant	2018-09-10 18:30:40 +01:00
Lioncash	0bb908fb53	ir: Add opcodes for vector CLZ operations We can optimize these cases further for with the use of a fair bit of shuffling via pshufb and the use of masks, but given the uncommon use of this instruction, I wouldn't consider it to be beneficial in terms of amount of code to be worth it over a simple manageable naive solution like this. If we ever do hit a case where vectorized CLZ happens to be a bottleneck, then we can revisit this. At least with AVX-512CD, this can be done with a single instruction for the 32-bit word case.	2018-09-10 18:30:40 +01:00
MerryMage	3b13259630	A64/translate: VectorZeroUpper for V(64) stores Ensures correctness.	2018-09-09 19:59:02 +01:00
MerryMage	1931d44495	simd_two_register_misc: FNEG (vector) with Q == 0 had dirty upper	2018-09-09 19:55:37 +01:00
Lioncash	a0790f02d0	emit_x64_vector: Remove unnecessary [[maybe_unused]] attributes These were unintentionally left in when introducing SUQADD and USQADD	2018-09-09 19:30:14 +01:00
Lioncash	b0e1eb5a15	A64: Implement USQADD's scalar and vector variants	2018-09-09 17:06:03 +01:00
Lioncash	28424c7ad1	ir: Add opcodes form unsigned saturated accumulations of signed values	2018-09-09 17:06:03 +01:00
Lioncash	9923ea0b71	A64: Implement SUQADD's scalar and vector variants	2018-09-09 17:06:03 +01:00
Lioncash	4c0adbb7f1	ir: Add opcodes for signed saturated accumulations of unsigned values	2018-09-09 17:06:03 +01:00
Lioncash	799bfed2df	A64: Implement SMLAL{2}, SMLSL{2}, UMLAL{2}, and UMLSL{2}'s vector by-element variants We can simply modify the general function made for SMULL{2} and UMULL{2}'s by-element variants to also handle the other multiply-based by-element variants.	2018-09-09 13:55:40 +01:00
Lioncash	94451ec321	A64: Implement UMULL{2}'s vector by-element variant	2018-09-09 13:55:40 +01:00
Lioncash	45867deac9	A64: Implement SMULL{2}'s vector by-element variant	2018-09-09 13:55:40 +01:00
Lioncash	02357939ac	ir/value: Replace includes with forward declarations enum classes are still considered complete types when forward declared (as the compiler knows the exact size of the type from the declaration alone). The only difference in this case being that the members of the enum class aren't visible. Given we don't use the members within this header in any way, we can simply forward declare them here and remove the inclusions.	2018-09-09 09:04:22 +01:00
Lioncash	450f721df5	ir/cond: Migrate to C++17 nested namespace specifiers	2018-09-09 09:03:42 +01:00
Lioncash	e649988cd6	CMakeLists: Add missing cond.h header to file listing Allows the file to show up within IDEs more easily.	2018-09-09 09:03:42 +01:00
Lioncash	d20e7694dd	A64: Implement URSQRTE	2018-09-09 00:37:28 +01:00
Lioncash	4f3bde5f12	ir: Add opcodes for performing unsigned reciprocal square root estimates	2018-09-09 00:37:28 +01:00
Lioncash	cfeeaec1c6	A64: Implement URECPE	2018-09-09 00:37:28 +01:00
Lioncash	622b60efd6	ir: Add opcodes for unsigned reciprocal estimate	2018-09-09 00:37:28 +01:00
Lioncash	d17599af40	Update Xbyak to 5.71 Merge commit 'f7c26e9f7ace572f440b80b0e71625295755c38b'	2018-09-08 17:09:25 -04:00
Lioncash	f7c26e9f7a	Squashed 'externals/xbyak/' changes from 671fc805..1de435ed 1de435ed bf uses Label class 613922bd add Label L() for convenience 43e15583 fix typo 93579ee6 add protect-re.cpp 60004b5c fix url of protect-re.cpp 348b2709 fix typo of doc f34f6ed5 update manual 232110be update test 82b78bf0 add setProtectMode dd8b290f put warning message if pageSize != 4096 64775ca2 a little refactoring 7c3e7b85 fix wrong VSIB encoding with idx >= 16 git-subtree-dir: externals/xbyak git-subtree-split: 1de435ed04c8e74775804da944d176baf0ce56e2	2018-09-08 16:52:55 -04:00
Lioncash	8782b69c93	travis: Make macOS build with Xcode 9.4.1 Builds against the latest release version of the Xcode toolchain	2018-09-08 13:21:58 +01:00
Lioncash	b575b23ea9	A64: Implement SQNEG's scalar and vector variant	2018-09-08 11:23:32 +01:00
Lioncash	06062a91c5	A64: Add opcodes for signed saturating negations	2018-09-08 11:23:32 +01:00
Lioncash	1c40579de5	emit_x64_vector: Simplify "position == 0" case for EmitVectorExtract() In the event position is zero, we can just treat it as a NOP, given there's no need to move the data.	2018-09-08 11:23:32 +01:00
Lioncash	e335050886	emit_x64_vector: Simplify "position == 0" case for EmitVectorExtractLower() In the event position == 0, we can just treat it as a simple movq, clearing the upper half of the XMM register. This also makes that case use only one register.	2018-09-08 11:23:32 +01:00
Lioncash	8b13421bac	A64: Implement SQDMULH's by-element scalar variant	2018-09-08 11:23:32 +01:00
Lioncash	9122a6e19e	A64: Implement SQDMULH's by-element vector variant	2018-09-08 11:23:32 +01:00
MerryMage	176e60ebb1	backend/x64: Do not clear fast_dispatch_table if not enabled There is no need to pay for the cost of setting a large block of memory if we're not using it.	2018-09-08 11:23:32 +01:00
MerryMage	959446573f	A64: Implement FastDispatchHint	2018-09-07 22:07:44 +01:00
MerryMage	2be95f2b3b	A32: Implement FastDispatchHint	2018-09-07 22:07:44 +01:00
MerryMage	96f23acd00	ir/terminal: Add FastDispatchHint	2018-09-07 21:29:47 +01:00
Lioncash	f5ca9e9e4a	A64: Implement SQDMULH's scalar variant	2018-09-06 20:35:43 +01:00
Lioncash	af8bea59d5	ir: Add opcodes for scalar signed saturated doubling multiplies	2018-09-06 20:35:43 +01:00
Lioncash	fed4220dc0	A64: Implement SQDMULH's vector variant	2018-09-06 20:35:43 +01:00
Lioncash	72eb6ad362	ir: Add opcodes for signed saturated doubling multiplies	2018-09-06 20:35:43 +01:00
Lioncash	0f8ae84c80	externals: Update catch to 2.4.0 Keeps the unit testing library up to date.	2018-09-06 20:33:10 +01:00
Lioncash	235165ba70	A64: Implement SQABS' scalar variant	2018-09-06 15:49:25 +01:00
Lioncash	1adca93d4a	A64: Implement SQABS' vector variant.	2018-09-06 15:49:25 +01:00
Lioncash	f978c445fa	ir: Add opcodes for signed saturated absolute values	2018-09-06 15:49:25 +01:00
MerryMage	d895a84e72	emit_x64_floating_point: EmitFPToFixed: maxsd optimization maxsd is not required when doing a signed conversion, because x64 produces a 0x80...00 value for out of range values.	2018-09-06 15:44:09 +01:00
MerryMage	c624fe3ff3	emit_x64_floating_point: ZeroIfNaN: pxor -> xorps xorps is shorter and more appropriate here.	2018-09-05 22:00:36 +01:00
MerryMage	e987a84062	IR: Simplify FP{Single,Double}ToFixed{U,S}{32,64}	2018-09-05 21:04:40 +01:00
Lioncash	f1babc8924	externals: Update catch to 2.3.0 Keeps the unit-testing library up to date.	2018-09-03 17:56:02 +01:00
Lioncash	a0c587ac1a	A32/decoder: Add missing <algorithm> includes These includes should be present, as we use std::find_if() within these headers.	2018-09-03 13:53:26 +01:00
Lioncash	0435ac2d80	emit_x64_vector: Provide AVX path for EmitVectorMinU64()	2018-08-31 21:15:48 +01:00
Lioncash	5e40b8fcf8	emit_x64_vector: Provide AVX path for EmitVectorMinS64()	2018-08-31 21:15:48 +01:00

1 2 3 4 5 ...

1666 Commits