MerryMage
158d9b16f0
A64: Implement SQSHRUN, SQRSHRUN (vector)
2018-07-24 17:20:49 +01:00
MerryMage
f886013526
simd_shift_by_immediate: Simplify ShiftRight
2018-07-24 16:38:51 +01:00
MerryMage
d9b59c69de
A64: Implement SQXTUN
2018-07-24 16:32:10 +01:00
MerryMage
50fe28b976
microinstruction: Reorganize FPSCR related instruction queries
2018-07-24 12:13:18 +01:00
Lioncash
d9d036acc9
microinstruction: Add missing FP scalar opcodes to ReadsFromFPSCR() and WritesToFPSCR()
...
These were forgotten when the opcodes were added.
2018-07-24 11:55:15 +01:00
Lioncash
db96163637
u128: Make Bit() a const-qualified member function
...
This function doesn't modify the struct members, so it can be made
const.
2018-07-24 09:15:44 +01:00
MerryMage
f7052ae04d
A64: Implement FRSQRTS (vector), single/double variant
2018-07-23 22:58:52 +01:00
MerryMage
0925ef6248
A64: Implement FRSQRTE (vector), single/double variant
2018-07-23 22:46:12 +01:00
MerryMage
f4cbbe3218
A64: Implement FRSQRTS (scalar), single/double variant
2018-07-23 22:05:17 +01:00
MerryMage
4ef864e81c
IR: Implement FPRSqrtStepFused
2018-07-23 22:05:17 +01:00
MerryMage
9dffeebc44
fp: Implement FPRSqrtStepFused
2018-07-23 22:05:17 +01:00
MerryMage
aa0455667e
fp: Implement FPNeg
2018-07-23 22:03:07 +01:00
MerryMage
cbde1c5a15
process_nan: Add two operand variant
2018-07-23 22:03:07 +01:00
Lioncash
1ec2663de3
A64: Implement FMAXP, FMINP, FMAXNMP and FMINNMP's scalar double/single-precision variant
2018-07-23 21:22:32 +01:00
MerryMage
027ddf9e2c
emit_x64_floating_point: Fixup special NaN case in FMA FPMulAdd implementation
2018-07-23 21:10:52 +01:00
Lioncash
75a9f7799a
fp: Use a forward declaration in fused.h
...
It's permissible to forward declare here, so we can do so and eliminate
a direct header dependency
2018-07-23 20:46:34 +01:00
Lioncash
1ee16303bd
u128: Implement comparison operators in terms of one another
...
We can just implement the comparisons in terms of operator< and
implement inequality with the negation of operator==.
2018-07-23 20:23:10 +01:00
MerryMage
3b77f48a76
tests: Print cpu info
2018-07-23 20:22:38 +01:00
MerryMage
bed3cc03f9
u128: StickyLogicalShiftRight requires special-casing for amount == 64
...
In this case (128 - amount) == 64, and this invokes undefined behaviour
2018-07-23 20:22:01 +01:00
Lioncash
15d04f489b
A64: Implement FMLA and FMLS (by element)'s double/single-precision scalar variant
2018-07-23 19:13:39 +01:00
Lioncash
7cfccdfa29
A64: Implement FMUL (by element)'s scalar double/single-precision variant
2018-07-23 19:13:39 +01:00
MerryMage
7d2d62ece7
emit_x64_floating_point: Implement accurate fallback for FPMulAdd{32,64}
2018-07-23 18:52:09 +01:00
MerryMage
a599eacebf
fp: Implement FPMulAdd
2018-07-23 18:52:07 +01:00
MerryMage
d70b90ed5d
process_nan: Add FPProcessNaNs3
2018-07-23 18:51:36 +01:00
MerryMage
38ef0e04cb
block_of_code: Add SysV ABI fifth and sixth parameters
2018-07-23 18:51:36 +01:00
MerryMage
8e2ff56569
u128: Add StickyLogicalShiftRight
2018-07-23 18:51:36 +01:00
MerryMage
3b337df076
u128: Add Multiply64To128
2018-07-23 18:51:36 +01:00
MerryMage
8219075ea3
u128: Add u128::Bit
2018-07-23 18:51:11 +01:00
MerryMage
a574dcb2ae
u128: Add comparison operators
2018-07-23 18:51:11 +01:00
MerryMage
391d6d46a4
unpacked: Use ResidualErrorOnRightShift in FPRoundBase
...
Fixes a bug relating to exponents that are severely out of range.
2018-07-23 18:29:44 +01:00
MerryMage
5e0cf9c83e
fp: Remove MantissaT
2018-07-23 14:23:47 +01:00
MerryMage
8c0a84cbd3
FPRSqrtEstimate: Improve documentation of RecipSqrtEstimate
2018-07-23 11:26:51 +01:00
Lioncash
c41d8552a7
FPRSqrtEstimate: Deduplicate array bounds
...
Dehardcodes a few constants in the loops.
2018-07-23 10:56:46 +01:00
Lioncash
4cf055ba47
A64: Implement FMAXV, FMINV, FMAXNMV, and FMINNMV
2018-07-23 10:45:12 +01:00
Lioncash
bf24f0febf
FPRSqrtEstimate: Use forward declarations where applicable
2018-07-23 10:36:49 +01:00
Lioncash
206230e9c4
translate: Return by bool in helpers where applicable
...
Gets rid of a bit of duplication regarding the early-out cases and makes
all helpers functions consistent (previously some had a return type of
bool, while others had a return type of void).
2018-07-23 10:36:28 +01:00
Lioncash
346b725878
Simplify fallback case for EmitVectorSetElement64()
2018-07-23 10:34:44 +01:00
MerryMage
2c34e1d964
emit_x64_floating_point: s/Esimate/Estimate/
2018-07-22 21:49:08 +01:00
MerryMage
5213fb670d
simd_scalar_two_register_misc: Implement FRSQRTE, scalar variant
2018-07-22 18:35:43 +01:00
MerryMage
7ed089fd8e
IR: Implement FPRSqrtEstimate
2018-07-22 18:35:43 +01:00
MerryMage
cd2e286313
simd_vector_x_indexed_element: Implement FMUL (by element), vector variant
2018-07-22 17:43:08 +01:00
MerryMage
fc6b73bd85
a64_emit_x64: Ensure host has updated ticks in EmitA64GetCNTPCT
...
Discovered by @Subv.
Fixes incomplete fix begun in 5a91c94dca47c9702dee20fbd5ae1f4c07eef9df.
That fix fails to take into account that LinkBlock doesn't update ticks until there
are no remaining ticks to be executed.
Test added to confirm fix.
2018-07-22 16:16:26 +01:00
MerryMage
888c6783a1
a64_emit_x64: Fix stack misalignment on Windows for 128-bit exclusive writes
...
Discovered by @Subv.
Includes a test to ensure this codepath is exercised on Windows.
2018-07-22 15:26:25 +01:00
Lioncash
352d53908a
emit_x64_aes: Eliminate extraneous usage of a scratch register in EmitAESInverseMixColumns()
...
We can just use the same register the data is in as the result register,
eliminating the need to use a completely separate register to store the
result.
2018-07-22 07:06:09 +01:00
Lioncash
ab7fe77799
A64: Implement SADDLV
2018-07-22 07:05:36 +01:00
Lioncash
09bd2b2c6e
A64: Implement UADDLV
2018-07-22 07:05:36 +01:00
Lioncash
62e86d7287
fp: Use forward declarations where applicable
...
Minimizes the amount of files that need to be rebuilt if the headers
ever change.
2018-07-22 07:04:47 +01:00
Lioncash
b3edb7a956
emit_x64_vector: Append 'v' prefix onto movq in AVX path
...
This is something I missed when adding in the AVX broadcast code.
2018-07-22 07:04:35 +01:00
Subv
7ea1241953
A64: The A64SetTPIDR IR instruction writes to a system register and should not be eliminated by the dead code elimination pass.
...
Previously this instruction was alway eliminated, resulting in incorrect values for TPIDR_EL0.
2018-07-21 09:34:23 +01:00
MerryMage
853bdd6b98
fp: A64::FPCR -> FP::FPCR
2018-07-20 11:39:39 +01:00