Lioncash
c623a94a4d
A64: Implement SRHADD and URHADD
2018-05-26 11:48:56 +01:00
Lioncash
2652e92928
ir: Add opcodes for performing rounding halving adds
2018-05-26 11:48:56 +01:00
Lioncash
990a569b7a
emit_x64_vector: Simplify AVX-512 codepath in EmitVectorMultiply64
...
I realized I introduced a helper for simple AVX operation emitting, so
use that instead of writing it all out long-form.
2018-05-23 08:02:12 +01:00
Lioncash
66c5d8fb06
A64: Implement UMLAL{2}, UMLSL{2}, and UMULL{2}
...
Now that we have the helper function set up for the signed variants, we
can also modify it to be used with the unigned ones by performing a zero
extension instead of a sign extension.
2018-05-23 07:58:41 +01:00
Lioncash
5f53fd2be8
A64: Implement SMLSL{2}
2018-05-23 07:58:41 +01:00
Lioncash
b6ed3f9c66
A64: Implement SMLAL{2}
2018-05-23 07:58:41 +01:00
Lioncash
2bfe1ce838
A64: Implement SMULL{2}
2018-05-23 07:58:41 +01:00
Lioncash
be453b0e1c
fuzz_with_unicorn: Remove exclusion of FMOV (imm) for FP-16 floats
...
Qemu, or rather, Unicorn now supports FP-16, since I backported support
for the recent changes to mainline Qemu relating to FP-16 support.
2018-05-19 12:25:43 +01:00
Lioncash
1f00e53b54
A64: Implement SABAL/SABAL2 and SABDL/SABDL2
...
Now that we have a helper function for the unsigned variants, we can
modify it to also be usable with the signed variants.
2018-05-14 23:10:01 +01:00
Lioncash
de3b545e57
A64: Implement UABAL/UABAL2
2018-05-14 23:10:01 +01:00
Lioncash
e1ab52c057
A64: Implement UABDL/UABDL2
2018-05-14 23:10:01 +01:00
Lioncash
7cd0ff18bf
emit_x64_vector: Emit VPMULLQ in EmitVectorMultiply64 on AVX-512{DQ, VL} capable CPUs
...
Shortens code-gen down to a single instruction in the 64-bit path.
2018-05-14 23:09:31 +01:00
Lioncash
f8f4f9abb4
A64: Implement LDR (literal, SIMD&FP)
2018-05-14 23:09:15 +01:00
Lioncash
b00f6d1044
Correct typo in DataCacheOperation enum
...
Fixes a typo for the InvalidateByVAToPoC enum entry. Given yuzu is the
only known user of 64-bit mode and it doesn't use this value, we can get
away with changing this.
2018-05-14 15:39:08 +01:00
Lioncash
ced64f7dda
A64: Implement FABS' half-precision variant
2018-05-12 11:17:46 +01:00
Lioncash
672641de2c
A64: Implement FABS' single and double precision variant
2018-05-12 11:17:46 +01:00
Lioncash
34e1b030dd
A64: Implement URSHR (scalar) and URSRA (scalar)
...
Now that the utility function is all set up from implementing SRSRA, the
unsigned variants can now be trivially implemented by modifying the
utility function to perform a logical shift right instead of an
arithmetical shift right for the unsigned case.
2018-05-12 11:17:19 +01:00
Lioncash
e662942ee3
A64: Implement SRSRA (scalar)
2018-05-12 11:17:19 +01:00
Lioncash
4a81075641
A64: Implement SRSHR (scalar)
2018-05-12 11:17:19 +01:00
Lioncash
ae666903c4
A64: Implement SABA
2018-05-12 11:16:42 +01:00
Lioncash
1629beed25
A64: Implement SABD
2018-05-12 11:16:42 +01:00
Lioncash
d7951233bd
ir: Add opcodes for signed absolute differences
2018-05-12 11:16:42 +01:00
Tillmann Karras
0718ee4482
decoder_detail: use structured bindings
2018-05-12 11:15:39 +01:00
Lioncash
cee8bfa797
CMakeLists: Add detection for Aarch64 compiler environments
...
Just closes a small hole in architecture detection for the ARM family.
2018-05-10 00:35:50 +01:00
Lioncash
e0faf3277e
simd_two_register_misc: Handle 64-bit case for SCVTF_int_4
2018-05-08 18:14:50 +01:00
Lioncash
b166981ff5
ir: Add opcode to perform the vector conversion S64->F64
...
Unfortunately x86 prior to AVX-512 doesn't really give us any convenient instruction to do the work for us
2018-05-08 18:14:50 +01:00
Lioncash
4d2c5184ff
A64: Implement SHLL/SHLL2
2018-05-08 17:57:55 +01:00
Lioncash
ae57f6eb58
A64: Add missing decoding for PRFM (unscaled offset)
2018-05-08 15:01:53 +01:00
Lioncash
3e0861d013
A64: Implement UHSUB
2018-05-07 19:04:10 +01:00
Lioncash
93255e6dbd
A64: Implement SHSUB
2018-05-07 19:04:10 +01:00
Lioncash
a0e3943ade
ir: Add opcodes for performing vector halving subtracts
2018-05-07 19:04:10 +01:00
Lioncash
b4a2d497f9
A64: Implement SM4EKEY
2018-05-07 19:01:22 +01:00
Lioncash
4372700ec9
A64: Implement SM4E
2018-05-07 19:01:22 +01:00
Lioncash
284afd18cb
ir: Add an opcode for doing an SM4 lookup table query
2018-05-07 19:01:22 +01:00
Lioncash
97a9fce094
emit_x64_vector: Use VPOPCNTB in EmitVectorPopulationCount() if AVX-512 BITALG is available
2018-05-07 16:40:28 +01:00
Lioncash
f5fe2af89c
fuzz_with_unicorn: Silence unused variable warning
...
Currently, structured bindings don't provide a way to ignore unused variables.
2018-05-07 16:39:42 +01:00
Lioncash
a86f56eb43
externals: Update Catch to v2.2.2
...
Keeps the unit-testing library up to date.
2018-05-07 16:39:42 +01:00
Lioncash
6d0b58039e
A64: Implement UHADD
2018-05-07 16:39:17 +01:00
Lioncash
e4efd365fb
A64: Implement SHADD
2018-05-07 16:39:17 +01:00
Lioncash
1da4671b53
ir: Add opcodes for performing halving adds
2018-05-07 16:39:17 +01:00
Lioncash
fae2d940f2
emit_x64_vector: Emit VPMINSQ and VPMINUQ for 64-bit vector min operations if AVX-512VL is available
2018-05-04 07:52:22 +01:00
Lioncash
ec66d94121
emit_x64_vector: Emit VPMAXSQ and VPMAXUQ for 64-bit vector max operations if AVX-512VL is available
2018-05-04 07:52:22 +01:00
Lioncash
69ac6dfca6
emit_x64_vector_floating_point: Deduplicate accurate NaN handling code
...
Allows the code to both be used from the 32 bit and 64 bit operations without duplicating code.
2018-05-03 23:22:50 +01:00
Lioncash
4ca546ce4d
emit_x64_vector: Emit VPABSQ in EmitVectorAbs() for the 64-bit case if AVX-512VL is available
2018-05-03 23:22:18 +01:00
Lioncash
c9a6d6264e
emit_x64_vector: Use VPSRAQ in EmitVectorArithmeticShiftRight64() if AVX-512VL is available
2018-05-03 16:13:11 +01:00
Lioncash
7a066fb011
disassembler_arm: Remove rotation helper function in favor of Common::RotateRight
...
Mildly reduces the amount of duplicated behavior
2018-05-02 17:14:13 +01:00
Lioncash
75d0b1ebd8
emit_x64_vector: Vectorize fallback path of EmitVectorMaxS32()
2018-05-02 17:14:07 +01:00
Lioncash
7f3cfff647
emit_x64_vector: Vectorize fallback path of EmitVectorMaxS8()
2018-05-02 17:14:07 +01:00
Lioncash
9607376f2f
emit_x64_vector: Vectorize fallback path in EmitVectorMinU32()
2018-05-02 17:13:41 +01:00
Lioncash
1162be609d
emit_x64_vector: Vectorize fallback path in EmitVectorMinU16()
2018-05-02 17:13:41 +01:00