llvm-project

Author	SHA1	Message	Date
Lei Zhang	b5192cbe50	[mlir][spirv] Fix result type for arith.cmpi/cmpf conversion We cannot directly use the original result type; instead we need to deduce it from the converted operand type. This addresses invalid ops generated from converting single element vectors. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D127574	2022-06-13 13:15:23 -04:00
Lei Zhang	91de20c36d	[mlir][spirv] Use UnrealizedConversionCast in ArithmeticToSPIRV This avoids pulling in function converion patterns, which is not part of what we want to test in ArithmeticToSPIRV. It also allows using ConvertArithmeticToSPIRVPass as a standalone step. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D127573	2022-06-13 13:13:57 -04:00
Mitch Phillips	d3ddc251ac	Revert "[CodeGen] Keep track info of lazy-emitted symbols in ModuleBuilder" This reverts commit `b8f9459715`. Broke the ASan buildbot. See https://reviews.llvm.org/D126781 for more information.	2022-06-13 10:12:38 -07:00
Mitch Phillips	d90eecff5c	Revert "Also move WeakRefReferences in CodeGenModule::moveLazyEmssionStates" This reverts commit `0ecbedc098`. Parent change broke the ASan buildbot. See https://reviews.llvm.org/D126781 for more information.	2022-06-13 10:12:38 -07:00
Lei Zhang	cc020a2236	[mlir][spirv] Convert math.ctlz to spv.GLSL.FindUMsb Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D127582	2022-06-13 13:02:37 -04:00
Valentin Clement	f1c84d0ff0	[flang][NFC] Add TODOs for KIND = 2 Add TODO for KIND=2 so the user is notified correctly. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier, PeteSteinfeld Differential Revision: https://reviews.llvm.org/D127619 Co-authored-by: Peter Steinfeld <psteinfeld@nvidia.com>	2022-06-13 18:45:32 +02:00
Simon Pilgrim	64eea34420	[X86] combineEXTEND_VECTOR_INREG - don't attempt to shuffle combine ANY_EXTEND_VECTOR_INREG without SSE41 Without SSE41, ANY_EXTEND_VECTOR_INREG nodes are likely to be prematurely combined to a target shuffle preventing generic sign extension folds. Fixes a number of sign-extend regressions in D127115.	2022-06-13 17:42:04 +01:00
Stanislav Mekhanoshin	0f81830632	[AMDGPU] Make temp vgpr selection stable in indirectCopyToAGPR This uses rotating reminder of division by 3 to select another temp vgpr each next time in a sequence of several agpr copies. Therefore, temp vgpr selection depends on the generated agpr number. This number could change with any unrelated change to the register definitions. Stabilize the selection by using a real agpr number. Differential Revision: https://reviews.llvm.org/D127524	2022-06-13 09:39:46 -07:00
Thomas Raoux	1c84800c42	[mlir][vector] Add patterns to ppropagate vector distribution Add patterns to propagate vector distribution and remove dead arguments. This handles propagation for several vector operations. Differential Revision: https://reviews.llvm.org/D127167	2022-06-13 16:38:50 +00:00
Kiran Chandramohan	c030f46703	[Flang][OpenMP] Avoid double privatisation of loop variables Loop variables of a worksharing loop and sequential loops in parallel region are privatised by default. These variables are marked with OmpPreDetermined. Skip explicit privatisation of these variables. Note: This is part of upstreaming from the fir-dev branch of https://github.com/flang-compiler/f18-llvm-project. Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D127249 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Mats Petersson <mats.petersson@arm.com>	2022-06-13 16:27:34 +00:00
Mogball	e16d13322b	[mlir] (NFC) Clean up bazel and CMake target names All dialect targets in bazel have been named Dialect and all dialect targets in CMake have been named MLIRDialect.	2022-06-13 16:24:15 +00:00
Lei Zhang	a10c09d1e3	[mlir][spirv] Remove unused `traits` from `SPV_Attr` This addresses the warning of unused template argument.	2022-06-13 12:20:57 -04:00
Lei Zhang	a4360efb2c	[mlir][spirv] Convert single element vector.splat/fma Reviewed By: ThomasRaoux, hanchung Differential Revision: https://reviews.llvm.org/D127572	2022-06-13 12:18:16 -04:00
Mark de Wever	23b10a4a66	[libc++][NFC] Use concepts in <bit>. All supported compilers have concepts support so use that in the C++20 functions in <bit>. s/_LIBCPP_INLINE_VISIBILITY/_LIBCPP_HIDE_FROM_ABI/ as drive-by fix. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D127594	2022-06-13 18:17:48 +02:00
Philip Reames	aaeb958ced	[RISCV] Mutate instruction after computing transfer rule in InsertVSETVLI [nfc] If we defer the mutation of the instruction, we can add the assert discussed in D126921. Once we do that, the API becomes subject to revision - but let's do that in a separate change.	2022-06-13 09:08:25 -07:00
Craig Topper	cef03e3dcd	[RISCV] Move creation of constant pools from isel to lowering. This simplifies the isel code by removing the manual load creation. It also improves our ability to use 0 strided loads for vector splats. There is an assumption here that Mask and ShiftedMask constants are cheap enough that they don't become constant pool loads so that our isel optimizations involving And still work. I believe those constants are 3 instructions in the worst case. The rv64zbp-intrinsic.ll changes is a regression caused by intrinsics being expanded to RISCVISD also occuring during lowering. So the optimizations were only happening during the last DAGCombine, which can't see through the load. I believe we can fix this test by implementing TargetLowering::getTargetConstantFromLoad for RISC-V or by adding the intrinsic to computeKnownBitsForTargetNode to enable earlier DAG combine. Since Zbp is not a ratified extension, I don't view these as blocking this patch. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D127520	2022-06-13 09:07:57 -07:00
Mark de Wever	c36870c8e7	[libc++] Removes unneeded includes. This removes all "TODO: remove these headers" comments from our headers. Note there seem to be more headers that can be removed, that will be done in separate commits. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D127592	2022-06-13 17:56:50 +02:00
Mark de Wever	26465c8337	[libc++] Removes a GCC bug work-around. Based on the comments in [1] this should be fixed in GCC-11. [1] https://gcc.gnu.org/bugzilla/show_bug.cgi?id=37804 Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D127590	2022-06-13 17:55:43 +02:00
Mark de Wever	883dd770d7	[libc++][test] Remove support old compiler support. The compilers clang-11, clang-12, and apple-clang-12 are no longer supported, so remove their annotations in the tests. Reviewed By: #libc, philnik Differential Revision: https://reviews.llvm.org/D127588	2022-06-13 17:54:27 +02:00
Louis Dionne	5b386ac912	[libc++] Do not yield from __sp_mut::lock() Instead of trying to be clever and design our own locking primitive, simply rely on the OS-provided implementation to do the right thing. Indeed, manually yielding to the OS does not provide the necessary information for it to make good prioritization decisions. For example, if a thread with higher priority yields while waiting for a lock held by a thread with lower priority but the system is contended, it is possible for the thread with lower priority to not run until the higher priority thread has yielded 16 times and goes for __libcpp_mutex_lock(). Once that happens, the OS can bump the priority of the thread that currently holds the lock to unblock everyone. So instead, we might as well give the system all the information from the start so it can make appropriate decisions. As a fly-by change, also increase the number of locks in the table. The size increase is modest, but has the potential to half the amount of contention on those locks. rdar://93598606 Differential Revision: https://reviews.llvm.org/D126882	2022-06-13 11:48:13 -04:00
Valentin Clement	4a8305ce85	[flang] Add TODO for half-precision intrinsic reductions Add TODO for half-precision for reduction. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier, PeteSteinfeld Differential Revision: https://reviews.llvm.org/D127622 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2022-06-13 17:40:01 +02:00
Guillaume Chatelet	2b89a4dc51	[NFC] Remove dead code	2022-06-13 15:38:27 +00:00
Guillaume Chatelet	8865700f90	[NFC] Remove dead code	2022-06-13 15:38:27 +00:00
jeanPerier	a370a4ffce	[flang] Avoid raising a TODO in fir.boxproc rewrite when not needed (#1560 ) The pass was raising TODOs when a function both had a fir.boxproc<> argument and a fir.type<> argument (even if the fir.type<> did not contain a fir.boxproc itself). Prevent the TODO from firing when a fir.type<> does not actually contain a fir.boxproc. Add the location for the remaining TODO (it will be needed when procedure pointer components are supported in lowering). FYI, I actually tried to just implement the TODO, but I there is a funny issue. When creating the new fir::RecordType, since the name and context are the same as the type being translated, fir::RecordType:get just returns the existing type, and there is no way to change it (finalize() does nothing since it is already finalized). So this will require to add the ability to mutate the existing type, and I am not sure what are the MLIR constraints here, so I escaped and left the TODO for that case. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier, PeteSteinfeld Differential Revision: https://reviews.llvm.org/D127633 Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-06-13 17:36:56 +02:00
Jean Perier	c8a9afe7c8	[flang] Handle reversed bounds and negative length in inlined allocation ALLOCATE statement allows reversed bounds (see Fortran 2018 9.7.1.2 point 1) in which case the extents are zero. The same applies for the character length provided in the type spec that can be negative. In which case the new length is zero. Use genMaxWithZero to deal with these cases. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier, PeteSteinfeld Differential Revision: https://reviews.llvm.org/D127617 Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-06-13 17:35:03 +02:00
Joseph Huber	1054a73187	[Clang] Change host/device only compilation to a driver mode We use the flags `--offload-host-only` and `--offload-device-only` to change the driver's code generation for offloading programs. These are currently parsed out independently in many places. This patch simply refactors this to work as a mode for the Driver. This stopped us from emitting warnings if unused because it's always used now, but I don't think this is a great loss. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D127515	2022-06-13 11:33:54 -04:00
vdonaldson	70ade047a4	[flang] system_clock intrinsic calls with dynamically optional arguments system_clock intrinsic calls with dynamically optional arguments Modify intrinsic system_clock calls to allow for an argument that is optional or a disassociated pointer or an unallocated allocatable. A call with such an argument is the same as a call that does not specify that argument. Rename (genIsNotNull -> genIsNotNullAddr) and (genIsNull -> genIsNullAddr) and add a use of genIsNotNullAddr. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D127616 Co-authored-by: V Donaldson <vdonaldson@nvidia.com>	2022-06-13 17:33:28 +02:00
Guillaume Chatelet	111b32ecb4	[NFC][Alignment] Use getAlign in Attributor classes	2022-06-13 15:13:05 +00:00
Guillaume Chatelet	2887dd754e	[NFC][Alignment] Use getAlign in VNCoercion	2022-06-13 15:13:05 +00:00
Guillaume Chatelet	dff32e36f6	[NFC][Alignment] Use getAlign in SPIRVEmitIntrinsics	2022-06-13 15:13:05 +00:00
Guillaume Chatelet	5a293d21fc	[NFC][Alignment] Use getAlign in SelectionDAGBuilder	2022-06-13 15:13:05 +00:00
Jan Svoboda	d9390b6ac3	Reapply "[clang][lex] NFCI: Use DirectoryEntryRef in HeaderSearch::load*()" This reverts commit `340654e0f2`, essentially reapplying `1d3ba05e4a`. The test VFS/real-path-found-first.m that was failing on Windows is now passing with a workaround.	2022-06-13 17:03:32 +02:00
Matthias Springer	6ab1ed43f5	[mlir][shape][bufferize] Fix typo in external model Differential Revision: https://reviews.llvm.org/D127639	2022-06-13 16:38:56 +02:00
Kazu Hirata	23d9ca10ae	[CodeGen] Remove EvictionTrack (NFC) The last of getEvictor use was removed on Jun 5, 2022 in commit `5c06f7168f`, which was itself a patch to remove unused code. Once we remove getEvictor, EvictionTrack becomes a write-only data structure. The data in it won't affect compilation, so the entire class is essentially dead.	2022-06-13 07:21:29 -07:00
Quinn Pham	35aaf54823	[clang][driver] fix to correctly set devtoolset on RHEL This patch correctly sets the devtoolset on RHEL. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D127310	2022-06-13 09:12:49 -05:00
Andrzej Warzynski	e9bf76675d	[flang] Add target/triple in a test A test added in https://reviews.llvm.org/D127207 is missing target/triple. This has caused the PowerPC buildbot to start failing: * https://lab.llvm.org/buildbot/#/builders/21/builds/42860 (on PowerPC `; CHECK: ret` should be replaced with `; CHECK: `blr`). Sending this without a review as the fix is rather straightforward. Note that I've decided to add triple/target instead of e.g. removing: `; CHECK: ret`. That's for consistency with other tests that generate assembly. We could change that if that's what folks prefer.	2022-06-13 14:01:37 +00:00
Kazu Hirata	246e83e973	[GlobalISel] Remove buildSequence (NFC) The last use was removed on Jun 27, 2019 in commit `8138996128`.	2022-06-13 06:58:36 -07:00
Sanjay Patel	310adb658c	[InstCombine] reorder mask folds for efficiency This shows narrowing improvements on the logic tests (transforms recently added with `e247b0e5c9`). This is not a complete fix. That would require adding folds to visitOr/visitXor. But it enables the expected transforms for the basic patterns in the affected tests.	2022-06-13 09:49:57 -04:00
Stephen Tozer	30bb659c6f	[Dexter] Allow Dexter watch commands to specify a range of acceptable FP values This patch adds an optional argument to DexExpectWatchBase, float_range, which defines a +- acceptance range for expected floating point values. If passed, this assumes every expected value to be a floating point value, and an exception will be thrown if this is not the case. Differential Revision: https://reviews.llvm.org/D124511	2022-06-13 14:44:28 +01:00
Arnamoy Bhattacharyya	3f4a63e5f8	[Flang][OpenMP] Implementation of lowering of SIMD construct. This patch adds code so that using bbc we are able to see an end-to-end lowering of simd construct in action. Reviewed By: kiranchandramohan, peixin, shraiysh Differential Revision: https://reviews.llvm.org/D125282	2022-06-13 09:46:20 -04:00
Guillaume Chatelet	45a5cd41e5	[NFC][Alignment] Simplify code in MemorySanitizer	2022-06-13 13:36:36 +00:00
Guillaume Chatelet	4296f91323	[NFC][Alignment] Simplify code in JSONExporter	2022-06-13 13:36:36 +00:00
Guillaume Chatelet	310e3279d5	[NFC] Remove dead code in MipsFastISel	2022-06-13 13:36:36 +00:00
Guillaume Chatelet	93082108b7	[NFC][Alignment] Use getAlign in DXILBitcodeWriter	2022-06-13 13:36:36 +00:00
Guillaume Chatelet	01a8b89edb	[NFC][Alignment] Use getAlign in ARMFastISel	2022-06-13 13:36:36 +00:00
Nikita Popov	b9a7dea917	[SelectionDAG] Handle trapping aggregate (PR49839) Call canTrap() on Constant to account for trapping ConstantAggregate.	2022-06-13 15:06:53 +02:00
Nikita Popov	483a4b2226	[SelectionDAG] Add test for PR49839 (NFC)	2022-06-13 15:06:53 +02:00
Guillaume Chatelet	86f455750b	[NFC] Remove dead code	2022-06-13 12:59:38 +00:00
Guillaume Chatelet	eeda07e14b	[NFC][Alignment] Use proper type in tests	2022-06-13 12:59:38 +00:00
Guillaume Chatelet	a6c2ab0c3f	[NFC][Alignment] Use proper type in instrumentLoadOrStore	2022-06-13 12:59:38 +00:00

... 2 3 4 5 6 ...

426708 Commits