LLVM / project - FreshBSD

LLVM/project 0a0cac6 — llvm/lib/Target/SystemZ SystemZISelLowering.cpp, llvm/test/CodeGen/SystemZ atomic-store-08.ll atomic-load-08.ll

2024-05-06 10:17:19 UTC by Ulrich Weigand via GitHub on ⎇

main

[SystemZ] Simplify f128 atomic load/store (#90977)

Change definition of expandBitCastI128ToF128 and expandBitCastF128ToI128
to allow for simplified use in atomic load/store.

Update logic to split 128-bit loads and stores in DAGCombine to also
handle the f128 case where appropriate. This fixes the regressions
introduced by recent atomic load/store patches.

Delta		File
+155	-116	llvm/lib/Target/SystemZ/SystemZISelLowering.cpp
+8	-26	llvm/test/CodeGen/SystemZ/atomic-store-08.ll
+5	-16	llvm/test/CodeGen/SystemZ/atomic-load-08.ll
+2	-4	llvm/test/CodeGen/SystemZ/atomicrmw-fmin-03.ll
+2	-4	llvm/test/CodeGen/SystemZ/atomicrmw-fmax-03.ll
+172	-166	5 files

LLVM/project 522b4bf — llvm/include/llvm/CodeGen SDPatternMatch.h, llvm/lib/CodeGen/SelectionDAG DAGCombiner.cpp

2024-05-06 10:13:05 UTC by Simon Pilgrim via GitHub on ⎇

main

[DAG] Fold bitreverse(shl/srl(bitreverse(x),y)) -> srl/shl(x,y) (#89897)

Noticed while investigating GFNI per-element vector shifts (we can form SHL but not SRL/SRA)

Alive2: https://alive2.llvm.org/ce/z/fSH-rf

Delta		File
+11	-285	llvm/test/CodeGen/X86/combine-bitreverse.ll
+27	-112	llvm/test/CodeGen/RISCV/bitreverse-shift.ll
+14	-0	llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
+5	-0	llvm/include/llvm/CodeGen/SDPatternMatch.h
+57	-397	4 files

LLVM/project 0933a7a — llvm/lib/Target/LoongArch LoongArchOptWInstrs.cpp, llvm/test/CodeGen/LoongArch prefer-w-inst.ll

2024-05-06 10:07:30 UTC by WANG Rui on ⎇

main

[LoongArch] Rename some OptWInstrs functions. NFC

Delta		File
+25	-21	llvm/lib/Target/LoongArch/LoongArchOptWInstrs.cpp
+7	-7	llvm/test/CodeGen/LoongArch/prefer-w-inst.ll
+32	-28	2 files

LLVM/project 69d740e — clang/lib/AST/Interp ByteCodeEmitter.cpp, clang/test/AST/Interp cxx23.cpp

2024-05-06 09:38:06 UTC by Timm Bäder on ⎇

main

[clang][Interp] Fix creating functions with explicit instance parameters

Delta		File
+7	-5	clang/lib/AST/Interp/ByteCodeEmitter.cpp
+7	-0	clang/test/AST/Interp/cxx23.cpp
+1	-0	clang/test/SemaCXX/cxx2b-deducing-this-constexpr.cpp
+15	-5	3 files

LLVM/project d98a785 — llvm/test/CodeGen/LoongArch rotl-rotr.ll

2024-05-06 08:43:57 UTC by WANG Rui on ⎇

main

[LoongArch] Mark data type i32 are sign-extended. NFC

Delta		File
+9	-9	llvm/test/CodeGen/LoongArch/rotl-rotr.ll
+9	-9	1 files

LLVM/project e9bcd2b — llvm/lib/Target/LoongArch LoongArchOptWInstrs.cpp, llvm/test/CodeGen/LoongArch sextw-removal.ll opt-pipeline.ll

2024-05-06 08:41:26 UTC by hev via GitHub on ⎇

main

[LoongArch] Optimize *W Instructions at MI level (#90463)

Referring to RISC-V, adding an MI level pass to optimize *W instructions
for LoongArch.

First it removes unneeded sext(addi.w rd, rs, 0) instructions. Either
because the sign extended bits aren't consumed or because the input was
already sign extended by an earlier instruction.

Then:
1. Unless explicit disabled or the target prefers instructions with W
suffix, it removes the -w suffix from opw instructions whenever all
users are dependent only on the lower word of the result of the
instruction. The cases handled are:
* addi.w because it helps reduce test differences between LA32 and LA64
w/o being a pessimization.

2. Or if explicit enabled or the target prefers instructions with W
suffix, it adds the W suffix to the instruction whenever all users are

    [4 lines not shown]

Delta		File
+815	-0	llvm/lib/Target/LoongArch/LoongArchOptWInstrs.cpp
+554	-32	llvm/test/CodeGen/LoongArch/sextw-removal.ll
+164	-163	llvm/test/CodeGen/LoongArch/opt-pipeline.ll
+121	-121	llvm/test/CodeGen/LoongArch/atomicrmw-uinc-udec-wrap.ll
+50	-96	llvm/test/CodeGen/LoongArch/ir-instruction/atomic-cmpxchg.ll
+0	-80	llvm/test/CodeGen/LoongArch/ir-instruction/atomicrmw-minmax.ll
+1,704	-492	15 files not shown
+1,797	-627	21 files

LLVM/project 9a521e2 — clang/lib/AST/Interp ByteCodeExprGen.cpp, clang/test/AST/Interp lambda.cpp

2024-05-06 08:37:30 UTC by Timm Bäder on ⎇

main

[clang][Interp] Fix primitive lambda capture defaults

We need to use InitField here, not SetField.

Delta		File
+16	-0	clang/test/AST/Interp/lambda.cpp
+1	-1	clang/lib/AST/Interp/ByteCodeExprGen.cpp
+17	-1	2 files

LLVM/project 8a65ee8 — llvm/test/Analysis/UniformityAnalysis/AMDGPU/MIR temporal-divergence.mir, llvm/test/CodeGen/AMDGPU/GlobalISel divergence-structurizer.mir divergence-divergent-i1-used-outside-loop.mir

2024-05-06 08:37:11 UTC by Sameer Sahasrabuddhe via GitHub on ⎇

main

[AMDGPU] don't mark control-flow intrinsics as convergent (#90026)

This is really a workaround to allow control flow lowering in the
presence of convergence control tokens. Control-flow intrinsics in LLVM
IR are convergent because they indirectly represent the wave CFG, i.e.,
sets of threads that are "converged" or "execute in lock-step". But they
exist during a small window in the lowering process, inserted after the
structurizer and then translated to equivalent MIR pseudos. So rather
than create convergence tokens for these builtins, we simply mark them
as not convergent.

The corresponding MIR pseudos are marked as having side effects, which
is sufficient to prevent optimizations without having to mark them as
convergent.

Delta		File
+67	-67	llvm/test/CodeGen/AMDGPU/GlobalISel/divergence-structurizer.mir
+56	-56	llvm/test/CodeGen/AMDGPU/GlobalISel/divergence-divergent-i1-used-outside-loop.mir
+20	-20	llvm/test/CodeGen/AMDGPU/GlobalISel/divergence-temporal-divergent-i1.mir
+14	-14	llvm/test/Analysis/UniformityAnalysis/AMDGPU/MIR/temporal-divergence.mir
+12	-12	llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-amdgcn.if-invalid.mir
+12	-12	llvm/test/CodeGen/AMDGPU/GlobalISel/divergence-divergent-i1-phis-no-lane-mask-merging.mir
+181	-181	11 files not shown
+244	-232	17 files

LLVM/project d3dad7a — llvm/lib/Transforms/InstCombine InstCombineCompares.cpp, llvm/test/Transforms/InstCombine icmp-of-trunc-ext.ll

2024-05-06 08:30:07 UTC by Yingwei Zheng via GitHub on ⎇

main

[InstCombine] Fix miscompilation caused by #90436 (#91133)

Proof: https://alive2.llvm.org/ce/z/iRnJ4i

Fixes https://github.com/llvm/llvm-project/issues/91127.

Delta		File
+74	-0	llvm/test/Transforms/InstCombine/icmp-of-trunc-ext.ll
+1	-0	llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp
+75	-0	2 files

LLVM/project 30367cb — lldb/include/lldb/API SBType.h, lldb/source/API SBType.cpp

2024-05-06 08:06:51 UTC by Pavel Labath via GitHub on ⎇

main

[lldb] Add SBType::GetByteAlign (#90960)

lldb already mostly(*) tracks this information. This just makes it
available to the SB users.

(*) It does not do that for typedefs right now see llvm.org/pr90958

Delta		File
+21	-0	lldb/test/API/python_api/type/TestTypeList.py
+13	-0	lldb/source/API/SBType.cpp
+3	-0	lldb/test/API/python_api/type/main.cpp
+2	-0	lldb/include/lldb/API/SBType.h
+39	-0	4 files

LLVM/project eb75af2 — llvm/lib/Target/SystemZ SystemZInstrInfo.cpp SystemZInstrInfo.h, llvm/test/CodeGen/SystemZ fold-copy-vector-immediate.mir

2024-05-06 08:00:20 UTC by Matt Arsenault on ⎇

main

Reapply "SystemZ: Fold copy of vector immediate to gr128" (#91099)

This reverts commit a415b4dfcc02e3e82b8c8a7836f7c04b9d65dc9b.

Modify the instruction in place to transform it into a REG_SEQUENCE,
which is what other implementations of foldImmediate do. Also start
erasing the def instruction if there are no other uses.

Fixes #91110.

Delta		File
+206	-0	llvm/test/CodeGen/SystemZ/fold-copy-vector-immediate.mir
+55	-0	llvm/lib/Target/SystemZ/SystemZInstrInfo.cpp
+3	-0	llvm/lib/Target/SystemZ/SystemZInstrInfo.h
+264	-0	3 files

LLVM/project e2c8925 — llvm/lib/Target/AMDGPU AMDGPUPostLegalizerCombiner.cpp AMDGPUCombine.td

2024-05-06 07:55:09 UTC by Jay Foad on ⎇

main

[AMDGPU] Fix typo in function name

Delta		File
+3	-3	llvm/lib/Target/AMDGPU/AMDGPUPostLegalizerCombiner.cpp
+1	-1	llvm/lib/Target/AMDGPU/AMDGPUCombine.td
+4	-4	2 files

LLVM/project 4b61d04 — llvm/test/CodeGen/SystemZ frame-26.mir frame-28.mir

2024-05-06 07:52:35 UTC by Matt Arsenault on ⎇

main

SystemZ: Remove unnecessary REQUIRES asserts from tests

Delta		File
+10	-11	llvm/test/CodeGen/SystemZ/frame-26.mir
+3	-4	llvm/test/CodeGen/SystemZ/frame-28.mir
+0	-1	llvm/test/CodeGen/SystemZ/memcmp-03.ll
+13	-16	3 files

LLVM/project 181e821 — llvm/test/CodeGen/SystemZ zos-ppa2.ll

2024-05-06 07:52:35 UTC by Matt Arsenault on ⎇

main

SystemZ: Remove redundant REQUIRES systemz from test

Delta		File
+0	-1	llvm/test/CodeGen/SystemZ/zos-ppa2.ll
+0	-1	1 files

LLVM/project ef8d814 — llvm/include/llvm/ExecutionEngine/Orc LLJIT.h IndirectionUtils.h

2024-05-06 07:50:06 UTC by Mehdi Amini via GitHub on ⎇

main

Revert "Remove redundant move in return statement" (#91169)

Reverts llvm/llvm-project#90546

This broke some bots, seems like some toolchain don’t consider the
implicit move here.

Delta		File
+4	-4	llvm/include/llvm/ExecutionEngine/Orc/LLJIT.h
+1	-1	llvm/include/llvm/ExecutionEngine/Orc/IndirectionUtils.h
+5	-5	2 files

LLVM/project 0140ba0 — clang/include/clang/Basic LangOptions.h, clang/test/AST ast-dump-fpfeatures.cpp ast-dump-late-parsing.cpp

2024-05-06 07:30:54 UTC by Serge Pavlov via GitHub on ⎇

main

[clang] Enable FPContract with optnone (#91061)

Previously treatment of the attribute `optnone` was modified in
https://github.com/llvm/llvm-project/pull/85605 ([clang] Set correct
FPOptions if attribute 'optnone' presents). As a side effect FPContract
was disabled for optnone. It created unneeded divergence with the
behavior of -O0, which enables this optimization.

In the discussion
https://github.com/llvm/llvm-project/pull/85605#issuecomment-2089350379
it was pointed out that FP contraction should be enabled even if all
optimizations are turned off, otherwise results of calculations would be
different. This change enables FPContract at optnone.

Delta		File
+9	-9	clang/test/AST/ast-dump-fpfeatures.cpp
+4	-4	clang/test/AST/ast-dump-late-parsing.cpp
+1	-4	clang/include/clang/Basic/LangOptions.h
+2	-2	clang/test/AST/ast-dump-fpfeatures.m
+16	-19	4 files

LLVM/project d654278 — llvm/docs AMDGPUUsage.rst LangRef.rst, llvm/lib/Target/AMDGPU SIModeRegisterDefaults.cpp SIISelLowering.cpp

2024-05-06 07:09:19 UTC by Matt Arsenault via GitHub on ⎇

main

Reapply "AMDGPU: Implement llvm.set.rounding (#88587)" series (#91113)

Revert "Revert 4 last AMDGPU commits to unbreak Windows bots"

This reverts commit 0d493ed2c6e664849a979b357a606dcd8273b03f.

MSVC does not like constexpr on the definition after an extern
declaration of a global.

Delta		File
+1,665	-0	llvm/test/CodeGen/AMDGPU/llvm.set.rounding.ll
+119	-0	llvm/lib/Target/AMDGPU/SIModeRegisterDefaults.cpp
+88	-0	llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+7	-0	llvm/lib/Target/AMDGPU/SIModeRegisterDefaults.h
+6	-0	llvm/docs/AMDGPUUsage.rst
+2	-0	llvm/docs/LangRef.rst
+1,887	-0	2 files not shown
+1,890	-0	8 files

LLVM/project db532ff — llvm/include/llvm/ExecutionEngine/Orc LLJIT.h IndirectionUtils.h

2024-05-06 06:30:04 UTC by xiaoleis-nv via GitHub on ⎇

main

Remove redundant move in return statement (#90546)

This pull request removes unnecessary move in the return statement to
suppress compilation warnings.

Co-authored-by: Xiaolei Shi <xiaoleis at nvidia.com>

Delta		File
+4	-4	llvm/include/llvm/ExecutionEngine/Orc/LLJIT.h
+1	-1	llvm/include/llvm/ExecutionEngine/Orc/IndirectionUtils.h
+5	-5	2 files

LLVM/project 1500dc0 — llvm/test/CodeGen/RISCV/rvv coalesce-vsetvli.mir

2024-05-06 06:23:49 UTC by Luke Lau on ⎇

main

[RISCV] Use virtual registers for AVL instrs in coalesce-vsetvli.mir. NFC

All GPR registers will still be virtual at this stage, so update the test
to reflect that.

Delta		File
+11	-7	llvm/test/CodeGen/RISCV/rvv/coalesce-vsetvli.mir
+11	-7	1 files

LLVM/project 0348e71 — clang/lib/Analysis/FlowSensitive Transfer.cpp, clang/unittests/Analysis/FlowSensitive TransferTest.cpp

2024-05-06 06:15:12 UTC by martinboehme via GitHub on ⎇

main

[clang][dataflow] Fix crash when `operator=` result type is not destination type. (#90898)

The existing code was full of comments about how we assume this is
always the
case, but it's not mandated by the standard, and there is code out there
that
returns a different type. So check that the result type is in fact the
same as
the destination type before attempting to copy to the result.

To make sure that we don't bail out in more cases than intended, I've
extended
existing tests to verify that in the common case, we do return the
destination
object (by reference or value, as the case may be).

Delta		File
+71	-2	clang/unittests/Analysis/FlowSensitive/TransferTest.cpp
+16	-7	clang/lib/Analysis/FlowSensitive/Transfer.cpp
+87	-9	2 files

LLVM/project d70267f — clang/test/Driver riscv-option-arch.c riscv-option-arch.s, llvm/lib/Target/RISCV/AsmParser RISCVAsmParser.cpp

2024-05-06 05:55:37 UTC by Yeting Kuo via GitHub on ⎇

main

[RISCV] Teach .option arch to support experimental extensions. (#89727)

Previously `.option arch` denied extenions are not belongs to RISC-V
features. But experimental features have experimental- prefix, so
`.option arch` can not serve for experimental extension.
This patch uses the features of extensions to identify extension
existance.

Delta		File
+15	-10	llvm/lib/Target/RISCV/AsmParser/RISCVAsmParser.cpp
+10	-3	llvm/test/MC/RISCV/option-arch.s
+7	-0	clang/test/Driver/riscv-option-arch.c
+5	-0	clang/test/Driver/riscv-option-arch.s
+37	-13	4 files

LLVM/project 947b062 — clang/include/clang/Serialization ASTBitCodes.h SourceLocationEncoding.h, clang/lib/Serialization ASTReader.cpp ASTWriter.cpp

2024-05-06 05:35:16 UTC by Chuanqi Xu on ⎇

main

Reland "[Modules] No transitive source location change (#86912)"

This relands 6c31104.

The patch was reverted due to incorrectly introduced alignment. And the
patch was re-commited after fixing the alignment issue.

Following off are the original message:

This is part of "no transitive change" patch series, "no transitive
source location change". I talked this with @Bigcheese in the tokyo's
WG21 meeting.

The idea comes from @jyknight posted on LLVM discourse. That for:

```
// A.cppm
export module A;
...

    [246 lines not shown]

Delta		File
+57	-60	clang/include/clang/Serialization/ASTBitCodes.h
+65	-26	clang/include/clang/Serialization/SourceLocationEncoding.h
+87	-0	clang/test/Modules/no-transitive-source-location-change.cppm
+22	-44	clang/lib/Serialization/ASTReader.cpp
+31	-17	clang/include/clang/Serialization/ASTReader.h
+35	-8	clang/lib/Serialization/ASTWriter.cpp
+297	-155	8 files not shown
+326	-174	14 files

LLVM/project b944b54 — llvm/test/CodeGen/RISCV/rvv coalesce-vsetvli.mir

2024-05-06 05:31:11 UTC by Luke Lau on ⎇

main

[RISCV] Add RISCVCoalesceVSETVLI tests for removing dead AVLs. NFC

Delta		File
+62	-0	llvm/test/CodeGen/RISCV/rvv/coalesce-vsetvli.mir
+62	-0	1 files

LLVM/project db0ed55 — clang/lib/Format UnwrappedLineParser.cpp, clang/unittests/Format FormatTest.cpp

2024-05-06 04:33:41 UTC by Owen Pan via GitHub on ⎇

main

[clang-format] Don't remove parentheses of fold expressions (#91045)

Fixes #90966.

Delta		File
+9	-0	clang/unittests/Format/FormatTest.cpp
+6	-1	clang/lib/Format/UnwrappedLineParser.cpp
+15	-1	2 files

LLVM/project c609043 — clang/lib/Format UnwrappedLineParser.cpp, clang/unittests/Format TokenAnnotatorTest.cpp

2024-05-06 03:44:13 UTC by Emilia Kond via GitHub on ⎇

main

[clang-format] Don't allow comma in front of structural enum (#91056)

Assume that a comma in front of `enum` means it is actually a part of an
elaborated type in a template parameter list.

Fixes https://github.com/llvm/llvm-project/issues/47782

Delta		File
+3	-2	clang/lib/Format/UnwrappedLineParser.cpp
+4	-0	clang/unittests/Format/TokenAnnotatorTest.cpp
+7	-2	2 files

LLVM/project 774b7eb — llvm/include/llvm/ADT StringRef.h

2024-05-06 03:08:06 UTC by Kazu Hirata via GitHub on ⎇

main

[ADT] Reimplement operator==(StringRef, StringRef) (NFC) (#91139)

I'm planning to deprecate and eventually remove StringRef::equals in
favor of operator==.  This patch reimplements operator== without using
StringRef::equals.

I'm not sure if there is a good way to make StringRef::compareMemory
available to operator==, which is not a member function.  "friend"
works to some extent but breaks corner cases, which is why I've chosen
to "inline" compareMemory.

Delta		File
+5	-1	llvm/include/llvm/ADT/StringRef.h
+5	-1	1 files

LLVM/project f7bfb07 — llvm/lib/Target/X86 X86ISelLowering.cpp, llvm/test/CodeGen/X86 pr91005.ll

2024-05-06 02:59:44 UTC by Phoebe Wang via GitHub on ⎇

main

[X86][FP16] Do not create VBROADCAST_LOAD for f16 without AVX2 (#91125)

AVX doesn't provide 16-bit BROADCAST instruction.

Fixes #91005

Delta		File
+39	-0	llvm/test/CodeGen/X86/pr91005.ll
+1	-1	llvm/lib/Target/X86/X86ISelLowering.cpp
+40	-1	2 files

LLVM/project 3d6cf53 — mlir/docs/DefiningDialects Operations.md

2024-05-06 02:17:16 UTC by Jeremy Kun via GitHub on ⎇

main

fix formatting issues with ODS docs around assembly format directives (#91149)

- Some sentences are incorrectly split across list items.
- Some pre-formatted syntax is left in plaintext
- Some lines end in spaces

Co-authored-by: Jeremy Kun <j2kun at users.noreply.github.com>

Delta		File
+15	-14	mlir/docs/DefiningDialects/Operations.md
+15	-14	1 files

LLVM/project ddecada — clang/lib/Basic/Targets AArch64.cpp, clang/test/OpenMP distribute_parallel_for_simd_num_threads_codegen.cpp distribute_parallel_for_num_threads_codegen.cpp

2024-05-06 02:05:15 UTC by Doug Wyatt via GitHub on ⎇

main

[clang backend] In AArch64's DataLayout, specify a minimum function alignment of 4. (#90702)

This addresses an issue where the explicit alignment of 2 (for C++ ABI
reasons) was being propagated to the back end and causing under-aligned
functions (in special sections).

This is an alternate approach suggested by @efriedma-quic in PR #90415.

Fixes #90358

Delta		File
+15	-15	clang/test/OpenMP/distribute_parallel_for_simd_num_threads_codegen.cpp
+10	-10	clang/test/OpenMP/distribute_parallel_for_num_threads_codegen.cpp
+6	-6	clang/lib/Basic/Targets/AArch64.cpp
+6	-3	llvm/unittests/Bitcode/DataLayoutUpgradeTest.cpp
+8	-0	llvm/lib/IR/AutoUpgrade.cpp
+4	-4	llvm/lib/Target/AArch64/AArch64TargetMachine.cpp
+49	-38	4 files not shown
+57	-46	10 files

LLVM/project e123643 — llvm/lib/Target/AArch64 AArch64ISelLowering.cpp, llvm/test/CodeGen/AArch64 mul_pow2.ll

2024-05-06 00:56:34 UTC by Allen via GitHub on ⎇

main

[AArch64][SelectionDAG] Lower multiplication by a constant to shl+sub+shl+sub (#90199)

Change the costmodel to lower a = b * C where C = 1 - (1 - 2^m) * 2^n to
              sub  w8, w0, w0, lsl #m
              sub  w0, w0, w8, lsl #n
Fix https://github.com/llvm/llvm-project/issues/89430

Delta		File
+73	-2	llvm/test/CodeGen/AArch64/mul_pow2.ll
+30	-0	llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
+103	-2	2 files