diff options
author | Nate Begeman <natebegeman@mac.com> | 2008-02-11 04:19:36 +0000 |
---|---|---|
committer | Nate Begeman <natebegeman@mac.com> | 2008-02-11 04:19:36 +0000 |
commit | d77e59efa7ae2abffd48a834336ee4bedf5399eb (patch) | |
tree | d7bcb670b24ecb227f91407faf81ac2da765ada0 /lib/Target/X86/README-SSE.txt | |
parent | 10c8575ba747e8255d45e04564b2d216ad9aad18 (diff) | |
download | external_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.zip external_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.tar.gz external_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.tar.bz2 |
Enable SSE4 codegen and pattern matching.
Add some notes to the README.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46949 91177308-0d34-0410-b5e6-96231b3b80d8
Diffstat (limited to 'lib/Target/X86/README-SSE.txt')
-rw-r--r-- | lib/Target/X86/README-SSE.txt | 20 |
1 files changed, 20 insertions, 0 deletions
diff --git a/lib/Target/X86/README-SSE.txt b/lib/Target/X86/README-SSE.txt index d3f91bf..d9a03a3 100644 --- a/lib/Target/X86/README-SSE.txt +++ b/lib/Target/X86/README-SSE.txt @@ -761,3 +761,23 @@ an X86 fxor. This means that we need to handle this case in the x86 backend instead of in target independent code. //===---------------------------------------------------------------------===// + +Non-SSE4 insert into 16 x i8 is atrociously bad. + +//===---------------------------------------------------------------------===// + +<2 x i64> extract is substantially worse than <2 x f64>, even if the destination +is memory. + +//===---------------------------------------------------------------------===// + +SSE4 extract-to-mem ops aren't being pattern matched because of the AssertZext +sitting between the truncate and the extract. + +//===---------------------------------------------------------------------===// + +INSERTPS can match any insert (extract, imm1), imm2 for 4 x float, and insert +any number of 0.0 simultaneously. Currently we only use it for simple +insertions. + +See comments in LowerINSERT_VECTOR_ELT_SSE4. |