summaryrefslogtreecommitdiffstats
path: root/lib/Target/X86/README-SSE.txt
diff options
context:
space:
mode:
authorNate Begeman <natebegeman@mac.com>2008-02-11 04:19:36 +0000
committerNate Begeman <natebegeman@mac.com>2008-02-11 04:19:36 +0000
commitd77e59efa7ae2abffd48a834336ee4bedf5399eb (patch)
treed7bcb670b24ecb227f91407faf81ac2da765ada0 /lib/Target/X86/README-SSE.txt
parent10c8575ba747e8255d45e04564b2d216ad9aad18 (diff)
downloadexternal_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.zip
external_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.tar.gz
external_llvm-d77e59efa7ae2abffd48a834336ee4bedf5399eb.tar.bz2
Enable SSE4 codegen and pattern matching.
Add some notes to the README. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@46949 91177308-0d34-0410-b5e6-96231b3b80d8
Diffstat (limited to 'lib/Target/X86/README-SSE.txt')
-rw-r--r--lib/Target/X86/README-SSE.txt20
1 files changed, 20 insertions, 0 deletions
diff --git a/lib/Target/X86/README-SSE.txt b/lib/Target/X86/README-SSE.txt
index d3f91bf..d9a03a3 100644
--- a/lib/Target/X86/README-SSE.txt
+++ b/lib/Target/X86/README-SSE.txt
@@ -761,3 +761,23 @@ an X86 fxor. This means that we need to handle this case in the x86 backend
instead of in target independent code.
//===---------------------------------------------------------------------===//
+
+Non-SSE4 insert into 16 x i8 is atrociously bad.
+
+//===---------------------------------------------------------------------===//
+
+<2 x i64> extract is substantially worse than <2 x f64>, even if the destination
+is memory.
+
+//===---------------------------------------------------------------------===//
+
+SSE4 extract-to-mem ops aren't being pattern matched because of the AssertZext
+sitting between the truncate and the extract.
+
+//===---------------------------------------------------------------------===//
+
+INSERTPS can match any insert (extract, imm1), imm2 for 4 x float, and insert
+any number of 0.0 simultaneously. Currently we only use it for simple
+insertions.
+
+See comments in LowerINSERT_VECTOR_ELT_SSE4.