diff options
author | Evan Cheng <evan.cheng@apple.com> | 2009-07-25 00:33:29 +0000 |
---|---|---|
committer | Evan Cheng <evan.cheng@apple.com> | 2009-07-25 00:33:29 +0000 |
commit | 66ac53165e17b7c76b8c69e57bde623d44ec492e (patch) | |
tree | b9d935e416aac3ec82ffea50d7f543a2f590dba4 /lib/Target/ARM/README-Thumb2.txt | |
parent | 4711326d60251e9394192032e6646fff0d113eee (diff) | |
download | external_llvm-66ac53165e17b7c76b8c69e57bde623d44ec492e.zip external_llvm-66ac53165e17b7c76b8c69e57bde623d44ec492e.tar.gz external_llvm-66ac53165e17b7c76b8c69e57bde623d44ec492e.tar.bz2 |
Change Thumb2 jumptable codegen to one that uses two level jumps:
Before:
adr r12, #LJTI3_0_0
ldr pc, [r12, +r0, lsl #2]
LJTI3_0_0:
.long LBB3_24
.long LBB3_30
.long LBB3_31
.long LBB3_32
After:
adr r12, #LJTI3_0_0
add pc, r12, +r0, lsl #2
LJTI3_0_0:
b.w LBB3_24
b.w LBB3_30
b.w LBB3_31
b.w LBB3_32
This has several advantages.
1. This will make it easier to optimize this to a TBB / TBH instruction +
(smaller) table.
2. This eliminate the need for ugly asm printer hack to force the address
into thumb addresses (bit 0 is one).
3. Same codegen for pic and non-pic.
4. This eliminate the need to align the table so constantpool island pass
won't have to over-estimate the size.
Based on my calculation, the later is probably slightly faster as well since
ldr pc with shifter address is very slow. That is, it should be a win as long
as the HW implementation can do a reasonable job of branch predict the second
branch.
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@77024 91177308-0d34-0410-b5e6-96231b3b80d8
Diffstat (limited to 'lib/Target/ARM/README-Thumb2.txt')
-rw-r--r-- | lib/Target/ARM/README-Thumb2.txt | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/lib/Target/ARM/README-Thumb2.txt b/lib/Target/ARM/README-Thumb2.txt index 810490b..675b1d9 100644 --- a/lib/Target/ARM/README-Thumb2.txt +++ b/lib/Target/ARM/README-Thumb2.txt @@ -5,3 +5,7 @@ * We should model IT instructions explicitly. We should introduce them (even if if-converter is not run, the function could still contain movcc's) before PEI since passes starting from PEI may require exact code size. + +//===---------------------------------------------------------------------===// + +Make use of TBB and TBH for jumptables in small functions. |