Fix ccall return value boxing on ARM/AArch64 #23739

yuyichao · 2017-09-17T05:07:39Z

We previously rely on the extra allocation from the GC to keep the stores inbounds.
This is broken by the allocation optimization since the stack allocation will only have
the requested bytes and not more.

vtjnash · 2017-09-17T14:21:22Z

Seems like this should be using our llvm_type_rewrite so it doesn't inhibit llvm optimizations.

yuyichao · 2017-09-17T14:27:28Z

We are only storing the value so as long as the size is smaller or the same we don't care what type it actually is so unconditionally going through llvm_type_rewrite is unnecessary and can generate extra code.

As long as we are doing this only when there's a size mismatch (i.e. what this PR is doing), it is exactly the same as llvm_type_rewrite. I didn't use it since most of the useful logic from the function is already here to check the size....

yuyichao · 2017-09-17T14:29:58Z

Actually I missed the trunc case in llvm_type_rewrite so I guess I can do that. It doesn't actually matter for the real code though since the type we have here always have an aggregate....

vtjnash · 2017-09-18T18:06:55Z

Ah, didn't realize this was just for structs. (In which case, I think llvm_type_rewrite should be doing the same as this code?) Anyways, this is probably fine, was just considering opportunities for consolidating code.

yuyichao · 2017-09-19T00:06:40Z

FWIW, I think this can share code better with memcpy (the only use of the value is storing anyway). I didn't do it since I don't want to have conflict with #23352 but maybe I should just rebase on it.....

We previously relies on the extra allocation from the GC to keep the stores inbounds. This is broken by the allocation optimization since the stack allocation will only have the requested bytes and not more.

yuyichao added the compiler:codegen Generation of LLVM IR and native code label Sep 17, 2017

yuyichao force-pushed the yyc/codegen/aa64-ccall branch from 658b4de to 9986bfa Compare September 19, 2017 16:00

Fix ccall return value boxing on ARM/AArch64

ca28b5f

We previously relies on the extra allocation from the GC to keep the stores inbounds. This is broken by the allocation optimization since the stack allocation will only have the requested bytes and not more.

yuyichao force-pushed the yyc/codegen/aa64-ccall branch from 9986bfa to ca28b5f Compare September 19, 2017 20:12

yuyichao merged commit 83a89a1 into master Sep 20, 2017

yuyichao deleted the yyc/codegen/aa64-ccall branch September 20, 2017 18:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ccall return value boxing on ARM/AArch64 #23739

Fix ccall return value boxing on ARM/AArch64 #23739

yuyichao commented Sep 17, 2017 •

edited

Loading

vtjnash commented Sep 17, 2017

yuyichao commented Sep 17, 2017

yuyichao commented Sep 17, 2017

vtjnash commented Sep 18, 2017

yuyichao commented Sep 19, 2017

Fix ccall return value boxing on ARM/AArch64 #23739

Fix ccall return value boxing on ARM/AArch64 #23739

Conversation

yuyichao commented Sep 17, 2017 • edited Loading

vtjnash commented Sep 17, 2017

yuyichao commented Sep 17, 2017

yuyichao commented Sep 17, 2017

vtjnash commented Sep 18, 2017

yuyichao commented Sep 19, 2017

yuyichao commented Sep 17, 2017 •

edited

Loading