-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
8289943: Simplify some object allocation merges #9073
Conversation
Merge branch 'allocation-merges' of https://github.com/JohnTortugo/jdk into allocation-merges
👋 Welcome back JohnTortugo! A progress list of the required criteria for merging this PR into |
@JohnTortugo this pull request can not be integrated into git checkout allocation-merges
git fetch https://git.openjdk.java.net/jdk master
git merge FETCH_HEAD
# resolve conflicts and follow the instructions given by git merge
git commit -m "Merge master"
git push |
|
@JohnTortugo The following label will be automatically applied to this pull request:
When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing list. If you would like to change these labels, use the /label pull request command. |
Yes, experimental flag is good for now if you DO plan to switch it ON later (and convert flag into other type). |
@JohnTortugo I just noticed that both new tests missing |
Yes, our plan is to keep working on this until we can remove all the constraints in
Totally understand your concern. I'm fine with setting the flag to true and making it "Diagnostic" - I just don't see it as a "Diagnostic" patch, but if it's fine for you then it's fine for me. |
test/hotspot/jtreg/compiler/c2/irTests/scalarReplacement/AllocationMergesTests.java
Show resolved
Hide resolved
The
Yes, in that case both allocations are removed. I just confirmed it with a test locally. Also, there is an IR-based test for that case.
The current patch bails out in that test because there is a Phi (or CmpP) consuming the merge Phi. Actually, that code example is one of the tests that I run "internally". There is already work going on to improve the current patch to make it able to handle CmpP with NULL. |
Thank you for answering my questions and doing all research. Let do this: make flag "experimental" and I will start new round of testing (more tiers) and I will ask others to review changes. |
@JohnTortugo Looks like your latest change/fix cause new issue. I see GitHub Action testing failed with:
|
@JohnTortugo I meant you can specify the warmup iterations for the whole test, not just some methods inside with
It would be great if you include all known failure cases of scalar replacement in the IR test. Thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very nice work, Cesar. Most notably, I'm happy to see the test with so many non-trivial cases enumerated.
Speaking of the proposed patch itself, I'm not done yet reviewing it.
As of now, I don't fully grasp what's the purpose and motivation to introduce ReducedAllocationMerge
. I would be grateful for additional information about how you ended up with the current design. (I went through the email thread, but it didn't help me.)
In particular, I still don't understand how it interacts with existing scalar replacement logic when it comes to unique (per-allocation) memory slices.
Also, on bailouts when the new analysis fails: I instrumented all possible failure modes with asserts and all the test failures I saw were in 2 places:
src/hotspot/share/opto/callnode.cpp:1912
} else if (input->bottom_type()->base() == Type::Memory) {
// Somehow the base was eliminated and we still have a memory reference left
assert(false, "");
return NULL;
}
src/hotspot/share/opto/macro.cpp:2612
// In some cases the region controlling the RAM might go away due to some simplification
// of the IR graph. For now, we'll just bail out if this happens.
if (n->in(0) == NULL || !n->in(0)->is_Region()) {
assert(false, "");
C->record_failure(C2Compiler::retry_no_reduce_allocation_merges());
return;
}
How hard would it be to extend the test with cases which demonstrate existing limitations?
Additional high-level comments. (1) Overall, I don't see (yet) enough motivation to justify the addition of
Also, I believe you face some ideal graph inconsistencies because you capture information too early (before (2) Following up on my earlier question about interactions with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor comments. No need to act until we agree on high-level aspects of the patch.
Hi Vladimir, first of all, thank you for reviewing the proposed patch. Sorry for
My first implementation to deal with the problem was just replacing Phi's The proposed idea of using a new type of node to represent the merges have
Some arguably Cons of the current approach:
The RAM node itself doesn't interfere with alias indexes / memory slices During the Macro node elimination phase the allocation nodes will be visited and I hope I have answered your question. If not please let me know and I'll be
I'll try and create some test cases that trigger those limitations.
Can you please elaborate on that?
Can you please explain a bit more about this idea? Are you proposing to split the |
Thanks for the clarifications, Cesar. The concept of RAM node still looks like a hack to me. And I'd like the patch to better fit the overall design of EA rather than trying to workaround different peculiarities of the current implementation. I'll try to elaborate why I see RAMs redundant. As of now, it serves dual purpose. It (1) marks a merge point as safe to be untangled during SR; and (2) caches information about field values. I believe you can solve it in a cleaner manner without introducing placeholder One possible way to handle merge points is:
Do you see any problems with such an approach? One thing still confuses me though: the patch mentions that RAMs can merge both eliminated and not-yet-eliminated allocations. What's the intended use case? I believe it's still required to have all merged allocations to be eventually eliminated. Do you try to handle the case during allocation elimination when part of the inputs are already eliminated and the rest is pending their turn? |
There are places in the patch where you check for unintended graph modifications, like in
I consider that an implementation peculiarity contributed by RAMs rather than an inherent complication coming from the transformation being performed. |
An other important purpose of RAM is to have information at SafePoints after merge point for reallocation during deoptimization. You need Klass information. I don't think having only Phis for values is enough.
I am not sure how this could be possible. Currently EA rely on IGVN to propagate fields values based on unique memory slice. What you do with memory Load or Store nodes after merge point? Which memory slice you will use for them?
There is check for it in |
Klass information is available either from Allocation node in
My understanding of how proposed approach is expected to work: merge points have to be simple enough to still allow splitting unique types for individual allocations. For example, Alternatively, corresponding
That's nice! Now I see It means that only
|
Hi @iwanowww - Thank you for clarifying things! After much thought and some testing, I think I can make the RAM node go away and achieve the results I want. Below are some additional comments & the overall approach that I'm going to switch to.
Does it look like a better approach? TIA! |
It definitely does! Thanks a lot for thinking it through.
IMO it should be fine to customize
From design perspective, it's cleaner to modify the IR during
It would be helpful to see an implementation sketch to get better understanding how much complexity it adds. |
@JohnTortugo This pull request has been inactive for more than 4 weeks and will be automatically closed if another 4 weeks passes without any activity. To avoid this, simply add a new comment to the pull request. Feel free to ask for assistance if you need help with progressing this pull request towards integration! |
Hi all! I'm back from leave and I'm resuming the work on this project. I'll keep you updated. Thanks again for all the comments and feel free to add more in the meantime. |
@JohnTortugo This pull request has been inactive for more than 4 weeks and will be automatically closed if another 4 weeks passes without any activity. To avoid this, simply add a new comment to the pull request. Feel free to ask for assistance if you need help with progressing this pull request towards integration! |
Hi, ALL. I decided to create a new PR since after the latest changes the code looked much different than the version in this PR. I also attacked the problem from another direction: I decided to create an infrastructure for re-materializing objects before anything else since merges being used as debug information is the most common use case (see charts on new PR). Still, in this new approach, I plan to include all the feedback I received here. 1) No need for RAM node; 2) Improve split-unique-types, 3) Make use of split-through-phi. Thank you all again! |
Hi there, can I please get some feedback on this approach to simplify object allocation merges in order to promote Scalar Replacement of the objects involved in the merge?
The basic idea for this approach was discussed in this thread and it consists of:
There are a few conditions for doing the replacement of the Phi by a RAM node though - Although I plan to work on removing them in subsequent PRs:
These are the critical parts of the implementation and I'd appreciate it very much if you could tell me if what I implemented isn't violating any C2 IR constraints:
Testing:
Progress
Issue
Reviewing
Using
git
Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk pull/9073/head:pull/9073
$ git checkout pull/9073
Update a local copy of the PR:
$ git checkout pull/9073
$ git pull https://git.openjdk.org/jdk pull/9073/head
Using Skara CLI tools
Checkout this PR locally:
$ git pr checkout 9073
View PR using the GUI difftool:
$ git pr show -t 9073
Using diff file
Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/9073.diff