Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[JIT] Enable ccmp in X86 emitter backend. #110881

Draft
wants to merge 77 commits into
base: main
Choose a base branch
from

Conversation

anthonycanino
Copy link
Contributor

Overview


This PR is built on top of #108796.

This PR adds APX new ccmp instruction to the X86 backend.

Design

For reference, there is a unique extended evex encoding for ccmp:

image

where SC0 - SC3 encode the condition for ccmp to conditionally execute on (please see SDM Vol 1, Appendix B). If the status codes fail to satisfy the condition encoded by SC0 - SC3, no compare will be performed, and the OF, SF, ZF, and CF flags will be set to the default flag value (DFV) fields of, sf, zf and cf.

Testing

Note: The testing plan for APX work has been discussed in #106557, please refer to that PR for details, only results and comments will be posted in this PR. Results posted below.

Update comments.

Merge the REX2 changes into the original legacy emit path

bug fix: Set REX2.W with correct mask code.

register encoding and prefix emitting logics.

Add REX2 prefix emit logic

bug fixes

Add Stress mode for REX2 encoding and some bug fixes

resolve comments:
1. add assertion check for UD opcodes.
2. add checks for EGPRs.

Add REX2 to emitOutputAM, and let LEA to be REX2 compatible.

Add REX2.X encoding for SIB byte

But fixes: add REX2 prefix on the path in RI where MOV is specially handled.

Enable REX2 encoding for `movups`

fixed bugs in REX2 prefix emitting logic when working with map 1 instructions, and enabled REX2 for POPCNT

legacy map index-er

bug fixes

some clean-up

Adding initial APX unit testing path.

Adding a coredistools dll that has LLVM APX disasm capability.

It must be coppied into a CORE_ROOT manually.

clean up work for REX2

narrow the REX2 scope to `sub` only

some clean up based on the comments.

bug fix

resolve comment
 - SV path is mostly for debugging purposes

Added encoding unit tests for instructions with immediates
Code refactoring: AddX86PrefixIfNeeded.
… missing in JIT, may indicate these instructions are not being used in JIT, drop them for now.
Refactor REX2 encoding stress logics.
(this will have side effect that the estimated code will go up and mismatch with actual code size.)
@dotnet-issue-labeler dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Dec 20, 2024
@dotnet-policy-service dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Dec 20, 2024
Copy link
Contributor

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

@anthonycanino
Copy link
Contributor Author

anthonycanino commented Dec 20, 2024

1. Emitter unit tests

The left is output from JitDisasm, and right from JitLateDisasm.

image
image

2. Intel SDE testing

Test run with SDE:

base

3. SuperPMI results

Diffs are based on 2,623,457 contexts (1,043,127 MinOpts, 1,580,330 FullOpts).

MISSED contexts: 2,983 (0.11%)

Base JIT options: JitBypassApxCheck=1

Diff JIT options: JitBypassApxCheck=1

No diffs found.

Details

Context information

Collection Diffed contexts MinOpts FullOpts Missed, base Missed, diff
aspnet.run.windows.x64.checked.mch 126,540 63,098 63,442 2,665 (2.06%) 2,665 (2.06%)
benchmarks.run.windows.x64.checked.mch 28,757 4 28,753 0 (0.00%) 0 (0.00%)
benchmarks.run_pgo.windows.x64.checked.mch 105,618 52,679 52,939 0 (0.00%) 0 (0.00%)
benchmarks.run_tiered.windows.x64.checked.mch 55,912 38,403 17,509 0 (0.00%) 0 (0.00%)
coreclr_tests.run.windows.x64.checked.mch 582,221 349,625 232,596 0 (0.00%) 0 (0.00%)
libraries.crossgen2.windows.x64.checked.mch 280,377 16 280,361 0 (0.00%) 0 (0.00%)
libraries.pmi.windows.x64.checked.mch 295,086 6 295,080 0 (0.00%) 0 (0.00%)
libraries_tests.run.windows.x64.Release.mch 751,895 517,237 234,658 0 (0.00%) 0 (0.00%)
libraries_tests_no_tiered_compilation.run.windows.x64.Release.mch 342,818 22,045 320,773 0 (0.00%) 0 (0.00%)
realworld.run.windows.x64.checked.mch 24,824 2 24,822 0 (0.00%) 0 (0.00%)
smoke_tests.nativeaot.windows.x64.checked.mch 29,409 12 29,397 318 (1.07%) 318 (1.07%)
2,623,457 1,043,127 1,580,330 2,983 (0.11%) 2,983 (0.11%)

@anthonycanino
Copy link
Contributor Author

anthonycanino commented Dec 23, 2024

We are working to isolate the failures which are likely part of general APX backend changes. I will mark it ready for review in case once they have been fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI community-contribution Indicates that the PR has been added by a community member
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants