several caprevoke optimizations #2195

markjdb · 2024-08-14T15:05:53Z

The procstat-related patches should perhaps be upstreamed as well, though procstat vm doesn't have a verbose mode in vanilla FreeBSD.

nwf-msr · 2024-08-14T15:10:27Z

sys/vm/vm_cheri_revoke.c

+SYSCTL_COUNTER_U64(_vm_stats_cheri_revoke, OID_AUTO, skip_obj_no_hascap, CTLFLAG_RD,
+    &cheri_skip_obj_no_hascap,
+    "Virtual pages skipped in VM objects with no capabilities");
+


Reason to prefer this over the caprevoke stats infrastructure? (Though admittedly I'd not be sad to see the latter get ripped out.)

Mostly that I wanted to have a global view of the counter value, and sysctl is more convenient for that. The existing stats infrastructure collects per-process stats and I believe is limited to printing them when the process exits. The right direction might be a hybrid scheme wherein we maintain per-process (really, per-vmspace) and global counters, and use the former to update the latter after each scan.

nwf-msr · 2024-08-14T15:14:07Z

sys/vm/vm_cheri_revoke.c

 		goto fini;
+	if ((entry->max_protection & VM_PROT_READ_CAP) == 0) {


The commit message should probably say "have PROT_READ_CAP clear"?

No, but the commit log is not very clear. I meant that the existing check (max_prot & PROT_READ_CAP) == 0 does not exclude, e.g., mappings of executable ELF file segments, so without this change we end up scanning those pages unnecessarily.

brooksdavis

I've given this all a light skim and it generally seems good. The second commit message makes sense now that I understand what's going on, but spelling out more from the big block comment is probably worthwhile.

brooksdavis · 2024-08-15T03:37:56Z

sys/sys/user.h

@@ -576,7 +576,8 @@ struct kinfo_vmentry {
 	} kve_type_spec;
 	uint64_t kve_vn_rdev;			/* Device id if device. */
 	uint64_t kve_reservation;		/* Map reservation */
-	int	 _kve_ispare[6];		/* Space for more stuff. */
+	int	 kve_max_protection;		/* Max protection bitmask. */


I wonder if we should upstream kve_max_protection and swap them around in CheriBSD before the next release?

https://reviews.freebsd.org/D46293

If, when scanning, we reach a VA that is unmapped and corresponds to a non-resident page, and the VM object is swap-backed and has no swap blocks assigned, we skip to the next resident page. Extend this optimization to the case where the object has swap blocks assigned: the in-memory tracking structure for a swap block includes a bitmap of all the capability tags, so when skipping we can search for the closer of - the next resident page - the next page backed by a swap block that has at least one capability tag.

This skips over vnode mappings that have PROT_READ_CAP set in max_prot and thus avoids a lot of needless scanning. Add some counters so that we can see the relative effectiveness of different checks in reducing the number of scanned virtual pages.

This avoids some needless work in cases where a short-lived process triggers async revocation.

markjdb · 2024-10-09T12:22:35Z

I dropped the procstat changes since those are coming from upstream and aren't critical to the rest of the PR.

bsdjhb · 2024-10-16T16:23:38Z

sys/vm/vm_cheri_revoke.c

@@ -192,15 +199,14 @@ vm_cheri_revoke_kproc(void *arg __unused)
 		/*
 		 * Do the actual revocation pass.
 		 */
-		error = vm_cheri_revoke_pass_locked(&arc->cookie);
+		error = vm_cheri_revoke_pass_locked(arc->vm, &arc->cookie);


It looks a bit weird to use this after it was seemingly free'd above. It might be clearer to the reader with a comment and a local vm temporary like so:

vmspace_switch_aio(&arc->vm); vm = arc->vm; /* * Drop arc's reference on the vmspace. The scanner kproc holds a reference via * its p_vmspace pointer while the scanning the address space. Dropping the * reference here permits the scan to return early if target process exits early * leaving the scanner kproc's reference as the only reference. */ vmspace_free(arc->vm);

nwf-msr reviewed Aug 14, 2024

View reviewed changes

nwf-msr approved these changes Aug 14, 2024

View reviewed changes

brooksdavis approved these changes Aug 15, 2024

View reviewed changes

markjdb and others added 3 commits October 9, 2024 12:21

caprevoke: Bail if the target process exited during an async scan

f0dce4c

This avoids some needless work in cases where a short-lived process triggers async revocation.

markjdb force-pushed the dev-caprevoke-optimizations branch from f4b1b61 to f0dce4c Compare October 9, 2024 12:21

bsdjhb reviewed Oct 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

several caprevoke optimizations #2195

several caprevoke optimizations #2195

markjdb commented Aug 14, 2024

nwf-msr Aug 14, 2024

markjdb Aug 14, 2024

nwf-msr Aug 14, 2024

markjdb Aug 14, 2024

brooksdavis left a comment

brooksdavis Aug 15, 2024

jrtc27 Aug 15, 2024

markjdb commented Oct 9, 2024

bsdjhb Oct 16, 2024

		goto fini;
		if ((entry->max_protection & VM_PROT_READ_CAP) == 0) {

several caprevoke optimizations #2195

Are you sure you want to change the base?

several caprevoke optimizations #2195

Conversation

markjdb commented Aug 14, 2024

nwf-msr Aug 14, 2024

Choose a reason for hiding this comment

markjdb Aug 14, 2024

Choose a reason for hiding this comment

nwf-msr Aug 14, 2024

Choose a reason for hiding this comment

markjdb Aug 14, 2024

Choose a reason for hiding this comment

brooksdavis left a comment

Choose a reason for hiding this comment

brooksdavis Aug 15, 2024

Choose a reason for hiding this comment

jrtc27 Aug 15, 2024

Choose a reason for hiding this comment

markjdb commented Oct 9, 2024

bsdjhb Oct 16, 2024

Choose a reason for hiding this comment