Refactor refobj #441

bbartley · 2023-10-14T20:36:07Z

This experimental feature branch aims to eliminate costly lookup operations. It was motivated by the fact that LabOP protocols, even relatively simple ones, start to bog down during protocol execution. Profiling showed that execution was getting bogged down by lookups and finds that take a long time to traverse over the Document tree. As the protocol executes, new objects are dynamically generated, making these document traversals ever more costly. Moreover, a conventional caching approach (e.g., Python functools) was ineffective, as the lookups are predominantly performed on nascent, uncached objects.

ReferencedObject attributes have been refactored so that a ReferencedObject attribute will always return an object, not a URI
Thus ReferencedObject attributes always contain direct pointers to Python objects in memory; no document traversal is required to dereference the object
When a ReferencedObject refers to an external object not currently contained in the Document, a stub SBOLObject is used instead
Implements rudimentary reference counter/garbage collection (for lack of better words)
Almost maintains reverse compatibility (almost)

jakebeal · 2023-10-15T16:10:59Z

I'm quite sympathetic to this idea, having experienced the significant time costs myself.

I think that my key concern ends up being around the idea of the implementation of out-of-document objects via anonymous stubs.

What would you think of changing from anonymous stubs to objects of a subclass with a name like MissingObject or OutOfDocument? (I'm not sure whether it should be a subclass of TopLevel or of SBOLObject). Having a designated subclass would let a program that's traversing a document be able to know explicitly and positively that it's encountering an out-of-document link, and take actions like trying to resolve the link or ignoring the link.

tcmitchell · 2023-10-16T16:47:18Z

@bbartley we should merge main into this PR because two other PRs have been merged. One of the other PRs fixes the read-the-docs issue that is causing this PR's build to break.
Would you like me to do the merge? I don't want to step on toes if you're actively developing.

bbartley · 2023-10-16T17:03:45Z

@tcmitchell go for it! this PR is at a stable point and the only thing failing is the read the docs. it would be great to see it go green

Bryan Bartley added 10 commits June 4, 2023 11:11

refobj property returns object not string

fa1f13b

Parsing, serialization, and copy works

a73248a

Add equality operator

1983fe3

Prototype implementation of reference counter

2569573

Implement assignment operations on ReferencedObjectList

27ede0e

Duplicate code is bad

11c73a3

Replace stub objects when a reference is resolved

2a5764d

Remove debugging artifact

190c6c3

Update tests

9bd4921

Fix style errors

6e2ea5e

Bryan Bartley added 7 commits October 15, 2023 20:10

Resolve references and replace stubs during copy operations

783f1b5

When cloning, maintain references internal to the object tree

dce93f0

Fix style

1929965

Set up tests by setting namespace

99aaf63

Delete deprecated copy method

8b35f2d

Fix code checks

6b2417b

Restore deprecated copy because too many tests still depend on it

e2f4a5a

bbartley force-pushed the refactor_refobj branch from 3cf2a13 to e2f4a5a Compare October 16, 2023 05:05

Merge branch 'main' into refactor_refobj

934ce6b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor refobj #441

Refactor refobj #441

bbartley commented Oct 14, 2023

jakebeal commented Oct 15, 2023

tcmitchell commented Oct 16, 2023

bbartley commented Oct 16, 2023

Refactor refobj #441

Are you sure you want to change the base?

Refactor refobj #441

Conversation

bbartley commented Oct 14, 2023

jakebeal commented Oct 15, 2023

tcmitchell commented Oct 16, 2023

bbartley commented Oct 16, 2023