-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs: [FC-0074] add ADR with event design suggestions #438
Changes from all commits
684283f
724e87f
81ea9f3
66846f4
dc51ca3
0f86b3d
c71bb7f
608bd43
e8c16da
361d478
63435f7
41cda0f
3be70ee
e2dcc09
070e5d7
b03570e
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,130 @@ | ||
16. Event Design Best Practices | ||
############################### | ||
|
||
Status | ||
------ | ||
|
||
Draft | ||
|
||
Context | ||
------- | ||
|
||
It is important to follow standards to ensure that the events are consistent, maintainable, and reusable. The design of the events should be self-descriptive, self-contained, and provide enough information for consumers to understand the message. This ADR aims to provide a set of suggested practices for designing Open edX events. | ||
|
||
Decision | ||
-------- | ||
|
||
We have compiled a list of suggested practices taken from the following sources: | ||
|
||
- `Event-Driven Microservices`_ | ||
- `Event-Driven article`_ | ||
- `Thin Events - The lean muscle of event-driven architecture`_ | ||
|
||
These are the practices that we recommend reviewing and following when designing an Open edX Event and contributing to the library. The goal is to implement events that are consistent with the architecture, reusable, and maintainable over time. | ||
|
||
Event Purpose and Content | ||
~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
||
- An event should describe as accurately as possible what happened (what) and why it happened (why). It must contain enough information for consumers to understand the message. For instance, if an event is about a user enrollment, it should contain the user's data, the course data, and the enrollment status and the event should be named accordingly. | ||
- Avoid immediately contacting the source service to retrieve additional information. Instead, consider adding the necessary information to the event payload by managing the granularity of the event. If the event requires additional information, consider adding a field to the event that contains the necessary information. This will reduce the number of dependencies between services and make the event more self-contained. | ||
- Keep the event size small. Avoid adding unnecessary information to the event. If the information is not necessary for consumers to react to the event, consider removing it. | ||
- Avoid adding flow-control information or business logic to events. Events should be solely a representation of what took place. If a field is necessary to control the behavior of the consumer, consider moving it to the consumer side. If adding additional data to the event is absolutely necessary document the reasoning behind it and carefully study the use case and implications. | ||
|
||
Here is an example of an event that follows these practices which is emitted when the a user registers: | ||
|
||
.. code-block:: python | ||
# Location openedx_events/learning/signal.py | ||
# .. event_type: org.openedx.learning.student.registration.completed.v1 | ||
# .. event_name: STUDENT_REGISTRATION_COMPLETED | ||
# .. event_description: emitted when the user registration process in the LMS is completed. | ||
# .. event_data: UserData | ||
STUDENT_REGISTRATION_COMPLETED = OpenEdxPublicSignal( | ||
event_type="org.openedx.learning.student.registration.completed.v1", | ||
data={ | ||
"user": UserData, | ||
} | ||
) | ||
Where: | ||
|
||
- The event name indicates what happened: ``STUDENT_REGISTRATION_COMPLETED``. | ||
- The event description explains why the event happened: ``emitted when the user registration process in the LMS is completed``. | ||
- The event data contains data directly related to what happened ``UserData`` which should contain the necessary information to understand the event, like the username and email of the user. | ||
|
||
Responsibility and Granularity | ||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
||
- Design events with a single responsibility in mind. Each event should represent a single action or fact that happened in the system. If an event contains multiple actions, consider splitting it into multiple events. For instance, if the course grade is updated to pass or fail, there should be two events: one for the pass action and another for the fail action. | ||
|
||
.. note:: For the :doc:`Event Bus <../concepts/event-bus>`, events that are split across multiple actions are an exceptional case where the same event :term:`Topic` should be used to help maintain order across these events. | ||
|
||
- Manage the granularity of the event so it is not too coarse (generic with too much information) or too fine-grained (specific with too little information). When making a decision on the granularity of the event, start with the minimum required information for consumers to react to the event and add more information as needed with enough justification. If necessary, leverage API calls from the consumer side to retrieve additional information but always consider the trade-offs of adding dependencies with other services. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I agree with the idea of avoid generic events, however, we should also avoid split events like course_passed or course_failed when we could use just course_completed with an status. could we include a practical example on an appropiate level of granularity? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm interested in understanding why having a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I understand your perspective, and I think both approaches have valid use cases depending on the context, let me elaborate on why I suggested using course_completed with a status field and we can discuss further to find a balance If the event represents a single conceptual action for example: completing a course, having one event like course_completed with a clear status passed, failed, etc. could simplify the producer's logic and reduce event proliferation and for consumers, interpreting the status field is relatively straightforward if it's well-documented and includes only a few well-defined values, however if different consumers need to handle passed and failed cases in significantly different ways, separate those events might reduce complexity for them but emitting separate events like course_passed and course_failed could also create challenges such as needing to ensure mutual exclusivty. So in cases like this I think We could start with course_completed and a status field, ensuring it is well-documented and strictly validted and if in the future we observe a clear need for distinct event flows We could introduce the more granular events without breaking existing consumers. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. In the example we're using, I still think it'd be more straightforward to send smaller and more specific events. As I see it, course completion and grade passing would be two different critical facts happening in the system; therefore, they should be independent. This is more of a question of what consumers would want with a course completion event or a course passing status change. In any case, these are the only suggestions that should be evaluated for each case. We currently have an event called There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I added some examples illustrating the granularity I'm referring to: https://github.com/openedx/openedx-events/pull/438/files#diff-bdc081c0ccc9885672f08f8b2c853ca7ce8a068db2cb2497d11e04b19013e19aR88-R112 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @Alec4r I worry that your example is predicated on understanding the needs of the consumers. I don't think that's always possible, to predict how various consumers will need to consume events. I agree with @mariajgrimaldi that consuming an event that doesn't require me to also check the status field is conceptually simpler, even if it means I need to handle consuming a few discrete events. |
||
- Ensure that the triggering logic is consistent and narrow. For instance, if an event is triggered when a user enrolls in a course, it should be triggered when the user enrolls in a course in all ways possible to enroll in a course. If the event is triggered when a user enrolls in a course through the API, it should also be triggered when the user enrolls in a course through the UI. | ||
|
||
For instance, consider the following events: | ||
|
||
.. code-block:: python | ||
# Location openedx_events/learning/signal.py | ||
# .. event_type: org.openedx.learning.course.grade.passed.v1 | ||
# .. event_name: COURSE_GRADE_PASSED | ||
# .. event_description: emitted when the user's course grade is updated to pass. | ||
# .. event_data: CourseGradeData | ||
COURSE_GRADE_PASSED = OpenEdxPublicSignal( | ||
event_type="org.openedx.learning.course.grade.passed.v1", | ||
data={ | ||
"grade": CourseGradeData, | ||
} | ||
) | ||
# Location openedx_events/learning/signal.py | ||
# .. event_type: org.openedx.learning.course.grade.failed.v1 | ||
# .. event_name: COURSE_GRADE_FAILED | ||
# .. event_description: emitted when the user's course grade is updated to fail. | ||
# .. event_data: CourseGradeData | ||
COURSE_GRADE_FAILED = OpenEdxPublicSignal( | ||
event_type="org.openedx.learning.course.grade.failed.v1", | ||
data={ | ||
"grade": CourseGradeData, | ||
} | ||
) | ||
Where: | ||
|
||
- The event name indicates what happened: ``COURSE_GRADE_PASSED`` and ``COURSE_GRADE_FAILED``. | ||
- The event description explains why the event happened: ``emitted when the user's course grade is updated to pass`` and ``emitted when the user's course grade is updated to fail``. | ||
- The event data contains data directly related to what happened ``CourseGradeData`` which should contain the necessary information to understand the event, like the user, the course, the grade, and the date of the grade update. | ||
- The granularity of the event is managed by having two events: one for the pass action and another for the fail action. | ||
|
||
Each of these practices should be reviewed with each case, and the granularity of the event should be adjusted according to the use case and the information required by the consumers. | ||
|
||
Event Structure and Clarity | ||
~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
||
- Use appropriate data types and formats for the event fields. Don't use generic data types like strings for all fields. Use specific data types like integers, floats, dates, or custom types when necessary. | ||
- Avoid ambiguous data fields or fields with multiple meaning. For instance, if an event contains a field called ``status`` it should be clear what the status represents. If the status can have multiple meanings, consider splitting the event into multiple events or adding a new field to clarify the status. | ||
|
||
For instance, consider the ``CourseEnrollmentData`` class: | ||
|
||
- The ``mode`` field is a string that represents the course mode. It could be a string like "verified", "audit", "honor", etc. | ||
- The ``is_active`` field is a boolean that represents whether the enrollment is active or not. | ||
- The ``creation_date`` field is a datetime that represents the creation date of the enrollment. | ||
- The ``created_by`` field is a ``UserData`` that represents the user who created the enrollment. | ||
- The ``user`` field is a ``UserData`` that represents the user associated with the Course Enrollment. | ||
- The ``course`` field is a ``CourseData`` that represents the course where the user is enrolled in. | ||
|
||
Consumer-Centric Design | ||
~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
||
- When designing an event, consider the consumers that will be using it. What information do they need to react to the event? What data is necessary for them to process the event? | ||
mariajgrimaldi marked this conversation as resolved.
Show resolved
Hide resolved
|
||
- You can't always predict the needs of consumers in the future. Ensure your design is discrete, flexible, and well-documented, so new and novel use cases can be developed. | ||
- Design events carefully from the start to minimize breaking changes for consumers, although it is not always possible to avoid breaking changes. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. will it be necessary to version events to handle event changes, or what is the plan for handling event changes? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Events are versioned by definition, see this ADR for more info. As for the evolution of the events schema, this other ADR describes what the behavior is supposed to be: https://github.com/openedx/openedx-events/blob/main/docs/decisions/0006-event-schema-serialization-and-evolution.rst#decision, although according to this comment the ADR might be outdated -- I'll be working on updating it. The reality is that we haven't needed to change an event definition in any way that's breaking. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @robrap: Can you help us figure out what needs to change from the ADR-0006? I could do it, but I need more context to do it effectively. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Hello @mariajgrimaldi. I was off for the last two weeks, so just getting back to things. It seems like you are pretty up to date. I'll summarize what I think you are already saying.
Does this help? Let me know if you have any specific questions. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yes, it helps! Thank you. So I'll go ahead and create a new ADR considering the full transitive evolution strategy, so at least it is documented somewhere, noting that we haven't gone through any process of evolution just yet. I'll update the ADR in the coming weeks as a different effort. Thank you! |
||
|
||
Some of these practices might not be applicable to all events, but they are a good starting point to ensure that the events are consistent and maintainable over time. So, design the event so it is small, well-defined and only contain relevant information. | ||
|
||
In addition to these practices, review the Architectural Decision Records (ADRs) related to events to understand the naming, versioning, payload, and other practices that are specific to Open edX events. | ||
|
||
.. _Event-Driven Microservices: https://www.oreilly.com/library/view/building-event-driven-microservices/9781492057888/ | ||
.. _Event-Driven article: https://martinfowler.com/articles/201701-event-driven.html | ||
.. _Thin Events - The lean muscle of event-driven architecture: https://www.thoughtworks.com/insights/blog/architecture/thin-events-the-lean-muscle-of-event-driven-architecture |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I know this ADR is about the event design itself, but somewhere you may want to consider adding a note like the following:
Events that are split across multiple actions is an exceptional case where the same event queue/topic should be used to help maintain order across these events.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This note's pretty useful! Thanks. I'll make sure to add it right away.