The difficulty in handling complex scenarios due to the hierarchical architecture of Planning and the modularization of functionalities #5033
Replies: 2 comments 2 replies
-
I would like to share the progress on this proposal. First, we have started by creating a prototype of the ranking algorithm. The role of the ranking is to select the best trajectory from those generated by the generator. By the time the trajectories reach the ranking stage, they are already guaranteed to comply with traffic rules, safety, and followability, so the process is similar to selecting a driving policy. For example, when deciding whether to choose a candidate trajectory that aggressively changes lanes or to continue driving straight, this decision is made by the ranking. We are testing two types of ranking algorithms: one based on inverse reinforcement learning and another based on a rule-based approach. The inverse reinforcement learning-based approach has not yet yielded results that are ready to be shared. On the other hand, for the rule-based approach, we are implementing the ranking with the hypothesis that it can function reasonably well with the following five criteria.
Currently, we have collected imitation data from manual driving and are able to visualize Safety, Comfortability, and Efficiency. Moving forward, we plan to design a scoring function based on these criteria and verify whether the appropriate trajectory is selected. |
Beta Was this translation helpful? Give feedback.
-
@yukkysaito san, Is it possible for someone to update us and present the work that has been done so far at Planning & Control WG? I have checked the papers you have shared and Frenetix looks like the old lattice planner that we had which I lost 2 years from my life I hope they are not similar. Also is this testing open source? Can others also join the testing efforts? I would like to learn more and join the efforts on testing in simulation. Let me know if that's a possibility and when it could be done. |
Beta Was this translation helpful? Give feedback.
-
Background
https://github.com/orgs/autowarefoundation/discussions/5032
Issues
The current Autoware planning architecture employs a parallel architecture that can switch according to the scenario, using a hierarchical architecture primarily for lane driving scenarios. Currently, most of the planning functions operate within this hierarchical architecture to handle lane driving scenarios.
One of the shortcomings of this hierarchical architecture is the discrepancy in decision-making that arises from separating behavior planning and motion planning. This issue was recognized as a challenge of the architecture since its proposal (following image).
https://github.com/tier4/AutowareArchitectureProposal.proj/blob/main/docs/design/software_architecture/Planning/DesignRationale.md
In practice, we have faced the shortcomings of the hierarchical architecture, such as implementing speed planning within behavior planning to improve decision-making accuracy. Additionally, based on the concept of micro-autonomy, we have modularized the Planning function into scene modules to enable the on/off switching and expansion of each function. However, in complex scenarios, the cooperation between scene modules becomes necessary, making it difficult for modules to operate independently.
As a result, the current Autoware architecture faces increased functional demands in complex scenarios, leading to higher coupling between scene modules and between behavior planning and motion planning. This makes it challenging to switch or expand each function, resulting in more complex code, increased bugs, and higher maintenance costs.
Proposal
To solve this problem, we propose building a higher-level concept, the Sampling-based Planner Framework, and treating the existing hierarchical architecture as part of the Sampler.
This will enhance the flexibility in developing the Sampler. In the Sampler, various implementations such as machine learning-based planners (e.g., End-to-End AI using World Models and Imitation Learning), optimization-based planners, and rule-based planners can be freely implemented and run simultaneously. Consequently, external planning can operate as part of the Sampler, and the introduction of machine learning-based Samplers becomes easier.
Introducing a machine learning-based Sampler, in particular, is expected to handle edge cases that rule-based approaches cannot address, thereby improving robustness.
Moreover, this approach aligns well with remote assistance, allowing for sampling near the area where remote instructions are given and potentially enhancing the evaluation function of the Selector near the remote instruction area in the future.
https://waymo.com/blog/2024/05/fleet-response/
TIER IV plans to prototype this architecture to verify its effectiveness, and if it works well, we intend to propose it as the new planning architecture.
References
DriveIRL: https://arxiv.org/pdf/2206.03004.pdf
Frenetix Motion Planner: https://arxiv.org/abs/2402.01443
Beta Was this translation helpful? Give feedback.
All reactions