Analysis and Improvement of Deep Reinforcement Learning-Based Mobile Robot Navigation in Dynamic Environments

dc.contributor.authorWilson, Sona Preethy
dc.contributor.facultyfi=Tekniikan ja innovaatiojohtamisen yksikkö|en=School of Technology and Innovations|
dc.contributor.organizationfi=Vaasan yliopisto|en=University of Vaasa|
dc.date.accessioned2026-06-08T13:30:00Z
dc.date.issued2026-05-13
dc.description.abstractAutonomous mobile robot navigation in dynamic environments has been studied widely because of the increasing necessity for safe and efficient movement in complex settings such as warehouses, hospitals, transport terminals, and public spaces. Even though classical navigation approaches have shown stronger performance in controlled static environments, real-world continuous applications require repeated replanning, map updates, and simplified assumptions about obstacle position and dynamics. These requirements often limit the application of classical approaches in complex scenarios. Therefore, deep reinforcement learning is progressively adopted as a promising approach for navigation tasks requiring continuous control and adaptation to changing environments. In this thesis, deep reinforcement learning-based mobile robot navigation in dynamic environments has been analyzed and improved with a focus on safety, stability, task completion, and generalization. The research problem is aligned with the difficulty of learning reliable navigation policies, mainly in dynamic environments, where the performance can be reduced due to sparse rewards, unsafe exploration, and limited transferability to unseen settings. The study has been rooted in reinforcement learning theory, the Markov decision processes, continuous control, and safety-aware reward design function. Notably, DDPG, TD3, and SAC have been used as proposed algorithms for continuous control. Using structured navigation environments, a simulation-based experimental framework has been developed. The baseline DDPG has first been evaluated and conducted both qualitative and quantitative analysis on its failure behavior. After this, an iterative reward refinement was performed to improve collision avoidance, goal reaching, and motion stability. Later, algorithmic enhancement was performed, where TD3 and SAC were compared under controlled static and dynamic settings, and SAC was selected for curriculum-based training because of its generalization behavior. Finally, a curriculum-based training has been implemented with progressive complex environments followed by three generalization assessments, including zero-shot evaluation, warm-up fine-tuning, and OOD stress sweep. The results show that the improved model is more stable and has effective navigation behavior. The curriculum-trained policy achieved consistent task completions and has shown reasonable transferability to unseen environments, although performance degrades in more difficult out-of distribution cases. Overall, the study has demonstrated that a combination of reward engineering, stable learning, and curriculum-based training can improve safe navigation in dynamic environments.
dc.description.notificationfi=Opinnäytetyö kokotekstinä PDF-muodossa.|en=Thesis fulltext in PDF format.|sv=Lärdomsprov tillgängligt som fulltext i PDF-format|
dc.format.contentfi=kokoteksti|en=fulltext|
dc.format.extent79
dc.identifier.urihttps://osuva.uwasa.fi/handle/11111/20739
dc.identifier.urnURN:NBN:fi-fe2026051344747
dc.language.isoeng
dc.rightsCC BY 4.0
dc.subject.degreeprogrammeMaster’s Programme in Smart Energy
dc.subject.disciplineAutomation and Robotics
dc.subject.ysoreinforcement learning
dc.subject.ysomachine learning
dc.subject.ysodeep learning
dc.subject.ysoalgorithms
dc.subject.ysolearning environment
dc.subject.ysoevaluation
dc.subject.ysorobotics
dc.subject.ysonavigation
dc.subject.ysosimulation
dc.subject.ysoefficiency (properties)
dc.titleAnalysis and Improvement of Deep Reinforcement Learning-Based Mobile Robot Navigation in Dynamic Environments
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|sv=Pro gradu -avhandling|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Uwasa_2026_Wilson_Sona-Preethy.pdf
Size:
2.35 MB
Format:
Adobe Portable Document Format