LLMs for Path Planning for Mobile Ground Robots

This project introduces a Python-based approach to robotic path planning that leverages the capabilities of Large Language Models (LLMs), particularly OpenAI’s GPT-3.5. The core innovation lies in enabling these advanced language models to interpret high-level, human-like instructions and translate them into precise, machine-readable commands for robotic navigation. The primary objective is to bridge the gap between natural language communication and complex robotic control, facilitating dynamic and adaptive path planning in a 2D simulation environment. The implementation integrates pygame for real-time visualization, matplotlib for plotting, and artificial potential fields for effective navigation strategies.

Within this system, real-time simulation occurs in an 800x800 space, constrained by boundaries using pygame. The navigation strategy employs artificial potential fields, balancing attractive potentials (goals) and repulsive potentials (obstacles) to guide the robot. A significant challenge in path planning, the local minima problem, is addressed by implementing solutions that enhance the robot’s navigational efficiency. Specifically, when a local minimum is detected, the system temporarily reassigns a corner of the space as a new goal to maneuver the robot out of its stuck position.

The integration of Large Language Models (LLMs) in robotics plays a pivotal role in this project. It enables robots to understand and act upon complex, human-like instructions, enhancing their ability to make real-time decisions based on their current state and environment. This integration significantly improves human-robot interaction, making interactions with robots more intuitive and efficient, thereby enhancing the overall user experience.

GPT-3.5 specifically plays a critical role in interpreting user input, processing high-level natural language instructions, and translating them into actionable commands for the robot’s navigation system. Its adaptability to varied instructions makes the robot’s path planning flexible and responsive to diverse scenarios. Customization options within the code allow for modifications to initial setups and scenarios, thereby accommodating different path planning requirements.

Moving forward, the project aims to expand its capabilities to handle multiple coordinating robots, implement advanced algorithms for various navigation scenarios, and develop a robustness framework for defining and measuring adaptability. Additionally, improvements in natural language understanding through enhanced LLM integration and a focus on user interface development are key areas for future enhancements.

Ayaz Ahmed
Ayaz Ahmed
Graduate Student

My research interests include control of autonomous systems, robotics, Mechatronics, Numerical Modeling and Simulation.