Researchers from the University of the West of England, led by Muhayy Ud Din, have developed a groundbreaking framework for autonomous maritime port inspection. The team, including Waseem Akram, Ahsan B. Bakht, and Irfan Hussain, has introduced an innovative approach that leverages the synergy between Large Language Models (LLMs) and Vision Language Models (VLMs) to enhance the capabilities of cooperative aerial and surface robotic platforms.
Maritime port inspection is crucial for maintaining safety, regulatory compliance, and operational efficiency in complex maritime environments. However, traditional inspection methods often rely on manual operations and conventional computer vision techniques, which lack scalability and contextual understanding. The researchers’ novel framework addresses these limitations by integrating LLMs and VLMs to enable autonomous inspection using Unmanned Aerial Vehicles (UAVs) and Unmanned Surface Vehicles (USVs).
The proposed framework replaces traditional state-machine mission planners with LLM-driven symbolic planning. This approach translates natural language mission instructions into executable symbolic plans, which include dependency graphs that encode operational constraints and ensure safe coordination between UAVs and USVs. This symbolic planning allows for more flexible and adaptive mission execution, as the system can dynamically adjust to changing conditions and priorities.
In parallel, the VLM module performs real-time semantic inspection and compliance assessment. By analyzing visual data from the UAVs and USVs, the VLM module can identify and interpret objects and activities within the port environment. This semantic understanding enables the system to generate structured reports with contextual reasoning, providing detailed insights into the port’s operational status and compliance with regulations.
The researchers validated their framework using the extended MBZIRC Maritime Simulator, which features realistic port infrastructure. They further assessed the system through real-world robotic inspection trials, demonstrating its effectiveness in practical scenarios. The lightweight design of the framework ensures its suitability for resource-constrained maritime platforms, making it a viable solution for enhancing the autonomy and intelligence of port inspection systems.
This innovative approach not only advances the development of autonomous inspection systems but also sets a new standard for the integration of advanced AI technologies in maritime operations. By combining the strengths of LLMs and VLMs, the researchers have created a robust framework that can significantly improve the efficiency, accuracy, and adaptability of maritime port inspections. Read the original research paper here.

