【Unboxing and Reviewing】---ROSMASTER M3 Pro ROS2 Robot with Multimodal AI Large Model

【Unboxing and Reviewing】---ROSMASTER M3 Pro ROS2 Robot with Multimodal AI Large Model

0 comentarios

Introduction

ROSMASTER M3 Pro is a highly integrated embodied intelligent robot platform developed by Yahboom specifically for ROS education, scientific research experiments, and AI application teaching. It utilizes Mecanum wheels and pendulum suspension chassis for omnidirectional movement. Developed based on the ROS2 Humble system, equipped with a 6DOF robotic arm and a binocular structured light depth camera to perform tasks such as visual recognition, 3D grasping, and precise handling. With dual TOF LiDAR, it enables stable and reliable SLAM mapping and autonomous navigation, as well as LiDAR obstacle avoidance and path planning. Unlike traditional ROS robots, the ROSMASTER M3 Pro deeply integrates cutting-edge AI large-scale model technology. Built-in speech recognition and natural language understanding modules, can realize voice command control, multimodal interaction with text/image/voice, task planning and execution, and dynamic environment perception. Whether used in AI courses, robotics algorithm teaching, or university research projects, the ROSMASTER M3 Pro provides a stable, powerful, and easily scalable experimental platform, making it an ideal choice for AI and robotics education.

Features

【Top-level Hardware Configuration】

  1. Raspberry Pi 5, Jetson NANO B01, Jetson ORIN NANO SUPER, Jetson ORIN NX SUPER development boards, four main control boards for choice.
  2.  An aluminum alloy chassis, 80mm Mecanum wheels, and a rear-wheel pendulum suspension structure allow for easy navigation across diverse terrains.
  3. Combining a depth camera and a 6DOF robotic arm, it enables 3D grasping, precise handling, and MoveIt simulation. 4. Two TOF ranging LiDAR on the front left and rear right provide 360° scanning, enhancing mapping and navigation accuracy.

【Fully Integrates Intelligent Perception and Semantic Understanding】

  1. Built-in speech recognition and natural language processing, combined with speakers, enable easy voice commands and question-and-answer interactions.
  2. Integrating multimodal interaction capabilities, including text, image, and voice, it can adjust actions in real time based on environmental changes, supporting free conversation interruptions and dynamic feedback reasoning.
  3. Integrating a large model and an extensible RAG knowledge system enhances task awareness and complex problem-solving capabilities.

【A full-stack robotics platform for teaching and research】

  1. Based on the ROS2 Humble system, compatible with a variety of robotics algorithms and AI course content.
  2. Integrated hardware and software, along with comprehensive documentation and teaching resources, facilitate efficient progress from introductory learning to scientific research experiments.
  3. From AI and SLAM to visual recognition and robot control, it's widely applicable to university teaching, scientific research experiments, and robotics competition platform development.

Unboxing & Shipping List

As shown below. These are all the parts for the ROSMASTER M3 PRO.

There are two option available: without display and with display. With display, debugging is more convenient.

Five main control board for choice: Raspberry Pi 5, JETSON NANO B01, NVIDIA JETSON ORIN NANO 8GB SUPER, JETSON ORIN NX 8GB/16GB Developer Kit

The course materials, product features, and control software for each main control board are essentially the same. Only affects the performance of the M3 PRO

Basic car body *1 

6DOF robotic arm *1

T-Mini Plus LiDAR *2

LiDAR bracket*4

DABAI DCW2 depth camera *1

Camera bracket *1

AI large model voice module *1

USB 3.0 HUB expansion board *1

Speaker *1

Wireless handle + AAA battery *1

Acrylic plate *1

9600mAh battery pack *1

12.6V charger (2A, DC4017) *1

Copper column screw package *1

Screwdriver *1

40*40*40mm block *1

30*30*30mm block (red/green/blue/yellow)

30*30*60mm block(red/green/blue/yellow)

30*30mm block(red/green/blue/yellow)

T-Mini Plus LiDAR line(30cm)  *1

T-Mini Plus LiDAR line(15cm) *1

Side elbow Type-C data cable(25cm)  *2

Upper elbow USB to USB cable(30cm)  *1 

Upper elbow Type-C data cable (1m) *1 

XH2.54 flat cable (10cm) *1

Note: If you choose with display, you will got following content.

Optional 7-inch touch screen [optional]  *1

7-inch touch screen  *1

Display touch screen bracket *1

Copper column screw package  *1

Product Parameter Details

About Structure Design and Hardware

1. Adopt Dual TOF laser LiDAR T-miniPlus

T-mini Plus laser LiDAR adopts TOF ranging principle, with a ranging range of 0.05m to 12m and a sampling frequency of up to 4000 times/s. With dual LiDAR data fusion filtering and diagonal staggered layout, it effectively improves the robot mapping and navigation accuracy and operating efficiency in complex environments.

2. Large-size Aluminum Alloy Chassis & Mecanum Wheel Pendulum Suspension 

ROSMASTER M3 Pro uses an aluminum alloy chassis, which is made of oxidized sandblasting technology, with 4PCS Mecanum wheels and a rear wheel pendulum suspension chassis design, which allows the robot to adapt to uneven ground. At the same time, it ensures that when the four wheels touch the ground, the wheels do not slip and affect the motor encoder recognition, which is convenient for users to carry out motion algorithm research and ROS function development.

3. Equipped with 6DOF robotic arm 

The robotic arm consists of 6 serial bus servos, with an overall repeatability accuracy of ±0.5mm. It can grab objects within a circle with a radius of 30cm around the center axis of the robotic arm, and supports the handling of objects weighing no more than 410g. It provides MoveIt2 simulation courses, and can complete voice-controlled handling, garbage sorting and other functions in combination with voice interaction.

4. Equipped with Binocular structured light depth camera 

Equipped with DABAIDCW2 binocular structured light depth camera, the depth measurement can reach 5M, with higher measurement accuracy, and can accurately calculate the distance, shape, height, volume and other information of objects, so as to realize high-level AI projects such as grasping, sorting and handling in 3D space.

5. AI large model voice module

AI voice large model module is the core hub connecting user voice input and intelligent model decision-making. The module is equipped with a high-sensitivity MEMS microphone and a cavity speaker, which can clearly pick up voice and has functions such as far-field pickup, echo cancellation, voice broadcast, and environmental noise reduction.

Important Functions

Creative Application of Multimodal Visual Model

ROSMASTER M3 Pro can accurately perceive the surrounding environment with its high-performance hardware configuration. By deeply integrating the multimodal large model and the dual-model reasoning architecture, it can understand the environment, plan actions and flexibly perform tasks, realize high-level human-machine collaboration and intelligent task processing, and bring a more natural and efficient human-machine interaction experience.

AI large model + SLAM map navigation and transportation

ROSMASTER M3 Pro integrates a multimodal AI large model, which allows it to understand user voice commands through the AI large model for multi-point navigation. After reaching the designated location, it deeply understands the semantic information of the surrounding environment, objects and events through the visual large model, and grasps the transportation target through the 6DOF robotic arm, which greatly improves the robot's intelligence, flexibility and user experience, and is closer to real life needs.

Large model intention understanding planning | Context-aware response 

By expanding the RAG knowledge base to realize user intention recognition and environmental context analysis, the robot can understand the user's potential needs, independently plan tasks and respond dynamically without issuing detailed instructions.

Environmental Perception

Through visual large model analysis, ROSMASTER M3 Pro can deeply understand the objects and spatial layout in different areas of the map.

Intelligent navigation integrates multi-point navigation 

ROSMASTER M3 Pro can transmit environmental data to the visual large model in real time for in-depth analysis, and plan dynamic paths based on different user voice commands, autonomously navigate to single or multiple designated areas, thereby achieving intelligent navigation.

Summary

The ROSMASTER M3 PRO represents a multi-dimensional, systematic, and groundbreaking upgrade to traditional ROS robots. It not only inherits the ROS robot's strengths in AI visual interaction, SLAM mapping and navigation and 3D spatial perception. Significantly expands its functionality and improves its performance, with 3D vision-based robotic arm, which significantly enhances its 3D grasping and handling capabilities.

Dual lidars arranged in a diagonally staggered configuration. Offer greater accuracy and speed compared to single-lidar mapping navigation. Thanks to its four innovative features M3 PRO achieves a level of intelligence far exceeding similar products and a more natural and fluid interactive experience.

It is ideal for robotics developers or geeks seeking cutting-edge technology.

 


ROSMASTER M3 Pro: The Future of Educational Robotics with AI Integration

Deja un comentario