ROS 2 Navigation Interface

This repository provides an interface for ROS 2 navigation utilizing voice commands. The system processes natural language input via Google's Gemini 2.5 Flash model and visualizes the robot's state on a React-based dashboard.

System Architecture

The interaction pipeline consists of four primary components:

Frontend: React application built with Vite, utilizing MediaRecorder for audio capture.
Backend: FastAPI server that coordinates API requests.
Speech processing: OpenAI Speech-to-Text transcribes the audio input.
Command synthesis: Gemini 2.5 Flash interpolates the transcript and outputs structured ROS 2 Twist messages.
Execution: The rosbridge_server transmits the parsed commands to the ROS 2 Humble environment (Nav2, Cartographer, AMCL).

The system includes a regex-based fallback parser to ensure operational continuity during API service interruptions.

graph TD
    User((User)) -->|Voice| Frontend[React Dashboard]
    Frontend -->|Audio Blob| Backend[FastAPI Server]
    Backend -->|STT| OpenAI[OpenAI Speech-to-Text]
    OpenAI -->|Transcript| Backend
    Backend -->|Gemini 2.5 Flash| AI[Google AI Studio]
    AI -->|JSON Action| Backend
    Backend -->|Response| Frontend
    Frontend -->|WebSocket| Bridge[rosbridge_server]
    Bridge -->|/cmd_vel| Robot[ROS 2 Robot/Sim]

Installation and Configuration

Prerequisites

Ubuntu 22.04 LTS (or WSL2)
ROS 2 Humble Hawksbill (Desktop Install)
Node.js (or Bun)
Python 3.10+

Setup Instructions

Repository mapping

git clone https://github.com/howdoiusekeyboard/ros2_navigation_project.git
cd ros2_navigation_project

Environment configuration Populate the backend environment file with required API keys.

cp backend/.env.example backend/.env
# Add OPENAI_API_KEY and GEMINI_API_KEY to the .env file

System initialization Execute the provided bash script to instantiate the simulation, backend server, and frontend dashboard concurrently.
```
./start_robot_dashboard.sh
```
Interface access The dashboard hosts on http://localhost:5173. A Chromium-based browser is required for full MediaRecorder compatibility.

Operation Guidelines

Verify connection state via the dashboard ("Connected to ROS 2").
Initiate voice capture utilizing the interface microphone control.
Issue spatial or directional commands (e.g., "rotate left 90 degrees", "proceed forward 2 meters", "halt").
Alternatively, use the text input field for manual command insertion.

Project Structure

src/: ROS 2 packages integrating Cartographer and Nav2 configurations.
backend/: Python backend utilizing FastAPI.
project/: React-based interactive dashboard.
scripts/: Operational scripts for initialization and debugging.

Documentation References

SETUP.md: Comprehensive environment preparation guide.
RECOVERY.md: Guidelines for restoring the system from failure states.

License

This project operates under the MIT License. Reference the LICENSE file for exact parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.claude		.claude
backend		backend
config		config
docs		docs
maps		maps
new_sem		new_sem
presentation_diagrams		presentation_diagrams
project		project
scripts		scripts
src		src
worlds		worlds
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
BACKUP_GUIDE.md		BACKUP_GUIDE.md
CLAUDE.md		CLAUDE.md
DEPENDENCIES.md		DEPENDENCIES.md
GEMINI.md		GEMINI.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
QUICK_START.md		QUICK_START.md
QUICK_START_REAL_ROBOT.md		QUICK_START_REAL_ROBOT.md
README.md		README.md
RECOVERY.md		RECOVERY.md
SETUP.md		SETUP.md
SYSTEM_STATUS.md		SYSTEM_STATUS.md
cleanup_demo.sh		cleanup_demo.sh
demo_real_nav.sh		demo_real_nav.sh
demo_real_robot.sh		demo_real_robot.sh
demo_week4.sh		demo_week4.sh
demo_weighted_xai.sh		demo_weighted_xai.sh
dev_helpers.sh		dev_helpers.sh
fix_sync.sh		fix_sync.sh
kill_all_services.sh		kill_all_services.sh
launch_complete_fixed.sh		launch_complete_fixed.sh
launch_final_working.sh		launch_final_working.sh
launch_weighted_demo.sh		launch_weighted_demo.sh
launch_xai_system.sh		launch_xai_system.sh
start_robot_dashboard.sh		start_robot_dashboard.sh
test_fusion_quick.sh		test_fusion_quick.sh
test_obstacle_avoidance.sh		test_obstacle_avoidance.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ROS 2 Navigation Interface

System Architecture

Installation and Configuration

Prerequisites

Setup Instructions

Operation Guidelines

Project Structure

Documentation References

License

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ROS 2 Navigation Interface

System Architecture

Installation and Configuration

Prerequisites

Setup Instructions

Operation Guidelines

Project Structure

Documentation References

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages