MAUDE

Multi-Agent Unified Dispatch Engine

MAUDE

NemotronCodestralLLaVAWhisperTailscaleSQLiteDockerPython

MAUDE is a local AI assistant running entirely on the NVIDIA DGX Spark. It coordinates multiple specialized models — Nemotron for reasoning, Codestral for code generation, LLaVA for vision, and Mistral for general tasks — through a unified gateway architecture on port 30000.

The system features voice interaction with Whisper transcription, RAG-powered memory via vector database, autonomous sandboxed code execution in Docker containers, and a cross-device client/server architecture over Tailscale VPN. A Mac/PC/Linux client connects to the Spark for remote access from anywhere.

MAUDE also includes Forge, an autonomous builder that plans, executes, verifies, and fixes code in a sandboxed environment — with automatic model escalation from local Nemotron (free) to Codestral to Mistral when tasks require more capability. All inference stays on-device for full privacy.

Architecture

Multi-Model Gateway

A single Python gateway on port 30000 routes all traffic — multi-model LLM requests (Mistral, Codestral, Nemotron, LLaVA), WebSocket proxying for terminal and voice, file upload/download, and image analysis. Model names are resolved via short aliases at startup.

Forge — Autonomous Builder

Plan → Execute → Verify → Fix loop with mandatory verification. Builds software autonomously in a Docker sandbox (Ubuntu 24.04, Python 3.12, Node 22, Go 1.22). Budget-capped token usage with automatic model escalation. Local model tokens are free.

Cross-Machine Task Dispatch

MAUDE can dispatch tasks to remote clients over Tailscale. Target resolution supports exact client ID, hostname, or platform name. Clients poll every 10 seconds for new work and report results back.

Voice Mode (PersonaPlex)

Full-duplex voice conversation via Whisper transcription and TTS. Opus-encoded audio streams over WebSocket with scheduled-playback buffering. Supports camera capture during calls — LLaVA analyzes the photo and injects image context into the conversation.

RAG Memory

Vector database-powered retrieval augmented generation. MAUDE stores and retrieves conversation context, project knowledge, and user preferences to maintain long-term memory across sessions.

Telegram Integration

Full Telegram bot for remote access. Send messages, receive responses, share files, and trigger tools from any device with Telegram installed.

Gallery

MAUDE Server TUI

MAUDE Server TUI

MAUDE Client on Mac

MAUDE Client on Mac