Project Cue: Unified Soft Sensor Interface

Executive Summary

Cue is a unified sensor platform that intelligently routes user requests to the right AI model. An LLM acts as the decision engine — analyzing user intent, discovering relevant HuggingFace models, and orchestrating execution. Launch with vision and audio soft sensors; hard sensors (GPS, accelerometer, etc.) as future phases.

Core Value: Zero friction model selection. Users ask questions naturally; Cue handles the complexity.

Project Context

  • Primary Input: Vision (camera) and Audio (microphone) — leveraging sensors available on most modern devices

  • Model Source: HuggingFace model hub (dynamic discovery & loading)

  • Intelligence Layer: LLM-based router that interprets intent and selects optimal models

  • Deployment: Starting with cloud API/web interface, edge optimization later

Goals

  1. Build a working LLM router that maps user intent → HuggingFace models

  2. Implement vision and audio processing pipelines with dynamic model loading

  3. Create seamless UX where users never think about model selection

  4. Establish foundation for scaling to hard sensors and multi-modal queries

  5. Optimize for latency, cost, and model quality

Phases & Breakdown

Phase 1: Foundation & LLM Router (Weeks 1-3)

Task 1.1: LLM Router Architecture

  • 1.1.1: Design LLM prompt/system that interprets user intent

  • Input: natural language request

  • Output: structured task definition (sensor type, sub-task, parameters)

  • 1.1.2: Set up LLM integration (GPT-4, Claude, open model?)

  • 1.1.3: Build intent classification system (vision vs audio vs multi-modal)

  • 1.1.4: Create task parser that extracts model requirements from LLM output

Task 1.2: HuggingFace Integration

  • 1.2.1: Build HF API client for model discovery

  • Query by task type (object-detection, speech-recognition, etc.)

  • Filter by popularity/downloads/rating

  • Handle model card parsing

  • 1.2.2: Implement model loading & caching logic

  • 1.2.3: Create fallback mechanism (if primary model fails, try alternatives)