docs: add comprehensive README and project scaffolding

- README completo con istruzioni di installazione, configurazione e utilizzo - API Swagger/OpenAPI documentata - File env.example con variabili di configurazione - Dockerfile multi-stage ottimizzato - Docker Compose con Ollama e LLM Monitor - Struttura completa dell'app FastAPI (main.py, config, api routes) - Servizio client Ollama reusabile - Dashboard web HTML con TailwindCSS - Test suite con pytest - Makefile per comandi comuni - CONTRIBUTING.md per i contributori - LICENSE MIT - .editorconfig e .dockerignore - requirements.txt e requirements-dev.txt
2026-04-24 19:11:58 +02:00
commit 4b782ffdc8
28 changed files with 2087 additions and 0 deletions
@@ -0,0 +1,44 @@
 # Python
 __pycache__/
 *.pyc
 *.pyo
 *.egg-info/
 dist/
 build/
 *.egg
 .pytest_cache/
 .mypy_cache/
 .coverage
 .venv/
 venv/
 # IDE
 .vscode/
 .idea/
 *.swp
 *.swo
 # OS
 .DS_Store
 .gitignore
 Thumbs.db
 # Git
 .git/
 .gitignore
 # Documentation
 docs/
 *.md
 LICENSE
 CONTRIBUTING.md
 # Development
 node_modules/
 package-lock.json
 Makefile
 .env*
 # Test
 tests/
 pytest.ini
@@ -0,0 +1,35 @@
 # EditorConfig is awesome: https://EditorConfig.org
 # top-most EditorConfig file
 root = true
 # Unix-style newlines with a newline ending every file
 [*]
 end_of_line = lf
 insert_final_newline = true
 charset = utf-8
 # Python files
 [*.py]
 indent_style = space
 indent_size = 4
 max_line_length = 100
 # JSON files
 [*.json]
 indent_style = space
 indent_size = 2
 # YAML files
 [*.{yml,yaml}]
 indent_style = space
 indent_size = 2
 # Markdown files
 [*.md]
 trim_trailing_whitespace = false
 # Dockerfile
 [Dockerfile*]
 indent_style = space
 indent_size = 4
@@ -0,0 +1,15 @@
 # LLM Monitor - Local Development Environment
 # Copia questo file da env.example e personalizza per il tuo ambiente
 OLLAMA_HOST=http://localhost:11434
 OLLAMA_TIMEOUT=30
 API_HOST=0.0.0.0
 API_PORT=8000
 API_WORKERS=1
 CORS_ORIGINS=http://localhost:3000,http://localhost:5173,http://localhost:8000
 LOG_LEVEL=DEBUG
 ENVIRONMENT=development
@@ -0,0 +1,145 @@
 # Byte-compiled / optimized / DLL files
 __pycache__/
 *.py[cod]
 *$py.class
 # C extensions
 *.so
 # Distribution / packaging
 .Python
 build/
 develop-eggs/
 dist/
 downloads/
 eggs/
 .eggs/
 lib/
 lib64/
 parts/
 sdist/
 var/
 wheels/
 pip-wheel-metadata/
 share/python-wheels/
 *.egg-info/
 .installed.cfg
 *.egg
 MANIFEST
 # PyInstaller
 *.manifest
 *.spec
 # Unit test / coverage reports
 htmlcov/
 .tox/
 .nox/
 .coverage
 .coverage.*
 .cache
 nosetests.xml
 coverage.xml
 *.cover
 *.py,cover
 .hypothesis/
 .pytest_cache/
 # Translations
 *.mo
 *.pot
 # Django stuff:
 *.log
 local_settings.py
 db.sqlite3
 db.sqlite3-journal
 # Flask stuff:
 instance/
 .webassets-cache
 # Scrapy stuff:
 .scrapy
 # Sphinx documentation
 docs/_build/
 # PyBuilder
 target/
 # Jupyter Notebook
 .ipynb_checkpoints
 # IPython
 profile_default/
 ipython_config.py
 # pyenv
 .python-version
 # pipenv
 Pipfile.lock
 # PEP 582
 __pypackages__/
 # Celery stuff
 celerybeat-schedule
 celerybeat.pid
 # SageMath parsed files
 *.sage.py
 # Environments
 .env
 .venv
 env/
 venv/
 ENV/
 env.bak/
 venv.bak/
 # Spyder project settings
 .spyderproject
 .spyproject
 # Rope project settings
 .ropeproject
 # mkdocs documentation
 /site
 # mypy
 .mypy_cache/
 .dmypy.json
 dmypy.json
 # Pyre type checker
 .pyre/
 # IDE
 .vscode/
 .idea/
 *.swp
 *.swo
 *~
 # OS
 .DS_Store
 Thumbs.db
 # Node modules (per TailwindCSS)
 node_modules/
 package-lock.json
 # Build outputs
 app/web/static/css/output.css
 # Database
 *.db
 *.sqlite
 *.sqlite3
 # Uploads
 uploads/
@@ -0,0 +1,123 @@
 # Contribuire a LLM Monitor
 Grazie per l'interesse nel contribuire a LLM Monitor! Questo documento fornisce linee guida per contribuire al progetto.
 ## Codice di Condotta
 Questo progetto aderisce a un Codice di Condotta per garantire un ambiente inclusivo e rispettoso.
 ## Come Contribuire
 ### Segnalare Bug
 - **Verificare prima** se il bug non è già stato segnalato
 - **Includere dettagli**: sistema operativo, versione Python, stack trace
 - **Fornire un esempio ripetibile** se possibile
 ### Suggerire Miglioramenti
 - **Verificare prima** se il suggerimento non è già stato fatto
 - **Spiegare chiaramente** il caso d'uso e i benefici
 - **Fornire esempi** di come dovrebbe funzionare
 ### Pull Requests
 1. **Fork il repository**
 2. **Crea un branch**: `git checkout -b feature/my-feature`
 3. **Installa le dipendenze di sviluppo**:
   ```bash
   pip install -r requirements-dev.txt
   ```
 4. **Effettua i tuoi cambiamenti** seguendo lo [Style Guide](#style-guide)
 5. **Scrivi i test**: I test sono obbligatori per nuove funzionalità
 6. **Esegui i test**: `make test`
 7. **Formatta il codice**: `make format`
 8. **Esegui il linting**: `make lint`
 9. **Fai il commit**: `git commit -m "feat: descrizione della feature"`
 10. **Push**: `git push origin feature/my-feature`
 11. **Apri una PR** descrivendo i cambiamenti
 ## Style Guide
 ### Python
 - Usa **Black** per la formattazione: `make format`
 - Usa **isort** per l'organizzazione degli import
 - Segui **PEP 8**
 - Usa type hints per le funzioni nuove
 - Documenta con docstring (formato Google):
 ```python
 def my_function(param1: str, param2: int) -> bool:
    """
    Descrizione breve della funzione.
    Args:
        param1: Descrizione del primo parametro
        param2: Descrizione del secondo parametro
    Returns:
        Descrizione del valore ritornato
    Raises:
        ValueError: Quando succede
    """
    pass
 ```
 ### Commit Messages
 Usa il formato Conventional Commits:
 - `feat: aggiungi nuova feature`
 - `fix: correggi un bug`
 - `docs: aggiorna documentazione`
 - `style: formattazione, without semantic change`
 - `refactor: ristruttura codice`
 - `perf: migliora le performance`
 - `test: aggiungi o modifica test`
 - `chore: aggiorna dipendenze, etc`
 Esempio:
 ```
 feat: aggiungi endpoint per ottenere statistiche modelli
 - Nuovo endpoint GET /api/v1/models/stats
 - Ritorna conteggio, spazio totale e ultimi aggiornamenti
 - Include test di integrazione
 ```
 ### Codice
 - Mantieni le funzioni piccole e ben definite
 - Usa nomi descrittivi
 - Aggiungi commenti per la logica complessa
 - Evita magic numbers, usa costanti
 ## Testing
 - Tutti i PR devono includere test per nuove funzionalità
 - La copertura del codice deve rimanere ≥ 80%
 - Esegui i test prima di submitare:
  ```bash
  make test
  ```
 ## Documentazione
 - Aggiorna il README se cambi il comportamento
 - Aggiungi docstring a nuove funzioni
 - Aggiorna il CHANGELOG.md
 ## Processo di Review
 - I PR saranno reviewati il prima possibile
 - I feedback saranno forniti in buona fede
 - Le discussioni devono essere costruttive
 ## Licenza
 Contribuendo, accetti che i tuoi contributi siano licensiati sotto la MIT License.
 ---
 Domande? Apri un issue o contatta il maintainer!
@@ -0,0 +1,52 @@
 # Multi-stage build per LLM Monitor
 # Stage 1: Builder
 FROM python:3.11-slim as builder
 WORKDIR /app
 # Installare dipendenze di build
 RUN apt-get update && apt-get install -y --no-install-recommends \
    gcc \
    && rm -rf /var/lib/apt/lists/*
 # Copiare requirements
 COPY requirements.txt .
 # Installare Python packages in un virtualenv
 RUN python -m venv /opt/venv
 ENV PATH="/opt/venv/bin:$PATH"
 RUN pip install --no-cache-dir --upgrade pip setuptools wheel && \
    pip install --no-cache-dir -r requirements.txt
 # Stage 2: Runtime
 FROM python:3.11-slim
 WORKDIR /app
 # Installare dipendenze di runtime
 RUN apt-get update && apt-get install -y --no-install-recommends \
    curl \
    && rm -rf /var/lib/apt/lists/*
 # Copiare il virtualenv dal builder
 COPY --from=builder /opt/venv /opt/venv
 # Copiare codice dell'app
 COPY app/ /app/app/
 COPY main.py /app/
 COPY .env* /app/
 # Impostare PATH
 ENV PATH="/opt/venv/bin:$PATH"
 ENV PYTHONUNBUFFERED=1
 # Esporre la porta
 EXPOSE 8000
 # Health check
 HEALTHCHECK --interval=30s --timeout=10s --start-period=5s --retries=3 \
    CMD curl -f http://localhost:8000/api/v1/health || exit 1
 # Comando di avvio
 CMD ["uvicorn", "main:app", "--host", "0.0.0.0", "--port", "8000"]
@@ -0,0 +1,21 @@
 MIT License
 Copyright (c) 2024-2026 Luca Sacchi
 Permission is hereby granted, free of charge, to any person obtaining a copy
 of this software and associated documentation files (the "Software"), to deal
 in the Software without restriction, including without limitation the rights
 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 copies of the Software, and to permit persons to whom the Software is
 furnished to do so, subject to the following conditions:
 The above copyright notice and this permission notice shall be included in all
 copies or substantial portions of the Software.
 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
 SOFTWARE.
@@ -0,0 +1,54 @@
 .PHONY: help install dev prod test lint format clean docker-build docker-up docker-down
 help:
 	@echo "LLM Monitor - Makefile Commands"
 	@echo "================================"
 	@echo "make install       - Installa le dipendenze"
 	@echo "make dev          - Avvia in modalità sviluppo"
 	@echo "make prod         - Avvia in modalità produzione"
 	@echo "make test         - Esegui i test"
 	@echo "make lint         - Linting e type checking"
 	@echo "make format       - Formatta il codice"
 	@echo "make clean        - Pulisce cache e file temporanei"
 	@echo "make docker-build - Build dell'immagine Docker"
 	@echo "make docker-up    - Avvia i container con Docker Compose"
 	@echo "make docker-down  - Ferma i container con Docker Compose"
 install:
 	python3 -m venv venv
 	. venv/bin/activate && pip install -r requirements.txt -r requirements-dev.txt
 dev:
 	. venv/bin/activate && uvicorn main:app --reload --host 0.0.0.0 --port 8000
 prod:
 	. venv/bin/activate && uvicorn main:app --host 0.0.0.0 --port 8000 --workers 4
 test:
 	. venv/bin/activate && pytest tests/ -v --cov=app
 lint:
 	. venv/bin/activate && flake8 app/ tests/ main.py && mypy app/
 format:
 	. venv/bin/activate && black app/ tests/ main.py && isort app/ tests/ main.py
 clean:
 	find . -type d -name __pycache__ -exec rm -rf {} + 2>/dev/null || true
 	find . -type f -name "*.pyc" -delete
 	rm -rf .pytest_cache .mypy_cache .coverage htmlcov dist build *.egg-info
 docker-build:
 	docker build -t llm-monitor:latest .
 docker-up:
 	docker compose up -d
 docker-down:
 	docker compose down
 docker-logs:
 	docker compose logs -f llm-monitor
 docker-shell:
 	docker compose exec llm-monitor /bin/bash
@@ -0,0 +1,381 @@
 # LLM Monitor - Dashboard Ollama
 Una dashboard web moderna e intuitiva per monitorare e gestire i modelli LLM caricati in **Ollama**. Visualizza i modelli disponibili, i dettagli dei caricamenti e accedi ai dati via API Ollama direttamente da una web app elegante.
 ## 🎯 Caratteristiche
 - ✨ **Dashboard intuitiva** - Visualizza in tempo reale i modelli caricati in Ollama
 - 📊 **Monitoraggio modelli** - Dettagli completi di ogni modello (nome, dimensione, memoria, stato)
 - 🔌 **API REST documentata** - Documentazione interattiva con Swagger/OpenAPI
 - 🎨 **UI moderna** - Interfaccia elegante realizzata con TailwindCSS
 - 🐳 **Docker ready** - Container sempre acceso (until stopped)
 - ⚡ **Performance** - Costruito su FastAPI e uVicorn
 - 🔐 **Configurazione flessibile** - File `.env` per personalizzazione
 ## 📋 Requisiti
 - **Python** 3.10+
 - **Ollama** installato e in esecuzione
 - **Docker** (opzionale, per containerizzazione)
 - **Docker Compose** (opzionale)
 ## 🚀 Installazione Rapida
 ### 1. Clonare il repository
 ```bash
 git clone https://github.com/LucaSacchiNet/llm-monitor.git
 cd llm-monitor
 ```
 ### 2. Configurare l'ambiente
 Copia il file di esempio:
 ```bash
 cp env.example .env
 ```
 Modifica `.env` con i tuoi parametri (vedi [Configurazione](#configurazione)):
 ```bash
 nano .env
 ```
 ### 3. Installare le dipendenze
 #### Opzione A: Ambiente virtuale (Sviluppo)
 ```bash
 python3 -m venv venv
 source venv/bin/activate  # Su Windows: venv\Scripts\activate
 pip install -r requirements.txt
 ```
 #### Opzione B: Docker (Produzione)
 ```bash
 docker compose up -d
 ```
 ### 4. Avviare l'applicazione
 #### Modalità sviluppo
 ```bash
 python3 -m uvicorn main:app --reload --host 0.0.0.0 --port 8000
 ```
 #### Modalità produzione
 ```bash
 uvicorn main:app --host 0.0.0.0 --port 8000 --workers 4
 ```
 ## ⚙️ Configurazione
 Crea un file `.env` nella root del progetto (copia da `env.example`):
 ```env
 # Ollama Configuration
 OLLAMA_HOST=http://localhost:11434
 OLLAMA_TIMEOUT=30
 # API Configuration
 API_HOST=0.0.0.0
 API_PORT=8000
 API_WORKERS=4
 # CORS Configuration
 CORS_ORIGINS=http://localhost:3000,http://localhost:5173
 # Logging
 LOG_LEVEL=INFO
 # Environment
 ENVIRONMENT=development
 ```
 ### Variabili disponibili
 | Variabile | Default | Descrizione |
 |-----------|---------|-------------|
 | `OLLAMA_HOST` | `http://localhost:11434` | URL della API Ollama |
 | `OLLAMA_TIMEOUT` | `30` | Timeout (secondi) per le richieste |
 | `API_HOST` | `0.0.0.0` | Host su cui esporre l'API |
 | `API_PORT` | `8000` | Porta dell'API |
 | `API_WORKERS` | `4` | Worker processes |
 | `CORS_ORIGINS` | `http://localhost:3000` | Origini CORS consentite |
 | `LOG_LEVEL` | `INFO` | Livello di logging |
 | `ENVIRONMENT` | `development` | Ambiente (development/production) |
 ## 📚 API Swagger
 La documentazione interattiva dell'API è disponibile automaticamente:
 - **Swagger UI**: http://localhost:8000/docs
 - **ReDoc**: http://localhost:8000/redoc
 ### Endpoint principali
 #### Recuperare modelli caricati
 ```bash
 GET /api/v1/models
 ```
 **Risposta:**
 ```json
 {
  "models": [
    {
      "name": "llama2",
      "digest": "abc123...",
      "size": 3825922048,
      "modified_at": "2024-01-15T10:30:00Z"
    }
  ]
 }
 ```
 #### Dettagli di un modello specifico
 ```bash
 GET /api/v1/models/{model_name}
 ```
 #### Health check API Ollama
 ```bash
 GET /api/v1/health
 ```
 **Risposta:**
 ```json
 {
  "status": "healthy",
  "ollama_version": "0.1.0",
  "uptime_seconds": 3600
 }
 ```
 ### Test API con cURL
 ```bash
 # Ottenere i modelli
 curl http://localhost:8000/api/v1/models
 # Ottenere info su un modello
 curl http://localhost:8000/api/v1/models/llama2
 # Health check
 curl http://localhost:8000/api/v1/health
 ```
 ## 🐳 Docker
 ### Build dell'immagine
 ```bash
 docker build -t llm-monitor:latest .
 ```
 ### Eseguire il container
 ```bash
 docker run -d \
  --name llm-monitor \
  -p 8000:8000 \
  --env-file .env \
  --network host \
  llm-monitor:latest
 ```
 > ⚠️ **Nota**: `--network host` consente al container di accedere a Ollama su localhost
 ### Docker Compose
 Usa il file `docker-compose.yml` fornito:
 ```bash
 # Avviare i servizi
 docker compose up -d
 # Visualizzare i log
 docker compose logs -f llm-monitor
 # Fermare i servizi
 docker compose down
 # Riavviare
 docker compose restart llm-monitor
 ```
 ### Container sempre acceso
 Il container Ollama rimarrà in esecuzione fino al suo arresto manuale:
 ```bash
 # Fermare
 docker compose stop ollama
 # oppure
 docker stop llm-monitor
 # Riavviare
 docker compose start ollama
 # oppure
 docker start llm-monitor
 ```
 ## 📁 Struttura del Progetto
 ```
 llm-monitor/
 ├── main.py                 # Entry point dell'applicazione
 ├── requirements.txt        # Dipendenze Python
 ├── env.example            # Esempio di configurazione
 ├── Dockerfile             # Configurazione Docker
 ├── docker-compose.yml     # Composizione servizi
 ├── README.md              # Questo file
 ├── .gitignore
 │
 ├── app/
 │   ├── __init__.py
 │   ├── config.py          # Configurazione (variabili ambiente)
 │   ├── main.py            # Inizializzazione FastAPI
 │   │
 │   ├── api/
 │   │   ├── __init__.py
 │   │   ├── models.py      # Endpoint modelli
 │   │   ├── health.py      # Endpoint health
 │   │   └── v1/
 │   │       └── __init__.py
 │   │
 │   ├── services/
 │   │   ├── __init__.py
 │   │   ├── ollama.py      # Client Ollama
 │   │   └── cache.py       # Cache in-memory (opzionale)
 │   │
 │   └── web/
 │       ├── __init__.py
 │       ├── static/        # Assets statici (CSS compilato TailwindCSS)
 │       └── templates/     # Template HTML
 │
 └── tests/
    ├── __init__.py
    ├── test_api.py
    └── test_ollama.py
 ```
 ## 🛠️ Sviluppo
 ### Setup locale
 ```bash
 # Clonare il repo
 git clone https://github.com/LucaSacchiNet/llm-monitor.git
 cd llm-monitor
 # Ambiente virtuale
 python3 -m venv venv
 source venv/bin/activate
 # Installare dipendenze + dev
 pip install -r requirements.txt
 pip install -r requirements-dev.txt  # black, pytest, flake8, etc.
 ```
 ### Comandi utili
 ```bash
 # Formattare codice
 black app/ tests/ main.py
 # Linting
 flake8 app/ tests/ main.py
 # Test
 pytest tests/ -v
 # Test con coverage
 pytest tests/ --cov=app
 # Hot reload durante sviluppo
 uvicorn main:app --reload
 ```
 ### Compilare TailwindCSS
 ```bash
 # Installare dipendenze Node (opzionale)
 npm install
 # Generare CSS in modalità développement
 npm run tailwind:dev
 # Build per produzione
 npm run tailwind:build
 ```
 ## 🐛 Troubleshooting
 ### Errore: "Cannot connect to Ollama"
 - Verificare che Ollama sia in esecuzione: `curl http://localhost:11434/api/tags`
 - Controllare che l'indirizzo in `.env` sia corretto (`OLLAMA_HOST`)
 - Se usi Docker, assicurati che il container abbia accesso a Ollama (vedi [Docker](#docker))
 ### Errore: "Port 8000 already in use"
 ```bash
 # Cambiare la porta in .env
 API_PORT=8001
 # Oppure liberare la porta
 lsof -ti :8000 | xargs kill -9
 ```
 ### Dashboard lenta
 - Verificare lo stato di Ollama
 - Aumentare `OLLAMA_TIMEOUT` in `.env`
 - Controllare i log: `docker compose logs -f llm-monitor`
 ## 📄 Dipendenze Principali
 - **FastAPI** - Framework web moderno
 - **uVicorn** - Server ASGI ad alte prestazioni
 - **Pydantic** - Validazione dati
 - **Requests** - Client HTTP
 - **Jinja2** - Template HTML
 - **TailwindCSS** - Utility-first CSS
 ## 📜 Licenza
 Questo progetto è distribuito sotto licenza **MIT**. Vedi il file `LICENSE` per dettagli.
 ## 🤝 Contribuire
 Le pull request sono benvenute! Per cambiamenti importanti, apri prima un issue per discutere i cambiamenti proposti.
 ### Processo di contribuzione
 1. Fork il repository
 2. Crea un branch (`git checkout -b feature/amazing-feature`)
 3. Commit i cambiamenti (`git commit -m 'Add amazing feature'`)
 4. Push al branch (`git push origin feature/amazing-feature`)
 5. Apri una Pull Request
 ## 📞 Supporto
 Per domande o segnalazioni di bug, apri un **Issue** nel repository.
 ---
 **Fatto con ❤️ da [LucaSacchi.Net](https://lucasacchi.net)**
 **Versione**: 1.0.0  
 **Ultima modifica**: Aprile 2026  
 **Status**: 🟢 In Development
@@ -0,0 +1,5 @@
 """
 LLM Monitor - Package principale
 """
 __version__ = "1.0.0"
@@ -0,0 +1,3 @@
 """
 API routes
 """
@@ -0,0 +1,70 @@
 """
 Health check endpoints
 """
 from fastapi import APIRouter, HTTPException
 from pydantic import BaseModel
 from datetime import datetime
 import requests
 import logging
 from app.config import settings
 logger = logging.getLogger(__name__)
 router = APIRouter()
 class HealthResponse(BaseModel):
    status: str
    ollama_status: str
    timestamp: datetime
    class Config:
        json_schema_extra = {
            "example": {
                "status": "healthy",
                "ollama_status": "online",
                "timestamp": "2024-01-15T10:30:00Z"
            }
        }
@router.get("/health", response_model=HealthResponse)
 async def health_check():
    """
    Health check dell'API e dello stato di Ollama
    Returns:
        HealthResponse: Status dell'API e di Ollama
    """
    try:
        # Check Ollama
        response = requests.get(
            f"{settings.OLLAMA_HOST}/api/tags",
            timeout=settings.OLLAMA_TIMEOUT
        )
        ollama_status = "online" if response.status_code == 200 else "offline"
    except Exception as e:
        logger.warning(f"Ollama health check failed: {e}")
        ollama_status = "offline"
    return HealthResponse(
        status="healthy",
        ollama_status=ollama_status,
        timestamp=datetime.utcnow()
    )
@router.get("/ready")
 async def ready():
    """
    Readiness probe per Kubernetes/Docker
    """
    try:
        response = requests.get(
            f"{settings.OLLAMA_HOST}/api/tags",
            timeout=5
        )
        if response.status_code == 200:
            return {"status": "ready"}
        else:
            raise HTTPException(status_code=503, detail="Service unavailable")
    except Exception as e:
        logger.error(f"Readiness check failed: {e}")
        raise HTTPException(status_code=503, detail="Service unavailable")
@@ -0,0 +1,232 @@
 """
 Models endpoints - Gestione dei modelli Ollama
 """
 from fastapi import APIRouter, HTTPException
 from pydantic import BaseModel
 from typing import List, Optional
 from datetime import datetime
 import requests
 import logging
 from app.config import settings
 logger = logging.getLogger(__name__)
 router = APIRouter()
 class ModelInfo(BaseModel):
    """Informazioni su un modello"""
    name: str
    digest: str
    size: int
    modified_at: datetime
    class Config:
        json_schema_extra = {
            "example": {
                "name": "llama2",
                "digest": "abc123def456...",
                "size": 3825922048,
                "modified_at": "2024-01-15T10:30:00Z"
            }
        }
 class ModelsResponse(BaseModel):
    """Risposta con lista di modelli"""
    models: List[ModelInfo]
    total: int
    class Config:
        json_schema_extra = {
            "example": {
                "models": [
                    {
                        "name": "llama2",
                        "digest": "abc123def456...",
                        "size": 3825922048,
                        "modified_at": "2024-01-15T10:30:00Z"
                    }
                ],
                "total": 1
            }
        }
@router.get("/models", response_model=ModelsResponse)
 async def get_models():
    """
    Recupera l'elenco di tutti i modelli caricati in Ollama
    Returns:
        ModelsResponse: Lista dei modelli disponibili
    Raises:
        HTTPException: Se Ollama non è disponibile
    """
    try:
        response = requests.get(
            f"{settings.OLLAMA_HOST}/api/tags",
            timeout=settings.OLLAMA_TIMEOUT
        )
        if response.status_code != 200:
            raise HTTPException(
                status_code=502,
                detail="Ollama non disponibile"
            )
        data = response.json()
        models_data = data.get("models", [])
        models = [
            ModelInfo(
                name=model.get("name", "unknown"),
                digest=model.get("digest", ""),
                size=model.get("size", 0),
                modified_at=datetime.fromisoformat(
                    model.get("modified_at", "").replace("Z", "+00:00")
                ) if model.get("modified_at") else datetime.utcnow()
            )
            for model in models_data
        ]
        return ModelsResponse(
            models=models,
            total=len(models)
        )
    except requests.exceptions.Timeout:
        raise HTTPException(
            status_code=504,
            detail="Timeout: Ollama non ha risposto in tempo"
        )
    except requests.exceptions.ConnectionError:
        raise HTTPException(
            status_code=502,
            detail="Impossible connettersi a Ollama"
        )
    except Exception as e:
        logger.error(f"Error fetching models: {e}")
        raise HTTPException(
            status_code=500,
            detail="Errore nel recupero dei modelli"
        )
@router.get("/models/{model_name}", response_model=ModelInfo)
 async def get_model(model_name: str):
    """
    Recupera le informazioni di un modello specifico
    Args:
        model_name: Nome del modello da cercare
    Returns:
        ModelInfo: Informazioni del modello
    Raises:
        HTTPException: Se il modello non esiste o Ollama non è disponibile
    """
    try:
        response = requests.get(
            f"{settings.OLLAMA_HOST}/api/tags",
            timeout=settings.OLLAMA_TIMEOUT
        )
        if response.status_code != 200:
            raise HTTPException(
                status_code=502,
                detail="Ollama non disponibile"
            )
        data = response.json()
        models_data = data.get("models", [])
        # Cercare il modello
        for model in models_data:
            if model.get("name") == model_name:
                return ModelInfo(
                    name=model.get("name", "unknown"),
                    digest=model.get("digest", ""),
                    size=model.get("size", 0),
                    modified_at=datetime.fromisoformat(
                        model.get("modified_at", "").replace("Z", "+00:00")
                    ) if model.get("modified_at") else datetime.utcnow()
                )
        raise HTTPException(
            status_code=404,
            detail=f"Modello '{model_name}' non trovato"
        )
    except HTTPException:
        raise
    except Exception as e:
        logger.error(f"Error fetching model: {e}")
        raise HTTPException(
            status_code=500,
            detail="Errore nel recupero del modello"
        )
@router.post("/models/{model_name}/pull")
 async def pull_model(model_name: str):
    """
    Scarica/carica un modello in Ollama
    Args:
        model_name: Nome del modello da scaricare
    Returns:
        dict: Status del download
    """
    try:
        response = requests.post(
            f"{settings.OLLAMA_HOST}/api/pull",
            json={"name": model_name},
            timeout=None  # Pull può essere lungo
        )
        if response.status_code not in [200, 201]:
            raise HTTPException(
                status_code=502,
                detail="Errore nel pull del modello"
            )
        return {"status": "pulling", "model": model_name}
    except Exception as e:
        logger.error(f"Error pulling model: {e}")
        raise HTTPException(
            status_code=500,
            detail="Errore nel pull del modello"
        )
@router.delete("/models/{model_name}")
 async def delete_model(model_name: str):
    """
    Elimina un modello da Ollama
    Args:
        model_name: Nome del modello da eliminare
    Returns:
        dict: Confirmazione eliminazione
    """
    try:
        response = requests.delete(
            f"{settings.OLLAMA_HOST}/api/delete",
            json={"name": model_name},
            timeout=settings.OLLAMA_TIMEOUT
        )
        if response.status_code not in [200, 204]:
            raise HTTPException(
                status_code=502,
                detail="Errore nell'eliminazione del modello"
            )
        return {"status": "deleted", "model": model_name}
    except Exception as e:
        logger.error(f"Error deleting model: {e}")
        raise HTTPException(
            status_code=500,
            detail="Errore nell'eliminazione del modello"
        )
@@ -0,0 +1,34 @@
 """
 Configurazione dell'applicazione tramite variabili di ambiente
 """
 from pydantic_settings import BaseSettings
 from typing import List
 class Settings(BaseSettings):
    """Configurazione dell'applicazione"""
    # Ollama
    OLLAMA_HOST: str = "http://localhost:11434"
    OLLAMA_TIMEOUT: int = 30
    # API
    API_HOST: str = "0.0.0.0"
    API_PORT: int = 8000
    API_WORKERS: int = 4
    # CORS
    CORS_ORIGINS: str = "http://localhost:3000,http://localhost:5173,http://localhost:8000"
    # Logging
    LOG_LEVEL: str = "INFO"
    # Environment
    ENVIRONMENT: str = "development"
    class Config:
        env_file = ".env"
        env_file_encoding = "utf-8"
 # Istanza globale della configurazione
 settings = Settings()
@@ -0,0 +1,3 @@
 """
 Services - Business logic
 """
@@ -0,0 +1,116 @@
 """
 Ollama client service
 """
 import requests
 import logging
 from typing import List, Dict, Optional
 from app.config import settings
 logger = logging.getLogger(__name__)
 class OllamaClient:
    """Client per interagire con l'API Ollama"""
    def __init__(self, host: str = None, timeout: int = None):
        self.host = host or settings.OLLAMA_HOST
        self.timeout = timeout or settings.OLLAMA_TIMEOUT
    def get_models(self) -> List[Dict]:
        """
        Recupera l'elenco dei modelli da Ollama
        Returns:
            List[Dict]: Lista dei modelli
        """
        try:
            response = requests.get(
                f"{self.host}/api/tags",
                timeout=self.timeout
            )
            response.raise_for_status()
            return response.json().get("models", [])
        except Exception as e:
            logger.error(f"Error getting models from Ollama: {e}")
            return []
    def get_model(self, model_name: str) -> Optional[Dict]:
        """
        Recupera informazioni su un modello specifico
        Args:
            model_name: Nome del modello
        Returns:
            Dict: Informazioni del modello, o None se non trovato
        """
        try:
            models = self.get_models()
            for model in models:
                if model.get("name") == model_name:
                    return model
            return None
        except Exception as e:
            logger.error(f"Error getting model {model_name}: {e}")
            return None
    def is_available(self) -> bool:
        """
        Verifica se Ollama è disponibile
        Returns:
            bool: True se disponibile, False altrimenti
        """
        try:
            response = requests.get(
                f"{self.host}/api/tags",
                timeout=5
            )
            return response.status_code == 200
        except Exception:
            return False
    def pull_model(self, model_name: str) -> bool:
        """
        Scarica/carica un modello
        Args:
            model_name: Nome del modello
        Returns:
            bool: True se ha successo
        """
        try:
            response = requests.post(
                f"{self.host}/api/pull",
                json={"name": model_name},
                timeout=None
            )
            return response.status_code in [200, 201]
        except Exception as e:
            logger.error(f"Error pulling model {model_name}: {e}")
            return False
    def delete_model(self, model_name: str) -> bool:
        """
        Elimina un modello
        Args:
            model_name: Nome del modello
        Returns:
            bool: True se ha successo
        """
        try:
            response = requests.delete(
                f"{self.host}/api/delete",
                json={"name": model_name},
                timeout=self.timeout
            )
            return response.status_code in [200, 204]
        except Exception as e:
            logger.error(f"Error deleting model {model_name}: {e}")
            return False
 # Istanza globale del client Ollama
 ollama_client = OllamaClient()
@@ -0,0 +1,3 @@
 """
 Web templates and static files
 """
@@ -0,0 +1,224 @@
 <!DOCTYPE html>
 <html lang="it">
 <head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>LLM Monitor - Dashboard Ollama</title>
    <script src="https://cdn.tailwindcss.com"></script>
    <style>
        @keyframes spin {
            to { transform: rotate(360deg); }
        }
        .animate-spin {
            animation: spin 1s linear infinite;
        }
    </style>
 </head>
 <body class="bg-gray-900 text-white">
    <div class="min-h-screen flex flex-col">
        <!-- Header -->
        <header class="bg-gray-800 border-b border-gray-700 sticky top-0 z-50">
            <div class="max-w-7xl mx-auto px-4 py-6">
                <div class="flex items-center justify-between">
                    <div class="flex items-center gap-3">
                        <div class="w-10 h-10 bg-gradient-to-br from-purple-500 to-pink-500 rounded-lg flex items-center justify-center font-bold text-lg">
                            🦙
                        </div>
                        <h1 class="text-2xl font-bold">LLM Monitor</h1>
                    </div>
                    <div class="flex items-center gap-4">
                        <div id="health-status" class="flex items-center gap-2">
                            <div id="status-indicator" class="w-3 h-3 bg-gray-500 rounded-full"></div>
                            <span id="status-text" class="text-sm text-gray-400">Controllo...</span>
                        </div>
                    </div>
                </div>
            </div>
        </header>
        <!-- Main Content -->
        <main class="flex-1">
            <div class="max-w-7xl mx-auto px-4 py-8">
                <!-- Stats Cards -->
                <div class="grid grid-cols-1 md:grid-cols-3 gap-6 mb-8">
                    <div class="bg-gray-800 rounded-lg p-6 border border-gray-700">
                        <div class="text-gray-400 text-sm font-medium">Modelli Caricati</div>
                        <div id="models-count" class="text-4xl font-bold mt-2">-</div>
                    </div>
                    <div class="bg-gray-800 rounded-lg p-6 border border-gray-700">
                        <div class="text-gray-400 text-sm font-medium">Spazio Totale</div>
                        <div id="total-size" class="text-4xl font-bold mt-2">-</div>
                    </div>
                    <div class="bg-gray-800 rounded-lg p-6 border border-gray-700">
                        <div class="text-gray-400 text-sm font-medium">Status Ollama</div>
                        <div id="ollama-status" class="text-4xl font-bold mt-2">-</div>
                    </div>
                </div>
                <!-- Models Section -->
                <div class="bg-gray-800 rounded-lg border border-gray-700 p-6">
                    <div class="flex items-center justify-between mb-6">
                        <h2 class="text-xl font-bold">Modelli Disponibili</h2>
                        <button onclick="loadModels()" class="bg-purple-600 hover:bg-purple-700 px-4 py-2 rounded-lg text-sm font-medium transition">
                            🔄 Aggiorna
                        </button>
                    </div>
                    <!-- Models List -->
                    <div id="models-container" class="space-y-4">
                        <div class="text-center py-8">
                            <div class="animate-spin inline-block w-8 h-8 border-4 border-gray-600 border-t-purple-500 rounded-full"></div>
                            <p class="text-gray-400 mt-4">Caricamento modelli...</p>
                        </div>
                    </div>
                </div>
                <!-- API Documentation Section -->
                <div class="mt-8 bg-blue-900 bg-opacity-20 border border-blue-700 rounded-lg p-6">
                    <h3 class="text-lg font-bold mb-4">📚 Documentazione API</h3>
                    <p class="text-gray-300 mb-4">La API è documentata e testabile direttamente da:</p>
                    <div class="flex gap-3 flex-wrap">
                        <a href="/docs" target="_blank" class="inline-block bg-blue-600 hover:bg-blue-700 px-4 py-2 rounded-lg text-sm font-medium transition">
                            Swagger UI
                        </a>
                        <a href="/redoc" target="_blank" class="inline-block bg-blue-600 hover:bg-blue-700 px-4 py-2 rounded-lg text-sm font-medium transition">
                            ReDoc
                        </a>
                    </div>
                </div>
            </div>
        </main>
        <!-- Footer -->
        <footer class="bg-gray-800 border-t border-gray-700 mt-12">
            <div class="max-w-7xl mx-auto px-4 py-6 text-center text-gray-400 text-sm">
                <p>LLM Monitor v1.0.0 • Fatto con ❤️ da <a href="https://lucasacchi.net" target="_blank" class="text-purple-400 hover:text-purple-300">LucaSacchi.Net</a></p>
            </div>
        </footer>
    </div>
    <script>
        const API_BASE = "/api/v1";
        // Formattare bytes in formato leggibile
        function formatBytes(bytes) {
            if (bytes === 0) return "0 B";
            const k = 1024;
            const sizes = ["B", "KB", "MB", "GB"];
            const i = Math.floor(Math.log(bytes) / Math.log(k));
            return (bytes / Math.pow(k, i)).toFixed(2) + " " + sizes[i];
        }
        // Formattare data
        function formatDate(dateString) {
            const date = new Date(dateString);
            return date.toLocaleDateString("it-IT", {
                year: "numeric",
                month: "short",
                day: "numeric",
                hour: "2-digit",
                minute: "2-digit"
            });
        }
        // Verificare health
        async function checkHealth() {
            try {
                const response = await fetch(`${API_BASE}/health`);
                if (response.ok) {
                    const data = await response.json();
                    const statusEl = document.getElementById("status-indicator");
                    const statusText = document.getElementById("status-text");
                    const ollamaStatus = data.ollama_status;
                    if (ollamaStatus === "online") {
                        statusEl.className = "w-3 h-3 bg-green-500 rounded-full";
                        statusText.className = "text-sm text-green-400";
                        statusText.textContent = "Ollama Online";
                        document.getElementById("ollama-status").innerHTML = "🟢 Online";
                    } else {
                        statusEl.className = "w-3 h-3 bg-red-500 rounded-full";
                        statusText.className = "text-sm text-red-400";
                        statusText.textContent = "Ollama Offline";
                        document.getElementById("ollama-status").innerHTML = "🔴 Offline";
                    }
                }
            } catch (error) {
                console.error("Health check error:", error);
                document.getElementById("status-indicator").className = "w-3 h-3 bg-red-500 rounded-full";
                document.getElementById("status-text").textContent = "Errore connessione";
            }
        }
        // Caricare modelli
        async function loadModels() {
            try {
                const response = await fetch(`${API_BASE}/models`);
                if (!response.ok) throw new Error("Errore nel caricamento");
                const data = await response.json();
                const models = data.models || [];
                // Aggiornare conteggio
                document.getElementById("models-count").textContent = models.length;
                // Calcolare spazio totale
                const totalSize = models.reduce((sum, m) => sum + m.size, 0);
                document.getElementById("total-size").textContent = formatBytes(totalSize);
                // Renderizzare modelli
                if (models.length === 0) {
                    document.getElementById("models-container").innerHTML = `
                        <div class="text-center py-8 text-gray-400">
                            <p>Nessun modello caricato</p>
                        </div>
                    `;
                } else {
                    document.getElementById("models-container").innerHTML = models.map(model => `
                        <div class="bg-gray-700 rounded-lg p-4 border border-gray-600 hover:border-purple-500 transition">
                            <div class="flex items-start justify-between mb-3">
                                <h3 class="text-lg font-semibold">${model.name}</h3>
                                <span class="bg-purple-600 px-3 py-1 rounded text-xs font-medium">Caricato</span>
                            </div>
                            <div class="grid grid-cols-2 gap-4 text-sm">
                                <div>
                                    <p class="text-gray-400">Dimensione</p>
                                    <p class="font-semibold">${formatBytes(model.size)}</p>
                                </div>
                                <div>
                                    <p class="text-gray-400">Ultimo aggiornamento</p>
                                    <p class="font-semibold">${formatDate(model.modified_at)}</p>
                                </div>
                            </div>
                            <div class="mt-3">
                                <p class="text-gray-400 text-xs">Digest</p>
                                <p class="font-mono text-xs bg-gray-800 p-2 rounded mt-1 break-all">${model.digest.substring(0, 64)}...</p>
                            </div>
                        </div>
                    `).join("");
                }
            } catch (error) {
                console.error("Error loading models:", error);
                document.getElementById("models-container").innerHTML = `
                    <div class="text-center py-8 text-red-400">
                        <p>❌ Errore nel caricamento dei modelli</p>
                        <p class="text-sm mt-2">${error.message}</p>
                    </div>
                `;
            }
        }
        // Inizializzazione
        document.addEventListener("DOMContentLoaded", () => {
            checkHealth();
            loadModels();
            // Refresh ogni 30 secondi
            setInterval(() => {
                checkHealth();
                loadModels();
            }, 30000);
        });
    </script>
 </body>
 </html>
@@ -0,0 +1,69 @@
 version: '3.8'
 services:
  # Ollama Service
  ollama:
    image: ollama/ollama:latest
    container_name: ollama-server
    ports:
      - "11434:11434"
    environment:
      OLLAMA_HOST: 0.0.0.0:11434
    volumes:
      - ollama_data:/root/.ollama
    restart: unless-stopped
    # Keep container running until stopped
    stdin_open: true
    tty: true
    networks:
      - llm-monitor-network
  # LLM Monitor Dashboard
  llm-monitor:
    build:
      context: .
      dockerfile: Dockerfile
    container_name: llm-monitor-app
    ports:
      - "8000:8000"
    environment:
      # Carica variabili da .env
      OLLAMA_HOST: http://ollama:11434
      OLLAMA_TIMEOUT: 30
      API_HOST: 0.0.0.0
      API_PORT: 8000
      API_WORKERS: 4
      CORS_ORIGINS: http://localhost:3000,http://localhost:5173,http://localhost:8000
      LOG_LEVEL: INFO
      ENVIRONMENT: production
    env_file:
      - .env
    depends_on:
      - ollama
    restart: unless-stopped
    stdin_open: true
    tty: true
    networks:
      - llm-monitor-network
    # Health check
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8000/api/v1/health"]
      interval: 30s
      timeout: 10s
      retries: 3
      start_period: 10s
 volumes:
  ollama_data:
    driver: local
 networks:
  llm-monitor-network:
    driver: bridge
 # Istruzioni di avvio:
 # docker compose up -d          # Avvia i servizi
 # docker compose logs -f        # Visualizza i log
 # docker compose down           # Ferma i servizi
 # docker compose stop ollama    # Ferma solo Ollama
 # docker compose start ollama   # Riavvia Ollama
@@ -0,0 +1,52 @@
 # LLM Monitor - Environment Configuration Example
 # Copy this file to .env and adjust values for your environment
 # ===========================================
 # Ollama Configuration
 # ===========================================
 # URL base dell'API Ollama
 OLLAMA_HOST=http://localhost:11434
 # Timeout per le richieste a Ollama (secondi)
 OLLAMA_TIMEOUT=30
 # ===========================================
 # API Configuration
 # ===========================================
 # Host su cui esporre l'API
 API_HOST=0.0.0.0
 # Porta su cui esporre l'API
 API_PORT=8000
 # Numero di worker processes per uVicorn
 API_WORKERS=4
 # ===========================================
 # CORS Configuration
 # ===========================================
 # Origini CORS consentite (separare con virgola)
 CORS_ORIGINS=http://localhost:3000,http://localhost:5173,http://localhost:8000
 # ===========================================
 # Logging
 # ===========================================
 # Livello di logging (DEBUG, INFO, WARNING, ERROR, CRITICAL)
 LOG_LEVEL=INFO
 # ===========================================
 # Environment
 # ===========================================
 # Ambiente di esecuzione (development, production)
 ENVIRONMENT=development
 # ===========================================
 # Security (opzionale)
 # ===========================================
 # Secret key per sessioni (genera con: python -c "import secrets; print(secrets.token_hex(32))")
 # SECRET_KEY=your-secret-key-here
 # ===========================================
 # Database (se necessario in futuro)
 # ===========================================
 # DATABASE_URL=sqlite:///./llm-monitor.db
@@ -0,0 +1,82 @@
 """
 LLM Monitor - Dashboard per controllare i modelli caricati in Ollama
 Entry point dell'applicazione FastAPI
 """
 import logging
 from fastapi import FastAPI
 from fastapi.staticfiles import StaticFiles
 from fastapi.responses import FileResponse
 from fastapi.middleware.cors import CORSMiddleware
 from pathlib import Path
 import os
 # Configurazione logging
 logging.basicConfig(level=logging.INFO)
 logger = logging.getLogger(__name__)
 # Importare le rotte
 from app.api.health import router as health_router
 from app.api.models import router as models_router
 from app.config import settings
 # Creare l'app FastAPI
 app = FastAPI(
    title="LLM Monitor API",
    description="Dashboard per il monitoraggio dei modelli LLM in Ollama",
    version="1.0.0",
    docs_url="/docs",
    redoc_url="/redoc",
    openapi_url="/openapi.json"
 )
 # Configurare CORS
 app.add_middleware(
    CORSMiddleware,
    allow_origins=settings.CORS_ORIGINS.split(","),
    allow_credentials=True,
    allow_methods=["*"],
    allow_headers=["*"],
 )
 # Registrare le rotte API
 app.include_router(health_router, prefix="/api/v1", tags=["health"])
 app.include_router(models_router, prefix="/api/v1", tags=["models"])
 # Servire i file statici
 static_path = Path(__file__).parent / "app" / "web" / "static"
 if static_path.exists():
    app.mount("/static", StaticFiles(directory=static_path), name="static")
 # Servire la dashboard web
 templates_path = Path(__file__).parent / "app" / "web" / "templates"
@app.get("/")
 async def root():
    """Redirect alla dashboard"""
    return FileResponse(templates_path / "index.html")
@app.get("/dashboard")
 async def dashboard():
    """Dashboard principale"""
    return FileResponse(templates_path / "index.html")
 # Event hooks
@app.on_event("startup")
 async def startup_event():
    logger.info("🚀 LLM Monitor avviato")
    logger.info(f"📊 Ollama host: {settings.OLLAMA_HOST}")
@app.on_event("shutdown")
 async def shutdown_event():
    logger.info("🛑 LLM Monitor arrestato")
 if __name__ == "__main__":
    import uvicorn
    uvicorn.run(
        "main:app",
        host=settings.API_HOST,
        port=settings.API_PORT,
        reload=settings.ENVIRONMENT == "development",
        log_level=settings.LOG_LEVEL.lower()
    )
@@ -0,0 +1,13 @@
 {
  "name": "llm-monitor",
  "version": "1.0.0",
  "description": "Dashboard per controllare i modelli caricati in Ollama",
  "private": true,
  "scripts": {
    "tailwind:dev": "tailwindcss -i app/web/static/css/input.css -o app/web/static/css/output.css --watch",
    "tailwind:build": "tailwindcss -i app/web/static/css/input.css -o app/web/static/css/output.css --minify"
  },
  "devDependencies": {
    "tailwindcss": "^3.4.0"
  }
 }
@@ -0,0 +1,26 @@
 # Development Dependencies
 # Testing
 pytest==7.4.3
 pytest-cov==4.1.0
 pytest-asyncio==0.21.1
 # Code Quality
 black==23.12.0
 flake8==6.1.0
 isort==5.13.2
 mypy==1.7.1
 # Linting
 pylint==3.0.3
 # Documentation
 mkdocs==1.5.3
 mkdocs-material==9.5.0
 # Debug
 ipython==8.18.1
 ipdb==0.13.13
 # Pre-commit hooks
 pre-commit==3.5.0
@@ -0,0 +1,34 @@
 # LLM Monitor Requirements
 # Core Web Framework
 fastapi==0.104.1
 uvicorn[standard]==0.24.0
 pydantic==2.5.0
 pydantic-settings==2.1.0
 # HTTP Client
 requests==2.31.0
 httpx==0.25.1
 # Template Engine
 jinja2==3.1.2
 # Database & ORM (opzionale)
 # sqlalchemy==2.0.23
 # alembic==1.12.1
 # Utilities
 python-dotenv==1.0.0
 python-multipart==0.0.6
 # Async
 aiohttp==3.9.1
 # Logging & Monitoring
 python-json-logger==2.0.7
 # CORS
 fastapi-cors==0.0.6
 # API Documentation
 # (Swagger/ReDoc sono inclusi di default in FastAPI)
@@ -0,0 +1,3 @@
 """
 Test suite for LLM Monitor
 """
@@ -0,0 +1,32 @@
 """
 Pytest configuration and fixtures
 """
 import pytest
 from fastapi.testclient import TestClient
 from main import app
@pytest.fixture
 def client():
    """FastAPI test client"""
    return TestClient(app)
@pytest.fixture
 def mock_models_response():
    """Mock response from Ollama API"""
    return {
        "models": [
            {
                "name": "llama2",
                "digest": "91ab89b1b9117e34fb2ff4a5bff07b2e1fa1f1d2d3e4f5a6b7c8d9e0f1a2b3c",
                "size": 3825922048,
                "modified_at": "2024-01-15T10:30:00.000Z"
            },
            {
                "name": "mistral",
                "digest": "a1b2c3d4e5f6a7b8c9d0e1f2a3b4c5d6e7f8a9b0c1d2e3f4a5b6c7d8e9f0a1",
                "size": 4096000000,
                "modified_at": "2024-01-14T15:45:00.000Z"
            }
        ]
    }
@@ -0,0 +1,94 @@
 """
 Test API endpoints
 """
 import pytest
 from unittest.mock import patch, MagicMock
 def test_health_check(client):
    """Test health endpoint"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_get.return_value = mock_response
        response = client.get("/api/v1/health")
        assert response.status_code == 200
        data = response.json()
        assert "status" in data
        assert data["status"] == "healthy"
 def test_ready_endpoint(client):
    """Test readiness probe"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_get.return_value = mock_response
        response = client.get("/api/v1/ready")
        assert response.status_code == 200
        assert response.json() == {"status": "ready"}
 def test_get_models(client, mock_models_response):
    """Test getting models list"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_models_response
        mock_get.return_value = mock_response
        response = client.get("/api/v1/models")
        assert response.status_code == 200
        data = response.json()
        assert "models" in data
        assert "total" in data
        assert data["total"] == 2
        assert len(data["models"]) == 2
        assert data["models"][0]["name"] == "llama2"
 def test_get_models_ollama_offline(client):
    """Test getting models when Ollama is offline"""
    with patch("requests.get") as mock_get:
        mock_get.side_effect = Exception("Connection refused")
        response = client.get("/api/v1/models")
        assert response.status_code == 500
 def test_get_specific_model(client, mock_models_response):
    """Test getting specific model"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_models_response
        mock_get.return_value = mock_response
        response = client.get("/api/v1/models/llama2")
        assert response.status_code == 200
        data = response.json()
        assert data["name"] == "llama2"
 def test_get_nonexistent_model(client, mock_models_response):
    """Test getting nonexistent model"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_models_response
        mock_get.return_value = mock_response
        response = client.get("/api/v1/models/nonexistent")
        assert response.status_code == 404
 def test_root_endpoint(client):
    """Test root endpoint redirects to dashboard"""
    response = client.get("/", follow_redirects=False)
    assert response.status_code in [200, 307]
 def test_openapi_schema(client):
    """Test OpenAPI schema is available"""
    response = client.get("/openapi.json")
    assert response.status_code == 200
    schema = response.json()
    assert "info" in schema
    assert "paths" in schema
    assert "/api/v1/health" in schema["paths"]
    assert "/api/v1/models" in schema["paths"]
@@ -0,0 +1,122 @@
 """
 Test Ollama client service
 """
 import pytest
 from unittest.mock import patch, MagicMock
 from app.services.ollama import OllamaClient
@pytest.fixture
 def ollama_client():
    """Create OllamaClient instance"""
    return OllamaClient(host="http://localhost:11434", timeout=30)
 def test_get_models(ollama_client):
    """Test getting models from Ollama"""
    mock_data = {
        "models": [
            {"name": "llama2", "digest": "abc123", "size": 3825922048},
            {"name": "mistral", "digest": "def456", "size": 4096000000}
        ]
    }
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_data
        mock_get.return_value = mock_response
        models = ollama_client.get_models()
        assert len(models) == 2
        assert models[0]["name"] == "llama2"
 def test_get_models_error(ollama_client):
    """Test get models when error occurs"""
    with patch("requests.get") as mock_get:
        mock_get.side_effect = Exception("Connection error")
        models = ollama_client.get_models()
        assert models == []
 def test_get_model(ollama_client):
    """Test getting specific model"""
    mock_data = {
        "models": [
            {"name": "llama2", "digest": "abc123", "size": 3825922048}
        ]
    }
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_data
        mock_get.return_value = mock_response
        model = ollama_client.get_model("llama2")
        assert model is not None
        assert model["name"] == "llama2"
 def test_get_nonexistent_model(ollama_client):
    """Test getting nonexistent model"""
    mock_data = {"models": []}
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_response.json.return_value = mock_data
        mock_get.return_value = mock_response
        model = ollama_client.get_model("nonexistent")
        assert model is None
 def test_is_available(ollama_client):
    """Test checking if Ollama is available"""
    with patch("requests.get") as mock_get:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_get.return_value = mock_response
        assert ollama_client.is_available() is True
 def test_is_available_offline(ollama_client):
    """Test checking if Ollama is available when offline"""
    with patch("requests.get") as mock_get:
        mock_get.side_effect = Exception("Connection refused")
        assert ollama_client.is_available() is False
 def test_pull_model(ollama_client):
    """Test pulling a model"""
    with patch("requests.post") as mock_post:
        mock_response = MagicMock()
        mock_response.status_code = 200
        mock_post.return_value = mock_response
        result = ollama_client.pull_model("llama2")
        assert result is True
 def test_pull_model_error(ollama_client):
    """Test pull model when error occurs"""
    with patch("requests.post") as mock_post:
        mock_post.side_effect = Exception("Error")
        result = ollama_client.pull_model("llama2")
        assert result is False
 def test_delete_model(ollama_client):
    """Test deleting a model"""
    with patch("requests.delete") as mock_delete:
        mock_response = MagicMock()
        mock_response.status_code = 204
        mock_delete.return_value = mock_response
        result = ollama_client.delete_model("llama2")
        assert result is True
 def test_delete_model_error(ollama_client):
    """Test delete model when error occurs"""
    with patch("requests.delete") as mock_delete:
        mock_delete.side_effect = Exception("Error")
        result = ollama_client.delete_model("llama2")
        assert result is False