docs: comprehensive documentation for NotebookLM-RAG integration

Update documentation to reflect new integration features: README.md: - Add 'Integrazione NotebookLM + RAG' section after Overview - Update DocuMente component section with new endpoints - Add notebooklm_sync.py and notebooklm_indexer.py to architecture - Add integration API examples - Add link to docs/integration.md SKILL.md: - Add RAG Integration to Capabilities table - Update Autonomy Rules with new endpoints - Add RAG Integration section to Quick Reference - Add Sprint 2 changelog with integration features - Update Skill Version to 1.2.0 docs/integration.md (NEW): - Complete integration guide with architecture diagram - API reference for all sync and query endpoints - Usage examples and workflows - Best practices and troubleshooting - Performance considerations and limitations - Roadmap for future features All documentation now accurately reflects the unified NotebookLM + RAG agent capabilities.
2026-04-06 18:01:50 +02:00
parent a5029aef20
commit 568489cae4
3 changed files with 628 additions and 3 deletions
@@ -31,6 +31,9 @@ Questo repository contiene **due sistemi AI complementari**:
 ## Panoramica
 ### Integrazione
 - [docs/integration.md](./docs/integration.md) - Guida integrazione NotebookLM + RAG
 ### NotebookLM Agent
 Interfaccia API e webhook per **Google NotebookLM** che permette:
@@ -51,10 +54,45 @@ Sistema **Retrieval-Augmented Generation** standalone con:
 **Ideale per:** Knowledge management, Document analysis, Research assistant
 ---
 ## Integrazione NotebookLM + RAG
 Ora puoi sincronizzare i tuoi notebook di NotebookLM nel sistema RAG di DocuMente, permettendo di:
 - **Effettuare ricerche semantiche** sui contenuti dei tuoi notebook
 - **Combinare documenti locali e notebook** nelle stesse query
 - **Usare tutti i provider LLM** disponibili per interrogare i notebook
 - **Filtrare per notebook specifici** durante le ricerche
 ### Architettura
 ```
 NotebookLM → NotebookLMIndexerService → Qdrant Vector Store
                                               ↓
                                    RAGService (query con filtri)
                                               ↓
                                    Multi-Provider LLM Response
 ```
 ### Come funziona
 1. **Sincronizzazione**: I contenuti dei notebook vengono estratti, divisi in chunks e indicizzati in Qdrant
 2. **Metadati**: Ogni chunk mantiene informazioni sul notebook e la fonte di origine
 3. **Ricerca**: Le query RAG possono filtrare per notebook_id specifici
 4. **Risposta**: Il LLM riceve contesto dai notebook selezionati
 ---
 ---
 ## Componenti
 ### Integrazione
 - [docs/integration.md](./docs/integration.md) - Guida integrazione NotebookLM + RAG
 ### NotebookLM Agent
 ```
@@ -101,16 +139,46 @@ src/agentic_rag/
 │       ├── documents.py  # Upload documenti
 │       ├── query.py      # Query RAG
 │       ├── chat.py       # Chat conversazionale
-│       └── providers.py  # Gestione provider LLM
+│       ├── providers.py  # Gestione provider LLM
 │       └── notebooklm_sync.py  # [NUOVO] Sync NotebookLM
 ├── services/             # Business logic
 │   ├── rag_service.py    # Core RAG logic
 │   ├── vector_store.py   # Qdrant integration
-│   └── document_service.py
+│   ├── document_service.py
 │   └── notebooklm_indexer.py  # [NUOVO] Indexing service
 └── core/                 # Configurazioni
    ├── config.py        # Multi-provider config
    └── llm_factory.py   # LLM factory pattern
 ```
 **Endpoint API NotebookLM Integration:**
 - `POST /api/v1/notebooklm/sync/{notebook_id}` - Sincronizza un notebook da NotebookLM
 - `GET /api/v1/notebooklm/indexed` - Lista notebook sincronizzati
 - `DELETE /api/v1/notebooklm/sync/{notebook_id}` - Rimuovi sincronizzazione
 - `GET /api/v1/notebooklm/sync/{notebook_id}/status` - Verifica stato sincronizzazione
 - `POST /api/v1/query/notebooks` - Query solo sui notebook
 **Query con filtri notebook:**
 ```bash
 # Ricerca in notebook specifici
 POST /api/v1/query
 {
  "question": "Quali sono i punti chiave?",
  "notebook_ids": ["uuid-1", "uuid-2"],
  "include_documents": true  # Include anche documenti locali
 }
 # Ricerca solo nei notebook
 POST /api/v1/query/notebooks
 {
  "question": "Trova informazioni su...",
  "notebook_ids": ["uuid-1"],
  "k": 10
 }
 ```
 ---
 **Provider LLM Supportati:**
 | Provider | Modelli Principali | Stato |
@@ -217,6 +285,9 @@ DEBUG=false
 ## Avvio
 ### Integrazione
 - [docs/integration.md](./docs/integration.md) - Guida integrazione NotebookLM + RAG
 ### NotebookLM Agent
 ```bash
@@ -292,6 +363,39 @@ curl -X POST http://localhost:8000/api/v1/query \
    "provider": "openai",
    "model": "gpt-4o-mini"
  }'
 ### Integrazione NotebookLM + RAG
 **Sincronizzare un notebook:**
 ```bash
 # Sincronizza un notebook da NotebookLM al vector store
 curl -X POST http://localhost:8000/api/v1/notebooklm/sync/{notebook_id}
 # Lista notebook sincronizzati
 curl http://localhost:8000/api/v1/notebooklm/indexed
 # Rimuovi sincronizzazione
 curl -X DELETE http://localhost:8000/api/v1/notebooklm/sync/{notebook_id}
 ```
 **Query sui notebook:**
 ```bash
 # Query solo sui notebook (senza documenti locali)
 curl -X POST http://localhost:8000/api/v1/query/notebooks   -H "Content-Type: application/json"   -d '{
    "question": "Quali sono le conclusioni principali?",
    "notebook_ids": ["uuid-del-notebook"],
    "k": 10,
    "provider": "openai"
  }'
 # Query mista (documenti + notebook)
 curl -X POST http://localhost:8000/api/v1/query   -H "Content-Type: application/json"   -d '{
    "question": "Confronta le informazioni tra i documenti e i notebook",
    "notebook_ids": ["uuid-1", "uuid-2"],
    "include_documents": true,
    "provider": "anthropic"
  }'
 ```
 ---
@@ -351,6 +455,9 @@ documente/
 ## Documentazione
 ### Integrazione
 - [docs/integration.md](./docs/integration.md) - Guida integrazione NotebookLM + RAG
 ### NotebookLM Agent
 - [SKILL.md](./SKILL.md) - Skill definition per agenti AI
 - [prd.md](./prd.md) - Product Requirements Document
@@ -31,6 +31,7 @@ Interfaccia agentica per Google NotebookLM tramite API REST e webhook. Automatiz
 | **Generazione** | Audio (podcast), Video, Slide, Infografiche, Quiz, Flashcard, Report, Mappe mentali, Tabelle |
 | **Artifacts** | Monitorare stato, scaricare in vari formati |
 | **Webhook** | Registrare endpoint, ricevere notifiche eventi |
 | **RAG Integration** | Sincronizzare notebook, ricerche semantiche, query multi-notebook |
 ---
@@ -73,6 +74,10 @@ http://localhost:8000/health
 | `GET /api/v1/notebooks/{id}/chat/history` | Read-only |
 | `GET /api/v1/notebooks/{id}/artifacts` | Read-only |
 | `GET /api/v1/notebooks/{id}/artifacts/{id}/status` | Read-only |
 | `GET /api/v1/notebooklm/indexed` | Read-only |
 | `GET /api/v1/notebooklm/sync/{id}/status` | Read-only |
 | `POST /api/v1/query` | Read-only (ricerca) |
 | `POST /api/v1/query/notebooks` | Read-only (ricerca) |
 | `GET /health` | Health check |
 | `POST /api/v1/webhooks/{id}/test` | Test non distruttivo |
@@ -86,6 +91,8 @@ http://localhost:8000/health
 | `POST /api/v1/notebooks/{id}/generate/*` | Lungo, può fallire |
 | `GET /api/v1/notebooks/{id}/artifacts/{id}/download` | Scrive filesystem |
 | `POST /api/v1/webhooks` | Configura endpoint |
 | `POST /api/v1/notebooklm/sync/{id}` | Indicizza dati (tempo/risorse) |
 | `DELETE /api/v1/notebooklm/sync/{id}` | Rimuove dati indicizzati |
 ---
@@ -242,6 +249,46 @@ curl http://localhost:8000/api/v1/notebooks/{id}/artifacts/{artifact_id}/downloa
  -o artifact.mp3
 ```
 ### RAG Integration
 ```bash
 # Sincronizzare notebook nel vector store
 curl -X POST http://localhost:8000/api/v1/notebooklm/sync/{notebook_id} \
  -H "X-API-Key: your-key"
 # Lista notebook sincronizzati
 curl http://localhost:8000/api/v1/notebooklm/indexed \
  -H "X-API-Key: your-key"
 # Query sui notebook (solo contenuto notebook)
 curl -X POST http://localhost:8000/api/v1/query/notebooks \
  -H "X-API-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "question": "Quali sono i punti chiave?",
    "notebook_ids": ["uuid-1", "uuid-2"],
    "k": 10,
    "provider": "openai"
  }'
 # Query mista (documenti + notebook)
 curl -X POST http://localhost:8000/api/v1/query \
  -H "X-API-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "question": "Confronta le informazioni tra documenti e notebook",
    "notebook_ids": ["uuid-1"],
    "include_documents": true,
    "provider": "anthropic"
  }'
 # Rimuovere sincronizzazione
 curl -X DELETE http://localhost:8000/api/v1/notebooklm/sync/{notebook_id} \
  -H "X-API-Key: your-key"
 ```
 ---
 ### Webhook Management
 ```bash
@@ -555,7 +602,7 @@ curl http://localhost:8000/api/v1/notebooks -H "X-API-Key: your-key"
 ---
-**Skill Version:** 1.1.0  
+**Skill Version:** 1.2.0  
 **API Version:** v1  
 **Last Updated:** 2026-04-06
@@ -585,3 +632,38 @@ curl http://localhost:8000/api/v1/notebooks -H "X-API-Key: your-key"
 - Chat functionality
 - Content generation (audio, video, etc.)
 - Webhook system
 ---
 ## Changelog Sprint 2
 ### 2026-04-06 - NotebookLM + RAG Integration
 **Implemented:**
 - ✅ `POST /api/v1/notebooklm/sync/{id}` - Sync notebook to RAG vector store
 - ✅ `GET /api/v1/notebooklm/indexed` - List synced notebooks
 - ✅ `DELETE /api/v1/notebooklm/sync/{id}` - Remove notebook from RAG
 - ✅ `GET /api/v1/notebooklm/sync/{id}/status` - Check sync status
 - ✅ `POST /api/v1/query/notebooks` - Query only notebook content
 - ✅ Enhanced `POST /api/v1/query` - Filter by notebook_ids
 **Features:**
 - NotebookLMIndexerService for content extraction and indexing
 - Vector store integration with Qdrant
 - Metadata preservation (notebook_id, source_id, source_title)
 - Multi-notebook queries
 - Hybrid search (documents + notebooks)
 - Support for all LLM providers in notebook queries
 - Comprehensive test coverage (428 lines of tests)
 **Architecture:**
 - Service layer: NotebookLMIndexerService
 - API routes: notebooklm_sync.py
 - Enhanced RAGService with notebook filtering
 - Extended VectorStoreService with filter support
 **Documentation:**
 - ✅ Updated README.md with integration overview
 - ✅ Created docs/integration.md with full guide
 - ✅ Updated SKILL.md with new capabilities
 - ✅ API examples and best practices
@@ -0,0 +1,436 @@
 # Guida Integrazione NotebookLM + RAG
 Questo documento descrive l'integrazione tra **NotebookLM Agent** e **DocuMente RAG**, che permette di eseguire ricerche semantiche (RAG) sui contenuti dei notebook di Google NotebookLM.
 ---
 ## Indice
 - [Overview](#overview)
 - [Architettura](#architettura)
 - [Come Funziona](#come-funziona)
 - [API Reference](#api-reference)
 - [Esempi di Utilizzo](#esempi-di-utilizzo)
 - [Best Practices](#best-practices)
 - [Troubleshooting](#troubleshooting)
 ---
 ## Overview
 L'integrazione colma il divario tra **gestione notebook** (NotebookLM Agent) e **ricerca semantica** (DocuMente RAG), permettendo di:
 - 🔍 **Ricercare** nei contenuti dei notebook con semantic search
 - 🧠 **Usare LLM multi-provider** per interrogare i notebook
 - 📊 **Combinare** notebook e documenti locali nelle stesse query
 - 🎯 **Filtrare** per notebook specifici
 - ⚡ **Indicizzare** automaticamente i contenuti
 ### Use Cases
 1. **Research Assistant**: "Cosa dicono tutti i miei notebook sull'intelligenza artificiale?"
 2. **Knowledge Mining**: "Trova tutte le fonti che parlano di Python nei miei notebook di programmazione"
 3. **Cross-Notebook Analysis**: "Confronta le conclusioni tra il notebook A e il notebook B"
 4. **Document + Notebook Search**: "Quali informazioni ho sia nei documenti PDF che nei notebook?"
 ---
 ## Architettura
 ```
 ┌─────────────────────────────────────────────────────────────────┐
 │                        NotebookLM Agent                         │
 │  ┌─────────────┐    ┌─────────────┐    ┌─────────────────────┐ │
 │  │  Notebooks  │───▶│   Sources   │───▶│   Full Text Get     │ │
 │  └─────────────┘    └─────────────┘    └─────────────────────┘ │
 └─────────────────────────────────────────────────────────────────┘
                              │
                              │ Extract Content
                              ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                   NotebookLMIndexerService                      │
 │  ┌─────────────┐    ┌─────────────┐    ┌─────────────────────┐ │
 │  │   Chunking  │───▶│  Embedding  │───▶│   Metadata Store    │ │
 │  └─────────────┘    └─────────────┘    └─────────────────────┘ │
 └─────────────────────────────────────────────────────────────────┘
                              │
                              │ Index to Vector Store
                              ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                         Qdrant Vector Store                     │
 │  ┌───────────────────────────────────────────────────────────┐  │
 │  │  Collection: "documents"                                  │  │
 │  │  Points with metadata:                                    │  │
 │  │    - notebook_id, source_id, source_title                 │  │
 │  │    - notebook_title, source_type                          │  │
 │  │    - source: "notebooklm"                                 │  │
 │  └───────────────────────────────────────────────────────────┘  │
 └─────────────────────────────────────────────────────────────────┘
                              │
                              │ Query with Filters
                              ▼
 ┌─────────────────────────────────────────────────────────────────┐
 │                          RAGService                             │
 │  ┌─────────────┐    ┌─────────────┐    ┌─────────────────────┐ │
 │  │    Query    │───▶│   Search    │───▶│   LLM Generation    │ │
 │  └─────────────┘    └─────────────┘    └─────────────────────┘ │
 └─────────────────────────────────────────────────────────────────┘
 ```
 ---
 ## Come Funziona
 ### 1. Sincronizzazione
 Quando sincronizzi un notebook:
 1. **Estrazione**: Ottiene tutte le fonti dal notebook via `notebooklm-py`
 2. **Full Text**: Recupera il testo completo di ogni fonte (se disponibile)
 3. **Chunking**: Divide i contenuti in chunks di ~1024 caratteri
 4. **Embedding**: Genera embeddings vettoriali usando OpenAI
 5. **Storage**: Salva in Qdrant con metadata completi
 ### 2. Metadata Structure
 Ogni chunk memorizzato contiene:
 ```json
 {
  "text": "contenuto del chunk...",
  "notebook_id": "uuid-del-notebook",
  "source_id": "uuid-della-fonte",
  "source_title": "Titolo della Fonte",
  "source_type": "url|file|youtube|drive",
  "notebook_title": "Titolo del Notebook",
  "source": "notebooklm"
 }
 ```
 ### 3. Query
 Quando esegui una query:
 1. **Embedding**: La domanda viene convertita in embedding
 2. **Search**: Qdrant cerca i chunk più simili
 3. **Filter**: Se specificati, filtra per `notebook_id`
 4. **Context**: I chunk vengono formattati come contesto
 5. **Generation**: Il LLM genera la risposta basata sul contesto
 ---
 ## API Reference
 ### Sync Endpoints
 #### POST `/api/v1/notebooklm/sync/{notebook_id}`
 Sincronizza un notebook da NotebookLM al vector store.
 **Response:**
 ```json
 {
  "sync_id": "uuid-della-sync",
  "notebook_id": "uuid-del-notebook",
  "notebook_title": "Titolo Notebook",
  "status": "success",
  "sources_indexed": 5,
  "total_chunks": 42,
  "message": "Successfully synced 5 sources with 42 chunks"
 }
 ```
 #### GET `/api/v1/notebooklm/indexed`
 Lista tutti i notebook sincronizzati.
 **Response:**
 ```json
 {
  "notebooks": [
    {
      "notebook_id": "uuid-1",
      "notebook_title": "AI Research",
      "sources_count": 10,
      "chunks_count": 150,
      "last_sync": "2026-01-15T10:30:00Z"
    }
  ],
  "total": 1
 }
 ```
 #### DELETE `/api/v1/notebooklm/sync/{notebook_id}`
 Rimuove un notebook dal vector store.
 **Response:**
 ```json
 {
  "notebook_id": "uuid-del-notebook",
  "deleted": true,
  "message": "Successfully removed index..."
 }
 ```
 #### GET `/api/v1/notebooklm/sync/{notebook_id}/status`
 Verifica lo stato di sincronizzazione di un notebook.
 **Response:**
 ```json
 {
  "notebook_id": "uuid-del-notebook",
  "status": "indexed",
  "sources_count": 5,
  "chunks_count": 42,
  "last_sync": "2026-01-15T10:30:00Z"
 }
 ```
 ### Query Endpoints
 #### POST `/api/v1/query` (with notebook filter)
 Esegue una RAG query con possibilità di filtrare per notebook.
 **Request:**
 ```json
 {
  "question": "Quali sono i punti chiave?",
  "notebook_ids": ["uuid-1", "uuid-2"],
  "include_documents": true,
  "k": 10,
  "provider": "openai",
  "model": "gpt-4o"
 }
 ```
 **Response:**
 ```json
 {
  "question": "Quali sono i punti chiave?",
  "answer": "Secondo i documenti e i notebook analizzati...",
  "provider": "openai",
  "model": "gpt-4o",
  "sources": [
    {
      "text": "Contenuto del chunk...",
      "source_type": "notebooklm",
      "notebook_id": "uuid-1",
      "notebook_title": "AI Research",
      "source_title": "Introduction to AI"
    }
  ],
  "user": "anonymous",
  "filters_applied": {
    "notebook_ids": ["uuid-1", "uuid-2"],
    "include_documents": true
  }
 }
 ```
 #### POST `/api/v1/query/notebooks`
 Esegue una query **solo** sui notebook (esclude documenti locali).
 **Request:**
 ```json
 {
  "question": "Trova informazioni su...",
  "notebook_ids": ["uuid-1"],
  "k": 10,
  "provider": "anthropic"
 }
 ```
 ---
 ## Esempi di Utilizzo
 ### Esempio 1: Sincronizzazione e Query Base
 ```bash
 # 1. Sincronizza un notebook
 curl -X POST http://localhost:8000/api/v1/notebooklm/sync/abc-123
 # 2. Query sul notebook sincronizzato
 curl -X POST http://localhost:8000/api/v1/query/notebooks \
  -H "Content-Type: application/json" \
  -d '{
    "question": "Quali sono le tecnologie AI menzionate?",
    "notebook_ids": ["abc-123"]
  }'
 ```
 ### Esempio 2: Ricerca Multi-Notebook
 ```bash
 # Query su più notebook contemporaneamente
 curl -X POST http://localhost:8000/api/v1/query \
  -H "Content-Type: application/json" \
  -d '{
    "question": "Confronta gli approcci di machine learning descritti",
    "notebook_ids": ["notebook-1", "notebook-2", "notebook-3"],
    "k": 15,
    "provider": "anthropic"
  }'
 ```
 ### Esempio 3: Workflow Completo
 ```bash
 #!/bin/bash
 # 1. Ottieni lista notebook da NotebookLM
 NOTEBOOKS=$(curl -s http://localhost:8000/api/v1/notebooks)
 # 2. Sincronizza il primo notebook
 NOTEBOOK_ID=$(echo $NOTEBOOKS | jq -r '.data.items[0].id')
 echo "Sincronizzazione notebook: $NOTEBOOK_ID"
 SYNC_RESULT=$(curl -s -X POST "http://localhost:8000/api/v1/notebooklm/sync/$NOTEBOOK_ID")
 echo "Risultato: $SYNC_RESULT"
 # 3. Attendi che la sincronizzazione sia completata (se asincrona)
 sleep 2
 # 4. Esegui query sul notebook
 curl -X POST http://localhost:8000/api/v1/query/notebooks \
  -H "Content-Type: application/json" \
  -d "{
    \"question\": \"Riassumi i punti principali\",
    \"notebook_ids\": [\"$NOTEBOOK_ID\"],
    \"provider\": \"openai\"
  }"
 ```
 ---
 ## Best Practices
 ### 1. **Sincronizzazione Selettiva**
 Non sincronizzare tutti i notebook, solo quelli rilevanti per le ricerche.
 ```bash
 # Sincronizza solo i notebook attivi
 for notebook_id in "notebook-1" "notebook-2"; do
  curl -X POST "http://localhost:8000/api/v1/notebooklm/sync/$notebook_id"
 done
 ```
 ### 2. **Gestione Chunks**
 Ogni fonte viene divisa in chunks di ~1024 caratteri. Se un notebook ha molte fonti grandi, considera:
 - Aumentare `k` nelle query (default: 5, max: 50)
 - Filtrare per notebook specifici per ridurre il contesto
 ### 3. **Provider Selection**
 Usa provider diversi per tipologie di query diverse:
 - **OpenAI GPT-4o**: Query complesse, analisi dettagliate
 - **Anthropic Claude**: Sintesi lunghe, analisi testuali
 - **Mistral**: Query veloci, risposte concise
 ### 4. **Refresh Periodico**
 I notebook cambiano nel tempo. Considera di:
 - Rimuovere e risincronizzare periodicamente
 - Aggiungere un job schedulato per il refresh
 ```bash
 # Cron job per refresh settimanale
 0 2 * * 0 /path/to/sync-notebooks.sh
 ```
 ### 5. **Monitoraggio**
 Traccia quali notebook sono sincronizzati:
 ```bash
 # Lista e verifica stato
 curl http://localhost:8000/api/v1/notebooklm/indexed | jq '.'
 ```
 ---
 ## Troubleshooting
 ### Problema: Sincronizzazione fallita
 **Sintomi**: Errore 500 durante la sincronizzazione
 **Causa**: NotebookLM potrebbe non avere il testo completo disponibile per alcune fonti
 **Soluzione**:
 1. Verifica che il notebook esista: `GET /api/v1/notebooks/{id}`
 2. Controlla che le fonti siano indicizzate: NotebookLM mostra "Ready"
 3. Alcune fonti (YouTube, Drive) potrebbero non avere testo estratto
 ### Problema: Query non trova risultati
 **Sintomi**: Risposta "I don't have enough information..."
 **Verifica**:
 ```bash
 # 1. Il notebook è sincronizzato?
 curl http://localhost:8000/api/v1/notebooklm/sync/{notebook_id}/status
 # 2. Quanti chunks ci sono?
 curl http://localhost:8000/api/v1/notebooklm/indexed
 ```
 **Soluzione**:
 - Aumenta `k` nella query
 - Verifica che il contenuto sia stato effettivamente estratto
 - Controlla che l'embedding model sia configurato correttamente
 ### Problema: Rate Limiting
 **Sintomi**: Errori 429 durante sincronizzazione
 **Soluzione**:
 - NotebookLM ha rate limits aggressivi
 - Aggiungi delay tra le sincronizzazioni
 - Sincronizza durante ore di basso traffico
 ```python
 # Aggiungi delay
 import asyncio
 for notebook_id in notebook_ids:
    await sync_notebook(notebook_id)
    await asyncio.sleep(5)  # Attendi 5 secondi
 ```
 ---
 ## Performance Considerations
 ### Dimensione dei Chunks
 - **Default**: 1024 caratteri
 - **Trade-off**: 
  - Chunks più grandi = più contesto ma meno precisione
  - Chunks più piccoli = più precisione ma meno contesto
 ### Numero di Notebook
 - **Consigliato**: < 50 notebook sincronizzati contemporaneamente
 - **Ottimale**: Filtra per notebook specifici nelle query
 ### Refresh Strategy
 - **Full Refresh**: Rimuovi tutto e risincronizza (lento ma pulito)
 - **Incremental**: Aggiungi solo nuove fonti (più veloce ma può avere duplicati)
 ---
 ## Limitazioni Conosciute
 1. **Testo Completo**: Non tutte le fonti di NotebookLM hanno testo completo disponibile (es. alcuni PDF, YouTube)
 2. **Sync Non Automatica**: La sincronizzazione è manuale via API, non automatica
 3. **Storage**: I chunks duplicano lo storage (contenuto sia in NotebookLM che in Qdrant)
 4. **Embedding Model**: Attualmente usa OpenAI per embeddings (configurabile in futuro)
 ---
 ## Roadmap
 - [ ] **Auto-Sync**: Sincronizzazione automatica quando i notebook cambiano
 - [ ] **Incremental Sync**: Aggiornamento solo delle fonti modificate
 - [ ] **Multi-Embedder**: Supporto per altri modelli di embedding
 - [ ] **Semantic Chunking**: Chunking basato su significato anziché lunghezza
 - [ ] **Cross-Reference**: Link tra fonti simili in notebook diversi
 ---
 **Versione**: 1.0.0  
 **Ultimo Aggiornamento**: 2026-04-06