PflegeLotse
GDPR-compliant RAG assistant for German long-term-care law (SGB XI) — source-grounded answers and an eval harness.
Problem & context
Care law is complex — and answers must be verifiable
Care providers and families lose time searching the SGB XI. A generic chatbot hallucinates — unacceptable for legal questions. What is needed is an assistant that answers only from the statute, cites every claim, and abstains honestly when unsure.
Solution
Retrieval-augmented generation with strict grounding
Hybrid retrieval via pgvector, source-grounded answers, anti-hallucination and abstention.
Architecture
Clean Architecture, four layers
Entities & rules — framework-free
Use cases: ingest, retrieve, answer
pgvector, E5 embeddings, Mistral/Ollama
FastAPI + Jinja2/HTMX
Process history
From plan to deploy — six phases
- 01
Setup & architecture
IN PROGRESSClean-architecture skeleton, Docker, CI. ADR-0001: Python/HTMX over Next.js.
- 02
Data & ingestion
PLANNEDLoad SGB XI (public domain), chunk, E5 embeddings → pgvector.
- 03
Retrieval & grounding
PLANNEDHybrid retrieval, grounded answers, abstention.
- 04
Eval harness
PLANNEDMeasure recall@k, faithfulness, abstention.
- 05
UI & HITL
PLANNEDJinja2/HTMX, source display, disclaimer (RDG).
- 06
Deploy & docs
PLANNEDDocker deploy, README with metrics, ADRs, datenschutz.md.
Results
Made measurable
Will be filled with real numbers after the eval phase — and then feeds into the CV.
Stack & compliance
GDPR & EU AI Act: no personal data, EU or local LLM, citations instead of free generation. Disclaimer: no legal advice (RDG).