Added production-grade validation tooling and documentation:
- ADDED: validate-connectivity.yml playbook with comprehensive checks
* Ping test, sudo verification, Docker status
* NFS mount validation, disk usage warnings
* Proxmox-specific checks (version, cluster status)
* System uptime reporting
* Passes ansible-lint production profile
- ADDED: validate-environment.sh health check script
* 10-point diagnostic validation
* Color-coded status output
* Reports all 4 nodes operational
- ADDED: QUICK-REFERENCE.md comprehensive command guide
* Ad-hoc commands, playbook operations
* Vault management, linting workflows
* Inventory targeting examples
* Integration guides (VSCode, Git)
- ADDED: Ansible Vault secrets template (encrypted)
* group_vars/all/vault.yml with placeholder secrets
* AES256 encrypted with vault password
* Template for sudo, Proxmox, Gitea, NFS credentials
- UPDATED: plan-ansibleSetup.md progress report
* Phase completion status (Phases 1-4 complete)
* Deviations documented (hosts.ini format, PVE01 added)
* Next steps and recommendations
- UPDATED: README.md Ansible section
* Production-ready status badge
* Quick validation command
* Links to new documentation
Environment Status: 🟢 PRODUCTION READY
All 4 nodes responding, linting passed, documentation complete
Documentation
Overview
This directory contains all technical documentation for the Castaldi Family Homelab infrastructure.
Quick Reference
📘 Runbooks & Guides
- TECHNICAL_RUNBOOK.md - Complete infrastructure reference, emergency procedures, and maintenance schedule
- SECURITY_AUDIT_REPORT.md - 🔴 Security audit findings, exposed credentials, and remediation steps
Knowledge Base Articles (KBAs/)
Structured troubleshooting articles following the incident → resolution format.
GitOps & Deployment
- KBA-001: Komodo GitOps Stack Deployment Failures
Troubleshooting guide for Git-linked stack pull/deploy failures, canonicalize errors, and Docker image tag issues.
Standard Operating Procedures (SOPs/)
Step-by-step guides for operational tasks and migrations.
Infrastructure Deployment
- SOP-002: Initial Infrastructure Deployment
Complete guide for deploying the homelab from scratch, including secure repository setup, Ansible control node configuration, core service deployment, and GitOps integration.
Stack Management
- SOP-001: Migrate Stack from UI to Git
Complete guide for converting Komodo stacks from UI-defined to Git-based deployment, including secrets management and verification steps.
Document Conventions
- KBA-XXX: Troubleshooting articles with clear problem/solution format (stored in
KBAs/) - SOP-XXX: Procedural guides for operational tasks (stored in
SOPs/) - Runbooks: Infrastructure reference and emergency procedures (root level)
Contributing
When documenting new issues or procedures:
- KBAs: Create in
KBAs/folder for troubleshooting scenarios with clear diagnosis → resolution - SOPs: Create in
SOPs/folder for repeatable operational procedures and migrations - Update Runbook: Add new emergency procedures to TECHNICAL_RUNBOOK.md
- Update Repository Memory: Store critical lessons in
/memories/repo/ - Commit Messages: Use conventional commits (e.g.,
docs(kba): add KBA-002 for...)
Last Updated: April 12, 2026