coleam00 · Femstar08 · Jul 4, 2025 · Jul 4, 2025 · Jul 4, 2025 · Jul 4, 2025
diff --git a/.claude/commands/analyze-prp-results.md b/.claude/commands/analyze-prp-results.md
diff --git a/.claude/commands/execute-prp.md b/.claude/commands/execute-prp.md
diff --git a/.claude/commands/generate-prp.md b/.claude/commands/generate-prp.md
diff --git a/.claude/commands/validate-prp.md b/.claude/commands/validate-prp.md
@@ -0,0 +1,197 @@
+# Validate PRP
+
+## PRP File: $ARGUMENTS
+
+Pre-flight validation of a PRP to ensure all context and dependencies are available before execution.
+
+## Validation Process
+
+1. **Parse PRP**
+   - Read the specified PRP file
+   - Extract all file references, URLs, and dependencies
+   - Parse validation checklist items
+
+2. **Context Validation**
+   - Check all referenced files exist
+   - Validate all URLs are accessible
+   - Verify environment dependencies are available
+   - Check for required API keys/credentials
+
+3. **Codebase Analysis**
+   - Scan for similar patterns mentioned in PRP
+   - Validate existing examples are current
+   - Check for architectural consistency
+
+4. **Dependency Check**
+   - Verify all required libraries are installed
+   - Check version compatibility
+   - Validate external service connectivity
+
+5. **Risk Assessment**
+   - Analyze failure patterns mentioned in PRP
+   - Assess complexity and confidence score
+   - Identify potential bottlenecks
+
+## Validation Gates
+
+### File References
+```bash
+# Check all referenced files exist
+echo "Validating file references..."
+for file in $(grep -o 'file: [^[:space:]]*' "$PRP_FILE" | cut -d' ' -f2); do
+    if [ ! -f "$file" ]; then
+        echo "❌ Missing file: $file"
+        exit 1
+    else
+        echo "✅ Found: $file"
+    fi
+done
+```
+
+### URL Accessibility
+```bash
+# Check all referenced URLs are accessible
+echo "Validating URL references..."
+for url in $(grep -o 'url: [^[:space:]]*' "$PRP_FILE" | cut -d' ' -f2); do
+    if curl -s --head "$url" > /dev/null; then
+        echo "✅ Accessible: $url"
+    else
+        echo "⚠️  Cannot access: $url"
+    fi
+done
+```
+
+### Environment Dependencies
+```bash
+# Check environment setup
+echo "Validating environment dependencies..."
+
+# Check Python dependencies
+if command -v python3 &> /dev/null; then
+    echo "✅ Python3 available"
+
+    # Check specific imports mentioned in PRP
+    python3 -c "
+import re
+import sys
+
+# Read PRP file and extract import statements
+with open('$PRP_FILE', 'r') as f:
+    content = f.read()
+
+# Find import statements in code blocks
+imports = re.findall(r'^(?:import|from)\s+([a-zA-Z_][a-zA-Z0-9_]*)', content, re.MULTILINE)
+unique_imports = set(imports)
+
+failed_imports = []
+for module in unique_imports:
+    try:
+        __import__(module)
+        print(f'✅ Module available: {module}')
+    except ImportError:
+        failed_imports.append(module)
+        print(f'⚠️  Module missing: {module}')
+
+if failed_imports:
+    print(f'❌ Missing modules: {failed_imports}')
+    sys.exit(1)
+"
+else
+    echo "❌ Python3 not available"
+    exit 1
+fi
+```
+
+### API Connectivity
+```bash
+# Check external API connectivity
+echo "Validating API connectivity..."
+
+# Check common APIs mentioned in PRP
+if grep -q "api.openai.com" "$PRP_FILE"; then
+    if [ -n "$OPENAI_API_KEY" ]; then
+        echo "✅ OpenAI API key configured"
+    else
+        echo "⚠️  OpenAI API key not set"
+    fi
+fi
+
+if grep -q "api.anthropic.com" "$PRP_FILE"; then
+    if [ -n "$ANTHROPIC_API_KEY" ]; then
+        echo "✅ Anthropic API key configured"
+    else
+        echo "⚠️  Anthropic API key not set"
+    fi
+fi
+
+# Add more API checks as needed
+```
+
+## Validation Report
+
+Generate a comprehensive validation report with:
+
+1. **Context Completeness Score** (0-100)
+2. **Dependency Readiness** (Ready/Issues/Blocked)
+3. **Risk Assessment** (Low/Medium/High)
+4. **Recommended Actions** (before execution)
+
+## Output Format
+
+```
+🔍 PRP Validation Report
+========================
+
+📁 Context Validation: [PASS/FAIL]
+- Files referenced: X/X found
+- URLs accessible: X/X responding
+- Examples current: [YES/NO]
+
+🔧 Dependencies: [READY/ISSUES/BLOCKED]
+- Python modules: X/X available
+- External services: X/X accessible
+- API keys: X/X configured
+
+⚠️  Risk Assessment: [LOW/MEDIUM/HIGH]
+- Complexity score: X/10
+- Failure patterns: X identified
+- Mitigation strategies: X documented
+
+📊 Readiness Score: XX/100
+
+🎯 Recommended Actions:
+[ ] Install missing dependencies
+[ ] Configure missing API keys
+[ ] Update stale examples
+[ ] Review risk mitigation strategies
+
+Status: [READY_TO_EXECUTE/NEEDS_ATTENTION/BLOCKED]
+```
+
+## Auto-Fix Suggestions
+
+When validation fails, provide actionable suggestions:
+
+```bash
+# Auto-generate fixes where possible
+if [ "$STATUS" != "READY_TO_EXECUTE" ]; then
+    echo "🔧 Auto-fix suggestions:"
+    echo "pip install missing-module-1 missing-module-2"
+    echo "export MISSING_API_KEY=your_key_here"
+    echo "git checkout HEAD -- outdated-example.py"
+fi
+```
+
+## Integration with Execute Command
+
+The validate command should be automatically called by execute-prp before starting implementation:
+
+```bash
+# In execute-prp.md, add this as step 0:
+echo "Running pre-execution validation..."
+validate-prp "$PRP_FILE"
+if [ $? -ne 0 ]; then
+    echo "❌ Validation failed. Please fix issues before execution."
+    exit 1
+fi
+```
diff --git a/CLAUDE.md b/CLAUDE.md
@@ -1,9 +1,17 @@
-### 🔄 Project Awareness & Context
+### 🔄 Enhanced Project Awareness & Context Engineering
 - **Always read `PLANNING.md`** at the start of a new conversation to understand the project's architecture, goals, style, and constraints.
-- **Check `TASK.md`** before starting a new task. If the task isn’t listed, add it with a brief description and today's date.
+- **Check `TASK.md`** before starting a new task. If the task isn't listed, add it with a brief description and today's date.
+- **Use enhanced Context Engineering system** with failure pattern awareness and validation loops.
 - **Use consistent naming conventions, file structure, and architecture patterns** as described in `PLANNING.md`.
 - **Use venv_linux** (the virtual environment) whenever executing Python commands, including for unit tests.
 
+### 🧠 Context Engineering Enhanced Rules
+- **Always validate PRPs before execution** using `validate-prp` command to catch issues early.
+- **Use failure pattern awareness** from the knowledge base to prevent common mistakes.
+- **Follow multi-level validation** approach: syntax → unit tests → integration → performance.
+- **Run post-implementation analysis** to capture learnings and improve future implementations.
+- **Update knowledge base** with new patterns and metrics after each implementation.
+
 ### 🧱 Code Structure & Modularity
 - **Never create a file longer than 500 lines of code.** If a file approaches this limit, refactor by splitting it into modules or helper files.
 - **Organize code into clearly separated modules**, grouped by feature or responsibility.
@@ -12,27 +20,50 @@
     - `tools.py` - Tool functions used by the agent 
     - `prompts.py` - System prompts
 - **Use clear, consistent imports** (prefer relative imports within packages).
-- **Use clear, consistent imports** (prefer relative imports within packages).
 - **Use python_dotenv and load_env()** for environment variables.
+- **Always implement proper error handling** with specific exception types and meaningful error messages.
 
-### 🧪 Testing & Reliability
+### 🧪 Testing & Reliability Enhanced
 - **Always create Pytest unit tests for new features** (functions, classes, routes, etc).
+- **Follow test-driven development** when implementing complex features.
 - **After updating any logic**, check whether existing unit tests need to be updated. If so, do it.
 - **Tests should live in a `/tests` folder** mirroring the main app structure.
   - Include at least:
     - 1 test for expected use
     - 1 edge case
     - 1 failure case
+    - 1 async context test (if applicable)
+- **Use proper test isolation** to prevent test pollution and ensure consistent results.
+- **Mock external dependencies** appropriately but avoid over-mocking.
+
+### ⚡ Performance & Quality Standards
+- **Always use async/await consistently** - never mix sync and async contexts.
+- **Implement proper connection pooling** for database and external API connections.
+- **Use connection timeouts** for all external API calls.
+- **Implement retry logic with exponential backoff** for transient failures.
+- **Monitor memory usage** and implement proper cleanup for long-running processes.
+- **Use proper type hints** throughout the codebase for better maintainability.
 
-### ✅ Task Completion
+### 🔧 Validation & Quality Assurance
+- **Run ruff check and fix** before committing any code.
+- **Run mypy for type checking** and fix all type errors.
+- **Run security scanning with bandit** for security vulnerabilities.
+- **Ensure test coverage is above 80%** for new features.
+- **Use pre-commit hooks** if available to enforce quality standards.
+
+### ✅ Task Completion Enhanced
 - **Mark completed tasks in `TASK.md`** immediately after finishing them.
-- Add new sub-tasks or TODOs discovered during development to `TASK.md` under a “Discovered During Work” section.
+- **Run post-implementation analysis** using `analyze-prp-results` command.
+- **Update knowledge base** with any new patterns or gotchas discovered.
+- **Add new sub-tasks or TODOs** discovered during development to `TASK.md` under a "Discovered During Work" section.
+- **Document any deviations** from the original PRP and reasons for changes.
 
-### 📎 Style & Conventions
-- **Use Python** as the primary language.
-- **Follow PEP8**, use type hints, and format with `black`.
-- **Use `pydantic` for data validation**.
-- Use `FastAPI` for APIs and `SQLAlchemy` or `SQLModel` for ORM if applicable.
+### 📎 Style & Conventions Enhanced
+- **Use Python** as the primary language with modern Python 3.9+ features.
+- **Follow PEP8**, use type hints, and format with `ruff` (preferred) or `black`.
+- **Use `pydantic` for data validation** and leverage v2 features properly.
+- Use `FastAPI` for APIs and `SQLAlchemy` (async) for ORM if applicable.
+- **Use proper async patterns** - async/await throughout, proper session management.
 - Write **docstrings for every function** using the Google style:
   ```python
   def example():
@@ -44,16 +75,75 @@
 
       Returns:
           type: Description.
+
+      Raises:
+          ValueError: When invalid input provided.
       """
   ```
 
-### 📚 Documentation & Explainability
+### 📚 Documentation & Explainability Enhanced
 - **Update `README.md`** when new features are added, dependencies change, or setup steps are modified.
 - **Comment non-obvious code** and ensure everything is understandable to a mid-level developer.
+- **Document architectural decisions** and include rationale for complex implementations.
 - When writing complex logic, **add an inline `# Reason:` comment** explaining the why, not just the what.
+- **Keep `.env.example` up to date** with all required environment variables and descriptions.
+
+### 🛡️ Security & Best Practices
+- **Never hardcode secrets** - always use environment variables.
+- **Validate all input data** using Pydantic models or similar validation.
+- **Use proper authentication and authorization** patterns.
+- **Implement proper logging** without exposing sensitive information.
+- **Use HTTPS for all external API calls** and never disable SSL verification.
+- **Implement proper rate limiting** for API endpoints.
 
-### 🧠 AI Behavior Rules
+### 🧠 AI Behavior Rules Enhanced
 - **Never assume missing context. Ask questions if uncertain.**
+- **Use the knowledge base** to learn from previous implementations and avoid known pitfalls.
 - **Never hallucinate libraries or functions** – only use known, verified Python packages.
 - **Always confirm file paths and module names** exist before referencing them in code or tests.
-- **Never delete or overwrite existing code** unless explicitly instructed to or if part of a task from `TASK.md`.
+- **Never delete or overwrite existing code** unless explicitly instructed to or if part of a task from `TASK.md`.
+- **Follow the enhanced PRP execution process** with proper validation at each step.
+- **Learn from failures** and update the knowledge base with new patterns.
+
+### 🔄 Continuous Improvement
+- **Analyze each implementation** for patterns and improvements.
+- **Share learnings** by updating failure patterns and success metrics.
+- **Iterate on templates** based on real-world usage and outcomes.
+- **Monitor success rates** and adjust approaches based on data.
+- **Celebrate successes** and learn from failures without blame.
+
+### 🚨 Critical Failure Patterns to Avoid
+Based on historical data, always be aware of these common failure patterns:
+
+1. **Async Context Mixing** - Never mix sync and async code contexts
+2. **Environment Variable Issues** - Always validate config and provide defaults
+3. **Import Path Errors** - Verify all imports and dependencies before implementation
+4. **Database Session Management** - Use proper async session patterns
+5. **API Rate Limiting** - Implement proper retry logic and rate limiting
+6. **Pydantic v2 Breaking Changes** - Use correct v2 syntax and imports
+7. **Test Isolation Issues** - Ensure proper test cleanup and isolation
+8. **JSON Serialization Errors** - Use proper serialization with Pydantic
+
+### 📊 Success Metrics Awareness
+Be aware of typical implementation metrics for different feature types:
+- **API Integration**: ~35 min, 85% success rate
+- **CLI Applications**: ~20 min, 95% success rate  
+- **Database Operations**: ~25 min, 92% success rate
+- **Web Applications**: ~45 min, 82% success rate
+- **Agent Systems**: ~60 min, 75% success rate
+
+Use these as guidelines for complexity assessment and time estimation.
+
+### 🎯 Quality Gates
+Before considering any implementation complete:
+- [ ] All tests pass with good coverage
+- [ ] No linting errors (ruff, mypy, bandit)
+- [ ] Code follows project patterns and conventions
+- [ ] Documentation is complete and accurate
+- [ ] Environment variables documented
+- [ ] Error handling is comprehensive
+- [ ] Performance meets requirements
+- [ ] Security best practices followed
+- [ ] Knowledge base updated with learnings
+
+Remember: **The goal is not just working code, but maintainable, reliable, and learnable implementations that improve the entire development process.**