Add HTTP/SSE transport with graceful degradation for public deployment

New Features: - HTTP/SSE server (server-http.js) for network access - Express-based web server with MCP SSE transport - Rate limiting (50 req/min per IP) - Request timeouts (30s) - Concurrent request limiting (max 10) - Circuit breaker pattern for failure handling - Memory monitoring (450MB threshold) - Gzip compression for responses - CORS support for cross-origin requests - Health check endpoint (/health) Infrastructure: - Updated package.json with new dependencies (express, cors, compression, rate-limit) - New npm script: start:http for HTTP server - Comprehensive deployment guide (DEPLOYMENT.md) - Updated README with deployment instructions Graceful Degradation: - Automatically rejects requests when at capacity - Circuit breaker opens after 5 failures - Memory-aware request handling - Per-IP rate limiting to prevent abuse The original stdio server (index.js) remains unchanged for local use. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-26 10:57:39 +00:00
parent 7c8efd2228
commit d68885cff8
5 changed files with 1868 additions and 168 deletions
--- a/DEPLOYMENT.md
+++ b/DEPLOYMENT.md
@@ -0,0 +1,422 @@
+# Deployment Guide
+
+This guide provides step-by-step instructions for deploying the HPR Knowledge Base MCP Server to various hosting platforms.
+
+## Prerequisites
+
+- Git installed locally
+- Account on your chosen hosting platform
+- Node.js 18+ installed locally for testing
+
+## Quick Start
+
+1. Install dependencies:
+```bash
+npm install
+```
+
+2. Test locally:
+```bash
+npm run start:http
+```
+
+3. Verify the server is running:
+```bash
+curl http://localhost:3000/health
+```
+
+## Deployment Options
+
+### Option 1: Render.com (Recommended)
+
+**Cost**: Free tier available, $7/mo for always-on service
+
+**Steps**:
+
+1. Create a Render account at https://render.com
+
+2. Push your code to GitHub/GitLab/Bitbucket
+
+3. In Render Dashboard:
+   - Click "New +" → "Web Service"
+   - Connect your repository
+   - Configure:
+     - **Name**: `hpr-knowledge-base`
+     - **Environment**: `Node`
+     - **Build Command**: `npm install`
+     - **Start Command**: `npm run start:http`
+     - **Instance Type**: Free or Starter ($7/mo)
+
+4. Add environment variable (optional):
+   - `PORT` is automatically set by Render
+
+5. Click "Create Web Service"
+
+6. Your server will be available at: `https://hpr-knowledge-base.onrender.com`
+
+**Health Check Configuration**:
+- Path: `/health`
+- Success Codes: 200
+
+**Auto-scaling**: Available on paid plans
+
+---
+
+### Option 2: Railway.app
+
+**Cost**: $5 free credit/month, then pay-per-usage (~$5-10/mo for small services)
+
+**Steps**:
+
+1. Create a Railway account at https://railway.app
+
+2. Install Railway CLI (optional):
+```bash
+npm install -g @railway/cli
+railway login
+```
+
+3. Deploy via CLI:
+```bash
+railway init
+railway up
+```
+
+4. Or deploy via Dashboard:
+   - Click "New Project"
+   - Choose "Deploy from GitHub repo"
+   - Select your repository
+   - Railway auto-detects Node.js and runs `npm install`
+
+5. Add start command in Railway:
+   - Go to project settings
+   - Add custom start command: `npm run start:http`
+
+6. Your service URL will be generated automatically
+
+**Advantages**:
+- Pay only for what you use
+- Scales to zero when idle
+- Simple deployment process
+
+---
+
+### Option 3: Fly.io
+
+**Cost**: Free tier (256MB RAM), ~$3-5/mo beyond that
+
+**Steps**:
+
+1. Install Fly CLI:
+```bash
+curl -L https://fly.io/install.sh | sh
+```
+
+2. Login to Fly:
+```bash
+fly auth login
+```
+
+3. Create `fly.toml` in project root:
+```toml
+app = "hpr-knowledge-base"
+
+[build]
+  builder = "heroku/buildpacks:20"
+
+[env]
+  PORT = "8080"
+
+[http_service]
+  internal_port = 8080
+  force_https = true
+  auto_stop_machines = true
+  auto_start_machines = true
+  min_machines_running = 0
+
+[[vm]]
+  cpu_kind = "shared"
+  cpus = 1
+  memory_mb = 256
+```
+
+4. Create `Procfile`:
+```
+web: npm run start:http
+```
+
+5. Deploy:
+```bash
+fly launch --no-deploy
+fly deploy
+```
+
+6. Your app will be at: `https://hpr-knowledge-base.fly.dev`
+
+**Advantages**:
+- Global edge deployment
+- Auto-scaling
+- Good free tier
+
+---
+
+### Option 4: Vercel (Serverless)
+
+**Cost**: Free tier generous (100GB bandwidth), Pro at $20/mo
+
+**Note**: Serverless functions have cold starts and are less ideal for MCP's persistent connection model, but can work.
+
+**Steps**:
+
+1. Install Vercel CLI:
+```bash
+npm install -g vercel
+```
+
+2. Create `vercel.json`:
+```json
+{
+  "version": 2,
+  "builds": [
+    {
+      "src": "server-http.js",
+      "use": "@vercel/node"
+    }
+  ],
+  "routes": [
+    {
+      "src": "/(.*)",
+      "dest": "server-http.js"
+    }
+  ]
+}
+```
+
+3. Deploy:
+```bash
+vercel
+```
+
+4. Your app will be at: `https://hpr-knowledge-base.vercel.app`
+
+**Limitations**:
+- 10 second timeout on hobby plan
+- Cold starts may affect first request
+- Less suitable for SSE persistent connections
+
+---
+
+### Option 5: Self-Hosted VPS
+
+**Cost**: $4-6/mo (DigitalOcean, Hetzner, Linode)
+
+**Steps**:
+
+1. Create a VPS with Ubuntu 22.04
+
+2. SSH into your server:
+```bash
+ssh root@your-server-ip
+```
+
+3. Install Node.js:
+```bash
+curl -fsSL https://deb.nodesource.com/setup_20.x | bash -
+apt-get install -y nodejs
+```
+
+4. Clone your repository:
+```bash
+git clone https://github.com/yourusername/hpr-knowledge-base.git
+cd hpr-knowledge-base
+npm install
+```
+
+5. Install PM2 for process management:
+```bash
+npm install -g pm2
+```
+
+6. Start the server:
+```bash
+pm2 start server-http.js --name hpr-mcp
+pm2 save
+pm2 startup
+```
+
+7. Set up nginx reverse proxy:
+```bash
+apt-get install nginx
+```
+
+Create `/etc/nginx/sites-available/hpr-mcp`:
+```nginx
+server {
+    listen 80;
+    server_name your-domain.com;
+
+    location / {
+        proxy_pass http://localhost:3000;
+        proxy_http_version 1.1;
+        proxy_set_header Upgrade $http_upgrade;
+        proxy_set_header Connection 'upgrade';
+        proxy_set_header Host $host;
+        proxy_cache_bypass $http_upgrade;
+    }
+}
+```
+
+Enable site:
+```bash
+ln -s /etc/nginx/sites-available/hpr-mcp /etc/nginx/sites-enabled/
+nginx -t
+systemctl restart nginx
+```
+
+8. (Optional) Add SSL with Let's Encrypt:
+```bash
+apt-get install certbot python3-certbot-nginx
+certbot --nginx -d your-domain.com
+```
+
+---
+
+## Configuration
+
+### Environment Variables
+
+- `PORT`: Server port (default: 3000)
+- All other configuration is in `server-http.js`
+
+### Adjusting Limits
+
+Edit these constants in `server-http.js`:
+
+```javascript
+const MAX_CONCURRENT_REQUESTS = 10;          // Max concurrent connections
+const REQUEST_TIMEOUT_MS = 30000;            // Request timeout (30s)
+const RATE_LIMIT_MAX_REQUESTS = 50;          // Requests per minute per IP
+const MEMORY_THRESHOLD_MB = 450;             // Memory limit before rejection
+const CIRCUIT_BREAKER_THRESHOLD = 5;         // Failures before circuit opens
+```
+
+## Monitoring
+
+### Health Checks
+
+All platforms should use the `/health` endpoint:
+```bash
+curl https://your-server.com/health
+```
+
+### Logs
+
+**Render/Railway/Fly**: Check platform dashboard for logs
+
+**Self-hosted**: Use PM2:
+```bash
+pm2 logs hpr-mcp
+```
+
+### Metrics to Monitor
+
+- Response time
+- Error rate
+- Memory usage
+- Active connections
+- Rate limit hits
+
+## Troubleshooting
+
+### High Memory Usage
+
+The server loads ~35MB of data on startup. If memory usage exceeds 450MB:
+- Increase server RAM
+- Reduce `MAX_CONCURRENT_REQUESTS`
+- Check for memory leaks
+
+### Circuit Breaker Opening
+
+If the circuit breaker opens frequently:
+- Check error logs
+- Verify data files are not corrupted
+- Increase `CIRCUIT_BREAKER_THRESHOLD`
+
+### Rate Limiting Issues
+
+If legitimate users hit rate limits:
+- Increase `RATE_LIMIT_MAX_REQUESTS`
+- Implement authentication for higher limits
+- Consider using API keys
+
+### Connection Timeouts
+
+If requests timeout:
+- Increase `REQUEST_TIMEOUT_MS`
+- Check server performance
+- Verify network connectivity
+
+## Security Considerations
+
+1. **CORS**: Currently allows all origins. Restrict in production:
+```javascript
+app.use(cors({
+  origin: 'https://your-allowed-domain.com'
+}));
+```
+
+2. **Rate Limiting**: Adjust based on expected traffic
+
+3. **HTTPS**: Always use HTTPS in production
+
+4. **Environment Variables**: Use platform secrets for sensitive config
+
+## Updating the Server
+
+### Platform Deployments
+
+Most platforms auto-deploy on git push. Otherwise:
+
+**Render**: Push to git, auto-deploys
+
+**Railway**: `railway up` or push to git
+
+**Fly**: `fly deploy`
+
+**Vercel**: `vercel --prod`
+
+### Self-Hosted
+
+```bash
+cd hpr-knowledge-base
+git pull
+npm install
+pm2 restart hpr-mcp
+```
+
+## Cost Estimates
+
+| Platform | Free Tier | Paid Tier | Best For |
+|----------|-----------|-----------|----------|
+| Render | Yes (sleeps) | $7/mo | Always-on, simple |
+| Railway | $5 credit | ~$5-10/mo | Pay-per-use |
+| Fly.io | 256MB RAM | ~$3-5/mo | Global deployment |
+| Vercel | 100GB bandwidth | $20/mo | High traffic |
+| VPS | No | $4-6/mo | Full control |
+
+## Support
+
+For deployment issues:
+- Check platform documentation
+- Review server logs
+- Test locally first
+- Open an issue on GitHub
+
+## Next Steps
+
+After deployment:
+1. Test the `/health` endpoint
+2. Try the `/sse` endpoint with an MCP client
+3. Monitor logs for errors
+4. Set up alerts for downtime
+5. Document your deployment URL