Troubleshooting Guide
Common issues and solutions for SNAP deployments.
Installation Issues
Docker Compose Won’t Start
Problem: Services fail to start with Docker Compose
Solutions:
# Check Docker status
docker --version
docker-compose --version
# Check port availability
netstat -tulpn | grep :8000
netstat -tulpn | grep :3000
# View detailed logs
docker-compose logs snapapi
docker-compose logs snapui
Permission Issues
Problem: Permission denied errors
Solutions:
# Fix Docker permissions
sudo usermod -aG docker $USER
newgrp docker
# Check volume permissions
ls -la snapapi-checkpoints/
Cluster Connection Issues
Cluster Not Found
Problem: Cannot connect to cluster
Solutions:
- Verify kubeconfig:
kubectl cluster-info kubectl get nodes
- Check network connectivity:
curl -k https://your-cluster-api:6443/version
- Verify credentials:
kubectl auth can-i get pods
Permission Denied
Problem: Insufficient permissions
Solutions:
- Check RBAC permissions:
kubectl auth can-i create pods kubectl auth can-i get nodes
- Verify service account:
kubectl get serviceaccounts kubectl describe serviceaccount snap-service-account
Checkpointing Issues
Checkpoint Creation Fails
Problem: Checkpoint creation fails
Solutions:
- Check pod status:
kubectl get pods kubectl describe pod <pod-name>
- Verify node permissions:
kubectl describe node <node-name>
- Check CRI-O version:
kubectl get nodes -o jsonpath='{.items[*].status.nodeInfo.containerRuntimeVersion}'
Checkpoint Too Large
Problem: Checkpoint exceeds storage limits
Solutions:
- Increase storage:
kubectl patch pvc <pvc-name> -p '{"spec":{"resources":{"requests":{"storage":"50Gi"}}}}'
- Enable compression:
# Configure checkpoint compression in SNAP settings
Registry Issues
Registry Push Fails
Problem: Cannot push images to registry
Solutions:
- Verify credentials:
docker login your-registry.com
- Check network connectivity:
curl -k https://your-registry.com/v2/
- Review registry logs:
docker-compose logs snapapi | grep registry
Authentication Failed
Problem: Registry authentication fails
Solutions:
- Test manual login:
docker login your-registry.com
- Check token validity:
curl -H "Authorization: Bearer <token>" https://your-registry.com/v2/
Performance Issues
Slow Checkpointing
Problem: Checkpointing takes too long
Solutions:
- Check resource usage:
kubectl top nodes kubectl top pods
- Optimize storage:
- Use SSD storage
- Increase I/O capacity
- Enable compression
- Reduce checkpoint size:
- Exclude unnecessary files
- Use incremental checkpoints
High Memory Usage
Problem: SNAP consumes too much memory
Solutions:
- Check memory limits:
docker stats snapapi snapui
- Optimize configuration:
- Reduce concurrent operations
- Limit checkpoint retention
- Optimize JVM settings
Network Issues
API Not Accessible
Problem: Cannot access SNAP API
Solutions:
- Check service status:
curl http://localhost:8000/health
- Verify firewall rules:
sudo ufw status sudo iptables -L
- Check port binding:
netstat -tulpn | grep :8000
WebSocket Connection Fails
Problem: Real-time updates not working
Solutions:
- Check WebSocket endpoint:
curl -H "Upgrade: websocket" http://localhost:8000/ws/progress
- Verify proxy configuration:
- Check reverse proxy settings
- Ensure WebSocket support
- Verify CORS configuration
Log Analysis
Viewing Logs
# SNAP API logs
docker-compose logs snapapi
# SNAP UI logs
docker-compose logs snapui
# Kubernetes logs
kubectl logs -n snap <pod-name>
# System logs
journalctl -u docker
Common Log Patterns
- ERROR: Critical issues requiring attention
- WARN: Potential problems or deprecated features
- INFO: Normal operation information
- DEBUG: Detailed debugging information
Getting Help
Diagnostic Information
When reporting issues, include:
- SNAP version
- Cluster information
- Error messages
- Log files
- System configuration
Support Channels
- Documentation: Check relevant guides
- GitHub Issues: Report bugs and request features
- Community Forum: Get help from other users
- Enterprise Support: Direct support for enterprise customers