This project is a backend-focused cloud monitoring system that detects anomalies in AWS EC2 metrics using Amazon CloudWatch and AWS Lambda. It is designed to emulate core functionalities found in enterprise monitoring tools like Datadog or AWS CloudWatch Alarms β with AI-based anomaly detection logic built into a serverless function.
- π― Real-time metric analysis using AWS CloudWatch (
CPUUtilization
) - π§ Anomaly detection Lambda function using Python and threshold logic
- π οΈ Lambda deployed via .zip package β fully cloud-native
- π CloudWatch metrics integration (with plans for automated remediation)
- Auto-remediation Lambda to restart EC2 instances on anomalies
- Trigger flow between detection and remediation via
boto3
or SNS - Alerting via email/SNS/Slack
- Optional frontend dashboard (Streamlit or S3-hosted HTML)
- CI/CD via GitHub Actions for automated deployment
[EC2 Instance] β [CloudWatch Metrics] β [Anomaly Detection Lambda] β Logs to Console (for now)
cloudops-ai-monitoring/ βββ lambda/ β βββ anomaly_detector/ β βββ main.py β βββ requirements.txt βββ README.md
Technology | Role |
---|---|
AWS EC2 | Simulated infrastructure to monitor |
AWS CloudWatch | Collects system metrics like CPU usage |
AWS Lambda | Runs anomaly detection logic serverlessly |
Python (boto3) | AWS SDK to fetch metrics and handle logic |
GitHub | Version control and project hosting |
- EC2 instance sends CPU metrics to CloudWatch
- Lambda function is triggered manually or on schedule
- It fetches the last 5β10 minutes of CPU usage
- Uses logic (e.g.,
CPU > 80%
) to flag anomalies - Logs results β ready to trigger remediation in next phase
This is a real-world simulation of cloud infrastructure monitoring + smart automation. It demonstrates:
- π Observability
- π§ AI-driven backend logic
- β‘ Serverless operations
- π§βπ» DevOps + Cloud Engineering principles
This project is part of my cloud portfolio as an aspiring cloud/DevOps engineer. I built it using AWS Free Tier tools and Python, and Iβm actively working on adding automated remediation and alerting features.
π¬ Connect with me on LinkedIn
This project is open-source and available under the MIT License.