We are looking for a skilled DevOps Engineer to join our technology team and take ownership of our CI/CD pipelines, infrastructure automation, and system reliability. You will work closely with developers, QA, and operations to streamline the deployment lifecycle and ensure high availability, scalability, and performance of our applications and ERP systems. Experience with cloud platforms (e.g.,Digital Ocean, AWS, and Azure) and containerization is highly preferred.
· Manage and optimize Kubernetes clusters, Docker containers, and cloud resources.
· Maintain infrastructure as code (IaC) for consistency and scalability.
· Enforcing security practices throughout the DevOps lifecycle (DevSecOps).
· Set up and maintain CI/CD pipelines using tools like GitLab CI, Jenkins, GitHub Actions, or Bitbucket Pipelines.
· Automate testing, building, and deployment processes for staging and production environments.
· Ensure zero-downtime deployments and rollbacks with blue-green or canary strategies.
· Implement monitoring and alerting tools (e.g., Sentry, Prometheus, Grafana, ELK/EFK, Datadog).
· Proactively monitor application health and infrastructure performance.
· Participate in on-call rotations and troubleshoot production issues.
· Enforce infrastructure and pipeline security best practices.
Manage secrets and credentials securely using tools like Vault, AWS Secrets Manager, or SOPS.
· Ensure systems follow compliance and auditing requirements.
· Work closely with developers, QA, and product teams to ensure smooth integration and delivery.
· Document infrastructure, pipelines, and procedures.
· Provide internal support and DevOps mentoring when needed.
Server Maintenance & Optimization:
· Regularly conduct server maintenance, including system updates, patching, and performance optimization.
· Ensure infrastructure is configured for high availability, scalability, and reliability.
· Perform regular backups and disaster recovery tests to minimize system downtime.
Security Testing & Hardening:
· Implement and enforce security best practices, including firewall configurations, access control, and user permissions.
· Regularly conduct vulnerability scans and security assessments to identify and address security risks.
· Apply security patches and updates promptly to mitigate vulnerabilities.
· Daily Infrastructure Monitoring & Checks:
· Perform daily system checks, including monitoring server health, disk usage, and performance metrics.
· Monitor and analyze logs and alerts for potential issues or anomalies in production systems.
· Quickly respond to and resolve incidents to ensure minimal service disruption.
· Collaborate with cross-functional teams to troubleshoot and resolve issues related to server uptime, performance, or security.