Cloud Engineer
Cloud Engineer
Clearance Required: Top Secret
Location: On-Site, Pentagon, Washington, D.C.
Position Type: Full-Time
Company: VivSoft Technologies
About the company:
VivSoft is an emerging technology company that specializes in using modern technologies to solve our clients' toughest mission challenges. We are focused on Cloud, Enterprise DevSecOps, Artificial Intelligence, and Digital Customer Experience to drive mission-enabling digital transformation. Our passion is building mission-focused, open, scalable solutions. We are a diverse team of strategists, engineers, designers, and creators experienced in building high-performance software and AI factory accelerators by embracing automation.
Job Summary:
The Cloud Engineer is responsible for operating, maintaining, and optimizing Kubernetes-based hosting platforms across multi-cloud environments. This role emphasizes automation, reliability engineering, and system resilience to ensure the availability and scalability of critical infrastructure. The engineer will manage infrastructure as code (IaC), monitor performance, and implement backup and disaster recovery strategies to meet a 99.7% uptime objective.
Key Responsibilities:
- Operate and maintain Kubernetes clusters across AWS, Azure, and/or GCP.
- Develop and manage multi-cloud infrastructure using IaC tools such as Terraform, CloudFormation, or Pulumi.
- Automate system monitoring, alerting, and incident response to ensure reliability and uptime.
- Collaborate with DevOps and development teams to stage and maintain data pipelines and analytics infrastructure.
- Implement and routinely test backup, disaster recovery, and failover procedures.
- Monitor system performance, conduct root cause analysis, and optimize for efficiency and resilience.
- Enforce security and compliance standards across infrastructure components.
- Create and maintain operational documentation and runbooks.
Required Qualifications:
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent work experience.
- 5+ years of experience in cloud operations, DevOps, or site reliability engineering.
- Hands-on experience with Kubernetes administration and Helm chart deployments.
- Strong proficiency with IaC tools (Terraform, CloudFormation, or similar).
- Experience managing cloud infrastructure in AWS, Azure, or GCP (multi-cloud experience preferred).
- Proficiency with scripting languages (e.g., Python, Bash, or Go).
- Familiarity with observability tools (e.g., Prometheus, Grafana, ELK, Datadog).
- Experience implementing high availability, failover, and DR strategies.
- Knowledge of CI/CD pipelines and GitOps practices.
- Strong troubleshooting and performance optimization skills.
- Comprehensive Medical, Dental, and Visions Plans (Healthcare benefits are 100% employer-paid for employees only)
- Life Insurance
- Paid Time Off (Flexible/Combined PTO, Bereavement Leave, 11 Company Paid Holidays)
- 401K Retirement Plan with employer match
- Professional Development Training Reimbursement
- Flexible/remote work schedules
Salary Range: $125,000-145,000