Description:
Seeking two Cloud Engineers to join the team on a project-based engagement. These two roles will be instrumental in helping write Terraform code as we architect DR environments and tests. It is imperative these candidates also have a strong background in DR in addition to Azure Security and compliance. The Leap (Audit) Platform is deployed across 9 regions within Azure including China and each day 30,000+ users are leveraging the platform across their 150+ member firm network. This platform was built for the cloud, it’s fully automated w/ terraform, mongoDB on the back-end and presents an awesome opportunity to join a great organization with strong leadership.
Azure Cloud Engineer
The primary focus of this role is to function as an Azure cloud SME (Subject Matter Expert) and provide cloud operational support across all aspects of our application environments including but not limited to compute health and monitoring, automated build and deploy, data backup and retention. A lot of the work is geared toward implementation for Azure Enablement. Other team members handling high level designs and this role geared toward the implementation side.
S The right candidate will possess:
- Over 5 years Azure Cloud and general cloud experience, with specific expertise around Azure Virtual Machines, Kubernetes Services (AKS), Azure Key Vault, Entra Active Directory, Azure Monitor, Azure Storage, Azure Backup, Azure Logic Apps, Azure SQL, Azure Recovery Vault
k Your Day-to-Day Essential Duties:
Knowledge and experience working with blob storage, Mongo DB, MS SQL database architectures
Experience with Cloud automation, Terraform and other tools to implement, maintain and improve CI/CD processes and tools.
Experience with API based applications and containerization including Kubernetes experience
Ability to implement, support and execute within disaster recovery scenarios involving our cloud hosted applications
Ensure all execution of work, including measures to build, enhance stabilize, automate and harden fall within current security and compliance standards
Understanding of appropriate role-based hierarchy and access management guidelines across the Azure solution and environments
Part of a team that troubleshoots applications, middleware, infrastructure, networks, tools, patching
Maintains/updates on ongoing operations and project tasks.
Build enhancements within an existing software architecture and suggest improvements to the architecture.
Assists in defining the appropriate operational planning.
Strong communication skills, analytical skills, thorough understanding of product development.
Collaborates on architectural design reviews and changes.
Own, define and improve metrics, KPIs, SLOs and visualizations for systems.
Act as an escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
Drive quality accountability within the organization with well-defined processes, metrics, and goals for process quality. This includes leading effective postmortems and ensuring actions are followed-up.
Building, and maintaining, robust, actionable alerting and monitoring systems and workflows. Influence across boundaries and at all levels of the organization.
Work closely with development teams to improve services, deployments and releases.
Troubleshoot production issues and continued documentation of runbooks.
i What You Bring to the Team:
Possess a very high attention to detail and organization with the passion and ability to create order out of disorder with excellence and efficiency.
A desire to automate everything. Whether that be infrastructure as code or tooling to eliminate toil, automation should be a core focus of your mindset and the elimination of repetitive tasks should be a constant desire in the role.
Natural curiosity. You aren’t simply satisfied with something working, you want to know why it works and how it works.
A mindset of total ownership - you aren’t afraid to dig into things you’ve never worked on before, from the browser all the way to the persistence layer. You’ve got a solid foundation in debugging and can jump in when needed to any problem you’re asked to help with.
An architectural mind. You understand the fundamentals of distributed computing and look for ways to make systems more resilient, self-healing, and eliminate the need for human intervention as much as possible.
Very strong communication and interpersonal skills allowing the candidate to work well in a team environment and deliver excellent customer service.
The ability to convey the importance of site reliability in both business and technical terms to a wide variety of audiences that range from non-technical to the most technical of engineers. Drive stakeholder buy-in of key metrics such as SLAs/SLOs for all supported systems.
Ability to maintain SLAs through the implementation of proactive issue detection and reporting
Experience developing scripts or tools for automating administrative tasks.
l Additional items to screen:
For DR, we are not looking at any specific tooling. More of what we are looking for is:
How have candidates dealt with DR for custom software applications in the past?
Have they utilized backups, Active-Active, Active-Passive?
Are they building multi-region applications or just keeping everything in one region?
Are they utilizing geo-redundancy for resources where possible?
l Are you looking at someone coming more from a DevSecOps background or traditional Cloud/DevOps?
We are seeking more modern cloud/DevOps. DevSecOps is good as we do have to deal with quite a bit around security, but it is more important they have foundational skills in the realm of:
Infrastructure as Code
Networking
Identity and Access Management (Azure AD/Entra)
CI/CD pipelines that deploy custom application code (software development).
Though I think the more common terms around this would be Platform Engineering and Release Engineering these days. No need to get hung up on titles considering so many companies use so many different titles to mean the same thing.
s In terms of Azure Security / Compliance – what specific tooling experience are you seeking? You mentioned securing keys, passwords, certificates, etc.
Azure Defender is nice to have but can be taught if necessary. Candidates should be fully aware how to work with key vaults as that is a core service in the Azure cloud offering. We are also utilizing applications like Wiz (Infrastructure vulnerability management) and Veracode (Static Code Analysis). Both of those are nice to have but would prefer that candidates have proper understanding of things like:
: Are you looking at candidates to have any specific experience with SIEM tools? If so, is there a preference from a tooling standpoint?
No SIEM tooling required at this time.
Additional items to note:
- Strong advanced terraform concepts, private networking, CI/CD pipeline builds and willingness to continue education, learn new things & leave environments better than where they were found.
Azure, Cloud, Kubernetes, Terraform, Automation, Devops, windows, AKS, Azure DevOps, yaml, Architecture
Top Skills Details:
Azure,Cloud,Kubernetes,Terraform,Automation,Devops,windows,AKS,Azure DevOps,yaml
Additional Skills & Qualifications:
Soft Skills:
The most important thing above most technical skills is that the candidates have the right attitude or outlook on the work. What is meant by that is outlined below:
They take ownership and pride in the work they do. When something is not correct or if there is an issue, they are not afraid to pick that item up and make it right.
A hunger for knowledge. The work requires constant education and if you are not continuing to learn then you are falling behind.
They do not need to be told every detail of what needs to be done. There is a lot of work to get done, and if these engineers need someone watching over their shoulder regularly then it just adds more work rather than less.
Experience Level:
Intermediate Level
Benefits:
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:
Medical, dental & vision
Critical Illness, Accident, and Hospital
401(k) Retirement Plan – Pre-tax and Roth post-tax contributions available
Life Insurance (Voluntary Life & AD&D for the employee and dependents)
Short and long-term disability
Health Spending Account (HSA)
Transportation benefits
Employee Assistance Program
Time Off/Leave (PTO, Vacation or Sick Leave)
Posting close date September 30, 2024.
About TEKsystems:
We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.
The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.