Deloitte Site Reliability/Dev Ops Python Engineer in Seattle, Washington
Are you an experienced, passionate pioneer in technology? A system professional who wants to work in a collaborative environment. As an experienced Site Reliability/DevOps Engineer, you will have the ability to share new ideas and collaborate on projects as a consultant without the extensive demands of travel. If so, consider an opportunity with Deloitte under our Project Delivery Talent Model. Project Delivery Model (PDM) is a talent model that is tailored specifically for long-term, onsite client service delivery. PDM practitioners are local to client project locations, minimizing extensive travel, and providing you with a full career path within the firm. This position is based in New York, San Francisco, Los Angeles, or Seattle.
Work you will do
Able to troubleshoot complicated, cross-platform issues handling OS, Networking, Database in a cloud-based environment and handle live production incidents, debug/troubleshoot application, and infrastructure issues, follow, and implement SRE best practices.
Managing infrastructure services, responsible for including, but not limited to, deployment, operation, and troubleshooting.
Continue to automate, scale, and manage cloud infrastructure.
Work with team to establish service level objectives and monitor to ensure the objectives are met.
Execute automation for known cloud-operations tasks and create new automation for new situations or issues you encounter; automate everything.
Collaborate with a great team to maintain, monitor, and improve amazing cloud-based infrastructure that solves real-world problems for end-users.
Facilitate blame-free root cause analysis meetings in the event of a production-systems incident so that the team can learn from mistakes and improve our systems and run books.
Be vigilant about security and adhere to best practices to secure our cloud infrastructure and real-time platform.
Design, write and deliver software and automation to dramatically improve availability, scalability, latency, and efficiency of infrastructure.
Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually
Monitor application performance, take steps to improve overall application performance and stability.
The US Cloud Engineering Offering focuses on enabling our client's end-to-end journey from On-Premise to Cloud, with opportunities in the areas of Cloud Strategy and Op Model Transformation, Cloud Development & Integration, Cloud Migration, and Cloud Infrastructure & Managed Services. Cloud Engineering supports our clients as they improve agility, resilience and identify opportunities to reduce IT operations spend through automation by enabling Cloud. We accelerate our clients toward a technology-driven future, leveraging vendor solutions, Deloitte-developed
Bachelor's degree, preferably in Computer Science, Information Technology, Computer Engineering, or related IT discipline; or equivalent experience
4+ years of Dev Ops or Site Reliability Engineering experience at the enterprise level
4+ years of strong development/scripting skills in Python
3+ years of software engineering in large scale production environment
Experience with Infrastructure As Code (Terraform, Cloud Formation, Ansible)
Familiarity with Linux systems and command line system administration
Experience with cloud providers like AWS, GCP, Azure, etc., their systems, products, and APIs
Experience securing corporate networks, cloud networks, and VPNs
Strong interpersonal and communication skills
Experience supporting dedicated/on-prem environments, along with planning the migration of those environments to a cloud provider
Willing to take rotating on-call responsibility
Must live near or willing to relocate to New York, San Francisco, Los Angeles, or Seattle
Ability to travel 10%, on average, based on the work you do and the clients and industries/sectors you serve.
Limited sponsorship may be available
Bachelor's Degree Computer Science, Information Technology, Computer Engineering, or related IT discipline
Consulting background. Experience working at client sites.
Understand Linux groups and good experience with containerization (Docker), orchestration (Kubernetes), Mesos, Flink, Linux/Open-Source Environment
Software engineering experience in SQL, my SQL
Hands on experience in configuration management of server farms (using tools such as Puppet, Chef)
Data Streaming tools such as Spark and Kafka
Java skills such as Elastic Search, NoSQL, Mongo DB
Familiarity with Apache RocketMQ
Administrator experience working with batch processing and tools in the Hadoop technical stack (Yarn, Hive, HDFS)
Experience in supporting/managing systems at scale (10s thousands to 100s thousands of instances)
Experience in In-memory data structure store administration with Redis