4621 – Sr. Site Reliability Engineer I
August 8, 2024 12:42 amSummary:
As a Sr. Site Reliability Engineer, you are instrumental in helping make our client’s Kubernetes-centric ProArchive application resilient. This position will coordinate with multiple teams to develop a migration plan for various components and services as well as implement best practices for our client’s tech stack. A person in this position will have a passion for getting things done for various functions, including automation, CI/CD, infra components, middleware, etc. You’ll work closely with our client’s Dev Engineering, QA, and Platform Engineering groups to manage their current on-prem deployments and on-prem & cloud-native infrastructures.
How will you contribute?
– Help define technology choices, best practices and process for the team.
– Develop and maintain documentation standard for the team.
– Develop new tools and libraries for broader use by SaaS Operations and Engineering teams. Enable engineering teams to discover and understand problems quicker.
– Work with product architects and make suggestions for architectural changes and design platform component roadmaps.
– Act as a subject matter expert (SME) for components and functions desired. Develop the skill as required, to become SME for components in need.
– Assist engineering teams in deep troubleshooting and application code review to find opportunities to improve performance and scalability.
– Work closely with Engineering and peer SRE teams to design and use client’s coding standards and best practices.
– Respond to incidents coordinated by SRE and Incident Response teams. Act as a Incident Commander during incidents.
– Participate in escalation and off-hours on-call schedule.
– Adopt and embrace qualities of an SRE as defined in the team charter. Help set them for the rest of the team.
– Mentor and train junior members of the team. Design training curriculum for the team.
What will you bring?
– Minimum 7+ years industry experience
– BS in CS or equivalent combination of education and experience
– Strong experience operating Kubernetes in production environments – EKS Anywhere is preferred
– Experience with middleware systems (Kafka, AMQ, Redis, Memcache, etc.)
– Experience managing CI/CD systems (Flux, Concourse)
– Experience deploying and/or operating Observability stack (Splunk, Datadog, Grafana)
– Experience with large scale systems
– Familiarity with working with PostgreSQL and MongoDB
– Background working in a multi-platform environment (Linux, Windows)
– Familiarity of programming/scripting languages (i.e. Python, Bash, PowerShell, Go, etc.)
– Familiarity with Agile/Scrum/Kanban methodologies
Strong interpersonal skills with a can-do attitude and sense of urgency for a high growth/fast paced environment
-Curious mind, wanting to learn new technologies and share with others.
– The ability to think outside of the box to resolve issues and create solutions
About our client’s culture:
They look to hire lifelong learners with a passion for innovating with purpose, humility and humor. Collaboration is at the heart of everything they do. They work closely with the most popular communications platforms and the world’s leading cloud infrastructure platforms. They use the latest in AI/ML technology to help their customers break new ground at scale. They are a global organization that values diversity, and they believe that providing opportunities for everyone to be their authentic self is key to their success. Our client’s leadership, culture, and commitment to developing their people have all garnered Comparably.com Best Places to Work Awards.
You must sign in to apply for this position.
← Back to Job Listings
Lexicon Solutions is a full-service staffing company specializing in contract, contract-to-hire, direct placement, and payroll services. Located in the Portland metro area, we are at the heart of technology in the Pacific Northwest. Lexicon Solutions has been voted by the Portland Business Journal as one of Portland's Top Staffing Firms from 2009 - 2023, and as one of Oregon’s Most Admired Companies in 2022 and 2023.
Lexicon Solutions is proud to offer a comprehensive benefits package, including the following:
- Major PPO (Pre-tax) medical/dental cafeteria plan.
- AFLAC supplemental insurance.
- Complementary care.
- Individual supplemental term life policies.
- Paid holidays and PTO.
- Direct deposit payroll option.
To view other Lexicon Solutions job opportunities, please visit our website at: www.lexiconsolutions.com/jobs