Site Reliability Engineer Job at Ascendion, McLean, VA

dkxxK0MzT1drR1NxNmdsbk5lRng4VTVnVmc9PQ==
  • Ascendion
  • McLean, VA

Job Description

About the Role:

  • We are seeking a highly motivated and experienced Site Reliability Engineering (SRE) Consultant to join our growing team. You will play a critical role in ensuring the reliability, performance, and scalability of our cloud-based applications and infrastructure. You will be a key contributor in implementing SRE best practices, driving automation, and fostering a culture of reliability within the engineering organization. This role requires a strong understanding of SRE principles, hands-on AWS experience, and a passion for building and maintaining highly available systems.

Responsibilities:

  • Design, implement, and maintain SRE practices and frameworks, including defining and tracking SLIs, SLOs, and SLAs.
  • Champion the adoption of observability best practices, implementing robust logging, tracing, and monitoring solutions.
  • Develop and maintain dashboards and alerts using tools like New Relic and Splunk to proactively identify and address potential issues.
  • Contribute to the development and execution of chaos engineering experiments to improve system resilience.
  • Automate operational tasks and processes using scripting languages like Python, particularly in the context of microservices architectures.
  • Collaborate with development teams to improve the reliability and performance of applications throughout the software development lifecycle.
  • Lead incident response efforts, performing root cause analysis and implementing preventative measures.
  • Mentor and guide other engineers on SRE principles and best practices.
  • Drive the improvement of our CI/CD pipelines using tools like Jenkins to enhance deployment frequency and reliability.
  • Proactively identify and address performance bottlenecks and scalability challenges.
  • Stay up-to-date with the latest SRE trends and technologies.

Qualifications:

  • Extensive hands-on experience with SRE concepts and practices, including SLI/SLO/SLA management, logging/tracing, observability, golden signals, chaos engineering, and alerting/monitoring.
  • Deep understanding of AWS services and infrastructure.
  • Proven leadership experience in a technical environment.
  • Strong programming skills, preferably in Python, with experience developing and deploying microservices.
  • Proficiency with DevOps practices and tools, particularly CI/CD and Jenkins.
  • Experience creating and maintaining dashboards using New Relic and Splunk.
  • Excellent communication and collaboration skills.
  • Strong problem-solving and analytical abilities.
  • Passion for building and maintaining highly reliable systems.

Job Tags

Similar Jobs

Royal Caribbean Group

Casino Dealer Job at Royal Caribbean Group

 ...Casino by performing the following duties. Hiring Requirements: Proof of completion of Gaming Board and/or Commission approved dealer school or international equivalency. Two years dealer experience conducting Blackjack, Dice, American Roulette, Caribbean Stud... 

Main Street Advisors

Office Administrator Job at Main Street Advisors

Office Administrator Advance your career and truly make a difference. We have an exciting opportunity for an organized, self-motivated individual with excellent interpersonal skills seeking to join a well-established Independent Advisory and Financial Service Company....

Appleton Finn

Project Sponsor - Heavy Civil (Utah/Rocky Mountain Region) Job at Appleton Finn

 ...background in heavy civil construction Extensive experience with large-scale heavy civil projects (bridges, highways, structural concrete) Willingness to travel throughout Utah and surrounding states (NV, NM, AZ, CO) Ready to take on the challenge? Lets talk!... 

PLAE

Marketing Assistant Job at PLAE

 ...Job Description | Marketing Assistant Reporting to the Global Trade Marketing Manager, youll play a key role in supporting our marketing efforts across a variety of exciting tasks. We're looking for a dynamic, organized, and enthusiastic Marketing Assistant to support... 

Capgemini

Celonis Data Engineer Job at Capgemini

 ...Role Celonis Data Engineer Location Rutherford NJ (Hybrid, 3 days onsite) Duration Fulltime Job Description ~4 to 10 years with experience as a Process Analyst SME in DW domain with at least 2-3 years of experience working on Celonis Customer project...