Premier Employers are industry leaders that have forged exclusive partnerships with Meytier to forward our shared mission to offset bias in hiring, and are only visible to members of the Meytier community.
EXCLUSIVELY ON MEYTIER
You're in luck. This opportunity exclusively available through Meytier.
As the Manager of Site Reliability Engineering (SRE), you will play a critical role in ensuring the performance, reliability, and scalability of our systems. Leveraging the principles of Site Reliability Engineering pioneered by Google, you will lead a team of talented engineers in implementing best practices for application performance monitoring, toil reduction, and system stability. Your focus will extend to both complex cloud-based and on-premises applications, ensuring high system uptime and availability. Collaboration with other SRE teams, departments, and business units across the organization will be essential to achieving our goals.
Additionally, your role will involve deep-diving with technologists and discussing strategic, long-term goals to drive innovation and growth. Experience with AWS and Azure technologies, as well as proficiency in industry standard tools is crucial for success in this role.
Key Responsibilities:
Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and excellence.
Develop and implement strategies for application performance monitoring proactively identify and resolve performance bottlenecks.
Drive initiatives to reduce toil and automate repetitive tasks, allowing the team to focus on high-impact projects that improve system reliability and scalability.
Collaborate closely with cross-functional teams including software engineering, infrastructure, and product management to design, deploy, and maintain highly available and resilient systems.
Establish and enforce best practices for incident management, post-mortem analysis, and continuous improvement, ensuring that lessons learned are applied to prevent future outages.
Implement robust monitoring and alerting systems using tools like Data Dog, ELK, and Open Telemetry to track system uptime and availability for complex cloud and on-premises applications, with a focus on meeting or exceeding defined service level objectives (SLOs) and service level agreements (SLAs).
Foster collaboration and knowledge sharing with other SRE teams and departments across the organization, leveraging their expertise and resources to drive improvements in system reliability and performance.
Engage in deep discussions with technologists to understand the intricacies of our systems and discuss strategic, long-term goals to drive innovation and growth.
Utilize expertise in AWS and Azure technologies to architect, deploy, and optimize cloud-based solutions, ensuring scalability, reliability, and cost-effectiveness.
Desired Profile:
Bachelor’s degree in Computer Science, Engineering, or related field
Proven experience leading a team of Site Reliability Engineers in a fast-paced and dynamic environment.
Deep understanding of application performance monitoring principles and tools, with hands-on experience in designing and implementing monitoring solutions.
Strong background in system architecture, infrastructure automation, and cloud technologies, with expertise in AWS and Azure.
Expertise in incident management, with the ability to effectively lead and coordinate response efforts during critical incidents.
Experience managing system uptime and availability for complex cloud-based and on-premises applications, with a track record of meeting or exceeding defined SLOs and SLAs.
Excellent communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams and influence decision-making at all levels of the organization.
Strong problem-solving skills and a passion for driving continuous improvement and innovation.
Compensation Range:190-200K base
Is this job not quite the right fit? No worries, Meytier has hundreds of active, open jobs. Browse more opportunities here. If you’d like to connect with a Meytier champion for help in your job search, create an account here.
{"group":"Organization","title":"Manager Software Engineering - Site Reliability Engineering (SRE)","skills":"<ul><li>Bachelor’s degree in Computer Science, Engineering, or related field</li><li>Proven experience leading a team of Site Reliability Engineers in a fast-paced and dynamic environment.</li><li>Deep understanding of application performance monitoring principles and tools, with hands-on experience in designing and implementing monitoring solutions.</li><li>Strong background in system architecture, infrastructure automation, and cloud technologies, with expertise in AWS and Azure.</li><li>Expertise in incident management, with the ability to effectively lead and coordinate response efforts during critical incidents.</li><li>Experience managing system uptime and availability for complex cloud-based and on-premises applications, with a track record of meeting or exceeding defined SLOs and SLAs.</li><li> Excellent communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams and influence decision-making at all levels of the organization.</li><li>Strong problem-solving skills and a passion for driving continuous improvement and innovation.</li></ul>","zohoId":"","endDate":"2024-07-11T18:30:00.000Z","isDraft":false,"jobType":"Full Time","job_url":"2881-citizens-manager-software-engineering-site-reliability-engineering-sre","agencyId":1,"benefits":"<p>We offer competitive pay, comprehensive medical, dental and vision coverage, retirement benefits, maternity/paternity leave, flexible work arrangements, education reimbursement, wellness programs and more. Note, Citizens’ paid time off policy exceeds the mandatory, paid sick or paid time-away policy of very local and state jurisdiction in the United States. </p>","betaMode":false,"clientId":"35","location":[{"lat":41.5800945,"lon":-71.4774291,"zip":"","city":"","text":"Rhode Island, USA","state":"Rhode Island","country":"United States","is_city":false,"is_state":true,"is_country":false,"state_code":"RI","countryCode":"US","isLocationSet":true,"isLocationResolved":true}],"eeocFound":true,"maxSalary":"","minSalary":"","questions":[],"startDate":"2024-06-26T18:30:00.000Z","hiringSPOC":"Palak Mundra","hiringTags":[],"onBehalfOf":"49","companyName":"Meytier","description":" ","isHybridJob":false,"isRemoteJob":false,"salaryRange":"<p><span style=\"color: rgb(34, 34, 34);\">190-200K base</span></p>","titleSkills":[{"keyword":"software engineering","node_id":"i10028","removed":false,"node_ptr":[["meytier_root","information technology","systems engineering","software engineering"]],"priority":-1,"alignedTF":true,"must_have":false,"node_name":"software engineering","extractedTF":true,"not_a_skill":false,"nice_to_have":false,"nodeAlignedWt":20,"is_industry_term":false,"gender_threshold_yn":"balanced","final_node_fft_weights":{"systems engineering":1},"final_node_skarea_basetype":""},{"keyword":"site reliability engineering","node_id":"i10642","removed":false,"node_ptr":[["meytier_root","information technology","it management","it infrastructure & networking","it infrastructure","site reliability"]],"priority":-1,"alignedTF":true,"must_have":false,"node_name":"site reliability","extractedTF":true,"not_a_skill":false,"nice_to_have":false,"nodeAlignedWt":26,"is_industry_term":false,"gender_threshold_yn":"unknown","final_node_fft_weights":{"it infrastructure":1},"final_node_skarea_basetype":""},{"keyword":"sre","node_id":"i10642","removed":false,"node_ptr":[["meytier_root","information technology","it management","it infrastructure & networking","it infrastructure","site reliability"]],"alignedTF":true,"must_have":false,"node_name":"site reliability","extractedTF":true,"not_a_skill":false,"nice_to_have":false,"nodeAlignedWt":26,"is_industry_term":false,"gender_threshold_yn":"unknown","final_node_fft_weights":{"it infrastructure":1},"final_node_skarea_basetype":""}],"otherCohorts":[],"benefitsFound":true,"hiringManager":"User / Info not available","maxExperience":25,"minExperience":15,"type_of_slate":"job","hiringFunction":["Technology & IT Delivery"],"isOnPremiseJob":true,"onBehalfOfName":"Citizens","otherlocations":[{"lat":32.7558935,"lon":-111.6709584,"zip":"85123","city":"Arizona City","text":"Arizona City, AZ, USA","state":"Arizona","country":"United States","is_city":true,"is_state":false,"is_country":false,"state_code":"AZ","countryCode":"US","isLocationSet":true,"nearByHexCodes":["8448e83ffffffff","8448e81ffffffff","8448e87ffffffff","8448eb9ffffffff","8448e95ffffffff","8448e9dffffffff","8448e8bffffffff"],"loc_h3_hex_res4":"8448e83ffffffff","isLocationResolved":true},{"lat":40.0583238,"lon":-74.4056612,"zip":"","city":"","text":"New Jersey, USA","state":"New Jersey","country":"United States","is_city":false,"is_state":true,"is_country":false,"state_code":"NJ","countryCode":"US","isLocationSet":true,"isLocationResolved":true},{"lat":35.2270869,"lon":-80.8431267,"zip":"","city":"Charlotte","text":"Charlotte, NC, USA","state":"North Carolina","country":"United States","is_city":true,"is_state":false,"is_country":false,"state_code":"NC","countryCode":"US","isLocationSet":true,"nearByHexCodes":["8444dabffffffff","8444da9ffffffff","8444da1ffffffff","8444da3ffffffff","8444dbdffffffff","8444d87ffffffff","8444d85ffffffff"],"loc_h3_hex_res4":"8444dabffffffff","isLocationResolved":true},{"lat":33.4483771,"lon":-112.0740373,"zip":"","city":"Phoenix","text":"Phoenix, AZ, USA","state":"Arizona","country":"United States","is_city":true,"is_state":false,"is_country":false,"state_code":"AZ","countryCode":"US","isLocationSet":true,"nearByHexCodes":["8429b6dffffffff","8448ebbffffffff","8448eb3ffffffff","8429b65ffffffff","8429b61ffffffff","8429b69ffffffff","8448e97ffffffff"],"loc_h3_hex_res4":"8429b6dffffffff","isLocationResolved":true},{"lat":31.9685988,"lon":-99.9018131,"zip":"","city":"","text":"Texas, USA","state":"Texas","country":"United States","is_city":false,"is_state":true,"is_country":false,"state_code":"TX","countryCode":"US","isLocationSet":true,"isLocationResolved":true}],"blindHiringMode":false,"experienceLevel":"Mid / Senior","numberOfOpenings":"1","otherCohortsName":"","responsibilities":"<ul><li>Lead and mentor a team of Site Reliability Engineers, fostering a culture of collaboration, innovation, and excellence.</li><li>Develop and implement strategies for application performance monitoring proactively identify and resolve performance bottlenecks.</li><li>Drive initiatives to reduce toil and automate repetitive tasks, allowing the team to focus on high-impact projects that improve system reliability and scalability.</li><li> Collaborate closely with cross-functional teams including software engineering, infrastructure, and product management to design, deploy, and maintain highly available and resilient systems.</li><li>Establish and enforce best practices for incident management, post-mortem analysis, and continuous improvement, ensuring that lessons learned are applied to prevent future outages.</li><li>Implement robust monitoring and alerting systems using tools like Data Dog, ELK, and Open Telemetry to track system uptime and availability for complex cloud and on-premises applications, with a focus on meeting or exceeding defined service level objectives (SLOs) and service level agreements (SLAs).</li><li>Foster collaboration and knowledge sharing with other SRE teams and departments across the organization, leveraging their expertise and resources to drive improvements in system reliability and performance.</li><li>Engage in deep discussions with technologists to understand the intricacies of our systems and discuss strategic, long-term goals to drive innovation and growth.</li><li>Utilize expertise in AWS and Azure technologies to architect, deploy, and optimize cloud-based solutions, ensuring scalability, reliability, and cost-effectiveness.</li></ul>","extractedSkillIds":["i10028","i10642"],"maxSeniorityLevel":6,"minSeniorityLevel":3,"otherJobReference":"","sharpenedJobTitle":"Manager Software Engineering - Site Reliability Engineering (SRE)","job_category_group":"2","growthOppurtunities":[],"educationQualification":"Baccalaureate Degree","skillSenNormalizedTitle":"","extractSkillsFromHereToo":true,"normalizedTitleSkillsObj":{},"companyTeamJobIntroduction":"<p><strong>About Role:</strong></p><p>As the Manager of Site Reliability Engineering (SRE), you will play a critical role in ensuring the performance, reliability, and scalability of our systems. Leveraging the principles of Site Reliability Engineering pioneered by Google, you will lead a team of talented engineers in implementing best practices for application performance monitoring, toil reduction, and system stability. Your focus will extend to both complex cloud-based and on-premises applications, ensuring high system uptime and availability. Collaboration with other SRE teams, departments, and business units across the organization will be essential to achieving our goals.</p><p><br></p><p>Additionally, your role will involve deep-diving with technologists and discussing strategic, long-term goals to drive innovation and growth. Experience with AWS and Azure technologies, as well as proficiency in industry standard tools is crucial for success in this role.</p>","dNIEEOCTextFocusOtherControl":"<p>At Citizens we value diversity, equity and inclusion, and treat everyone with respect and professionalism. Employment decisions are based solely on experience, performance, and ability. Citizens, its parent, subsidiaries, and related companies (Citizens) provide equal employment and advancement opportunities to all colleagues and applicants for employment without regard to age, ancestry, color, citizenship, physical or mental disability, perceived disability or history or record of a disability, ethnicity, gender, gender identity or expression (including transgender individuals who are transitioning, have transitioned, or are perceived to be transitioning to the gender with which they identify), genetic information, genetic characteristic, marital or domestic partner status, victim of domestic violence, family status/parenthood, medical condition, military or veteran status, national origin, pregnancy/childbirth/lactation, colleague’s or a dependent’s reproductive health decision making, race, religion, sex, sexual orientation, or any other category protected by federal, state and/or local laws</p>","expertise_coreskill_or_product":["software engineering"],"displayJobDescriptionSimpleForm":true,"expertise_coreskill_or_product_id":["i10028"],"job_id":"2881"}