Senior Infrastructure Reliability Lead – Hybrid/On-Prem & Cloud |SRE|


Are you an experienced Platform Engineerin Lead within SRE? This role focuses on maintaining and optimising the organisation’s most critical technology infrastructure, ensuring operational stability while swiftly resolving complex technical issues. The position blends advanced technical expertise with leadership, strategic planning, and stakeholder engagement to deliver continuous improvement and resilience. ️ Core Responsibilities Advanced Technical Support – Provide expert-level assistance to resolve complex incidents for assigned clients or client groups. Shape and enhance support models to boost service quality for customers and stakeholders Proactive Maintenance & Monitoring – Carry out preventative hardware/software maintenance and use monitoring metrics to identify, prevent, and resolve potential issues before they impact operations Knowledge Management – Maintain and expand a knowledge base with detailed case documentation, enabling self-service, faster resolutions, and effective information sharing across teams ️ Root Cause Analysis – Investigate system logs, error messages, and user reports to identify underlying issues in hardware, software, and networks; apply targeted fixes, replacements, reconfigurations, or software reinstalls ️ Automation & Efficiency – Implement process automations and fine-tune monitoring tools, thresholds, and alerts to increase stability, improve responsiveness, and reduce manual workload ️ Risk Management – Identify, mitigate, and escalate potential service-impacting risks through formal processes, supporting business continuity and governance requirements Capacity & Resiliency Planning – Oversee capacity management and implement resiliency strategies aligned with operational and front-office needs Strategic & Leadership Dimensions Strategic Planning & Policy Management – Contribute to defining strategy, shaping requirements, and recommending change initiatives. Oversee resources, budgets, and processes to align with business goals Team & Talent Development – Define roles, plan for future organisational needs, coach and mentor team members, and lead specialists to balance short- and long-term priorities Subject Matter Expertise – Provide technical direction, lead multi-year projects, guide structured work assignments, and ensure collaboration with other specialist areas when needed. Stakeholder Engagement – Build strong relationships with internal and external partners, influencing decisions through data-driven insights and negotiation skills Analytical & Problem-Solving Excellence – Apply deep analytical thinking, research, and complex option comparisons to define challenges and craft innovative solutions Governance & Control – Strengthen operational controls, assess risks, and ensure alignment with the organisation’s control and governance agenda Cross-Functional Collaboration – Stay aligned with evolving business strategies by actively engaging with other functional areas to provide tailored, business-aligned support

Požadujeme:

What you should already have: According to the Czech labour law, you need to hold a valid work permit. Proven experience on a SRE (Site Reliability Engineer) lead role ️ Strong technical experience in cloud/hybrid environments Key Technologies & Skills Strong Linux/Unix on-premise expertise, with advanced scripting skills in Python and Bash/Shell ️ Proficiency in database technologies including PostgreSQL, MS SQL, and Oracle Experience implementing observability & monitoring solutions (Elastic, Geneos ITRS, Grafana) Skilled in containerisation and virtualisation technologies Strong grasp of ITIL principles with a proactive, improvement-driven mindset Previous leadership or high-level technical experience in a similar environment
  • počet míst - 1

Nabízíme:

  • Annual bonus
  • 5 weeks of holiday/year
  • 60 sick days per year
  • pension contribution
  • courses
  • home office
  • modern offices
  • work-life balance
  • cafeteria points
  • meal vouchers
Odpovědět

 

informace

Zadavatel:

Personální agentura

Pracoviště:

Praha

Typ smluvního vztahu:

Práce na plný úvazek

Zařazeno v oborech:

IT / Vývoj softwaru
IT konzultant

Požadované vzdělání:

Vysokoškolské

Plat:

Dle domluvy

Datum zadání:

18.8.2025

Podobné nabídky

Hledaní práce

Site Reliability Engineer |SRE| Grafton Recruitment s.r.o. | Praha


/* Not affection functionality */