United to grow leaders
in a digital world.

Senior Site Reliability Engineer - temporary employment (all genders)

  • Full Time
  • Barcelona, Provinz Barcelona, Spanien
  • MacGyver
  • Eagle eye

We are seeking an experienced Site Reliability Engineer to play a pivotal role in bridging the gap between software engineering and operations. This role emphasizes designing robust solutions, mentoring teams, and driving performance improvements for both internal and client systems through expertise in automation, scalability, and system reliability.

As a Site Reliability Engineer, you will be responsible for owning the uptime and performance of critical infrastructure and applications while working closely with clients to align reliability goals with their business objectives.

You are curious about the connection between UDG and MMT? Both companies are part of global agency group MSQ and kick off great projects together.

What Motivates Us

At MMT we don’t only care about what work is done, we also care about how we get things done. The MMT Behaviours are part of our DNA and what makes us stand out from the crowd, build trusted solutions for our clients and build a better future.    

Our team can use the below behaviours to check in on how they are performing. 

  • Build trust. Don’t let it rust
    We build high levels of trust with our clients and our colleagues, and we work to maintain that trust over time.
  • Adopt a growth mindset
    We are energised by change; we continually explore new approaches to achieve great results.  We push our own boundaries to grow our skills and capabilities.
  • Go and see for yourself 
    We walk in the shoes of others; our clients, their customers and our fellow MMTers so we understand their challenges.
  • Bring challenge and solution in equal measure 
    We challenge the status quo and develop practical solutions to build a better future for all.
  • Build fast, Measure early, Learn often
    We think lean, deliver value fast & continuously improve.
  • Run towards the fire
    We roll up our sleeves and tackle challenges head on, supporting our clients and colleagues even when it’s not our direct responsibility.
  • Nurture our community 
    We take steps to positively impact our colleagues, clients, community and environment.

What You Do

System Reliability & Performance

  • Own uptime and performance of critical infrastructure and apps
  • Design scalable, fault-tolerant architectures; optimize efficiency
  • Define/govern NFRs (availability, performance, maintainability) for systems
  • Identify opportunities for optimization, scalability, and cost reduction

Automation & Infrastructure

  • Design/implement automation for monitoring, incident response, and repetitive tasks
  • Use IaC/CaC (Terraform, ARM, CloudFormation) for provisioning and data pipelines
  • Design/deploy Docker/Kubernetes solutions on cloud platforms
  • Manage CI/CD pipelines for cloud-native apps (GitHub Actions, Azure DevOps)

Cloud Infrastructure & Operations

  • Contribute to architecture; implement Azure/AWS with high availability
  • Optimize resources for performance, cost, security
  • Apply cloud-native best practices; ensure compliance
  • Integrate monitoring/alerting (Datadog, CloudWatch, App Insights) for multi-cloud observability

Incident Management & Analysis

  • Lead incident response; root cause analyses; blameless postmortems
  • Collaborate to embed observability in the development lifecycle
  • Build executive/developer dashboards for key metrics

What You Bring Along

Technical Expertise

  • Proven experience in running and maintaining production systems with expertise in triaging and solving incidents
  • Proficiency in automation and configuration management tools (e.g., Terraform, Ansible)
  • Expertise in cloud platforms, particularly Azure and AWS, and their associated tools
  • Strong programming skills, with a primary focus on Python, for developing automation scripts, creating custom tooling, and optimizing operational workflows
  • Experience with modern observability platforms such as Datadog
  • Strong network fundamentals with hands-on experience in Palo Alto next-generation firewalls, including configuration, monitoring, and troubleshooting in enterprise environments
  • Experience with Microsoft Identity solutions including Azure Active Directory (Entra ID), identity governance, and integration with enterprise authentication systems 

Skills & Experience

  • A solid foundation in system architecture, with a focus on scalability and reliability
  • Exceptional problem-solving skills and a data-driven mindset
  • Familiarity with CI/CD pipelines and tools like GitHub Actions and Azure DevOps 

Desirable Requirements

  • Experience with container orchestration tools such as Kubernetes
  • Knowledge of security best practices in cloud and hybrid environments
  • Experience with identity and access management solutions (e.g. Okta, Active Directory, Cyber Ark) including role-based access control and authentication protocols
  • Experience with network monitoring tools and infrastructure-as-code approaches to network configuration (e.g., Terraform for cloud networking, Ansible for network devices)