United to grow leaders
in a digital world.

Senior Site Reliability Engineer - temporary employment (all genders)

Full Time
Barcelona, Provinz Barcelona, Spanien
MacGyver
Eagle eye

We are seeking an experienced Site Reliability Engineer to play a pivotal role in bridging the gap between software engineering and operations. This role emphasizes designing robust solutions, mentoring teams, and driving performance improvements for both internal and client systems through expertise in automation, scalability, and system reliability.

As a Site Reliability Engineer, you will be responsible for owning the uptime and performance of critical infrastructure and applications while working closely with clients to align reliability goals with their business objectives.

You are curious about the connection between UDG and MMT? Both companies are part of global agency group MSQ and kick off great projects together.

What Motivates Us

At MMT we don’t only care about what work is done, we also care about how we get things done. The MMT Behaviours are part of our DNA and what makes us stand out from the crowd, build trusted solutions for our clients and build a better future.

Our team can use the below behaviours to check in on how they are performing.

Build trust. Don’t let it rust
We build high levels of trust with our clients and our colleagues, and we work to maintain that trust over time.
Adopt a growth mindset
We are energised by change; we continually explore new approaches to achieve great results. We push our own boundaries to grow our skills and capabilities.
Go and see for yourself
We walk in the shoes of others; our clients, their customers and our fellow MMTers so we understand their challenges.
Bring challenge and solution in equal measure
We challenge the status quo and develop practical solutions to build a better future for all.
Build fast, Measure early, Learn often
We think lean, deliver value fast & continuously improve.
Run towards the fire
We roll up our sleeves and tackle challenges head on, supporting our clients and colleagues even when it’s not our direct responsibility.
Nurture our community
We take steps to positively impact our colleagues, clients, community and environment.

What You Do

System Reliability & Performance

Own uptime and performance of critical infrastructure and apps
Design scalable, fault-tolerant architectures; optimize efficiency
Define/govern NFRs (availability, performance, maintainability) for systems
Identify opportunities for optimization, scalability, and cost reduction

Automation & Infrastructure

Design/implement automation for monitoring, incident response, and repetitive tasks
Use IaC/CaC (Terraform, ARM, CloudFormation) for provisioning and data pipelines
Design/deploy Docker/Kubernetes solutions on cloud platforms
Manage CI/CD pipelines for cloud-native apps (GitHub Actions, Azure DevOps)

Cloud Infrastructure & Operations

Contribute to architecture; implement Azure/AWS with high availability
Optimize resources for performance, cost, security
Apply cloud-native best practices; ensure compliance
Integrate monitoring/alerting (Datadog, CloudWatch, App Insights) for multi-cloud observability

Incident Management & Analysis

Lead incident response; root cause analyses; blameless postmortems
Collaborate to embed observability in the development lifecycle
Build executive/developer dashboards for key metrics

What You Bring Along

Technical Expertise

Proven experience in running and maintaining production systems with expertise in triaging and solving incidents
Proficiency in automation and configuration management tools (e.g., Terraform, Ansible)
Expertise in cloud platforms, particularly Azure and AWS, and their associated tools
Strong programming skills, with a primary focus on Python, for developing automation scripts, creating custom tooling, and optimizing operational workflows
Experience with modern observability platforms such as Datadog
Strong network fundamentals with hands-on experience in Palo Alto next-generation firewalls, including configuration, monitoring, and troubleshooting in enterprise environments
Experience with Microsoft Identity solutions including Azure Active Directory (Entra ID), identity governance, and integration with enterprise authentication systems

Skills & Experience

A solid foundation in system architecture, with a focus on scalability and reliability
Exceptional problem-solving skills and a data-driven mindset
Familiarity with CI/CD pipelines and tools like GitHub Actions and Azure DevOps

Desirable Requirements

Experience with container orchestration tools such as Kubernetes
Knowledge of security best practices in cloud and hybrid environments
Experience with identity and access management solutions (e.g. Okta, Active Directory, Cyber Ark) including role-based access control and authentication protocols
Experience with network monitoring tools and infrastructure-as-code approaches to network configuration (e.g., Terraform for cloud networking, Ansible for network devices)

Our DNA is digital. And yours?

We are the leading digital full-service agency in Germany.
We love innovative technology, creativity and our 400+ employees for what they achieve each day.
Everything we do is human-centered. That is how we achieve excellent results for users, our clients and ourselves.

Sounds good to you? Then go ahead and send us your online application or contact Marc from our recruiting team*

APPLY NOW

T +49 6131 57609 - 6600

* We promise we won’t ask you any strange questions.