Senior Site Reliability Engineer, Cloud & DevOps, Infrastructure Automation Expert
Aktualisiert am 21.03.2025
Profil
Mitarbeiter eines Dienstleisters
Verfügbar ab: 12.03.2025
Verfügbar zu: 100%
davon vor Ort: 100%
Skill-Profil eines fest angestellten Mitarbeiters des Dienstleisters
Portuguese
English
German

Einsatzorte

Einsatzorte

Deutschland
nicht möglich

Projekte

Projekte

1 year 1 month
2024-03 - now

Development of self-service ML platforms

Senior Platform Engineer (Freelance) Kubernetes KEDA ArgoCD ...
Senior Platform Engineer (Freelance)
  • Built self-service ML platforms and infrastructure automation solutions.
  • Designed an internal ML platform with EKS and KEDA, reducing GPU costs by $800K through dynamic scaling
  • Developed Go and Python-based platform APIs and custom operators for ML workflow automation
  • Created self-service infrastructure, allowing ML teams to deploy models with zero DevOps overhead
  • Responsibilities
    • ?ML platform development, infrastructure automation, cost optimization
Kubernetes KEDA ArgoCD AWS (EKS/ Bedrock/ SageMaker) Porter.run Hugging Face Terraform OpenTofu Prometheus Go Python
5 months
2023-10 - 2024-02

Infrastructure redesign and compliance with safety regulations

Senior Site Reliability Engineer AWS EKS Control Tower ...
Senior Site Reliability Engineer
  • Led infrastructure transformation and security compliance initiatives for a global fintech platform managing multi-billion-dollar assets
  • Architected multi-region EKS clusters with Istio service mesh for financial services applications
  • Built an observability stack using Prometheus and Grafana, including custom Go exporters
  • Implemented a Zero Trust architecture and GitOps workflow, achieving ISO 27001 compliance
  • Responsibilities
    • ?Infrastructure transformation, security compliance, observability
AWS EKS Control Tower Transit Gateway Aurora Kubernetes Istio Terraform OpenTofu Wiz (Security) GitOps
Moonfare GmbH
1 year 4 months
2022-06 - 2023-09

Scaling and operation of a critical search infrastructure

Senior Site Reliability Engineer AWS (EKS) GCP (GKE) Kubernetes ...
Senior Site Reliability Engineer
  • Scaled and operated critical search infrastructure for a platform processing millions of daily hotel searches
  • Migrated search infrastructure from AWS to GKE, reducing infrastructure costs by 30%
  • Optimized Auction Search reliability with a custom SLO/SLI framework during a 3x traffic growth period
  • Implemented a global disaster recovery strategy using Cloudflare edge caching for LATAM and APAC regions
  • Responsibilities
    • ?Cloud migration, reliability engineering, cost reduction
AWS (EKS) GCP (GKE) Kubernetes Istio Terraform OpenTofu Kafka Zero Trust Security Prometheus Go
Trivago N.V.
5 months
2022-02 - 2022-06

Design and implementation of Kubernetes operators

Senior DevOps Engineer AWS Kubernetes Helm ...
Senior DevOps Engineer
  • Designed and implemented custom Kubernetes operators for automated infrastructure management
  • Led infrastructure optimization initiatives and cloud cost reduction efforts
  • Responsibilities
    • ?Infrastructure automation, cost optimization
AWS Kubernetes Helm Terraform OpenTofu Pulumi Ansible Go Python
Collaboration Factory AG
5 years 1 month
2016-12 - 2021-12

AI platform for process automation

Senior DevOps Engineer AWS (EKS) Azure (Container Apps/ OCR) Kubernetes ...
Senior DevOps Engineer
  • Led a team of 5 engineers managing an AI process automation platform handling 30K+ daily transactions
  • Migrated workloads to AWS EKS using Flux/Helm and modernized Windows apps to Azure cloud-native services
  • Developed Go-based CLI tools and Kubernetes operators for internal system management
  • Implemented SAML SSO and improved security across all services
  • Responsibilities
    • ?DevOps team leadership, cloud migration, automation
AWS (EKS) Azure (Container Apps/ OCR) Kubernetes Helm Terraform OpenTofu GitLab CI Go
Cognotekt GmbH
3 years 10 months
2013-02 - 2016-11

Cloud migrations and automation projects

Senior Cloud Consultant AWS EC2 VPC ...
Senior Cloud Consultant
  • Led large-scale cloud migrations and infrastructure automation projects for Fortune 500 clients
  • Managed AWS/Azure migrations for 20+ companies using Docker and the 12-factor methodology
  • Developed Python automation tools and Infrastructure as Code (IaC) solutions for DevOps workflows
  • Responsibilities
    • ?Cloud consulting, automation, enterprise cloud migrations
AWS EC2 VPC S3 RDS Lambda ECS Elastic Beanstalk CloudFormation Azure Docker Terraform Puppet Chef Ansible Jenkins Python
Gennovacap
4 years 1 month
2009-01 - 2013-01

Infrastructure and system integration initiatives

Systems Administrator FreeBSD Linux VMware ...
Systems Administrator
  • Led infrastructure and systems integration initiatives supporting educational institutions across Brazil
  • Managed VMware/Xen virtualization and implemented SSO across multiple university networks
  • Integrated Moodle and Sakai LMS with university ERPs to optimize data flow
  • Responsibilities
    • ?Infrastructure management, system integration, university network administration
FreeBSD Linux VMware Xen JBoss Moodle Sakai LMS MySQL MS SQL Server LDAP SSO Git SVN Nagios Zabbix
MEC - Ministério da Educação, Brazil

Aus- und Weiterbildung

Aus- und Weiterbildung

2004 ? 2008
Study - Computer Science
Federal University of Alagoas, Brazil
Degree: Bachelor

2009 ? 2010
MBA in Project Management
CESMAC, Brazil

2002 ? 2003
Information Technology Technician
Federal Institute of Alagoas, Brazil

CERTIFICATIONS
  • Cloud Platform Engineering
    • CKA - Certified Kubernetes Administrator (CNCF)
    • Professional Cloud Architect (GCP)
    • DevOps Engineer Professional (AWS)
  • Cloud Development
    • AWS Developer Associate
    • AWS SysOps Administrator Associate
    • ?Terraform Associate 002 (HashiCorp)

Position

Position

  • Senior Site Reliability Engineer
  • Cloud & DevOps
  • Infrastructure Automation Expert

Kompetenzen

Kompetenzen

Produkte / Standards / Erfahrungen / Methoden

  • Cloud & Infrastructure 
    • AWS (Control Tower, EKS, Lambda, ECS, S3, CloudWatch, IAM, Route53)
    • GCP
    • Kubernetes (EKS, GKE)
    • Istio
    • Docker
    • Helm
    • Terraform
    • OpenTofu
    • Pulumi
    • CloudFormation
    • CDK
    • Ansible
  • DevOps & CI/CD 
    • GitHub Actions
    • GitLab CI
    • Jenkins
    • CircleCI
    • ArgoCD
  • Security & Observability 
    • Wiz
    • Zero Trust
    • ISO 27001
    • Prometheus
    • Grafana
    • OpenTelemetry
    • Thanos
    • Loki
    • AlertManager
    • PagerDuty
  • Programming & Automation
    • Go
    • Python
    • Shell
    • SQL
    • gRPC
    • Protocol Buffers
    • REST APIs
  • Reliability & Performance
    • Chaos Engineering (Chaos Monkey)
    • k6
    • JMeter
    • OpsGenie
    • ServiceNow
  • Data & Systems 
    • Linux (RHEL, Ubuntu, Debian)
    • Puppet
    • Chef
    • MySQL
    • PostgreSQL
    • Aurora
    • DynamoDB
    • Redis
    • Elasticsearch
    • Kafka
    • Kinesis
  • ML & Development 
    • AWS SageMaker
    • Bedrock
    • Kubeflow
    • MLflow
    • Ray
    • Seldon Core
    • KServe
    • Porter.run
  • Stakeholder Management 
    • Strategic Planning
    • Budget Optimization
    • Developer Enablement
    • Mentorship Programs
  • Compliance & Security
    • Disaster Recovery
    • High-Availability Systems
    • SLO/SLI Definition

Einsatzorte

Einsatzorte

Deutschland
nicht möglich

Projekte

Projekte

1 year 1 month
2024-03 - now

Development of self-service ML platforms

Senior Platform Engineer (Freelance) Kubernetes KEDA ArgoCD ...
Senior Platform Engineer (Freelance)
  • Built self-service ML platforms and infrastructure automation solutions.
  • Designed an internal ML platform with EKS and KEDA, reducing GPU costs by $800K through dynamic scaling
  • Developed Go and Python-based platform APIs and custom operators for ML workflow automation
  • Created self-service infrastructure, allowing ML teams to deploy models with zero DevOps overhead
  • Responsibilities
    • ?ML platform development, infrastructure automation, cost optimization
Kubernetes KEDA ArgoCD AWS (EKS/ Bedrock/ SageMaker) Porter.run Hugging Face Terraform OpenTofu Prometheus Go Python
5 months
2023-10 - 2024-02

Infrastructure redesign and compliance with safety regulations

Senior Site Reliability Engineer AWS EKS Control Tower ...
Senior Site Reliability Engineer
  • Led infrastructure transformation and security compliance initiatives for a global fintech platform managing multi-billion-dollar assets
  • Architected multi-region EKS clusters with Istio service mesh for financial services applications
  • Built an observability stack using Prometheus and Grafana, including custom Go exporters
  • Implemented a Zero Trust architecture and GitOps workflow, achieving ISO 27001 compliance
  • Responsibilities
    • ?Infrastructure transformation, security compliance, observability
AWS EKS Control Tower Transit Gateway Aurora Kubernetes Istio Terraform OpenTofu Wiz (Security) GitOps
Moonfare GmbH
1 year 4 months
2022-06 - 2023-09

Scaling and operation of a critical search infrastructure

Senior Site Reliability Engineer AWS (EKS) GCP (GKE) Kubernetes ...
Senior Site Reliability Engineer
  • Scaled and operated critical search infrastructure for a platform processing millions of daily hotel searches
  • Migrated search infrastructure from AWS to GKE, reducing infrastructure costs by 30%
  • Optimized Auction Search reliability with a custom SLO/SLI framework during a 3x traffic growth period
  • Implemented a global disaster recovery strategy using Cloudflare edge caching for LATAM and APAC regions
  • Responsibilities
    • ?Cloud migration, reliability engineering, cost reduction
AWS (EKS) GCP (GKE) Kubernetes Istio Terraform OpenTofu Kafka Zero Trust Security Prometheus Go
Trivago N.V.
5 months
2022-02 - 2022-06

Design and implementation of Kubernetes operators

Senior DevOps Engineer AWS Kubernetes Helm ...
Senior DevOps Engineer
  • Designed and implemented custom Kubernetes operators for automated infrastructure management
  • Led infrastructure optimization initiatives and cloud cost reduction efforts
  • Responsibilities
    • ?Infrastructure automation, cost optimization
AWS Kubernetes Helm Terraform OpenTofu Pulumi Ansible Go Python
Collaboration Factory AG
5 years 1 month
2016-12 - 2021-12

AI platform for process automation

Senior DevOps Engineer AWS (EKS) Azure (Container Apps/ OCR) Kubernetes ...
Senior DevOps Engineer
  • Led a team of 5 engineers managing an AI process automation platform handling 30K+ daily transactions
  • Migrated workloads to AWS EKS using Flux/Helm and modernized Windows apps to Azure cloud-native services
  • Developed Go-based CLI tools and Kubernetes operators for internal system management
  • Implemented SAML SSO and improved security across all services
  • Responsibilities
    • ?DevOps team leadership, cloud migration, automation
AWS (EKS) Azure (Container Apps/ OCR) Kubernetes Helm Terraform OpenTofu GitLab CI Go
Cognotekt GmbH
3 years 10 months
2013-02 - 2016-11

Cloud migrations and automation projects

Senior Cloud Consultant AWS EC2 VPC ...
Senior Cloud Consultant
  • Led large-scale cloud migrations and infrastructure automation projects for Fortune 500 clients
  • Managed AWS/Azure migrations for 20+ companies using Docker and the 12-factor methodology
  • Developed Python automation tools and Infrastructure as Code (IaC) solutions for DevOps workflows
  • Responsibilities
    • ?Cloud consulting, automation, enterprise cloud migrations
AWS EC2 VPC S3 RDS Lambda ECS Elastic Beanstalk CloudFormation Azure Docker Terraform Puppet Chef Ansible Jenkins Python
Gennovacap
4 years 1 month
2009-01 - 2013-01

Infrastructure and system integration initiatives

Systems Administrator FreeBSD Linux VMware ...
Systems Administrator
  • Led infrastructure and systems integration initiatives supporting educational institutions across Brazil
  • Managed VMware/Xen virtualization and implemented SSO across multiple university networks
  • Integrated Moodle and Sakai LMS with university ERPs to optimize data flow
  • Responsibilities
    • ?Infrastructure management, system integration, university network administration
FreeBSD Linux VMware Xen JBoss Moodle Sakai LMS MySQL MS SQL Server LDAP SSO Git SVN Nagios Zabbix
MEC - Ministério da Educação, Brazil

Aus- und Weiterbildung

Aus- und Weiterbildung

2004 ? 2008
Study - Computer Science
Federal University of Alagoas, Brazil
Degree: Bachelor

2009 ? 2010
MBA in Project Management
CESMAC, Brazil

2002 ? 2003
Information Technology Technician
Federal Institute of Alagoas, Brazil

CERTIFICATIONS
  • Cloud Platform Engineering
    • CKA - Certified Kubernetes Administrator (CNCF)
    • Professional Cloud Architect (GCP)
    • DevOps Engineer Professional (AWS)
  • Cloud Development
    • AWS Developer Associate
    • AWS SysOps Administrator Associate
    • ?Terraform Associate 002 (HashiCorp)

Position

Position

  • Senior Site Reliability Engineer
  • Cloud & DevOps
  • Infrastructure Automation Expert

Kompetenzen

Kompetenzen

Produkte / Standards / Erfahrungen / Methoden

  • Cloud & Infrastructure 
    • AWS (Control Tower, EKS, Lambda, ECS, S3, CloudWatch, IAM, Route53)
    • GCP
    • Kubernetes (EKS, GKE)
    • Istio
    • Docker
    • Helm
    • Terraform
    • OpenTofu
    • Pulumi
    • CloudFormation
    • CDK
    • Ansible
  • DevOps & CI/CD 
    • GitHub Actions
    • GitLab CI
    • Jenkins
    • CircleCI
    • ArgoCD
  • Security & Observability 
    • Wiz
    • Zero Trust
    • ISO 27001
    • Prometheus
    • Grafana
    • OpenTelemetry
    • Thanos
    • Loki
    • AlertManager
    • PagerDuty
  • Programming & Automation
    • Go
    • Python
    • Shell
    • SQL
    • gRPC
    • Protocol Buffers
    • REST APIs
  • Reliability & Performance
    • Chaos Engineering (Chaos Monkey)
    • k6
    • JMeter
    • OpsGenie
    • ServiceNow
  • Data & Systems 
    • Linux (RHEL, Ubuntu, Debian)
    • Puppet
    • Chef
    • MySQL
    • PostgreSQL
    • Aurora
    • DynamoDB
    • Redis
    • Elasticsearch
    • Kafka
    • Kinesis
  • ML & Development 
    • AWS SageMaker
    • Bedrock
    • Kubeflow
    • MLflow
    • Ray
    • Seldon Core
    • KServe
    • Porter.run
  • Stakeholder Management 
    • Strategic Planning
    • Budget Optimization
    • Developer Enablement
    • Mentorship Programs
  • Compliance & Security
    • Disaster Recovery
    • High-Availability Systems
    • SLO/SLI Definition

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.