Data Engineer, Cloud- GCP, AWS, Azure, AI, ML, LLM, RAG, Langchain, KI, Python, SQL, CI/CD, Docker, KI-Produktbesitzer, Data Scientist
Aktualisiert am 13.06.2025
Profil
Mitarbeiter eines Dienstleisters
Remote-Arbeit
Verfügbar ab: 15.06.2025
Verfügbar zu: 100%
davon vor Ort: 100%
Skill-Profil eines fest angestellten Mitarbeiters des Dienstleisters
English
Muttersprache
German
Fortgeschritten
Hindi
Muttersprache

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

8 Monate
2025-04 - heute

Data Engineering & AI/ML & Cloud Solutions

Freelancer - Data Engineering & AI/ML & Cloud Solutions ? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka ...
Freelancer - Data Engineering & AI/ML & Cloud Solutions

? Developed LLM-based semantic matching engine with Llama-3.1 and Qdrant

vector database

? Built Python FastAPI backend with Docker containerization and Kubernetes

deployment

? Implemented GitLab CI/CD pipelines for automated testing, building, and

deployment

? Created a Python web-scraping pipeline with automated scheduling for German job

market data

? Developed ML algorithms for resume-job matching using NLP and vector

embeddings

? Built RAG system for German tax documents with OCR extraction using

Tesseract and OpenCV

? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka Airflow MLOps LLM Fine-tuning (LLAMA2 LoRA) LangChain Scikit-learn TensorFlow PyTorch ? Cloud & DevOps: GCP (Vertex AI Big Query Dataflow GKE) AWS Kubernetes Docker Terraform CI/CD (Jenkins GitLab) Infrastructure as Code ? Programming: Python (Pandas NumPy FastAPI) SQL Scala R ? Databases & Storage: Snowflake PostgreSQL MongoDB Delta Lake Vector DBs (Qdrant ChromaDB) ? Automation & Processing: OCR (Tesseract Google Vision API) Document Processing Real-time Streaming Data Quality Frameworks
? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka Airflow MLOps LLM Fine-tuning (LLAMA2 LoRA) LangChain Scikit-learn TensorFlow PyTorch ? Cloud & DevOps: GCP (Vertex AI Big Query Dataflow GKE) AWS Kubernetes Docker Terraform CI/CD (Jenkins GitLab) Infrastructure as Code ? Programming: Python (Pandas NumPy FastAPI) SQL Scala R ? Databases & Storage: Snowflake PostgreSQL MongoDB Delta Lake Vector DBs (Qdrant ChromaDB) ? Automation & Processing: OCR (Tesseract Google Vision API) Document Processing Real-time Streaming Data Quality Frameworks
Freelance
Germany & Remote
5 Jahre
2020-04 - 2025-03

Commerzbank AG

Senior Data Engineer & AI/ML Specialist AI & ML: LLAMA2 LangChain LoRA ...
Senior Data Engineer & AI/ML Specialist
  • ? Architected end-to-end data platform on GCP with BigQuery, Dataflow, and real-

    time Kafka streaming (1M+ events/hour)

    ? Built MLOps pipeline on Vertex AI with automated CI/CD for model training and

    deployment

    ? Implemented Delta Lake data lakehouse on Databricks with Kubernetes-based

    microservices architecture

    ? Deployed auto-scaling GKE clusters reducing infrastructure costs by 40%

    ? Fine-tuned LLAMA2 model with LoRA for bank-specific document analysis

    processing 10K+ PDFs daily

    ? Developed LangChain-based RAG architecture with ChromaDB vector storage

    and automated OCR workflows

    ? Built application using NLP for customer segmentation and predictive analytics

    ? Developed PySpark-based credit risk models with automated feature engineering

    and CI/CD deployment

    ? Implemented real-time fraud detection system with Elastic Search, Logstash &

    Kibana

    ? Built automated data quality framework using Git & Jenkins pipelines

    ? Achieved 99.9% uptime for critical data pipelines through robust error handling

    and monitoring

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
CommerzBank AG
Frankfurt am Main
3 Jahre 9 Monate
2016-07 - 2020-03

Deloitte Consulting - Europe

Data Engineer & Analytics Consultant AI & ML: LLAMA2 LangChain LoRA ...
Data Engineer & Analytics Consultant

? Architected Hive-based data warehouse with automated ETL pipelines processing

30M+ records daily

? Implemented Elasticsearch search platform with automated indexing and NLP

text enhancement

 ? Built cloud-native data pipelines on AWS with Lambda, Step Functions, and

CodePipeline CI/CD

? Developed anomaly detection algorithms with statistical clustering and automated

alerting

? Created ML-based fraud detection models with ensemble methods and automated

deployment

? Built automated compliance workflows using Kubernetes microservices

architecture

? Designed real-time analytics dashboards using Tableau

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
Deloitte
Europe, USA
2 Jahre 10 Monate
2013-09 - 2016-06

CGI Management Consultants

Data Scientist & Python Developer AI & ML: LLAMA2 LangChain LoRA ...
Data Scientist & Python Developer

 Developed end-to-end ML pipelines with automated training, validation, and

deployment workflows

? Implemented predictive models for insurance claims forecasting using regression

& classification algorithms

? Created sentiment analysis engine with R and MapReduce on Hadoop HDFS

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
CGI Management Consultants
Germany, India

Aus- und Weiterbildung

Aus- und Weiterbildung

1 Jahr 6 Monate
2023-09 - 2025-02

Business Administration

Master of Business Administration, IE Business School, Madrid
Master of Business Administration
IE Business School, Madrid
  • Financial Accounting
  • Managerial Accounting
  • Corporate Finance
  • Marketing Management
  • Strategy & Competitive Advantage
  • Economics for Business
  • Operations & Supply Chain Management

  • Organizational Behavior
  • Leadership & Change Management
  • Human Resources & Talent Management

  • Digital Transformation
  • Data Analytics for Decision Making
  • Artificial Intelligence & Business Applications
  • Entrepreneurship & Innovation

  • Global Macroeconomics
  • International Business Strategy
  • Geopolitics & Business

  • Negotiation & Conflict Resolution
  • Communication & Public Speaking
  • Ethical Decision-Making & Corporate Social Responsibility
1 Jahr 4 Monate
2021-12 - 2023-03

Data Science

Master of Data Science, Liverpool John Moores University
Master of Data Science
Liverpool John Moores University

  • Probability and Statistics
  • Machine Learning
  • Deep Learning
  • Data Mining
  • Big Data Technologies
  • Data Visualization
  • Database Management Systems (SQL & NoSQL)
  • Data Engineering
  • Natural Language Processing (NLP)
  • Time Series Analysis

  • Python for Data Science
  • R for Data Science
  • Advanced Algorithms
  • Cloud Computing for Data Science
  • Distributed Systems & Computing

  • Linear Algebra
  • Calculus for Machine Learning
  • Optimization Techniques

  • Reinforcement Learning
  • Computer Vision
  • Bayesian Statistics
  • Bioinformatics
  • Financial Data Science
  • Cybersecurity & Data Privacy

  • Data-Driven Decision Making
  • AI & Ethics
  • Business Analytics
  • Product Analytics
  • Data Governance & Compliance
4 Jahre 1 Monat
2009-06 - 2013-06

Computer Science

Bachelor of Computer Science, BK Birla Institute of Engineering & Technology
Bachelor of Computer Science
BK Birla Institute of Engineering & Technology
  • Linear Algebra
  • Calculus
  • Probability & Statistics
  • Numerical Methods

  • Programming Fundamentals (C, C++, Java, Python)
  • Object-Oriented Programming (OOP)
  • Data Structures & Algorithms
  • Software Engineering
  • Web Development (HTML, CSS, JavaScript)

  • Computer Organization & Architecture
  • Operating Systems
  • Computer Networks
  • Embedded Systems

  • Database Management Systems (SQL, NoSQL)
  • Data Mining
  • Artificial Intelligence (AI)
  • Machine Learning (ML)

Position

Position

Senior Data Engineer, KI/ML Engineer und Cloud Architect mit über 7 Jahren Erfahrung in der Entwicklung skalierbarer Datenplattformen, KI-Systeme und Cloud-nativer Architekturen. Expertise in End-to-End-ML-Pipelines, Echtzeit-Datenverarbeitung und automatisierten CI/CD-Workflows auf GCP


Senior Data Engineer, AI/ML Engineer, and Cloud Architect with 7+ years of experience developing scalable data platforms, AI systems, and cloud-native architectures. Expertise in end-to-end ML pipelines, real-time data processing, and automated CI/CD workflows on GCP

Kompetenzen

Kompetenzen

Produkte / Standards / Erfahrungen / Methoden

Artificial Intelligence & Machine Learning
Experte
Big Data & Data Engineering
Experte
Cloud Computing & DevOps
Experte
Business Intelligence & Analytics
Experte
Data Science
Experte

 

Programmiersprachen

Python
Experte

Datenbanken

SQL & No SQL, Hive, Elastic Search
Experte

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

8 Monate
2025-04 - heute

Data Engineering & AI/ML & Cloud Solutions

Freelancer - Data Engineering & AI/ML & Cloud Solutions ? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka ...
Freelancer - Data Engineering & AI/ML & Cloud Solutions

? Developed LLM-based semantic matching engine with Llama-3.1 and Qdrant

vector database

? Built Python FastAPI backend with Docker containerization and Kubernetes

deployment

? Implemented GitLab CI/CD pipelines for automated testing, building, and

deployment

? Created a Python web-scraping pipeline with automated scheduling for German job

market data

? Developed ML algorithms for resume-job matching using NLP and vector

embeddings

? Built RAG system for German tax documents with OCR extraction using

Tesseract and OpenCV

? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka Airflow MLOps LLM Fine-tuning (LLAMA2 LoRA) LangChain Scikit-learn TensorFlow PyTorch ? Cloud & DevOps: GCP (Vertex AI Big Query Dataflow GKE) AWS Kubernetes Docker Terraform CI/CD (Jenkins GitLab) Infrastructure as Code ? Programming: Python (Pandas NumPy FastAPI) SQL Scala R ? Databases & Storage: Snowflake PostgreSQL MongoDB Delta Lake Vector DBs (Qdrant ChromaDB) ? Automation & Processing: OCR (Tesseract Google Vision API) Document Processing Real-time Streaming Data Quality Frameworks
? Data Engineering & ML Engineering: ETL/ELT Pipelines Apache Spark Kafka Airflow MLOps LLM Fine-tuning (LLAMA2 LoRA) LangChain Scikit-learn TensorFlow PyTorch ? Cloud & DevOps: GCP (Vertex AI Big Query Dataflow GKE) AWS Kubernetes Docker Terraform CI/CD (Jenkins GitLab) Infrastructure as Code ? Programming: Python (Pandas NumPy FastAPI) SQL Scala R ? Databases & Storage: Snowflake PostgreSQL MongoDB Delta Lake Vector DBs (Qdrant ChromaDB) ? Automation & Processing: OCR (Tesseract Google Vision API) Document Processing Real-time Streaming Data Quality Frameworks
Freelance
Germany & Remote
5 Jahre
2020-04 - 2025-03

Commerzbank AG

Senior Data Engineer & AI/ML Specialist AI & ML: LLAMA2 LangChain LoRA ...
Senior Data Engineer & AI/ML Specialist
  • ? Architected end-to-end data platform on GCP with BigQuery, Dataflow, and real-

    time Kafka streaming (1M+ events/hour)

    ? Built MLOps pipeline on Vertex AI with automated CI/CD for model training and

    deployment

    ? Implemented Delta Lake data lakehouse on Databricks with Kubernetes-based

    microservices architecture

    ? Deployed auto-scaling GKE clusters reducing infrastructure costs by 40%

    ? Fine-tuned LLAMA2 model with LoRA for bank-specific document analysis

    processing 10K+ PDFs daily

    ? Developed LangChain-based RAG architecture with ChromaDB vector storage

    and automated OCR workflows

    ? Built application using NLP for customer segmentation and predictive analytics

    ? Developed PySpark-based credit risk models with automated feature engineering

    and CI/CD deployment

    ? Implemented real-time fraud detection system with Elastic Search, Logstash &

    Kibana

    ? Built automated data quality framework using Git & Jenkins pipelines

    ? Achieved 99.9% uptime for critical data pipelines through robust error handling

    and monitoring

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
CommerzBank AG
Frankfurt am Main
3 Jahre 9 Monate
2016-07 - 2020-03

Deloitte Consulting - Europe

Data Engineer & Analytics Consultant AI & ML: LLAMA2 LangChain LoRA ...
Data Engineer & Analytics Consultant

? Architected Hive-based data warehouse with automated ETL pipelines processing

30M+ records daily

? Implemented Elasticsearch search platform with automated indexing and NLP

text enhancement

 ? Built cloud-native data pipelines on AWS with Lambda, Step Functions, and

CodePipeline CI/CD

? Developed anomaly detection algorithms with statistical clustering and automated

alerting

? Created ML-based fraud detection models with ensemble methods and automated

deployment

? Built automated compliance workflows using Kubernetes microservices

architecture

? Designed real-time analytics dashboards using Tableau

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
Deloitte
Europe, USA
2 Jahre 10 Monate
2013-09 - 2016-06

CGI Management Consultants

Data Scientist & Python Developer AI & ML: LLAMA2 LangChain LoRA ...
Data Scientist & Python Developer

 Developed end-to-end ML pipelines with automated training, validation, and

deployment workflows

? Implemented predictive models for insurance claims forecasting using regression

& classification algorithms

? Created sentiment analysis engine with R and MapReduce on Hadoop HDFS

AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
AI & ML: LLAMA2 LangChain LoRA Hugging Face Transformers Spark NLP Custom GPTs Cloud & DevOps: GCP AWS Kubernetes Terraform FastAPI Docker CI/CD Cloud Functions Data Engineering: PySpark Hive Kafka SQL Delta Lake Databricks ChromaDB PostgreSQL Blockchain & Visualization: Ethereum Solidity Tableau Power BI Elastic Kibana Python
CGI Management Consultants
Germany, India

Aus- und Weiterbildung

Aus- und Weiterbildung

1 Jahr 6 Monate
2023-09 - 2025-02

Business Administration

Master of Business Administration, IE Business School, Madrid
Master of Business Administration
IE Business School, Madrid
  • Financial Accounting
  • Managerial Accounting
  • Corporate Finance
  • Marketing Management
  • Strategy & Competitive Advantage
  • Economics for Business
  • Operations & Supply Chain Management

  • Organizational Behavior
  • Leadership & Change Management
  • Human Resources & Talent Management

  • Digital Transformation
  • Data Analytics for Decision Making
  • Artificial Intelligence & Business Applications
  • Entrepreneurship & Innovation

  • Global Macroeconomics
  • International Business Strategy
  • Geopolitics & Business

  • Negotiation & Conflict Resolution
  • Communication & Public Speaking
  • Ethical Decision-Making & Corporate Social Responsibility
1 Jahr 4 Monate
2021-12 - 2023-03

Data Science

Master of Data Science, Liverpool John Moores University
Master of Data Science
Liverpool John Moores University

  • Probability and Statistics
  • Machine Learning
  • Deep Learning
  • Data Mining
  • Big Data Technologies
  • Data Visualization
  • Database Management Systems (SQL & NoSQL)
  • Data Engineering
  • Natural Language Processing (NLP)
  • Time Series Analysis

  • Python for Data Science
  • R for Data Science
  • Advanced Algorithms
  • Cloud Computing for Data Science
  • Distributed Systems & Computing

  • Linear Algebra
  • Calculus for Machine Learning
  • Optimization Techniques

  • Reinforcement Learning
  • Computer Vision
  • Bayesian Statistics
  • Bioinformatics
  • Financial Data Science
  • Cybersecurity & Data Privacy

  • Data-Driven Decision Making
  • AI & Ethics
  • Business Analytics
  • Product Analytics
  • Data Governance & Compliance
4 Jahre 1 Monat
2009-06 - 2013-06

Computer Science

Bachelor of Computer Science, BK Birla Institute of Engineering & Technology
Bachelor of Computer Science
BK Birla Institute of Engineering & Technology
  • Linear Algebra
  • Calculus
  • Probability & Statistics
  • Numerical Methods

  • Programming Fundamentals (C, C++, Java, Python)
  • Object-Oriented Programming (OOP)
  • Data Structures & Algorithms
  • Software Engineering
  • Web Development (HTML, CSS, JavaScript)

  • Computer Organization & Architecture
  • Operating Systems
  • Computer Networks
  • Embedded Systems

  • Database Management Systems (SQL, NoSQL)
  • Data Mining
  • Artificial Intelligence (AI)
  • Machine Learning (ML)

Position

Position

Senior Data Engineer, KI/ML Engineer und Cloud Architect mit über 7 Jahren Erfahrung in der Entwicklung skalierbarer Datenplattformen, KI-Systeme und Cloud-nativer Architekturen. Expertise in End-to-End-ML-Pipelines, Echtzeit-Datenverarbeitung und automatisierten CI/CD-Workflows auf GCP


Senior Data Engineer, AI/ML Engineer, and Cloud Architect with 7+ years of experience developing scalable data platforms, AI systems, and cloud-native architectures. Expertise in end-to-end ML pipelines, real-time data processing, and automated CI/CD workflows on GCP

Kompetenzen

Kompetenzen

Produkte / Standards / Erfahrungen / Methoden

Artificial Intelligence & Machine Learning
Experte
Big Data & Data Engineering
Experte
Cloud Computing & DevOps
Experte
Business Intelligence & Analytics
Experte
Data Science
Experte

 

Programmiersprachen

Python
Experte

Datenbanken

SQL & No SQL, Hive, Elastic Search
Experte

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.