? Developed LLM-based semantic matching engine with Llama-3.1 and Qdrant
vector database
? Built Python FastAPI backend with Docker containerization and Kubernetes
deployment
? Implemented GitLab CI/CD pipelines for automated testing, building, and
deployment
? Created a Python web-scraping pipeline with automated scheduling for German job
market data
? Developed ML algorithms for resume-job matching using NLP and vector
embeddings
? Built RAG system for German tax documents with OCR extraction using
Tesseract and OpenCV
? Architected end-to-end data platform on GCP with BigQuery, Dataflow, and real-
time Kafka streaming (1M+ events/hour)
? Built MLOps pipeline on Vertex AI with automated CI/CD for model training and
deployment
? Implemented Delta Lake data lakehouse on Databricks with Kubernetes-based
microservices architecture
? Deployed auto-scaling GKE clusters reducing infrastructure costs by 40%
? Fine-tuned LLAMA2 model with LoRA for bank-specific document analysis
processing 10K+ PDFs daily
? Developed LangChain-based RAG architecture with ChromaDB vector storage
and automated OCR workflows
? Built application using NLP for customer segmentation and predictive analytics
? Developed PySpark-based credit risk models with automated feature engineering
and CI/CD deployment
? Implemented real-time fraud detection system with Elastic Search, Logstash &
Kibana
? Built automated data quality framework using Git & Jenkins pipelines
? Achieved 99.9% uptime for critical data pipelines through robust error handling
and monitoring
? Architected Hive-based data warehouse with automated ETL pipelines processing
30M+ records daily
? Implemented Elasticsearch search platform with automated indexing and NLP
text enhancement
? Built cloud-native data pipelines on AWS with Lambda, Step Functions, and
CodePipeline CI/CD
? Developed anomaly detection algorithms with statistical clustering and automated
alerting
? Created ML-based fraud detection models with ensemble methods and automated
deployment
? Built automated compliance workflows using Kubernetes microservices
architecture
? Designed real-time analytics dashboards using Tableau
Developed end-to-end ML pipelines with automated training, validation, and
deployment workflows
? Implemented predictive models for insurance claims forecasting using regression
& classification algorithms
? Created sentiment analysis engine with R and MapReduce on Hadoop HDFS
Senior Data Engineer, KI/ML Engineer und Cloud Architect mit über 7 Jahren Erfahrung in der Entwicklung skalierbarer Datenplattformen, KI-Systeme und Cloud-nativer Architekturen. Expertise in End-to-End-ML-Pipelines, Echtzeit-Datenverarbeitung und automatisierten CI/CD-Workflows auf GCP
Senior Data Engineer, AI/ML Engineer, and Cloud Architect with 7+ years of experience developing scalable data platforms, AI systems, and cloud-native architectures. Expertise in end-to-end ML pipelines, real-time data processing, and automated CI/CD workflows on GCP
? Developed LLM-based semantic matching engine with Llama-3.1 and Qdrant
vector database
? Built Python FastAPI backend with Docker containerization and Kubernetes
deployment
? Implemented GitLab CI/CD pipelines for automated testing, building, and
deployment
? Created a Python web-scraping pipeline with automated scheduling for German job
market data
? Developed ML algorithms for resume-job matching using NLP and vector
embeddings
? Built RAG system for German tax documents with OCR extraction using
Tesseract and OpenCV
? Architected end-to-end data platform on GCP with BigQuery, Dataflow, and real-
time Kafka streaming (1M+ events/hour)
? Built MLOps pipeline on Vertex AI with automated CI/CD for model training and
deployment
? Implemented Delta Lake data lakehouse on Databricks with Kubernetes-based
microservices architecture
? Deployed auto-scaling GKE clusters reducing infrastructure costs by 40%
? Fine-tuned LLAMA2 model with LoRA for bank-specific document analysis
processing 10K+ PDFs daily
? Developed LangChain-based RAG architecture with ChromaDB vector storage
and automated OCR workflows
? Built application using NLP for customer segmentation and predictive analytics
? Developed PySpark-based credit risk models with automated feature engineering
and CI/CD deployment
? Implemented real-time fraud detection system with Elastic Search, Logstash &
Kibana
? Built automated data quality framework using Git & Jenkins pipelines
? Achieved 99.9% uptime for critical data pipelines through robust error handling
and monitoring
? Architected Hive-based data warehouse with automated ETL pipelines processing
30M+ records daily
? Implemented Elasticsearch search platform with automated indexing and NLP
text enhancement
? Built cloud-native data pipelines on AWS with Lambda, Step Functions, and
CodePipeline CI/CD
? Developed anomaly detection algorithms with statistical clustering and automated
alerting
? Created ML-based fraud detection models with ensemble methods and automated
deployment
? Built automated compliance workflows using Kubernetes microservices
architecture
? Designed real-time analytics dashboards using Tableau
Developed end-to-end ML pipelines with automated training, validation, and
deployment workflows
? Implemented predictive models for insurance claims forecasting using regression
& classification algorithms
? Created sentiment analysis engine with R and MapReduce on Hadoop HDFS
Senior Data Engineer, KI/ML Engineer und Cloud Architect mit über 7 Jahren Erfahrung in der Entwicklung skalierbarer Datenplattformen, KI-Systeme und Cloud-nativer Architekturen. Expertise in End-to-End-ML-Pipelines, Echtzeit-Datenverarbeitung und automatisierten CI/CD-Workflows auf GCP
Senior Data Engineer, AI/ML Engineer, and Cloud Architect with 7+ years of experience developing scalable data platforms, AI systems, and cloud-native architectures. Expertise in end-to-end ML pipelines, real-time data processing, and automated CI/CD workflows on GCP