Senior Machine Learning Engineer mit über 7 Jahren Erfahrung, spezialisiert auf LLMs, RAG und intelligente Dokumentenverarbeitung.
Aktualisiert am 11.04.2025
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 11.04.2025
Verfügbar zu: 100%
davon vor Ort: 10%
Large Language Models (LLMs) & RAG
AWS Cloud Engineering
End-to-End AI Product Development
LangChain
Weaviate
MLOps
Model Deployment
Intelligent Document Processing
NLP
Text Classification
Docker
CI/CD
Infrastructure as Code
Serverless Architectures
Data Engineering
Real-Time Audio Processing
Time-Series Forecasting
Statistical Analysis
German
native
Polish
native
English
CEFR C1.1

Einsatzorte

Einsatzorte

Berlin (+100km)
Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

1 Jahr 1 Monat
2024-03 - 2025-03

development of a RAG knowledge platform

SENIOR MACHINE LEARNING ENGINEER Large Language Models AWS Machine Learning ...
SENIOR MACHINE LEARNING ENGINEER
  • Led the development of a RAG knowledge platform using AWS CDK, Lambda, Cognito, API Gateway, S3, and Bedrock, optimizing OpenSearch with LLM-based semantic splitting, resulting in significant time savings for marketing and medical professionals.
  • Built a Python package for extracting predefined keywords from candidate resumes, utilizing Bedrock, Jinja, and YAML, enhancing candidate selection efficiency and quality.
  • Built a full-stack solution (Next.js, AWS Bedrock, ECS) for automated ESG-relevant data extraction and document classification from diverse formats (PDFs, images, DOCX) with LangChain, reducing manual processing time from 40 hours to 15 minutes.
Large Language Models AWS Machine Learning IaC Model Finetuning Software Architecture Lambda Cognito API Gateway S3 and Bedrock Python package Bedrock Jinja YAML Next.js ECS
Public Cloud Group GmbH
Berlin
3 Monate
2023-12 - 2024-02

Redesigned a LLM classification pipeline

MACHINE LEARNING ENGINEER Large Language Models Data Science
MACHINE LEARNING ENGINEER

  • Redesigned a LLM classification pipeline to reduce token usage (80k ? 50-90% savings) and improve efficiency using LiteLLM, Instructor, and Pydantic for structured output. Implemented token caching, parallel processing, and background task management, significantly cutting costs and runtime for large-scale part classification.
  • Implemented Supabase authentication with role-based access control. Integrated Weaviate as a vector store for efficient semantic search across documents. Improved frontend user visibility and enhanced RAG responses by modifying Instructor Pydantic models to include inline citations, linking sources to documents with rolebased access controls.

Large Language Models Data Science
6 Monate
2023-07 - 2023-12

Formulated and executed a comprehensive AI strategy

CHIEF AI OFFICER Large Language Models AI Strategy Model Finetuning ...
CHIEF AI OFFICER
  • Formulated and executed a comprehensive AI strategy, developing and deploying an OpenAI-based large language model architecture on Heroku to significantly enhance the generation of high-quality presentation slides from text inputs.
  • Fine-tuned LLM modules using synthetically generated data, resulting in faster and more reliable outputs.
Large Language Models AI Strategy Model Finetuning Software Architecture
Cultway (Zapdeck) GmbH
Berlin
1 Jahr 5 Monate
2022-07 - 2023-11

DATA & ANALYTICS

DATA & SOFTWARE ENGINEER AWS Machine Learning Infrastructure as Code ...
DATA & SOFTWARE ENGINEER
  • Developed an asynchronous processing system for real-time language translation, leveraging AWS Transcribe, Translate, and Polly to manage continuous audio streams.
  • Co-created an AI-translation platform, leveraging AWS services and SageMaker Augmented AI, to create an AI-empowered translation solution.
AWS Machine Learning Infrastructure as Code CI/CD Data Platforms NLP Data Engineering
AllCloud GmbH
Berlin
9 Monate
2021-10 - 2022-06

AI & DATA ANALYTICS

MACHINE LEARNING CONSULTANT CDK Step Functions S3 ...
MACHINE LEARNING CONSULTANT
  • Developed and automated a SageMaker-based recommender model pipeline, leveraging CDK, Step Functions, S3, and ECR. Enabled model retraining and deployment on new data, with Lambda for performance checks and automated endpoint updates for real-time application recommendations.
  • Executed a ransomware identification POC using software log data. Developed and refined LR and XGBoost models. Enhanced XGBoost accuracy to 90%, F1-Score to 0.87 and ROCAUC to 0.93. Implemented cross-validation and hyperparameter tuning to mitigate overfitting.
CDK Step Functions S3 ECR LR XGBoost
Deloitte GmbH
Berlin
1 Jahr 4 Monate
2020-01 - 2021-04

marketing and advertising campaigns

DATA SCIENTIST Tableau Python
DATA SCIENTIST
  • Ensured the meeting of plan values for overall marketing and advertising campaigns with time-series predictions embedded in Tableau with Python.
  • Led A/B testing initiatives, designing test groups, monitoring systems, and providing statistical evaluations using Bayesian models
Tableau Python
Axel Springer SE
Berlin

Aus- und Weiterbildung

Aus- und Weiterbildung

1 Monat
2023-08 - 2023-08

AWS Certified Data Analytics ? Specialty

Amazon Web Services
Amazon Web Services
1 Monat
2022-10 - 2022-10

AWS Certified Solutions Architect - Associate

Amazon Web Services
Amazon Web Services
1 Monat
2022-06 - 2022-06

AWS Certified Cloud Practitioner

Amazon Web Services
Amazon Web Services
1 Monat
2022-04 - 2022-04

Deploying Machine Learning Models in Production

Coursera
Coursera
1 Monat
2022-02 - 2022-02

Machine Learning Modeling Pipelines in Production

Coursera
Coursera
1 Monat
2021-11 - 2021-11

Machine Learning Data Lifecycle in Production

Coursera
Coursera
2 Jahre 7 Monate
2019-03 - 2021-09

Industrial Engineering

B.Eng. (GPA 1.8), Fresenius University of Applied Sciences
B.Eng. (GPA 1.8)
Fresenius University of Applied Sciences

  • Process Management
  • Product Management
  • IT Management Consulting
  • Bachelor thesis: on request

1 Monat
2020-11 - 2020-11

Machine Learning by Andrew Ng (Stanford University)

Coursera
Coursera

Kompetenzen

Kompetenzen

Top-Skills

Large Language Models (LLMs) & RAG AWS Cloud Engineering End-to-End AI Product Development LangChain Weaviate MLOps Model Deployment Intelligent Document Processing NLP Text Classification Docker CI/CD Infrastructure as Code Serverless Architectures Data Engineering Real-Time Audio Processing Time-Series Forecasting Statistical Analysis

Produkte / Standards / Erfahrungen / Methoden

Profile

Senior Machine Learning Engineer with 7+ years of experience, specializing in LLMs, RAG, and Intelligent Document Processing, delivering scalable AI solutions in AWS Cloud for fast-paced, ambiguous scenarios.


SKILLS

Technologies:

  • Python
  • PySpark
  • SQL
  • AWS Services 
    • Lambda
    • S3
    • DynamoDB
    • ECR
    • Elastic Beanstalk
    • Redshift
    • Glue
    • SQS
    • SNS
    • Step Functions
    • CDK
    • SageMaker
    • Rekognition
    • Comprehend
    • Transcribe
    • Cognito
    • API Gateway
    • Bedrock
    • OpenSearch
  • Weaviate
  • LangChain
  • Docker
  • Apache Airflow
  • Flask
  • FastAPI
  • Git
  • Bitbucket 
  • Jira


ML & AI:

  • Large Language Models (LLMs)
  • Conversational AI
  • Retrieval-Augmented Generation (RAG)
  • NLP
  • Model Deployment & Optimization
  • Feature Engineering
  • Scikit-Learn


SWE:

  • API Design
  • Serverless Architectures
  • OOP
  • Agile Frameworks
  • Scalable Systems Design
  • Web Development
  • Frontend Development


Data Science:

  • Statistical Analysis
  • A/B Testing
  • Time-Series Forecasting
  • Exploratory Data Analysis (EDA)

Einsatzorte

Einsatzorte

Berlin (+100km)
Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

1 Jahr 1 Monat
2024-03 - 2025-03

development of a RAG knowledge platform

SENIOR MACHINE LEARNING ENGINEER Large Language Models AWS Machine Learning ...
SENIOR MACHINE LEARNING ENGINEER
  • Led the development of a RAG knowledge platform using AWS CDK, Lambda, Cognito, API Gateway, S3, and Bedrock, optimizing OpenSearch with LLM-based semantic splitting, resulting in significant time savings for marketing and medical professionals.
  • Built a Python package for extracting predefined keywords from candidate resumes, utilizing Bedrock, Jinja, and YAML, enhancing candidate selection efficiency and quality.
  • Built a full-stack solution (Next.js, AWS Bedrock, ECS) for automated ESG-relevant data extraction and document classification from diverse formats (PDFs, images, DOCX) with LangChain, reducing manual processing time from 40 hours to 15 minutes.
Large Language Models AWS Machine Learning IaC Model Finetuning Software Architecture Lambda Cognito API Gateway S3 and Bedrock Python package Bedrock Jinja YAML Next.js ECS
Public Cloud Group GmbH
Berlin
3 Monate
2023-12 - 2024-02

Redesigned a LLM classification pipeline

MACHINE LEARNING ENGINEER Large Language Models Data Science
MACHINE LEARNING ENGINEER

  • Redesigned a LLM classification pipeline to reduce token usage (80k ? 50-90% savings) and improve efficiency using LiteLLM, Instructor, and Pydantic for structured output. Implemented token caching, parallel processing, and background task management, significantly cutting costs and runtime for large-scale part classification.
  • Implemented Supabase authentication with role-based access control. Integrated Weaviate as a vector store for efficient semantic search across documents. Improved frontend user visibility and enhanced RAG responses by modifying Instructor Pydantic models to include inline citations, linking sources to documents with rolebased access controls.

Large Language Models Data Science
6 Monate
2023-07 - 2023-12

Formulated and executed a comprehensive AI strategy

CHIEF AI OFFICER Large Language Models AI Strategy Model Finetuning ...
CHIEF AI OFFICER
  • Formulated and executed a comprehensive AI strategy, developing and deploying an OpenAI-based large language model architecture on Heroku to significantly enhance the generation of high-quality presentation slides from text inputs.
  • Fine-tuned LLM modules using synthetically generated data, resulting in faster and more reliable outputs.
Large Language Models AI Strategy Model Finetuning Software Architecture
Cultway (Zapdeck) GmbH
Berlin
1 Jahr 5 Monate
2022-07 - 2023-11

DATA & ANALYTICS

DATA & SOFTWARE ENGINEER AWS Machine Learning Infrastructure as Code ...
DATA & SOFTWARE ENGINEER
  • Developed an asynchronous processing system for real-time language translation, leveraging AWS Transcribe, Translate, and Polly to manage continuous audio streams.
  • Co-created an AI-translation platform, leveraging AWS services and SageMaker Augmented AI, to create an AI-empowered translation solution.
AWS Machine Learning Infrastructure as Code CI/CD Data Platforms NLP Data Engineering
AllCloud GmbH
Berlin
9 Monate
2021-10 - 2022-06

AI & DATA ANALYTICS

MACHINE LEARNING CONSULTANT CDK Step Functions S3 ...
MACHINE LEARNING CONSULTANT
  • Developed and automated a SageMaker-based recommender model pipeline, leveraging CDK, Step Functions, S3, and ECR. Enabled model retraining and deployment on new data, with Lambda for performance checks and automated endpoint updates for real-time application recommendations.
  • Executed a ransomware identification POC using software log data. Developed and refined LR and XGBoost models. Enhanced XGBoost accuracy to 90%, F1-Score to 0.87 and ROCAUC to 0.93. Implemented cross-validation and hyperparameter tuning to mitigate overfitting.
CDK Step Functions S3 ECR LR XGBoost
Deloitte GmbH
Berlin
1 Jahr 4 Monate
2020-01 - 2021-04

marketing and advertising campaigns

DATA SCIENTIST Tableau Python
DATA SCIENTIST
  • Ensured the meeting of plan values for overall marketing and advertising campaigns with time-series predictions embedded in Tableau with Python.
  • Led A/B testing initiatives, designing test groups, monitoring systems, and providing statistical evaluations using Bayesian models
Tableau Python
Axel Springer SE
Berlin

Aus- und Weiterbildung

Aus- und Weiterbildung

1 Monat
2023-08 - 2023-08

AWS Certified Data Analytics ? Specialty

Amazon Web Services
Amazon Web Services
1 Monat
2022-10 - 2022-10

AWS Certified Solutions Architect - Associate

Amazon Web Services
Amazon Web Services
1 Monat
2022-06 - 2022-06

AWS Certified Cloud Practitioner

Amazon Web Services
Amazon Web Services
1 Monat
2022-04 - 2022-04

Deploying Machine Learning Models in Production

Coursera
Coursera
1 Monat
2022-02 - 2022-02

Machine Learning Modeling Pipelines in Production

Coursera
Coursera
1 Monat
2021-11 - 2021-11

Machine Learning Data Lifecycle in Production

Coursera
Coursera
2 Jahre 7 Monate
2019-03 - 2021-09

Industrial Engineering

B.Eng. (GPA 1.8), Fresenius University of Applied Sciences
B.Eng. (GPA 1.8)
Fresenius University of Applied Sciences

  • Process Management
  • Product Management
  • IT Management Consulting
  • Bachelor thesis: on request

1 Monat
2020-11 - 2020-11

Machine Learning by Andrew Ng (Stanford University)

Coursera
Coursera

Kompetenzen

Kompetenzen

Top-Skills

Large Language Models (LLMs) & RAG AWS Cloud Engineering End-to-End AI Product Development LangChain Weaviate MLOps Model Deployment Intelligent Document Processing NLP Text Classification Docker CI/CD Infrastructure as Code Serverless Architectures Data Engineering Real-Time Audio Processing Time-Series Forecasting Statistical Analysis

Produkte / Standards / Erfahrungen / Methoden

Profile

Senior Machine Learning Engineer with 7+ years of experience, specializing in LLMs, RAG, and Intelligent Document Processing, delivering scalable AI solutions in AWS Cloud for fast-paced, ambiguous scenarios.


SKILLS

Technologies:

  • Python
  • PySpark
  • SQL
  • AWS Services 
    • Lambda
    • S3
    • DynamoDB
    • ECR
    • Elastic Beanstalk
    • Redshift
    • Glue
    • SQS
    • SNS
    • Step Functions
    • CDK
    • SageMaker
    • Rekognition
    • Comprehend
    • Transcribe
    • Cognito
    • API Gateway
    • Bedrock
    • OpenSearch
  • Weaviate
  • LangChain
  • Docker
  • Apache Airflow
  • Flask
  • FastAPI
  • Git
  • Bitbucket 
  • Jira


ML & AI:

  • Large Language Models (LLMs)
  • Conversational AI
  • Retrieval-Augmented Generation (RAG)
  • NLP
  • Model Deployment & Optimization
  • Feature Engineering
  • Scikit-Learn


SWE:

  • API Design
  • Serverless Architectures
  • OOP
  • Agile Frameworks
  • Scalable Systems Design
  • Web Development
  • Frontend Development


Data Science:

  • Statistical Analysis
  • A/B Testing
  • Time-Series Forecasting
  • Exploratory Data Analysis (EDA)

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.