Data Science / Machine Learning Operations / Deep learning / AI
Aktualisiert am 19.10.2023
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 19.10.2023
Verfügbar zu: 100%
davon vor Ort: 100%
Data Scientist
Machine Learning
Künstliche Intelligenz
Model Pipelines
MLOps
German
Fluent
English
Very Fluent
French
Native
Ngo?Ba
Mother Tongue

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

2 Jahre 2 Monate
2019-09 - 2021-10

Project ?IREne?

Expert Data Scientist R Studio R Shiny MariaDB ...
Expert Data Scientist
  • Development of interactive Dashboards to assist the Risk Management Departments in the visualization of Key Risk Indicators
  • Digitalization of the existing manuall processes and improving the ability of the Business Experts to analyse large amounts of data
  • Conceiving and implementing the necessary data architecture for the improvement of the performance of the dashboard with very large amounts of data
  • Assist in the screening of optimal technology stack and suggestions for an optimal choice for the project
  • Implementing and outlier robust alternative for time series forecast

Use Case ?IREne?
  • Conception and Implementation dashboards with R Shiny to analyse large amounts of data
  • Suggest and implement an alternative data layer to improve the performance of the required dashboards
  • Implementing data loading procedures and performing data migrations
  • Statistical modeling of time series to improve the quality of the prediction
R Studio R Shiny MariaDB MS SQL Server MongoDB SVN
Consulting, Banking
4 Monate
2021-06 - 2021-09

Automatic Time Series analysis

Expert Data Scientist R Studio Docker
Expert Data Scientist
Implement statistical function to automatically analyse pattern in univariate time series including
  • Missing values imputation
  • Automated data transformation for enhancing time series modeling quality
  • Multiple seasonality detection for time series at different frequencies
  • Univariate forecast for input in multivarite cross-correlation time series analysis
  • Early detection of structural breakpoints in time series

Practical Work 

  • Conception and Implementation of the methods with R
  • Expert Knowledge Transfer in the approaches to automatic time series analysis
  • Statistical modeling and application of the methods on real life data

R Studio Docker
Consulting
1 Jahr
2018-02 - 2019-01

Project ?Apollo?

Lead Expert Data Scientist R Studio R Shiny Python ...
Lead Expert Data Scientist
  • Assist in the screening of predictive analytics tools and suggestions for an optimal choice for the project
  • Conception and implementation of a data driven Fraud Detection System in real time
  • Development of an interactive Dashboard to assist the Risk Manager in the visualization all the components of the Fraud Detection Module
  • Business Analytics and Requirement Engeneering for the development architecture
  • Conception and implementation of the core architecture of the Fraud Detection System
  • Digitalization of the existing manuall processes
  • Deployment of the Fraud Detection System and assist by end-to-end testing

Use Case ?Apollo?
  • Conception and Implementation of a data driven Fraud Detection System for the real time assesment of invoices
  • Extraction and processing of client data through API and optimization of the DB Schema for the monitoring of the Fraud score and optimize the reactivity of the Risk Dashboard
  • Statistical modeling of Expert knowledge to incorporate it in a machine learning algorithm for Fraud detection
R Studio R Shiny Python MariaDB MySQL AWS Docker
Consulting, FinTech
4 Monate
2017-10 - 2018-01

Project ?Backwelt?

Lead Expert Data Scientist (Predictive Analytics / Time Series Analytics) R Studio R Shiny MS SQL Server
Lead Expert Data Scientist (Predictive Analytics / Time Series Analytics)
  • Assist in the screening of predictive analytics tools and suggestions for an optimal choice for the project
  • Conception and development of a time series model to predict the daily quantity of bakery products to be sold
  • Development of an interactive Dashboard for Visualization of the results
  • Requirement Engeneering for the development and production environment for the predictive models
  • Advise the client regarding the deployment of the models
  • Mentoring Employees at the client site in the use of the Infrastructure and the understanding of the models for a future take over of the project

Use Case ?Backwelt?

  • Conception and development of time series predictiv models for the optimization of the quantity of different fresh bakery products (>75 products) in 61 pilot stores of the client In an extrem worst case scenario our model could achieved 38% increase in revenue (7 digits range)
R Studio R Shiny MS SQL Server
Consulting, Food
1 Jahr 1 Monat
2015-12 - 2016-12

Project ?Entstörung?

Senior Data Scientist Consultant / Lead Predictive Analytics R Studio Microstrategy TOAD ...
Senior Data Scientist Consultant / Lead Predictive Analytics
  • Deliver in depth expertise in the area of Analytics to harness knowledge from massive amounts of data to optimise business processes
  • Conception of business relevant reports with Microstrategy and R
  • Presentation of complex mathematical and statistical findings to clients from all business levels in the company
  • Introducing and leading Advanced Analytics in the Business Intelligence Team
  • Establishment of an organization structure and concepts for Advanced Analytics

Practical Work

  • Establish concepts and organization structure for data science
  • Establish requirement management guidelines for data analysis
  • Using Advanced Analytics insights to optimize processes
  • Solving complex business relevant problems using Predictive Analytics
  • Use Case: Apply Machine Learning to predict whether or not a customer will call the hotline within 2 weeks of his previous call to optimize customer satisfaction. Use Deep Learning on huge amount of technical data on the hardware of the client and more additional data from customer behavior.

R Studio Microstrategy TOAD Sybase JIRA
1&1 Telecommucation (Telecommunication)
1 Jahr 3 Monate
2014-10 - 2015-12

Project ?eAnalytics?

Senior Data Scientist Consultant / Lead Predictiv Analytics R Studio SAS Visual Analytics TOAD
Senior Data Scientist Consultant / Lead Predictiv Analytics
  • Assist in the screening of predictive analytics tools and suggestions for an optimal choice for the project
  • Conceive and implement predictive models to solve some business use cases and assess their benefits with R and SAS
  • Provide predictive analytics know-how and assist in requirement engineering for the development platform hosting the predictive models in the build phase
  • Provide know-how for the operative deployment of predictive models

Parctical Work

  • Involved in the RfP (Request for Provider) Predictive
  • Use case ?Weigh and measure?
    • Conception and implementation of a classification method to determine the subset of all shipments to be weighed and measured with the goal to weigh at most 10% of the shipment and still detect 50% of all deviations in weight or volume
  • Use case ?Predict booking?
    • Apply time series analysis to forecast the overall weight and volume on a plane several day before departure by using historical data on booked shipments
  • Use case ?Quotation Analysis?
    • Statistical analysis of special discount offers on specific flights (the so called quotations) to determine the probability that the offer was not accepted by the client due to the proposed price

R Studio SAS Visual Analytics TOAD
Lufthansa Cargo (Logistic)
2 Monate
2014-03 - 2014-04

Proof of Concept : Big Data Analytics in the automobile industry

Senior Data Scientist SAP HANA SAS R Studio
Senior Data Scientist
  • Analysis of Big data from the automobile industry to detect patterns

Practical Work

  • Perform the analysis with SAS and R integration in SAS
  • Cross reference the performances of predictive Analytics methods such as market basket analysis with those from the SAP PAL (Predictive Analytics Library) of SAP HANA

SAP HANA SAS R Studio
MSG Systems AG (Automotive)
9 Monate
2013-06 - 2014-02

Statistical modeling of non-linear dynamic processes

Postdoc Research Fellow / Senior Data Scientist R Studio
Postdoc Research Fellow / Senior Data Scientist
  • Development and programming in R of a statistical online monitoring system for the automatic detection and classification of structural breakpoints in time series with Bayesian a priori information

Practical Work 

  • Earlier detection of outliers and level Shifts
  • More accurate predictions of the time series using the knowledge in the detected structural breakpoint in real time
  • Programming of statistical packages in R (Ongoing)
  • Scientific Publication of the findings (Ongoing)

R Studio
Faculty of Statistics of TU Dortmund ? SFB 823
11 Monate
2012-08 - 2013-06

Multivariate Analysis (End)

Data Scientist Consultant JAVA R
Data Scientist Consultant
  • Development and programming (with JAVA and R) of statistical methods and packages for automatic multivariate time series analysis for business analytics

Practical Work 

  • Build a stable systems which uses a vector autoregressive moving average (VARMA) model to analyse and predict multivariate time series after analysing co-integration
  • Collaboration with computer scientists to build a graphic user interface
  • Test and monitor the goodness of fit the statistical model

JAVA R
Tonbeller AG (Business Intelligence)
4 Monate
2012-11 - 2013-02

Classification of structural breakpoints using their a posteriori probabilities of occurrence

Doctoral Research Fellow / Data Scientist R
Doctoral Research Fellow / Data Scientist
  • Develop a predictive model to classify structural change points in a time series using a posteriori probabilities at each time point and given additional information

Practical Work

  • Improve the method of Harrison and Stevens (1971) for time series online monitoring
  • Compute a posteriori probabilities of occurrence of a state change at each point in time
  • Find a predictive analytics method that uses the a posteriori probabilities and further information to optimally classify the state changes at each point in time Results
  • A linear discriminant analysis (LDA) rule to classify the state changes in real time
  • A second rule that updates the classification one point after the first classification using additional information available
  • Reduction of classification error rate by more than 50% w.r.t standard methods for classification
  • Improvement of the accuracy of predictions of the time series

R
Faculty of Statistics of TU Dortmund
11 Monate
2011-06 - 2012-04

Robust normality test

Doctoral Research Fellow / Data Scientist R
Doctoral Research Fellow / Data Scientist
  • Conception of a robust statistical test for normality in the presence of a outliers since they severely affect most test of normality

Practical Work

  • Develop a new robust Shapiro-Wilk test of normality in the presence of outliers
  • Mathematical derivation of the asymptotical null distribution of the new test for the use of critical values
  • Derive a robust test that outperforms the most robust test of normality in the presence of outliers

R
Faculty of Statistics of TU Dortmund
1 Jahr
2011-01 - 2011-12

Multivariate analysis

Data Science Consultant JAVA R
Data Science Consultant
  • Development and programming (with JAVA and R) of statistical methods and packages for automatic multivariate time series analysis for business analytics

Practical Work

  • Collaborate with computer scientist to build a user graphic interface
  • Conceive and implement a vector autoregressive moving average model to capture the cross correlation between interrelated time series with cointegration
  • Compute predictions for the whole system of time series (< 6 Time Series)

JAVA R
Tonbeller AG (Business Intelligence)
1 Jahr 3 Monate
2009-08 - 2010-10

Bivariate Analysis

Data Scientist Consultant JAVA R
Data Scientist Consultant
  • Development and programming (with JAVA and R) of statistical methods and packages for automatic bivariate time series analysis for business analytics

Practical Work

  • Collaborate with computer scientist to build a user graphic interface
  • Cross correlation and causal analysis for two time series
  • ARMA and ARMAX modeling to capture the correlation between the time series
  • Conceive and implement a graphic user interface with incorporated monitor for goodness of fit

JAVA R
Tonbeller AG (Business Intelligence)
1 Monat
2010-08 - 2010-08

Predictive Model for milk consumption and milk price

Data Scientist Consultant JAVA R
Data Scientist Consultant
  • Analyse the performance of the bivariate analysis algorithm on real data and compute predictions for milk consumption and milk price


Practical Work

  • Causal and cross correlation analysis first for milk consumption and butter stock, secondly for milk price and whey powder price
  • ARMAX models for milk consumption with butter stock as proxy-variable and for milk price with whey powder price as proxy-variable
  • One-step and two-step predictions for both time series


Result

Better prediction results w.r.t the univariate analysis

JAVA R
Tonbeller AG (Business Intelligence)
7 Monate
2009-01 - 2009-07

Univariate Analysis

Data Scientist Consultant JAVA R
Data Scientist Consultant
  • Conception and programming (with JAVA and R) of statistical methods and packages for the automatic time series analysis

Practical Work

  • Collaborate with computer scientists to build a graphic user interface
  • Conceive and implement a system to automatically extract seasonal and trend effects in a time series
  • Conceive and implement a system to monitor, control and detect state changes in time series using a state space model with Bayesian parameters

JAVA R
Tonbeller AG (Business Intelligence)

Aus- und Weiterbildung

4 Jahre 9 Monate
2008-10 - 2013-06

Doctoral Degree (PhD) in Statistics

Faculty of Statistics of the Technical University Dortmund, Germany
Faculty of Statistics of the Technical University Dortmund, Germany
Dissertation Thesis: on Request
3 Jahre
2005-10 - 2008-09

M.Sc. Data Science

Faculty of Statistics of the Technical University Dortmund, Germany
Faculty of Statistics of the Technical University Dortmund, Germany
Master Thesis on Request
1 Jahr 9 Monate
2004-01 - 2005-09

Zertifikat Deutsch und Deutsche Sprachprüfung für den Hochschulzugang (DSH)

Goethe Institute Yaoundé, Cameroon und PDL Dortmund, Germany
Goethe Institute Yaoundé, Cameroon und PDL Dortmund, Germany

5 Jahre
1998-10 - 2003-09

B.Sc. Mathematics

Faculty Science of University of Dschang, Cameroon
Faculty Science of University of Dschang, Cameroon

Position

Freelancer & Entrepreneur / Data Scientist & MLOps Expert

Kompetenzen

Top-Skills

Data Scientist Machine Learning Künstliche Intelligenz Model Pipelines MLOps

Produkte / Standards / Erfahrungen / Methoden

Analytics Domain Expertise
  • Data Science
  • Deep Learning, Machine Learning & MLOps
  • Machine Learning Data Lifecycle and Pipelines
  • Big Data Analytics
  • Predictive Analytics
  • Time Series Analytics and Econometrics
  • Data ETL
  • Data Exploration and Visualization / Dashboarding

Business Intelligence Expertise
  • R / R Studio / R Shiny 
  • Python / Spyder / Jupyter Notebook 
  • SAS Enterprise Guide, DI Studio and Enterprise Miner (Average User)
  • Microstrategy (Average User)
  • Tableau, JMP, Qlik and SAS Visual Analytics (Introduction and exposure within the project eAnalytics)
  • Eclipse 
  • TOAD

Cloud Computing and MLOps
  • Google Cloud Platform, BigQuery
  • Tensorflow Architecture (TFX, TFMA, TFDV)
  • Kubeflow
  • Docker
  • Kubernetes
  • Continuous Integration and Continuous Delivery (CI/CD)
  • Machine Learning Data, Modeling and Monitoring Pipelines
  • Python, Pandas, NumPy, Scikit-learn, keras, Matplotlib

Soft Skills 
  • Presentation Skills
  • Knowledge Sharing
  • Teambuilding
  • Conflict Management
  • Moderation
  • Leading projects
  • project management
  • Mentoring

Professional Experience
09/2019 - today
Role: Freelancer & Entrepreneur / Data Scientist & MLOps Expert

07/2017 ? 01/2019
Role: Managing Director & Chief Data Scientist
Customer: Smart Data Analytics

03/2014 - 06/2017
Role: Senior IT Consultant ? Senior Data Scientist
Customer: MSG Systems AG

07/2013 - 02/2014
Role: PostDoc Research Fellow ? Data Science
Customer: TU Dortmund

01/2009 - 06/2013
Role: Data Scientist / Consultant Statistical Expert
Customer: Tonbeller AG

10/2008 - 06/2013
Role: Doctoral Research Fellow ? Data Science
Customer: TU Dortmund

06/2007 - 09/2008
Role: Research Assistant ? Data Science
Customer: TU Dortmund

Betriebssysteme

MS Windows
Linux
MAC-OS (Macintosh)
MS Office

Programmiersprachen

R
Python
SAS
Advanced programmer for SAS 9
SAS DI Studio
JAVA
SQL
SPSS
C
Pascal
lisp-stat
Arc
Yale
Arena
Prolog

Datenbanken

Sybase
MySQL
MariaDB
MS SQL Server
MongoDB

Branchen

  • Logistic
  • Automotive
  • Food
  • Bank - FinTech
  • Telecommunication
  • Education

Vertrauen Sie auf GULP

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das GULP Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.