(Senior/Lead) Data Science, Machine Learning, AI (LLM Automation) | Data Pipeline & BI Dashboarding | Solutions Architect | SQL · Python · RAGs
Aktualisiert am 09.10.2025
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 15.10.2025
Verfügbar zu: 100%
davon vor Ort: 100%
Machine Learning (Pricing - Churn - Elasticity - Regression - Classification)
AI (LLMs - N8N - RAGs - embeddings)
Data Engineering & BI (dbt - Airflow - AWS - SQL - S3 - Looker - python - git)
SQL
S3
Looker
python
git
dbt
Tableau
regression
classification
jupyter notebook
a/b testing
airflow
REST
pricing
churn
llm
Data Engineer
Strategieberatung
Databricks
Claude code
english
Muttersprache
German
Verhandlungssicher
Hindi
Muttersprache

Einsatzorte

Einsatzorte

München (+500km)
Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

4 months
2025-02 - 2025-05

CLIP Photo Quality Scoring & Re-Ranking

Data Scientist - Project Lead Python CLIP/embeddings Experiment design ...
Data Scientist - Project Lead

  • Built a photo-quality signal independent of price/availability (photos, reviews, facilities, min-stay).
  • Implemented OpenAI CLIP image scoring (0?100) for apartment photos. 
  • Re-ranked the main/hero photo per listing using the CLIP score. 
  • Designed and ran A/B tests focused on CTR and CTB. 
  • Observed higher CTR with stable conversion to booking. 
  • Result: +4% lift in net eGMV.


Looker Google Colab Atlassian Confluence Atlassian JIRA Git AWS amazon Redshift airflow Python
Python CLIP/embeddings Experiment design SQL Looker OpenAPI duckdb
Vacation rental booking platform
Munich
4 months
2024-10 - 2025-01

LLM-Driven NPS Insights Pipeline

Data Scientist - Project Lead Python OpenAI (LLMs/RAG concepts) Airflow ...
Data Scientist - Project Lead
  • Designed and implemented an LLM-driven NPS analysis pipeline using Python, OpenAI, Airflow, Redshift, S3, and DuckDB. 
  • Automated ingestion, processing, and scheduling of NPS responses with monitoring via Airflow. 
  • Applied LLM prompts to structure free-text feedback into themes and actionable insights. 
  • Unified storage and querying across S3 -> Redshift, with fast ad-hoc slicing in DuckDB. 
  • Reduced manual analysis effort by ~70% across 200+ NPS visualisations. 
  • Enabled faster detection of customer pain points and improved visibility for stakeholders.
Looker OpenAPI AWS JIRA Atlassian Confluence Git
Python OpenAI (LLMs/RAG concepts) Airflow Redshift S3 DuckDB SQL Looker
Vacation rental booking platform
Munich
1 year 3 months
2023-08 - 2024-10

Medallion Data Platform (S3 -> dbt -> Redshift -> Looker)

Lead Data Engineer dbt Airflow Redshift ...
Lead Data Engineer

  • Designed a medallion data architecture (Bronze/Silver/Gold) with S3 -> dbt -> Redshift  -> Looker. 
  • Built dbt models (transformations, tests) and a reusable semantic layer (LookML views/explores). 
  • Orchestrated end-to-end pipelines in Airflow, including dependencies, retries, and SLAs. Implemented data quality checks and automated alerting via Opsgenie and Slack. 
  • Standardized source-to-gold lineage and documentation for reliable downstream use. 
  • Delivered and maintained Looker dashboards for company KPIs, host/guest behavior, and finance reporting. 
  • Enabled experiment and operational analytics by serving trusted, timely datasets to product and BI teams.

Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip
dbt Airflow Redshift S3 SQL Data modelling Monitoring Looker (LookML) git
Vacation rental booking platform
Munich
8 months
2023-01 - 2023-08

A/B Testing Pipeline & Experiment Analytics

Product Analyst - Lead SQL Redshift Python ...
Product Analyst - Lead
  • Built a centralised A/B testing pipeline on Redshift?S3 to track historical and live experiments end-to-end. 
  • Standardised exposure/event schemas and a metric catalog (e.g., value per user, GMV per user, CTR/CTB, conversion, session trends). 
  • Automated significance calculations (p-values) and experiment readouts; scheduled via Airflow. 
  • Delivered Looker dashboards for experiment monitoring, guardrails, and decision summaries.
  • Supported full lifecycle: design templates, data checks, analysis, and stakeholder readouts. 
  • Applied the framework to validate initiatives such as CLIP photo re-ranking (+4% net eGMV with stable conversion) and pricing changes. 
Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip
SQL Redshift Python Statistical testing Looker Product analytics AB testing
Vacation rental booking platform
Munich
1 year 1 month
2022-03 - 2023-03

Host Churn Prediction & Retention Triggers

Data Scientist - Lead Python XGBoost Feature engineering ...
Data Scientist - Lead
  • Built and deployed an XGBoost churn model (AUC 0.81) for host risk prediction. 
  • Implemented automated scoring and trigger rules to identify the top 4% highest-risk hosts. 
  • Launched targeted credit interventions, A/B-tested for effectiveness. 
  • Achieved ~10% churn reduction, retaining ~?1M+ in annual revenue.
Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip Jupyter Notebook Pipedrive
Python XGBoost Feature engineering Airflow SQL Experimentation Consulting Strategy
Vacation rental booking platform
Munich
4 months
2021-11 - 2022-02

Dynamic Pricing Rollout (A/B-Validated)

Data Analytics - Team Lead Experiment design Statistical testing SQL ...
Data Analytics - Team Lead
  • Implemented dynamic pricing based on a LightGBM elasticity model (AUC 0.72).
  • Ran A/B experiments against the legacy rule-based pricing to validate impact. 
  • Monitored pricing KPIs and experiment results via Tableau dashboards.
Confluence JIRA GIT Slack Python VS Code Datagrip Tableau Google big query
Experiment design Statistical testing SQL Python executive communications
Digital insurance platform
Bengaluru
1 year 7 months
2020-07 - 2022-01

Loss Ratio Improvement & Risk Clustering

Risk Analytics Lead Python SQL Clustering/segmentation ...
Risk Analytics Lead
  • Identified high-risk segments (e.g., fuel type, website behaviour, pincode) to target pricing actions. 
  • Integrated external police/theft data to build pincode-level risk clusters. 
  • Implemented location-based pricing adjustments, improving portfolio quality. 
  • Achieved ~12% loss-ratio reduction through segmentation and repricing. 
  • Delivered monitoring via Tableau dashboards and recurring portfolio reviews.
Google Big Query Data bricks JIRA VS code Python Slack Tableau
Python SQL Clustering/segmentation Tableau Analysis
Digital insurance platform
Bengaluru
1 year 1 month
2020-10 - 2021-10

Insurance Pricing Elasticity Model

Pricing Analytics Lead LightGBM Python SQL ...
Pricing Analytics Lead
  • Developed a LightGBM pricing elasticity model (AUC 0.72) to estimate conversion sensitivity to price changes.
  • Increased premiums ~6% (~?5M+/year) while maintaining conversion levels. 
  • A/B-tested model-driven pricing against rule-based strategy to validate uplift and guardrails. 
  • Delivered executive readouts and monitoring via Tableau; data processing in Python/SQL on Google BigQuery.
Google Big Query Data bricks JIRA VS code Python Slack Tableau
LightGBM Python SQL BigQuery Tableau Experiment design.
Digital insurance platform
Bengaluru
1 year
2019-06 - 2020-05

90-Day Activation Prediction

Risk Analyst SQL Python XG boost
Risk Analyst
  • Built 90-day activation prediction models using logistic regression and LightGBM (AUC 0.68). 
  • Performed EDA, feature engineering/selection, and model calibration on qualification and behavioural signals. 
  • Generated target lists for campaigns; partnered with account management to execute and track impact. 
  • Drove ~8% uplift in onboarding rates via targeted interventions.
Looker AWS Athena Redshift Confluence JIRA GIT Ops-Genie Slack Python DBT VS Code Datagrip
SQL Python XG boost
online business lender owned by American Express (Amex)
Bengaluru, India /Atlanta USA
1 year
2019-01 - 2019-12

Underwriting Strategy Optimisation

Decision Analyst · Risk Strategy SQL Python Logistic regression ...
Decision Analyst · Risk Strategy
  • Assessed old vs. new qualification strategies across industry, FICO, model scores, and line bands.
  • Performed univariate analyses on activation, utilisation, risk change, and population shifts. 
  • Compared bad rates and net credit margin at overall and granular segment levels. 
  • Recommended cutoff and policy adjustments to improve approval quality. 
  • Outcome: ~9% reduction in bad debt via refined underwriting segmentation.
Tableau MS SQL Jupyter
SQL Python Logistic regression Product Analytics
online business lender owned by American Express (Amex)
Bengaluru/Atlanta USA
7 months
2018-12 - 2019-06

Portfolio Monitoring & Automated Triggers

Decision Analyst · Risk Strategy SQL Python KPI design ...
Decision Analyst · Risk Strategy
  • Built monthly portfolio waterfalls to track acquisitions, activations, utilisation, and roll rates. 
  • Monitored cashflow and portfolio performance with standardised KPIs and variance analysis. 
  • Implemented automated triggers/alerts to flag significant shifts in risk, activation, and collections. 
  • Produced executive readouts summarising drivers, anomalies, and recommended actions.
Tableau MS SQL Execl Jupyter
SQL Python KPI design Automation
online business lender owned by American Express (Amex)
Bengaluru, India/ Atlanta, USA
3 months
2018-11 - 2019-01

Budget Forecasting (ARIMA)

Decision Scientist Python Statsmodels (ARIMA) SQL ...
Decision Scientist
  • Built time-series budget forecasts using ARIMA, based on historical retail spend and seasonality. 
  • Achieved ~82% forecasting accuracy, validated on holdout periods. 
  • Delivered monthly forecast reports with variance analysis and driver commentary. 
  • Documented assumptions and handover for business and finance teams.
r SQL python
Python Statsmodels (ARIMA) SQL reporting
Global chocolate manufacturer
Bengaluru
1 year 2 months
2017-09 - 2018-10

SAP -> Azure Data Lake Migration

Decision Scientist Azure (Data Lake Pipelines) SQL/U-SQL ...
Decision Scientist
  • Migrated SAP data to Azure Data Lake, designing dimensional models (facts/dimensions) for analytics. 
  • Built ETL/ELT pipelines using SQL/U-SQL, SSIS, and Azure Pipelines; implemented data quality checks. 
  • Managed QA -> pre-prod -> prod support, defect resolution, and runbook documentation. 
  • Delivered Tableau dashboards on governed datasets for business stakeholders. 
  • Outcome: Improved reporting efficiency and ~22% infrastructure cost reduction
U-SQL Tableau Excel SAP
Azure (Data Lake Pipelines) SQL/U-SQL SSIS Tableau Data modelling Data quality
Global chocolate manufacturer
Bengaluru
1 year 8 months
2016-01 - 2017-08

Educhimp ? IIT-JEE EdTech (Founder Project)

Co-Founder · Analytics & Growth Strategy Product development Team leading
Co-Founder · Analytics & Growth
  • Co-founded Educhimp, an IIT-JEE preparation platform. 
  • Created revision video lectures for Mathematics, Physics, and Chemistry. 
  • Ran digital marketing campaigns on Facebook and Instagram.
Strategy Product development Team leading
New Delhi

Aus- und Weiterbildung

Aus- und Weiterbildung

1 year 1 month
2020-11 - 2021-11

MSc in Environmental & Sustainable Development

Indira Gandhi National Open University, New Delhi, India
Indira Gandhi National Open University, New Delhi, India
4 years 2 months
2013-05 - 2017-06

B.Tech in Electronics & Communication Engineering

Maharaja Agrasen Institute of Technology, Delhi (GGSIP University), New Delhi, India
Maharaja Agrasen Institute of Technology, Delhi (GGSIP University), New Delhi, India

Kompetenzen

Kompetenzen

Top-Skills

Machine Learning (Pricing - Churn - Elasticity - Regression - Classification) AI (LLMs - N8N - RAGs - embeddings) Data Engineering & BI (dbt - Airflow - AWS - SQL - S3 - Looker - python - git) SQL S3 Looker python git dbt Tableau regression classification jupyter notebook a/b testing airflow REST pricing churn llm Data Engineer Strategieberatung Databricks Claude code

Aufgabenbereiche

Project Management
Experte
Analytics
Experte
Data Engineering
Experte
Data Science
Experte
Stakeholder communication
Experte
Hiring
Fortgeschritten
Mentoring
Fortgeschritten
Strategy
Experte
Executive reporting
Experte
Cross-functional project delivery
Experte
AI
Experte
LLMs
Experte
Requirement Gathering
Experte

Produkte / Standards / Erfahrungen / Methoden

Confluence
Experte
JIRA
Experte
GIT
Experte
Claude Code
Experte
Cursor
Fortgeschritten
Miro
Experte
AWS
Experte
Azure
Experte
Google big query
Experte
Data bricks
Experte
Airflow
Experte
Looker
Experte
Tableau
Fortgeschritten
Excel
Experte
MS office 365
Fortgeschritten
Athena
Experte
Redshift
Experte

PROFILE

Senior data & ML analyst with 8+ years across fintech, insurance and travel. Developed & deployed ML/LLM solutions for pricing, customer retention, NPS and content quality, delivering ?10M+ impact, playing a key role in cross-functional projects and advising C-suite stakeholders.


TECHNICAL & LEADERSHIP EXPERTISE

  • Technical: SQL, Python (pandas,scikit-learn), dbt, Airflow, AWS (S3, Athena, Redshift), Git, REST APIs
  • AI/ML: XGBoost, LightGBM, logistic/linear models, A/B testing, TensorFlow, LLMs (RAG, embeddings, evaluation), Transformers
  • BI: Looker (LookML), Tableau, Jupyter Notebook, Excel
  • Leadership: hiring & mentorship, cross-functional delivery, C-suite communication, cost & risk analytics


EXPERIENCE

2022 - Present

Role: Senior Data Analyst 

Customer: HOLIDU


Tasks:

Managed, mentored & hired analysts | Travel Tech


Apartment quality & photo ranking

  • Developed quality signal for partners (photos, reviews, facilities, min-stay), kept separate from price/availability; evaluated on CTR, CTB, and normalised Net eGMV
  • Implemented an OpenAI CLIP photo score (0?100) for the apartments and re-ranked the main photo
  • Designed and ran A/B experiments focused on CTR and CTB, confirming higher click-through rates while maintaining stable conversion to booking, which drove a +4% lift in net eGMV


NPS text analytics with LLMs

  • Developed an NPS analysis pipeline using Python, OpenAI, Redshift, S3, DuckDB, and Airflow
  • Reduced manual effort by 70% in analysing 200+ NPS visualisations, enabling faster detection of customer pain points


Data modelling, dashboarding & experimentation

  • Designed end-to-end data pipelines in Airflow using a medallion architecture (S3 ? dbt ? Redshift), with data quality checks and automated alerts via Opsgenie and Slack
  • Developed and maintained dashboards in Looker covering key company metrics, host and guest behavior patterns, and financial reporting, supporting both operational and strategic decision-making
  • AB test infrastructure: Built a Redshift?S3 data setup for the guest team to track past and ongoing A/B tests, giving clear visibility into metrics like value per user, GMV per user, session trends, and experiment significance (p-values)


Churn prediction & pricing experiments

  • Built and deployed an XGBoost churn model (AUC 0.81) with automated scoring and triggers, identifying the top 4% highest-risk hosts
  • Implemented targeted credit interventions validated through A/B testing, reducing churn by 10% and retaining ~?1M+ in annual revenue.


2020 - 2022

Role: Team Lead Data Analyst

Customer: ACKO


Tasks:

Managed & hired analysts and data scientists | Insurance Tech


Dynamic Pricing Optimisation -?5M+ Revenue Growth

  • Developed a lightGBM pricing elasticity model (AUC=0.72) that increased premiums by 6% (~?5M+/year) while maintaining conversion rates
  • Ran A/B experiments to validate dynamic pricing strategies against traditional rule-based methods


Loss ratio improvement:

Identified high-risk segments (e.g., fuel type, website behavior,pincode) to cut loss ratios by 12%.


2018 - 2020

Role: Decision Analyst 

Customer: KABBAGE (Acquired by American Express)


Tasks:

USA?s lending fintech

  • Analysed and optimised underwriting strategies, cutting bad debt by 9% through granular risk segmentation (industry, FICO, line bands) and refining qualification criteria
  • Built logistic regression and LightGBM models (AUC 0.68) to predict 90-day customer activation, driving an 8% uplift in onboarding rates via targeted campaigns


2017 - 2018

Role: Decision Scientist

Customer: MU-SIGMA


Tasks:

Fortune 500 analytics consulting | Retail/CPG focus

  • Worked on Azure ETL migration, cutting infrastructure costs by 22% and enhancing reporting efficiency

Betriebssysteme

windows
Experte
mac
Experte
linux
Fortgeschritten

Programmiersprachen

SQL
Experte
Python
Experte
R
Experte
dbt
Experte
C++
Fortgeschritten
HTML
Fortgeschritten
Java
Fortgeschritten

Branchen

Branchen

  • Banking
  • Finance
  • Retail
  • Consultancy
  • Travel
  • Insurance

Einsatzorte

Einsatzorte

München (+500km)
Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

4 months
2025-02 - 2025-05

CLIP Photo Quality Scoring & Re-Ranking

Data Scientist - Project Lead Python CLIP/embeddings Experiment design ...
Data Scientist - Project Lead

  • Built a photo-quality signal independent of price/availability (photos, reviews, facilities, min-stay).
  • Implemented OpenAI CLIP image scoring (0?100) for apartment photos. 
  • Re-ranked the main/hero photo per listing using the CLIP score. 
  • Designed and ran A/B tests focused on CTR and CTB. 
  • Observed higher CTR with stable conversion to booking. 
  • Result: +4% lift in net eGMV.


Looker Google Colab Atlassian Confluence Atlassian JIRA Git AWS amazon Redshift airflow Python
Python CLIP/embeddings Experiment design SQL Looker OpenAPI duckdb
Vacation rental booking platform
Munich
4 months
2024-10 - 2025-01

LLM-Driven NPS Insights Pipeline

Data Scientist - Project Lead Python OpenAI (LLMs/RAG concepts) Airflow ...
Data Scientist - Project Lead
  • Designed and implemented an LLM-driven NPS analysis pipeline using Python, OpenAI, Airflow, Redshift, S3, and DuckDB. 
  • Automated ingestion, processing, and scheduling of NPS responses with monitoring via Airflow. 
  • Applied LLM prompts to structure free-text feedback into themes and actionable insights. 
  • Unified storage and querying across S3 -> Redshift, with fast ad-hoc slicing in DuckDB. 
  • Reduced manual analysis effort by ~70% across 200+ NPS visualisations. 
  • Enabled faster detection of customer pain points and improved visibility for stakeholders.
Looker OpenAPI AWS JIRA Atlassian Confluence Git
Python OpenAI (LLMs/RAG concepts) Airflow Redshift S3 DuckDB SQL Looker
Vacation rental booking platform
Munich
1 year 3 months
2023-08 - 2024-10

Medallion Data Platform (S3 -> dbt -> Redshift -> Looker)

Lead Data Engineer dbt Airflow Redshift ...
Lead Data Engineer

  • Designed a medallion data architecture (Bronze/Silver/Gold) with S3 -> dbt -> Redshift  -> Looker. 
  • Built dbt models (transformations, tests) and a reusable semantic layer (LookML views/explores). 
  • Orchestrated end-to-end pipelines in Airflow, including dependencies, retries, and SLAs. Implemented data quality checks and automated alerting via Opsgenie and Slack. 
  • Standardized source-to-gold lineage and documentation for reliable downstream use. 
  • Delivered and maintained Looker dashboards for company KPIs, host/guest behavior, and finance reporting. 
  • Enabled experiment and operational analytics by serving trusted, timely datasets to product and BI teams.

Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip
dbt Airflow Redshift S3 SQL Data modelling Monitoring Looker (LookML) git
Vacation rental booking platform
Munich
8 months
2023-01 - 2023-08

A/B Testing Pipeline & Experiment Analytics

Product Analyst - Lead SQL Redshift Python ...
Product Analyst - Lead
  • Built a centralised A/B testing pipeline on Redshift?S3 to track historical and live experiments end-to-end. 
  • Standardised exposure/event schemas and a metric catalog (e.g., value per user, GMV per user, CTR/CTB, conversion, session trends). 
  • Automated significance calculations (p-values) and experiment readouts; scheduled via Airflow. 
  • Delivered Looker dashboards for experiment monitoring, guardrails, and decision summaries.
  • Supported full lifecycle: design templates, data checks, analysis, and stakeholder readouts. 
  • Applied the framework to validate initiatives such as CLIP photo re-ranking (+4% net eGMV with stable conversion) and pricing changes. 
Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip
SQL Redshift Python Statistical testing Looker Product analytics AB testing
Vacation rental booking platform
Munich
1 year 1 month
2022-03 - 2023-03

Host Churn Prediction & Retention Triggers

Data Scientist - Lead Python XGBoost Feature engineering ...
Data Scientist - Lead
  • Built and deployed an XGBoost churn model (AUC 0.81) for host risk prediction. 
  • Implemented automated scoring and trigger rules to identify the top 4% highest-risk hosts. 
  • Launched targeted credit interventions, A/B-tested for effectiveness. 
  • Achieved ~10% churn reduction, retaining ~?1M+ in annual revenue.
Looker AWS Athena Redshift Confluence JIRA GIT Slack Python DBT VS Code Datagrip Jupyter Notebook Pipedrive
Python XGBoost Feature engineering Airflow SQL Experimentation Consulting Strategy
Vacation rental booking platform
Munich
4 months
2021-11 - 2022-02

Dynamic Pricing Rollout (A/B-Validated)

Data Analytics - Team Lead Experiment design Statistical testing SQL ...
Data Analytics - Team Lead
  • Implemented dynamic pricing based on a LightGBM elasticity model (AUC 0.72).
  • Ran A/B experiments against the legacy rule-based pricing to validate impact. 
  • Monitored pricing KPIs and experiment results via Tableau dashboards.
Confluence JIRA GIT Slack Python VS Code Datagrip Tableau Google big query
Experiment design Statistical testing SQL Python executive communications
Digital insurance platform
Bengaluru
1 year 7 months
2020-07 - 2022-01

Loss Ratio Improvement & Risk Clustering

Risk Analytics Lead Python SQL Clustering/segmentation ...
Risk Analytics Lead
  • Identified high-risk segments (e.g., fuel type, website behaviour, pincode) to target pricing actions. 
  • Integrated external police/theft data to build pincode-level risk clusters. 
  • Implemented location-based pricing adjustments, improving portfolio quality. 
  • Achieved ~12% loss-ratio reduction through segmentation and repricing. 
  • Delivered monitoring via Tableau dashboards and recurring portfolio reviews.
Google Big Query Data bricks JIRA VS code Python Slack Tableau
Python SQL Clustering/segmentation Tableau Analysis
Digital insurance platform
Bengaluru
1 year 1 month
2020-10 - 2021-10

Insurance Pricing Elasticity Model

Pricing Analytics Lead LightGBM Python SQL ...
Pricing Analytics Lead
  • Developed a LightGBM pricing elasticity model (AUC 0.72) to estimate conversion sensitivity to price changes.
  • Increased premiums ~6% (~?5M+/year) while maintaining conversion levels. 
  • A/B-tested model-driven pricing against rule-based strategy to validate uplift and guardrails. 
  • Delivered executive readouts and monitoring via Tableau; data processing in Python/SQL on Google BigQuery.
Google Big Query Data bricks JIRA VS code Python Slack Tableau
LightGBM Python SQL BigQuery Tableau Experiment design.
Digital insurance platform
Bengaluru
1 year
2019-06 - 2020-05

90-Day Activation Prediction

Risk Analyst SQL Python XG boost
Risk Analyst
  • Built 90-day activation prediction models using logistic regression and LightGBM (AUC 0.68). 
  • Performed EDA, feature engineering/selection, and model calibration on qualification and behavioural signals. 
  • Generated target lists for campaigns; partnered with account management to execute and track impact. 
  • Drove ~8% uplift in onboarding rates via targeted interventions.
Looker AWS Athena Redshift Confluence JIRA GIT Ops-Genie Slack Python DBT VS Code Datagrip
SQL Python XG boost
online business lender owned by American Express (Amex)
Bengaluru, India /Atlanta USA
1 year
2019-01 - 2019-12

Underwriting Strategy Optimisation

Decision Analyst · Risk Strategy SQL Python Logistic regression ...
Decision Analyst · Risk Strategy
  • Assessed old vs. new qualification strategies across industry, FICO, model scores, and line bands.
  • Performed univariate analyses on activation, utilisation, risk change, and population shifts. 
  • Compared bad rates and net credit margin at overall and granular segment levels. 
  • Recommended cutoff and policy adjustments to improve approval quality. 
  • Outcome: ~9% reduction in bad debt via refined underwriting segmentation.
Tableau MS SQL Jupyter
SQL Python Logistic regression Product Analytics
online business lender owned by American Express (Amex)
Bengaluru/Atlanta USA
7 months
2018-12 - 2019-06

Portfolio Monitoring & Automated Triggers

Decision Analyst · Risk Strategy SQL Python KPI design ...
Decision Analyst · Risk Strategy
  • Built monthly portfolio waterfalls to track acquisitions, activations, utilisation, and roll rates. 
  • Monitored cashflow and portfolio performance with standardised KPIs and variance analysis. 
  • Implemented automated triggers/alerts to flag significant shifts in risk, activation, and collections. 
  • Produced executive readouts summarising drivers, anomalies, and recommended actions.
Tableau MS SQL Execl Jupyter
SQL Python KPI design Automation
online business lender owned by American Express (Amex)
Bengaluru, India/ Atlanta, USA
3 months
2018-11 - 2019-01

Budget Forecasting (ARIMA)

Decision Scientist Python Statsmodels (ARIMA) SQL ...
Decision Scientist
  • Built time-series budget forecasts using ARIMA, based on historical retail spend and seasonality. 
  • Achieved ~82% forecasting accuracy, validated on holdout periods. 
  • Delivered monthly forecast reports with variance analysis and driver commentary. 
  • Documented assumptions and handover for business and finance teams.
r SQL python
Python Statsmodels (ARIMA) SQL reporting
Global chocolate manufacturer
Bengaluru
1 year 2 months
2017-09 - 2018-10

SAP -> Azure Data Lake Migration

Decision Scientist Azure (Data Lake Pipelines) SQL/U-SQL ...
Decision Scientist
  • Migrated SAP data to Azure Data Lake, designing dimensional models (facts/dimensions) for analytics. 
  • Built ETL/ELT pipelines using SQL/U-SQL, SSIS, and Azure Pipelines; implemented data quality checks. 
  • Managed QA -> pre-prod -> prod support, defect resolution, and runbook documentation. 
  • Delivered Tableau dashboards on governed datasets for business stakeholders. 
  • Outcome: Improved reporting efficiency and ~22% infrastructure cost reduction
U-SQL Tableau Excel SAP
Azure (Data Lake Pipelines) SQL/U-SQL SSIS Tableau Data modelling Data quality
Global chocolate manufacturer
Bengaluru
1 year 8 months
2016-01 - 2017-08

Educhimp ? IIT-JEE EdTech (Founder Project)

Co-Founder · Analytics & Growth Strategy Product development Team leading
Co-Founder · Analytics & Growth
  • Co-founded Educhimp, an IIT-JEE preparation platform. 
  • Created revision video lectures for Mathematics, Physics, and Chemistry. 
  • Ran digital marketing campaigns on Facebook and Instagram.
Strategy Product development Team leading
New Delhi

Aus- und Weiterbildung

Aus- und Weiterbildung

1 year 1 month
2020-11 - 2021-11

MSc in Environmental & Sustainable Development

Indira Gandhi National Open University, New Delhi, India
Indira Gandhi National Open University, New Delhi, India
4 years 2 months
2013-05 - 2017-06

B.Tech in Electronics & Communication Engineering

Maharaja Agrasen Institute of Technology, Delhi (GGSIP University), New Delhi, India
Maharaja Agrasen Institute of Technology, Delhi (GGSIP University), New Delhi, India

Kompetenzen

Kompetenzen

Top-Skills

Machine Learning (Pricing - Churn - Elasticity - Regression - Classification) AI (LLMs - N8N - RAGs - embeddings) Data Engineering & BI (dbt - Airflow - AWS - SQL - S3 - Looker - python - git) SQL S3 Looker python git dbt Tableau regression classification jupyter notebook a/b testing airflow REST pricing churn llm Data Engineer Strategieberatung Databricks Claude code

Aufgabenbereiche

Project Management
Experte
Analytics
Experte
Data Engineering
Experte
Data Science
Experte
Stakeholder communication
Experte
Hiring
Fortgeschritten
Mentoring
Fortgeschritten
Strategy
Experte
Executive reporting
Experte
Cross-functional project delivery
Experte
AI
Experte
LLMs
Experte
Requirement Gathering
Experte

Produkte / Standards / Erfahrungen / Methoden

Confluence
Experte
JIRA
Experte
GIT
Experte
Claude Code
Experte
Cursor
Fortgeschritten
Miro
Experte
AWS
Experte
Azure
Experte
Google big query
Experte
Data bricks
Experte
Airflow
Experte
Looker
Experte
Tableau
Fortgeschritten
Excel
Experte
MS office 365
Fortgeschritten
Athena
Experte
Redshift
Experte

PROFILE

Senior data & ML analyst with 8+ years across fintech, insurance and travel. Developed & deployed ML/LLM solutions for pricing, customer retention, NPS and content quality, delivering ?10M+ impact, playing a key role in cross-functional projects and advising C-suite stakeholders.


TECHNICAL & LEADERSHIP EXPERTISE

  • Technical: SQL, Python (pandas,scikit-learn), dbt, Airflow, AWS (S3, Athena, Redshift), Git, REST APIs
  • AI/ML: XGBoost, LightGBM, logistic/linear models, A/B testing, TensorFlow, LLMs (RAG, embeddings, evaluation), Transformers
  • BI: Looker (LookML), Tableau, Jupyter Notebook, Excel
  • Leadership: hiring & mentorship, cross-functional delivery, C-suite communication, cost & risk analytics


EXPERIENCE

2022 - Present

Role: Senior Data Analyst 

Customer: HOLIDU


Tasks:

Managed, mentored & hired analysts | Travel Tech


Apartment quality & photo ranking

  • Developed quality signal for partners (photos, reviews, facilities, min-stay), kept separate from price/availability; evaluated on CTR, CTB, and normalised Net eGMV
  • Implemented an OpenAI CLIP photo score (0?100) for the apartments and re-ranked the main photo
  • Designed and ran A/B experiments focused on CTR and CTB, confirming higher click-through rates while maintaining stable conversion to booking, which drove a +4% lift in net eGMV


NPS text analytics with LLMs

  • Developed an NPS analysis pipeline using Python, OpenAI, Redshift, S3, DuckDB, and Airflow
  • Reduced manual effort by 70% in analysing 200+ NPS visualisations, enabling faster detection of customer pain points


Data modelling, dashboarding & experimentation

  • Designed end-to-end data pipelines in Airflow using a medallion architecture (S3 ? dbt ? Redshift), with data quality checks and automated alerts via Opsgenie and Slack
  • Developed and maintained dashboards in Looker covering key company metrics, host and guest behavior patterns, and financial reporting, supporting both operational and strategic decision-making
  • AB test infrastructure: Built a Redshift?S3 data setup for the guest team to track past and ongoing A/B tests, giving clear visibility into metrics like value per user, GMV per user, session trends, and experiment significance (p-values)


Churn prediction & pricing experiments

  • Built and deployed an XGBoost churn model (AUC 0.81) with automated scoring and triggers, identifying the top 4% highest-risk hosts
  • Implemented targeted credit interventions validated through A/B testing, reducing churn by 10% and retaining ~?1M+ in annual revenue.


2020 - 2022

Role: Team Lead Data Analyst

Customer: ACKO


Tasks:

Managed & hired analysts and data scientists | Insurance Tech


Dynamic Pricing Optimisation -?5M+ Revenue Growth

  • Developed a lightGBM pricing elasticity model (AUC=0.72) that increased premiums by 6% (~?5M+/year) while maintaining conversion rates
  • Ran A/B experiments to validate dynamic pricing strategies against traditional rule-based methods


Loss ratio improvement:

Identified high-risk segments (e.g., fuel type, website behavior,pincode) to cut loss ratios by 12%.


2018 - 2020

Role: Decision Analyst 

Customer: KABBAGE (Acquired by American Express)


Tasks:

USA?s lending fintech

  • Analysed and optimised underwriting strategies, cutting bad debt by 9% through granular risk segmentation (industry, FICO, line bands) and refining qualification criteria
  • Built logistic regression and LightGBM models (AUC 0.68) to predict 90-day customer activation, driving an 8% uplift in onboarding rates via targeted campaigns


2017 - 2018

Role: Decision Scientist

Customer: MU-SIGMA


Tasks:

Fortune 500 analytics consulting | Retail/CPG focus

  • Worked on Azure ETL migration, cutting infrastructure costs by 22% and enhancing reporting efficiency

Betriebssysteme

windows
Experte
mac
Experte
linux
Fortgeschritten

Programmiersprachen

SQL
Experte
Python
Experte
R
Experte
dbt
Experte
C++
Fortgeschritten
HTML
Fortgeschritten
Java
Fortgeschritten

Branchen

Branchen

  • Banking
  • Finance
  • Retail
  • Consultancy
  • Travel
  • Insurance

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.