- Lead the consulting on new data architecture (Data Warehouse vs Data Lakehouse) for a German manufacturing company & facilitated process for adoption of DLH architecture on Azure + Databricks
- Planned and developed a proof of concept logical data warehouse on Azure (Synapse, ADLS)
Databricks Certified Developer - Apache Spark 2.x for Python
2018
Freie Universität Berlin, Technische Universität Berlin
M.Sc. Physics
2015
Ruprecht-Karls-Universität Heidelberg
B.Sc. Physics
Big Data Engineer with 5+ years of international industry experience and a proven track record of designing data intensive pipelines as well as data mining algorithms. I am a Databricks certified Spark developer who is enthusiastic about scalable data architectures that drive measurable business value.
Skills
Data Engineering
Hadoop (HDFS, YARN), Spark, SQL, BigQuery, Vertica, Postgres, Elasticsearch, MongoDB, Scylla, Celery, Redis, bash scripting, Airflow, Jenkins, Docker, Flask, Django
Data Science
Tensorflow, Jupyter, gensim, spaCy, Hugging Face, Pandas, Numpy
BI
Tableau
DevOps
GCP, AWS, Azure, Debian, Ubuntu, Nginx, Terraform
SIDE PROJECTS & HACKATHONS
2019 - 2020
Kunde: rssBriefing
Tasks:
Built an briefing web app in Python powered by NLP models: (URL auf Anfrage)
2019 - 2019
Kunde: DEEP BERLIN hackathon
Tasks:
Contributed to CV object detection and classification team, built a sliding window module
- Lead the consulting on new data architecture (Data Warehouse vs Data Lakehouse) for a German manufacturing company & facilitated process for adoption of DLH architecture on Azure + Databricks
- Planned and developed a proof of concept logical data warehouse on Azure (Synapse, ADLS)
Databricks Certified Developer - Apache Spark 2.x for Python
2018
Freie Universität Berlin, Technische Universität Berlin
M.Sc. Physics
2015
Ruprecht-Karls-Universität Heidelberg
B.Sc. Physics
Big Data Engineer with 5+ years of international industry experience and a proven track record of designing data intensive pipelines as well as data mining algorithms. I am a Databricks certified Spark developer who is enthusiastic about scalable data architectures that drive measurable business value.
Skills
Data Engineering
Hadoop (HDFS, YARN), Spark, SQL, BigQuery, Vertica, Postgres, Elasticsearch, MongoDB, Scylla, Celery, Redis, bash scripting, Airflow, Jenkins, Docker, Flask, Django
Data Science
Tensorflow, Jupyter, gensim, spaCy, Hugging Face, Pandas, Numpy
BI
Tableau
DevOps
GCP, AWS, Azure, Debian, Ubuntu, Nginx, Terraform
SIDE PROJECTS & HACKATHONS
2019 - 2020
Kunde: rssBriefing
Tasks:
Built an briefing web app in Python powered by NLP models: (URL auf Anfrage)
2019 - 2019
Kunde: DEEP BERLIN hackathon
Tasks:
Contributed to CV object detection and classification team, built a sliding window module