ProfileThe candidate received his doctorate in computer science from the University of Konstanz at the age of 25. During his academic career, he worked in the areas of dimensionality reduction and graph embedding, and his work has been recognized by the scientific community. As a (senior) data scientist, the candidate focuses on LLMs/RAGs, recommender systems, knowledge graphs, and classical machine learning. His most notable work concerns the development and implementation of a recommender system for the ARD audio library. He is also a fiction author.
Computer SkillsPython
- Numpy
- Pandas
- scikit-learn
- TensorFlow
- Keras
- PyTorch
- PyG
- Networkx
- Matplotlib/ Seaborn
- XGBoost/ LightGBM/ CatBoost
- SpaCy/ NLTK
Other- AWS:
- Azure:
- Agile Development:
- Kubernetes
- Streamlit
- Spark
ScriptingTypesettingRecommender Systems- collaborative filtering (linear models, matrix-factorization)
- ensemble models
- bandit-based models
- graph-based recommenders
- neural-network recommenders
Software
Falcone ? a graph profiling software based on multidimensional scaling techniques, written in Java
Deep Learning Frameworks
- TensorFlow & Keras: Extensive experience in building and deploying neural network models.
- PyTorch: Proficient in model development and experimentation, with a focus on dynamic computation graphs.
- PyTorch Geometric (PyG): Skilled in implementing graph neural networks for complex relational data.
- PyTorch Lightning: Familiar with this framework for scalable and efficient deeplearning model training.
Vector Search Databases
- Milvus: Knowledgeable in building similarity search applications using this highly scalable platform.
- Pinecone: Experience in implementing vector search for machine learning models in production.
Databases & Data Warehousing
- Athena: Proficient in serverless queries with SQL on large-scale data directly in S3.
- Snowflake: Skilled in utilizing this cloud data platform for scalable analytics.
- Redis: Experienced in using this in-memory database for caching and real-time analytics.
- PostgreSQL: Strong understanding of relational database management and development
Cloud Computing Platforms
- AWS: Extensive experience with Sagemaker for ML model development and deployment; Redshift for data warehousing; Lambda for serverless computing; Personalize recipes for Recommender models.
- Google Cloud Platform(GCP): Proficient with BigQuery for data warehousing; developed a Recommender system on this cloud provider.
Knowledge Graphs
- Neo4j: In-depth experience with this graph database for building knowledge graphs and complex queries.
Data Engineering and Stream Processing
- Apache Kafka
- Advanced proficiency in stream processing systems for building faulttolerant, scalable real-time data pipelines.
- Analytics Databases
- Apache Druid: Experienced in real-time analytics with Druid, enabling interactive queries and insights on large-scale datasets.
- Data Visualization Tools
- ?Grafana: Experienced in deploying Grafana for comprehensive monitoring and visualization of metrics and logs across various data sources.
Work Experience09/2023 - todaySenior Data Scientist (Freelancer)
08/2021 - 08/2023Senior Data Scientist
Bayerischer Rundfunk/.pub, Munich (Germany)
- research on state-of-the-art developments in Recommender Systems in media (audio-, video- and textual content)
- implementation of a Recommender System powering ARD Audiothek (one one Germany?s most popular audio-on-demand platforms). The deployed production model has 15% higher precision than the previous
- NLP projects: entity recognition and redundancy removal
06/2020 - 07/2021Data Scientist
Yewno/Entropy387, Sarajevo (BiH)/San Francisco (US)
- research on and implementation of graph-based stock-market prediction models
- pricing engine implementation (achieved a 10% improvement of MAE with respect to the previous model)
06/2018 - 05/2019Post-Doc
Roma Tre University, Rome, Italy
- research on Graph Morphing Algorithms
- implementation of Graph Drawing Algorithms
04/2014 - 05/2016Software Engineer
Visteon (former Johnson Controls), Karlsruhe, Germany
- (re)implementation of a testing software
- implementation of Finite State Machines
- code generation