ProfileBioinformatics Data Scientist with 9+ years of applied research and healthcare industry in precision medicine and clinical genomics. Skilled in building and evaluating ML models - regression, classification, generative AI (VAEs), and transformers - and in developing workflows for biomarker discovery and translational research. Thrive in international, cross-functional, and creative environments.
SkillsTechnical skills
- ML & AI: statistical tests - incl. t-tests, Wilcoxon tests; regression - linear, logistic, Cox/survival analysis, elastic net/lasso/ridge; classification - random forest, XGBoost, SVM, LDA; clustering - K-means, EM, probabilistic models - Hidden Markov Models (HMMs), linear Gaussian state-space models; dimensionality reduction & factorization - PCA, t-SNE, MOFA, NMF; sampling/optimization - replica exchange monte carlo; deep learning - variational autoencoders (VAEs), CNNs, transformers, retrieval-augmented generation (RAG), federated learning
- MLOps: workflow languages - Nextflow, Snakemake; SLURM, Grid Engine, Docker/Singularity, Kubernetes, AWS, DigitalOcean, workflow engines - Galaxy
- Version Control and Software Management: Linux/Unix, git, svn, conda, GNU Guix
- Bioinformatic Tools: samtools, BEDtools, GATK, IGV, Bowtie2, BWA, Bismark, BLAST
- Omics Databases: TCGA, Roadmap Epigenomics, TEMPUS, GTEx, Ensembl, NCBI, GenomOncology
Soft Skills
- Cross-functional collaboration and independent project leadership across cross-disciplinary teams, clear communication of complex concepts to technical and non-technical audiences, adaptability and agility in evolving scientific and technical environments, analytical thinking and problem solving, project planning and mentoring, scientific writing and international presentation