Knowledge management system in the regulatory environment of the decommissioning of nuclear facilities
Data Scientist / ResearcherBig DataLarge Language ModelsRAG...
Data Scientist / Researcher
Data Scientist/Researcher in research project Alisa34 ? Automation of regulatory processes with AI (2025, University of Applied Science of South Westphalia)
Development of an AI-supported framework for the semi-automated processing of authorisation processes in the field of nuclear decommissioning. With the help of open-source large language models (LLMs), neurosymbolic modelling and agent-based systems, regulatory documents are to be evaluated more efficiently, knowledge mapped consistently and decisions supported in a well-founded manner. The project is being implemented together with an industry partner, which ensures direct practical relevance and the subsequent application of the developed solutions in real approval processes.
My Contribution to Success:
Document processing:
Extraction and structuring of regulatory content (e.g. laws, safety assessments, technical instructions) from heterogeneous formats (PDF, Word, HTML, scanned documents) using OCR, document layout detection and classic NLP techniques.
Semantic search and retrieval:
Development of a retrieval framework for semantic document search and knowledge linking (incl. ontology connection), using classic RAG approaches and GraphRAG methods
Training of LLMs:
Domain-specific fine-tuning of LLMs for better indexing of German-language regulatory texts and for context-sensitive answers to questions in the authorisation process.
MongoDBNeo4j
Big DataLarge Language ModelsRAGKnowledge GraphsOntologiesNatural Language ProcessingPythonAgentsAgent-WorkflowsAgentic RAGLinuxOCRDocument ProcessingFinetuningGraph DatabaseLangchainGitHuggingface Transformersvector databasesgraph
databasesNeo4jMongoDB AtlasFAISSNoSQLDSPYSpacyOllamaPerceptivenessstructured problem solvingteamworkLLM FinetuningPrompt EngineeringGraphRAGSemantic SearchGraph TheorySoftware EngineeringStakeholder ManagementOntology
FH Südwestfalen
Remote
5 months
2024-12 - 2025-04
Master Thesis - GraphRAG in data-driven decommissioning of nuclear facilities
StudentLangchainPythonGit...
Student
Development and evaluation of GraphRAG approaches to improve knowledge access and decision support in the field of nuclear decommissioning. The aim was to make regulatory, legal and safetyrelated documents more efficiently accessible - while complying with high data protection requirements and using purely local open source models. The focus was on modelling document contexts, ontologies and retrieval strategies in a safety-critical, knowledge-intensive field of application.
My Contribution to Success
Design, implementation and comparison of two GraphRAG architectures with classic RAG pipelines to improve semantic retrieval in regulatory domains.
Development of a locally running, data protection-compliant RAG infrastructure based on open source LLMs including PDF parsing, OCR, chunking, keyword normalisation and Neo4j graph modelling
Results and Benefits
The work shows how GraphRAG methods can improve knowledge access in highly complex and regulated domains - with a focus on transparency, explainability and data protection. The methods developed are transferable to other safety-critical industries (e.g. pharmaceuticals, energy, aviation) and are incorporated into current industry and research projects, in particular for the further development of RAG systems for German-speaking specialised domains.
LangchainPythonGitHuggingface TransformersVector DatabasesGraph
DatabasesNeo4jMongoDB AtlasFAISSNoSQLDSPYSpacyOllamaPerceptivenessstructured problem solvingteamworkLLM FinetuningPrompt EngineeringRAGSemantic SearchGraph TheorySoftware EngineeringStakeholder ManagementNatural Language Processing (NLP)
University of Applied Science of South Westphalia
3 years
2022-04 - 2025-03
Implementation of an MES
Consultant for digital Production & IoTOPC-UARESTC#...
Consultant for digital Production & IoT
Implementation of an MES system for digitalisation of production processes. Development and support of the interfaces
As part of the project, a Manufacturing Execution System (MES) was implemented to digitalise and optimise production processes and create end-to-end transparency in real time. The aim was to ensure seamless communication between production facilities, IT systems and the new MES in order to utilise production data more efficiently and automate processes.
The main task was to develop and support the interfaces that linked the MES with existing systems such as SAP and the production facilities via OPC-UA.
My Contribution to Success
Interface development:
Implementation of OPC-UA interfaces for connecting production systems to the MES, integration with SAP for bidirectional data transfer and SQL database connections.
Technical support:
Monitoring, optimisation and troubleshooting of the interfaces during the project and operational phase, ensuring performance and availability.
Cooperation:
Close coordination with team members and external partners, support with tests and user training in handling the interfaces.
Results and Benefits
Optimised production processes:
By integrating the MES, processes could be organised more efficiently and monitored in real time.
Seamless system communication:
The interfaces developed enabled smooth data transfer between machines, IT systems and the MES.
Future-proof architecture:
The solution is flexibly scalable and offers a stable basis for further digitalisation measures.
OPC-UARESTC#SAPSQLABAPStrong communication skillsproblem-solving skillsteamworkInterface developmentsystem integrationreal-time data processing
undisclosed metal processing company group
7 months
2024-08 - 2025-02
Consulting on RAG Application and LLMs
Data Scientist, ConsultantAzureLangchainPython...
Data Scientist, Consultant
The aim of the project was to demonstrate the potential and use cases of Retrieval Augmented Generation (RAG) and Large Language Models (LLMs) to the company. A proof of concept (PoC) was to show how these technologies can be used profitably to increase efficiency and added value in various business areas.
My Contribution to Success
Consulting and conception:
Analysis of business requirements, consulting on the selection and use of technologies such as Langchain, Huggingface Transformers and Vector & Graph Databases, as well as support with the architecture concept on Azure.
Coordination and implementation:
Close cooperation with the data scientist on PoC implementation and advice on best practices in the use of RAG and LLMs.
Results and Benefits
Proof of value creation:
The PoC successfully demonstrated that RAGs and LLMs can deliver significant efficiency and insight gains for business processes.
Technological basis:
Development of a scalable architecture that enables the use of LLMs and RAGs to be extended to other areas of the company.
Strategic consulting:
Clear recommendations for the long-term use of the technologies, customised to the individual requirements of the company
Consultant for digital Production & IoTC#OPC-UAREST...
Consultant for digital Production & IoT
Development of a digital shift book to optimise communication at production level
As part of the digitalisation of production, a digital shift book was developed that replaces manual processes with a paperless solution and significantly improves communication and the flow of information at production level. The aim of the project was to implement a centralised, intuitive platform for recording and transferring shift information that is directly integrated into existing IT and OT systems.
The project was of crucial importance to promote efficiency, traceability and transparency in the production processes.
My Contribution to Success
Interface development:
Implementation of REST interfaces to integrate the existing MES and OPC-UA for connecting production machines for real-time data.
System integration:
Ensuring seamless integration into the IT and OT infrastructure in close cooperation with the development of the SAP UI5 interface and SAP customisation.
Requirements management:
Translation of production requirements into technical solutions through regular coordination with the specialist departments.
Results and Benefits
Efficient communication:
Significantly improved shift handovers and optimised information flow at production level.
Internal demand:
Success led to interest from other production companies within the group.
Paperless processes:
Replaced manual documentation, reduced errors and labour.
C#OPC-UARESTSQLSAP-UI5SAPABAPGiteaTeamworkstrong communication skillsstructured problem solvingRequirements managementinterface development
undisclosed metal processing company group
2 years 11 months
2022-04 - 2025-02
Responsible for Developing and Maintaining the Data Integration Plattform
Consultant - Digital Production & IoTC#OPC UAMQTT...
Consultant - Digital Production & IoT
Consultancy and implementation of IT solutions, mainly in the production environment. From evaluation of feasibility to finished solutions.
Development of Proof-of-Concepts with LLMs and RAGs
Data Scientist, AI EngineerPythonAWSFAISS...
Data Scientist, AI Engineer
The aim of the project was to develop proof-of-concepts (PoCs) that translate current research findings on large language models (LLMs) and retrieval augmented generation (RAG) into practical applications. The PoCs served as presentation and test platforms to demonstrate the performance of modern AI technologies and secure funding and corporate support.
A particular focus was on the use of open-source models and the customisation of the systems for outof-domain use cases that deviated from the data in typical researchpublications.
My Contribution to Success
Development of PoCs:
Creation of interactive AI demos with the Langchain tech stack and Streamlit combined with retrieval mechanisms via vector and graph databases, as well as the use of LLMs. Implementation of PDF parsing to make unstructured data usable.
Results and Benefits
Funding and cooperation:
PoCs played a key role in securing funding totalling EUR 4 million and gaining the support of other companies.
Innovative solutions:
Successful adaptation of LLMs and RAG approaches to ?out-of-domain? data, expanding the range of possible applications.
Interdisciplinary collaboration:
Successful coordination with researchers and scientific leaders to develop innovative PoCs and demonstrate their potential.
PythonAWSFAISSPostgreSQLGitStreamlitLangchainLanggraphLangsmithNeo4JPytorchHuggingface TransformersSpacyNLTKPDF parsing librariesLearning abilityperceptivenesscreativityinnovation mentalityRetrieval augmented generationdata integrationAI-supported search and processing
systemsopen-source large language models
South Westphalia University of Applied Sciences
5 months
2024-05 - 2024-09
Development of RAG and Agent Prototypes for funding acquisition
Data ScientistBig DataLarge Language ModelsRAG...
Data Scientist
Development of RAG applications and training of LLMs.
Research contribution to RAG and LLM.
Successful contribution to the acquisition of millions in funding.
MongoDBNeo4jPostgreSQL
Big DataLarge Language ModelsRAGKnowledge GraphsOntologiesNatural Language ProcessingPythonAgentsAgent-WorkflowsAgentic RAGLinuxOCRDocument ProcessingFinetuningGraph Database
University of Applied Science of South Westphalia
4 months
2024-05 - 2024-08
Acquisition and Evaluation of Machine and Process Data
Bachelor StudentOPC-UASQLC#...
Bachelor Student
Acquisition and evaluation of machine and process data for process optimisation
The aim of the project was to develop a system for systematically recording and analysing machine and process data in order to make production processes and machine commissioning more efficient. Databased insights were used to uncover optimisation potential and create an improved understanding of complex production processes.
My Contribution to Success
Requirements management and coordination:
Coordination with stakeholders to define objectives and organise collaboration in an interdisciplinary team.
Technical implementation:
Development of a solution for real-time data acquisition with OPC-UA, storage in an SQL database and visualisation of the results with Microsoft PowerBI.
Process optimisation:
Analysis of the data to identify potential improvements for production and commissioning processes together with process optimisers.
Communication:
Presentation of the results and recommendations in an understandable form for technical teams and management.
Results and Benefits
Improved process understanding:
The system created deeper insights into production processes and supported data-driven decisions.
Increased efficiency during commissioning:
The commissioning of complex machines was made significantly faster and less error-prone.
Action-orientated dashboards:
The visualisation of data enabled a quick understanding and facilitated the implementation of optimisations.
Consultant for digital production & IoT, project managerC#RESTSAP...
Consultant for digital production & IoT, project manager
Migration of an IT/OT integration platform with development of a redundancy system to optimise availability and reliability
As part of a strategically important project, an integration platform was successfully migrated from version 3 to version 4 and a redundancy system was implemented at the same time in order to sustainably optimise availability and reliability. The platform is a central component of the company's digitalisation strategy and supports the implementation of Industry 4.0 initiatives, particularly in the connection between IT and OT systems.
The project was characterised by a high level of complexity, as the new platform version revealed technical weaknesses during implementation. These challenges were mastered with ease thanks to practical solutions and a consistent focus on the project objectives.
My Contribution to Success
Project management:
Planning, control and implementation of the project with responsibility for time and resource management and on-time delivery.
Technical implementation:
Migration and integration of the platform with IT and OT systems using C#, OPC-UA, ABAP, SQL, MongoDB and REST, implementation a redundancy system for high availability.
Problem-solving expertise:
Creative development of customised solutions for the challenges of the new platform version; development of a monitoring system with Grafana and InfluxDB.
Communication:
Coordination with specialist departments (production, controlling, facility management, etc.) and experts from key technologies; transparent reporting to stakeholders.
Results and Benefits
Increased system availability:
Redundancy system raises stability to a new level.
Digitalisation:
platform as the basis for Industry 4.0 applications.
Data ScientistLangchainHuggingface TransformersFAISS...
Data Scientist
As part of this project, a retrieval augmented generation (RAG) system was developed that can efficiently process and provide time-sensitive information in the context of the German Football League. The aim was to implement a powerful RAG system that combines both classic keyword-based approaches and modern methods such as time-aware retrieving and semantic search technologies.
My Contribution to Success
Hybrid RAG:
Implementation of a hybrid RAG with a time-sensitive component that evaluates retrieval results based on the time information in the query and the creation time of the documents. A keyword search and a vector retrieval and a workflow with huggingface transformers and Langchain were integrated for this purpose.
NLP and NER:
Use of Spacy and NLTK to identify football-specific entities and time-based contextualisation.
Prompt Engineering:
Development of precise prompts for relevant and accurate answers.
Results and Benefits
Powerful RAG system:
The developed system combines semantic, keyword and time-based search methods and provides relevant, time-sensitive answers.
Real-time functionality:
By optimising the retrieval and generation processes, a high level of efficiency and accuracy has been achieved that is suitable for real-time applications.
Flexibility and scalability:
The modular architecture allows easy customisation to other sports or dataintensive use cases.
LangchainHuggingface TransformersFAISSSpacyNLTKOpen-Source Large
Language Models (LLMs)GitPerceptivenessstakeholder managementself-organisationability to innovateKeyword SearchRAGNamed Entity Recognition (NER)Prompt EngineeringAugmented
SBERTRe-Ranking
South Westphalia University of Applied Sciences, undisclosed Cooperation Company
3 months
2023-04 - 2023-06
Analysis of Job Advertisements
Data ScientistPythonScrapyBeautifulSoup...
Data Scientist
As part of this university project, a system was developed to analyse job advertisements from various job portals. The aim was to gain valuable insights into the most frequently required skills in certain occupational groups and to analyse the salary structures of different sectors and companies. These analyses should help to facilitate targeted further training measures and create transparency about salary structures.
The core of the project involved collecting data using a web crawler and processing and analysing this data using modern NLP methods and language models.
My Contribution to Success
Web crawler development:
Automated collection and normalisation of relevant data such as skills, salaries and industry information using Scrapy and BeautifulSoup.
Data analysis:
Cleansing and transformation of data with Pandas; extraction of key terms with SpaCy, NLTK and with the help of language models.
Visualising results:
Creating reports and visualisations to present findings and possible applications.
Results and Benefits
Targeted further training:
Analysis revealed in-demand skills and qualification gaps.
Transparency in salaries:
Sound salary benchmarks of occupations across industries and companies.
Scalability:
Flexible system for additional data sources and specific analyses.
PythonScrapyBeautifulSoupGitPandasSpaCyNLTKHuggingface
TransformersLangChainData extractiondata analysistext processingnatural language processing
South Westphalia University of Applied Sciences
4 months
2021-11 - 2022-02
FDA-compliant Software for Machine Communication
Software developerC#Gitcryptography...
Software developer
The aim of the project was to develop a software solution that complies with the FDA's 21 CFR Part 11 guidelines and enables communication between pharmaceutical machines and third-party systems such as MES.
The software not only enabled secure and efficient integration, but also laid the foundation for a new, high-revenue business area for the machine manufacturer.
My Contribution to Success
Software development and architecture:
Development of a modular, scalable and secure software solution in C#, including communication interfaces for MES and third-party systems as well as implementation of cryptography for data integrity.
FDA compliance:
Implementation of the requirements in accordance with 21 CFR Part 11 with audit trails, electronic signatures and audit-proof traceability.
Requirements management:
Coordination with stakeholders to record and validate regulatory and technical requirements and translate them into a suitable software solution.
Results and Benefits
New sales potential:
The software solution led to the development of a new business area that generates significant sales.
FDA compliance:
Successful development of legally compliant software that fulfils the strict requirements of the pharmaceutical industry.
Efficient integration:
The solution enables secure and efficient communication between machines and third-party systems such as MES and ERP systems.
C#GitcryptographySelf-organisationstakeholder managementproblem-solving skillsolving skills
Methods: R
undisclosed mechanical engineering company for the pharmaceutical industry
Aus- und Weiterbildung
Aus- und Weiterbildung
2 years 7 months
2022-10 - 2025-04
Data Science (while working)
Master of Science, University of Applied Science Southwestphalia
Master of Science
University of Applied Science Southwestphalia
Thesis about GraphRAG in the regulatory environment of the decommissioning of nuclear facilities for rights, guidelines and technical instructions.
3 years
2017-09 - 2020-08
Electronic Engineering
Bachelor of Science, University of Applied Science Osnabrück
Bachelor of Science
University of Applied Science Osnabrück
Specialising in energy and automation technology.
Thesis about data mining and analysis of machine and process data
Certificates:
ERP4Students: SAP BW/4HANA ? Data flow and BusinessObjects reporting
ERP4Stundents: SAP Analytics Cloud ? Analysis, Planning and Integration
Microsoft Certified: Azure AI Fundamentals (AI ? 900)
Kompetenzen
Kompetenzen
Top-Skills
Data ScientistMachine LearningDeep LearningLarge Language ModelsAgentic AIRAGKnowledge GraphGraph DatabaseNatural Language ProcessingDocument ProcessingWissensdatenbankRetrieval Augmented GenerationLLMRESTBeratungStrategiePythonSQLNeo4jMongoDBOPC UA
Produkte / Standards / Erfahrungen / Methoden
Attributes:
Result orientated
Reliable
Strong communication
About me:
I am a freelance data scientist specialising in the development of intelligent data and knowledge systems - from classic natural language processing to modern solutions with large language models (LLMs) and retrieval augmented generation (RAG/GraphRAG).
I support companies in making information accessible in a structured way, optimising processes based on data and enabling well-founded decisions.
In projects, I have developed innovative technologies for funding projects worth millions, supported companies in setting up modern AI solutions and translated complex challenges into comprehensible, data-driven solutions. My way of working is characterised by ambition, creativity and a clear focus on measurable results.
In addition to my freelance work, I work as a data scientist in the GAIA-Lab, an application-oriented research project for the development of trustworthy AI systems in safety-critical fields of application.
There I contribute my expertise in NLP, GraphRAG and Large Language Models.
Skills & Experiences:
Senior Level:
Python
Langchain
Huggingface-Transformers
SentencesTransformers
REST
OPC-UA
Scarpy
RAG, Large Language Models (LLM)
Natural Language Processing (NLP)
Pandas
Requirements management
GraphRAG
Junior Level:
C#
C++
C
SQL
PyTorch
scikit-learn
Azure
AWS
SQL
Neo4J
Branchen
Branchen
Pharmaceutical industry
Nuclear technology
Energy industry
Automotive industry
Furniture industry
Metal processing for furniture
Mechanical engineering for the pharmaceutical industry
Electrical industry
Einsatzorte
Einsatzorte
Deutschland, Schweiz, Österreich
möglich
Projekte
Projekte
2 months
2025-03 - now
Knowledge management system in the regulatory environment of the decommissioning of nuclear facilities
Data Scientist / ResearcherBig DataLarge Language ModelsRAG...
Data Scientist / Researcher
Data Scientist/Researcher in research project Alisa34 ? Automation of regulatory processes with AI (2025, University of Applied Science of South Westphalia)
Development of an AI-supported framework for the semi-automated processing of authorisation processes in the field of nuclear decommissioning. With the help of open-source large language models (LLMs), neurosymbolic modelling and agent-based systems, regulatory documents are to be evaluated more efficiently, knowledge mapped consistently and decisions supported in a well-founded manner. The project is being implemented together with an industry partner, which ensures direct practical relevance and the subsequent application of the developed solutions in real approval processes.
My Contribution to Success:
Document processing:
Extraction and structuring of regulatory content (e.g. laws, safety assessments, technical instructions) from heterogeneous formats (PDF, Word, HTML, scanned documents) using OCR, document layout detection and classic NLP techniques.
Semantic search and retrieval:
Development of a retrieval framework for semantic document search and knowledge linking (incl. ontology connection), using classic RAG approaches and GraphRAG methods
Training of LLMs:
Domain-specific fine-tuning of LLMs for better indexing of German-language regulatory texts and for context-sensitive answers to questions in the authorisation process.
MongoDBNeo4j
Big DataLarge Language ModelsRAGKnowledge GraphsOntologiesNatural Language ProcessingPythonAgentsAgent-WorkflowsAgentic RAGLinuxOCRDocument ProcessingFinetuningGraph DatabaseLangchainGitHuggingface Transformersvector databasesgraph
databasesNeo4jMongoDB AtlasFAISSNoSQLDSPYSpacyOllamaPerceptivenessstructured problem solvingteamworkLLM FinetuningPrompt EngineeringGraphRAGSemantic SearchGraph TheorySoftware EngineeringStakeholder ManagementOntology
FH Südwestfalen
Remote
5 months
2024-12 - 2025-04
Master Thesis - GraphRAG in data-driven decommissioning of nuclear facilities
StudentLangchainPythonGit...
Student
Development and evaluation of GraphRAG approaches to improve knowledge access and decision support in the field of nuclear decommissioning. The aim was to make regulatory, legal and safetyrelated documents more efficiently accessible - while complying with high data protection requirements and using purely local open source models. The focus was on modelling document contexts, ontologies and retrieval strategies in a safety-critical, knowledge-intensive field of application.
My Contribution to Success
Design, implementation and comparison of two GraphRAG architectures with classic RAG pipelines to improve semantic retrieval in regulatory domains.
Development of a locally running, data protection-compliant RAG infrastructure based on open source LLMs including PDF parsing, OCR, chunking, keyword normalisation and Neo4j graph modelling
Results and Benefits
The work shows how GraphRAG methods can improve knowledge access in highly complex and regulated domains - with a focus on transparency, explainability and data protection. The methods developed are transferable to other safety-critical industries (e.g. pharmaceuticals, energy, aviation) and are incorporated into current industry and research projects, in particular for the further development of RAG systems for German-speaking specialised domains.
LangchainPythonGitHuggingface TransformersVector DatabasesGraph
DatabasesNeo4jMongoDB AtlasFAISSNoSQLDSPYSpacyOllamaPerceptivenessstructured problem solvingteamworkLLM FinetuningPrompt EngineeringRAGSemantic SearchGraph TheorySoftware EngineeringStakeholder ManagementNatural Language Processing (NLP)
University of Applied Science of South Westphalia
3 years
2022-04 - 2025-03
Implementation of an MES
Consultant for digital Production & IoTOPC-UARESTC#...
Consultant for digital Production & IoT
Implementation of an MES system for digitalisation of production processes. Development and support of the interfaces
As part of the project, a Manufacturing Execution System (MES) was implemented to digitalise and optimise production processes and create end-to-end transparency in real time. The aim was to ensure seamless communication between production facilities, IT systems and the new MES in order to utilise production data more efficiently and automate processes.
The main task was to develop and support the interfaces that linked the MES with existing systems such as SAP and the production facilities via OPC-UA.
My Contribution to Success
Interface development:
Implementation of OPC-UA interfaces for connecting production systems to the MES, integration with SAP for bidirectional data transfer and SQL database connections.
Technical support:
Monitoring, optimisation and troubleshooting of the interfaces during the project and operational phase, ensuring performance and availability.
Cooperation:
Close coordination with team members and external partners, support with tests and user training in handling the interfaces.
Results and Benefits
Optimised production processes:
By integrating the MES, processes could be organised more efficiently and monitored in real time.
Seamless system communication:
The interfaces developed enabled smooth data transfer between machines, IT systems and the MES.
Future-proof architecture:
The solution is flexibly scalable and offers a stable basis for further digitalisation measures.
OPC-UARESTC#SAPSQLABAPStrong communication skillsproblem-solving skillsteamworkInterface developmentsystem integrationreal-time data processing
undisclosed metal processing company group
7 months
2024-08 - 2025-02
Consulting on RAG Application and LLMs
Data Scientist, ConsultantAzureLangchainPython...
Data Scientist, Consultant
The aim of the project was to demonstrate the potential and use cases of Retrieval Augmented Generation (RAG) and Large Language Models (LLMs) to the company. A proof of concept (PoC) was to show how these technologies can be used profitably to increase efficiency and added value in various business areas.
My Contribution to Success
Consulting and conception:
Analysis of business requirements, consulting on the selection and use of technologies such as Langchain, Huggingface Transformers and Vector & Graph Databases, as well as support with the architecture concept on Azure.
Coordination and implementation:
Close cooperation with the data scientist on PoC implementation and advice on best practices in the use of RAG and LLMs.
Results and Benefits
Proof of value creation:
The PoC successfully demonstrated that RAGs and LLMs can deliver significant efficiency and insight gains for business processes.
Technological basis:
Development of a scalable architecture that enables the use of LLMs and RAGs to be extended to other areas of the company.
Strategic consulting:
Clear recommendations for the long-term use of the technologies, customised to the individual requirements of the company
Consultant for digital Production & IoTC#OPC-UAREST...
Consultant for digital Production & IoT
Development of a digital shift book to optimise communication at production level
As part of the digitalisation of production, a digital shift book was developed that replaces manual processes with a paperless solution and significantly improves communication and the flow of information at production level. The aim of the project was to implement a centralised, intuitive platform for recording and transferring shift information that is directly integrated into existing IT and OT systems.
The project was of crucial importance to promote efficiency, traceability and transparency in the production processes.
My Contribution to Success
Interface development:
Implementation of REST interfaces to integrate the existing MES and OPC-UA for connecting production machines for real-time data.
System integration:
Ensuring seamless integration into the IT and OT infrastructure in close cooperation with the development of the SAP UI5 interface and SAP customisation.
Requirements management:
Translation of production requirements into technical solutions through regular coordination with the specialist departments.
Results and Benefits
Efficient communication:
Significantly improved shift handovers and optimised information flow at production level.
Internal demand:
Success led to interest from other production companies within the group.
Paperless processes:
Replaced manual documentation, reduced errors and labour.
C#OPC-UARESTSQLSAP-UI5SAPABAPGiteaTeamworkstrong communication skillsstructured problem solvingRequirements managementinterface development
undisclosed metal processing company group
2 years 11 months
2022-04 - 2025-02
Responsible for Developing and Maintaining the Data Integration Plattform
Consultant - Digital Production & IoTC#OPC UAMQTT...
Consultant - Digital Production & IoT
Consultancy and implementation of IT solutions, mainly in the production environment. From evaluation of feasibility to finished solutions.
Development of Proof-of-Concepts with LLMs and RAGs
Data Scientist, AI EngineerPythonAWSFAISS...
Data Scientist, AI Engineer
The aim of the project was to develop proof-of-concepts (PoCs) that translate current research findings on large language models (LLMs) and retrieval augmented generation (RAG) into practical applications. The PoCs served as presentation and test platforms to demonstrate the performance of modern AI technologies and secure funding and corporate support.
A particular focus was on the use of open-source models and the customisation of the systems for outof-domain use cases that deviated from the data in typical researchpublications.
My Contribution to Success
Development of PoCs:
Creation of interactive AI demos with the Langchain tech stack and Streamlit combined with retrieval mechanisms via vector and graph databases, as well as the use of LLMs. Implementation of PDF parsing to make unstructured data usable.
Results and Benefits
Funding and cooperation:
PoCs played a key role in securing funding totalling EUR 4 million and gaining the support of other companies.
Innovative solutions:
Successful adaptation of LLMs and RAG approaches to ?out-of-domain? data, expanding the range of possible applications.
Interdisciplinary collaboration:
Successful coordination with researchers and scientific leaders to develop innovative PoCs and demonstrate their potential.
PythonAWSFAISSPostgreSQLGitStreamlitLangchainLanggraphLangsmithNeo4JPytorchHuggingface TransformersSpacyNLTKPDF parsing librariesLearning abilityperceptivenesscreativityinnovation mentalityRetrieval augmented generationdata integrationAI-supported search and processing
systemsopen-source large language models
South Westphalia University of Applied Sciences
5 months
2024-05 - 2024-09
Development of RAG and Agent Prototypes for funding acquisition
Data ScientistBig DataLarge Language ModelsRAG...
Data Scientist
Development of RAG applications and training of LLMs.
Research contribution to RAG and LLM.
Successful contribution to the acquisition of millions in funding.
MongoDBNeo4jPostgreSQL
Big DataLarge Language ModelsRAGKnowledge GraphsOntologiesNatural Language ProcessingPythonAgentsAgent-WorkflowsAgentic RAGLinuxOCRDocument ProcessingFinetuningGraph Database
University of Applied Science of South Westphalia
4 months
2024-05 - 2024-08
Acquisition and Evaluation of Machine and Process Data
Bachelor StudentOPC-UASQLC#...
Bachelor Student
Acquisition and evaluation of machine and process data for process optimisation
The aim of the project was to develop a system for systematically recording and analysing machine and process data in order to make production processes and machine commissioning more efficient. Databased insights were used to uncover optimisation potential and create an improved understanding of complex production processes.
My Contribution to Success
Requirements management and coordination:
Coordination with stakeholders to define objectives and organise collaboration in an interdisciplinary team.
Technical implementation:
Development of a solution for real-time data acquisition with OPC-UA, storage in an SQL database and visualisation of the results with Microsoft PowerBI.
Process optimisation:
Analysis of the data to identify potential improvements for production and commissioning processes together with process optimisers.
Communication:
Presentation of the results and recommendations in an understandable form for technical teams and management.
Results and Benefits
Improved process understanding:
The system created deeper insights into production processes and supported data-driven decisions.
Increased efficiency during commissioning:
The commissioning of complex machines was made significantly faster and less error-prone.
Action-orientated dashboards:
The visualisation of data enabled a quick understanding and facilitated the implementation of optimisations.
Consultant for digital production & IoT, project managerC#RESTSAP...
Consultant for digital production & IoT, project manager
Migration of an IT/OT integration platform with development of a redundancy system to optimise availability and reliability
As part of a strategically important project, an integration platform was successfully migrated from version 3 to version 4 and a redundancy system was implemented at the same time in order to sustainably optimise availability and reliability. The platform is a central component of the company's digitalisation strategy and supports the implementation of Industry 4.0 initiatives, particularly in the connection between IT and OT systems.
The project was characterised by a high level of complexity, as the new platform version revealed technical weaknesses during implementation. These challenges were mastered with ease thanks to practical solutions and a consistent focus on the project objectives.
My Contribution to Success
Project management:
Planning, control and implementation of the project with responsibility for time and resource management and on-time delivery.
Technical implementation:
Migration and integration of the platform with IT and OT systems using C#, OPC-UA, ABAP, SQL, MongoDB and REST, implementation a redundancy system for high availability.
Problem-solving expertise:
Creative development of customised solutions for the challenges of the new platform version; development of a monitoring system with Grafana and InfluxDB.
Communication:
Coordination with specialist departments (production, controlling, facility management, etc.) and experts from key technologies; transparent reporting to stakeholders.
Results and Benefits
Increased system availability:
Redundancy system raises stability to a new level.
Digitalisation:
platform as the basis for Industry 4.0 applications.
Data ScientistLangchainHuggingface TransformersFAISS...
Data Scientist
As part of this project, a retrieval augmented generation (RAG) system was developed that can efficiently process and provide time-sensitive information in the context of the German Football League. The aim was to implement a powerful RAG system that combines both classic keyword-based approaches and modern methods such as time-aware retrieving and semantic search technologies.
My Contribution to Success
Hybrid RAG:
Implementation of a hybrid RAG with a time-sensitive component that evaluates retrieval results based on the time information in the query and the creation time of the documents. A keyword search and a vector retrieval and a workflow with huggingface transformers and Langchain were integrated for this purpose.
NLP and NER:
Use of Spacy and NLTK to identify football-specific entities and time-based contextualisation.
Prompt Engineering:
Development of precise prompts for relevant and accurate answers.
Results and Benefits
Powerful RAG system:
The developed system combines semantic, keyword and time-based search methods and provides relevant, time-sensitive answers.
Real-time functionality:
By optimising the retrieval and generation processes, a high level of efficiency and accuracy has been achieved that is suitable for real-time applications.
Flexibility and scalability:
The modular architecture allows easy customisation to other sports or dataintensive use cases.
LangchainHuggingface TransformersFAISSSpacyNLTKOpen-Source Large
Language Models (LLMs)GitPerceptivenessstakeholder managementself-organisationability to innovateKeyword SearchRAGNamed Entity Recognition (NER)Prompt EngineeringAugmented
SBERTRe-Ranking
South Westphalia University of Applied Sciences, undisclosed Cooperation Company
3 months
2023-04 - 2023-06
Analysis of Job Advertisements
Data ScientistPythonScrapyBeautifulSoup...
Data Scientist
As part of this university project, a system was developed to analyse job advertisements from various job portals. The aim was to gain valuable insights into the most frequently required skills in certain occupational groups and to analyse the salary structures of different sectors and companies. These analyses should help to facilitate targeted further training measures and create transparency about salary structures.
The core of the project involved collecting data using a web crawler and processing and analysing this data using modern NLP methods and language models.
My Contribution to Success
Web crawler development:
Automated collection and normalisation of relevant data such as skills, salaries and industry information using Scrapy and BeautifulSoup.
Data analysis:
Cleansing and transformation of data with Pandas; extraction of key terms with SpaCy, NLTK and with the help of language models.
Visualising results:
Creating reports and visualisations to present findings and possible applications.
Results and Benefits
Targeted further training:
Analysis revealed in-demand skills and qualification gaps.
Transparency in salaries:
Sound salary benchmarks of occupations across industries and companies.
Scalability:
Flexible system for additional data sources and specific analyses.
PythonScrapyBeautifulSoupGitPandasSpaCyNLTKHuggingface
TransformersLangChainData extractiondata analysistext processingnatural language processing
South Westphalia University of Applied Sciences
4 months
2021-11 - 2022-02
FDA-compliant Software for Machine Communication
Software developerC#Gitcryptography...
Software developer
The aim of the project was to develop a software solution that complies with the FDA's 21 CFR Part 11 guidelines and enables communication between pharmaceutical machines and third-party systems such as MES.
The software not only enabled secure and efficient integration, but also laid the foundation for a new, high-revenue business area for the machine manufacturer.
My Contribution to Success
Software development and architecture:
Development of a modular, scalable and secure software solution in C#, including communication interfaces for MES and third-party systems as well as implementation of cryptography for data integrity.
FDA compliance:
Implementation of the requirements in accordance with 21 CFR Part 11 with audit trails, electronic signatures and audit-proof traceability.
Requirements management:
Coordination with stakeholders to record and validate regulatory and technical requirements and translate them into a suitable software solution.
Results and Benefits
New sales potential:
The software solution led to the development of a new business area that generates significant sales.
FDA compliance:
Successful development of legally compliant software that fulfils the strict requirements of the pharmaceutical industry.
Efficient integration:
The solution enables secure and efficient communication between machines and third-party systems such as MES and ERP systems.
C#GitcryptographySelf-organisationstakeholder managementproblem-solving skillsolving skills
Methods: R
undisclosed mechanical engineering company for the pharmaceutical industry
Aus- und Weiterbildung
Aus- und Weiterbildung
2 years 7 months
2022-10 - 2025-04
Data Science (while working)
Master of Science, University of Applied Science Southwestphalia
Master of Science
University of Applied Science Southwestphalia
Thesis about GraphRAG in the regulatory environment of the decommissioning of nuclear facilities for rights, guidelines and technical instructions.
3 years
2017-09 - 2020-08
Electronic Engineering
Bachelor of Science, University of Applied Science Osnabrück
Bachelor of Science
University of Applied Science Osnabrück
Specialising in energy and automation technology.
Thesis about data mining and analysis of machine and process data
Certificates:
ERP4Students: SAP BW/4HANA ? Data flow and BusinessObjects reporting
ERP4Stundents: SAP Analytics Cloud ? Analysis, Planning and Integration
Microsoft Certified: Azure AI Fundamentals (AI ? 900)
Kompetenzen
Kompetenzen
Top-Skills
Data ScientistMachine LearningDeep LearningLarge Language ModelsAgentic AIRAGKnowledge GraphGraph DatabaseNatural Language ProcessingDocument ProcessingWissensdatenbankRetrieval Augmented GenerationLLMRESTBeratungStrategiePythonSQLNeo4jMongoDBOPC UA
Produkte / Standards / Erfahrungen / Methoden
Attributes:
Result orientated
Reliable
Strong communication
About me:
I am a freelance data scientist specialising in the development of intelligent data and knowledge systems - from classic natural language processing to modern solutions with large language models (LLMs) and retrieval augmented generation (RAG/GraphRAG).
I support companies in making information accessible in a structured way, optimising processes based on data and enabling well-founded decisions.
In projects, I have developed innovative technologies for funding projects worth millions, supported companies in setting up modern AI solutions and translated complex challenges into comprehensible, data-driven solutions. My way of working is characterised by ambition, creativity and a clear focus on measurable results.
In addition to my freelance work, I work as a data scientist in the GAIA-Lab, an application-oriented research project for the development of trustworthy AI systems in safety-critical fields of application.
There I contribute my expertise in NLP, GraphRAG and Large Language Models.
Skills & Experiences:
Senior Level:
Python
Langchain
Huggingface-Transformers
SentencesTransformers
REST
OPC-UA
Scarpy
RAG, Large Language Models (LLM)
Natural Language Processing (NLP)
Pandas
Requirements management
GraphRAG
Junior Level:
C#
C++
C
SQL
PyTorch
scikit-learn
Azure
AWS
SQL
Neo4J
Branchen
Branchen
Pharmaceutical industry
Nuclear technology
Energy industry
Automotive industry
Furniture industry
Metal processing for furniture
Mechanical engineering for the pharmaceutical industry
Electrical industry
Vertrauen Sie auf Randstad
Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung