Generative AI Engineer with 5+ years specialising in NLP and AI solutions.
Aktualisiert am 17.10.2024
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 17.10.2024
Verfügbar zu: 100%
davon vor Ort: 15%
Generative AI
Natural Language Processing (NLP)
Large Language Models (LLM)
Artificial Intelligence (AI)
Retrieval-Augmented Generation (RAG)
Amazon Web Services (AWS)
Python
SQL
Data Scientist
Data Analyst
English
Fluent
Portuguese
Fluent
Spanish
Fluent
Catalan
Fortgeschritten

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

8 months
2023-11 - 2024-06

Multi-modal Retrieval-Augmented Generation (RAG) system

Generative AI Engineer Generative AI Natural Language Processing (NLP) Large Language Model (LLM) ...
Generative AI Engineer

This client had a bottleneck on producing high-quality and up-to-date research for their website. The existing manual process was too resource-intensive and error-prone to scale to the necessary volume of content.


ACTION

I built a multi-modal Retrieval-Augmented Generation (RAG) system for automated production of research, with direct website content injection. I also integrated 5+ data sources, including web search and an in-house content farm, and periodic reflection mechanisms, using a modular Large Language Model (LLM) framework with fine-grained oversight.


OUTCOME

Was able to ramp-up research production to an unbounded scale, while maintaining cost-effectiveness of the process. Also ensured continual content relevance and iterative accuracy improvement.

Generative AI Natural Language Processing (NLP) Large Language Model (LLM) Retrieval-Augmented Generation (RAG) Python
BIG Ai
Phuket, Thailand
1 year 8 months
2022-01 - 2023-08

High-performance Optical Character Recognition (OCR) pipeline

NLP Data Scientist Optical Character Recognition (OCR) AWS Azure ...
NLP Data Scientist

The client required an industry-grade OCR tool capable of handling a large volume and wide variety of internal documents in real time. The data set included 1M+ multilingual documents across 30+ languages and 20+ formats.


ACTION

Architect and deploy AWS/Azure multi-cloud solution, while managing a cross-functional team of Data Scientists and MLOps engineers.


OUTCOME

The solution was not only able to accommodate all of the client's requirements, but also provide a robust monitoring system that recorded all file entries and respective statuses across the pipeline, for fine-grained oversight over the process.

Optical Character Recognition (OCR) AWS Azure Data Scientist MLOps Cloud Architect
Siemens-Energy
Barcelona, Spain
1 year 1 month
2022-01 - 2023-01

Topic Modelling and Search Engine tools

NLP Data Scientist Research and Development (R&D) Topic Modelling Search Engine ...
NLP Data Scientist

The Research and Development (R&D) had trouble organising and visualising large volumes of research papers, due to their verbose nature. They also struggled to understand how research papers related to each other on a topic level, beyond their references and citations.


ACTION

Develop an innovative NLP solution, using Topic Modelling and Search Engine techniques to plot research papers onto a user-friendly 2D graph (zoom out), while also allowing for users to search and find particular information (zoom in), within the large dataset.


OUTCOME

Visualisation of 10k+ papers in an interactive plot which clusters research papers by topic, allowing for an unprecedented eagles-eye view on the data and accelerating discovery of research.

Research and Development (R&D) Topic Modelling Search Engine Data Visualisation
Siemens-Energy
Barcelona, Spain
1 year 6 months
2021-08 - 2023-01

Scalable Named Entity Recognition (NER) microservice

NLP Data Scientist Named Entity Recognition (NER) Microservice Modular Design ...
NLP Data Scientist

The client's team had a bottleneck on manually extracting critical information from large volumes of documents.


ACTION

Create a NER microservice for performing real-time data extraction, via detecting patterns on those documents, and optimising for the most important templates in the dataset.


OUTCOME

Reduced manual document search time by 100+ hours annually. Service was also implemented with a validation step to ensure reliability, and modular design for cross-departmental re-usability.

Named Entity Recognition (NER) Microservice Modular Design Agile
Siemens-Energy
Barcelona, Spain
1 year 1 month
2021-01 - 2022-01

In-house Applicant Tracking System (ATS)

NLP Data Scientist Human Resources (HR) NLP-powered analytics
NLP Data Scientist

The Human Resources (HR) team saw that standard Applicant Tracking Systems (ATS) lacked customisation, and their effectiveness fell short on use-cases where metrics needed tailored for the specific technical offers.


ACTION

Built an in-house ATS tool with NLP-powered analytics, that can accept any number of resumes and recruiter requirements, and display multiple metrics tailored to the specific technical offer.


OUTCOME

An interactive dashboard with matching results, 40% more customisable than standard tools, for streamlining recruitment processes and providing critical insights for hiring decisions.

Human Resources (HR) NLP-powered analytics
Siemens-Energy
Barcelona, Spain
1 year 1 month
2020-01 - 2021-01

Peer-reviewed Natural Language Processing (NLP) research

NLP Data Science Researcher Machine Learning (ML) Natural Language Processing (NLP) Python ...
NLP Data Science Researcher

In the aviation sector, human factors are the primary cause of safety incidents. Intelligent prediction systems, which are capable of evaluating human state and managing risk, have been developed over the years to identify and prevent human factors. However, the lack of large useful labelled data has often been a drawback to the development of these systems.


ACTION

Present a methodology to identify and classify human factor categories from aviation incident reports, introducing a novel classification framework, and pioneering methods linking Machine Learning (ML) to aviation safety.


OUTCOME

The best predictive models achieved a Micro F1 score of 0.900, 0.779, and 0.875, for each level of the taxonomic framework, proving that favourable predicting performances can be achieved for the classification of human factors based on text data. The published research also influenced subsequent academic work, driving innovations in minimising aviation incidents through advanced human factors analysis.

Machine Learning (ML) Natural Language Processing (NLP) Python Data Scientist
Instituto Superior Técnico
Lisbon, Portugal

Aus- und Weiterbildung

Aus- und Weiterbildung

1 year 1 month
2019-09 - 2020-09

European study abroad program

Ghent University
Ghent University
  • Scholarship from Erasmus Grants
2 years 1 month
2018-09 - 2020-09

Intelligent Systems

Master's Degree in Intelligent Systems, Instituto Superior Técnico
Master's Degree in Intelligent Systems
Instituto Superior Técnico

Kompetenzen

Kompetenzen

Top-Skills

Generative AI Natural Language Processing (NLP) Large Language Models (LLM) Artificial Intelligence (AI) Retrieval-Augmented Generation (RAG) Amazon Web Services (AWS) Python SQL Data Scientist Data Analyst

Produkte / Standards / Erfahrungen / Methoden

Profile:

He is a seasoned NLP Data Scientist with a history of developing AI solutions spanning both academic research and industry applications. He has proven proficiency in architecting and implementing high-performance systems, while collaborating with cross-functional teams. Tomás excels at aligning technological innovations with business objectives. He continuously expands his knowledge to stay current with AI advancements.


SKILLS

  • Generative AI
  • Natural Language Processing
  • Large Language Models
  • Retrieval-Augmented
  • Generation (RAG)
  • Amazon Web Services


Work Experience

2023 - 2024

Role: Generative AI Engineer

Customer: BIG Ai


Tasks:

  • Built a multi-modal Retrieval-Augmented Generation (RAG) system with 5+ data sources, including web search, and periodic reflection to ensure continual content relevance and improve accuracy by 20%.
  • Led company-wide Artificial Intelligence (AI) strategy, defining product portfolio and 12-month roadmap.


2021 - 2023

Role: NLP Data Scientist

Customer: Siemens Energy


Tasks:

  • Led Optical Character Recognition (OCR) pipeline, for processing 1M+ multilingual documents in 30+ languages and 20+ formats. Architected AWS/Azure multi-cloud solution.
  • Deployed scalable and re-usable Named Entity Recognition (NER) micro-service for real-time data extraction, reducing manual search time by 100+ hours annually.
  • Developed Topic Modelling and Search Engine tools for Research and Development (R&D), enabling visualization of 10k+ papers and accelerating discovery of research.
  • Built in-house Applicant Tracking System (ATS) with NLP-powered analytics, 40% more customizable than standard tools, providing critical insights for hiring decisions.
  • Supervised a Natural Language Query (NLQ) thesis project, using state-of-the-art Graph Database interfaces


2020 - 2021

Role: NLP Data Science Researcher

Customer: Instituto Superior Técnico


Tasks:

  • Published peer-reviewed research on Natural Language Processing (NLP), pioneering methods linking Machine Learning (ML) to aviation safety.
  • Influenced subsequent academic work, driving innovations in minimising aviation incidents

Programmiersprachen

Generative AI
Experte
Natural Language Processing (NLP)
Experte
Large Language Models (LLM)
Experte
Retrieval-Augmented Generation (RAG)
Experte
Machine Learning (ML)
Experte
Python
Experte
Amazon Web Services (AWS)
Experte
Data Science
Experte
Data Analytics
Fortgeschritten
Agile
Fortgeschritten
SQL
Fortgeschritten
Data Visualisation
Fortgeschritten
Client Relations
Fortgeschritten
Azure
Basics
Google Cloud Computing (GCP)
Basics
Docker
Basics
CI/CD
Basics

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

8 months
2023-11 - 2024-06

Multi-modal Retrieval-Augmented Generation (RAG) system

Generative AI Engineer Generative AI Natural Language Processing (NLP) Large Language Model (LLM) ...
Generative AI Engineer

This client had a bottleneck on producing high-quality and up-to-date research for their website. The existing manual process was too resource-intensive and error-prone to scale to the necessary volume of content.


ACTION

I built a multi-modal Retrieval-Augmented Generation (RAG) system for automated production of research, with direct website content injection. I also integrated 5+ data sources, including web search and an in-house content farm, and periodic reflection mechanisms, using a modular Large Language Model (LLM) framework with fine-grained oversight.


OUTCOME

Was able to ramp-up research production to an unbounded scale, while maintaining cost-effectiveness of the process. Also ensured continual content relevance and iterative accuracy improvement.

Generative AI Natural Language Processing (NLP) Large Language Model (LLM) Retrieval-Augmented Generation (RAG) Python
BIG Ai
Phuket, Thailand
1 year 8 months
2022-01 - 2023-08

High-performance Optical Character Recognition (OCR) pipeline

NLP Data Scientist Optical Character Recognition (OCR) AWS Azure ...
NLP Data Scientist

The client required an industry-grade OCR tool capable of handling a large volume and wide variety of internal documents in real time. The data set included 1M+ multilingual documents across 30+ languages and 20+ formats.


ACTION

Architect and deploy AWS/Azure multi-cloud solution, while managing a cross-functional team of Data Scientists and MLOps engineers.


OUTCOME

The solution was not only able to accommodate all of the client's requirements, but also provide a robust monitoring system that recorded all file entries and respective statuses across the pipeline, for fine-grained oversight over the process.

Optical Character Recognition (OCR) AWS Azure Data Scientist MLOps Cloud Architect
Siemens-Energy
Barcelona, Spain
1 year 1 month
2022-01 - 2023-01

Topic Modelling and Search Engine tools

NLP Data Scientist Research and Development (R&D) Topic Modelling Search Engine ...
NLP Data Scientist

The Research and Development (R&D) had trouble organising and visualising large volumes of research papers, due to their verbose nature. They also struggled to understand how research papers related to each other on a topic level, beyond their references and citations.


ACTION

Develop an innovative NLP solution, using Topic Modelling and Search Engine techniques to plot research papers onto a user-friendly 2D graph (zoom out), while also allowing for users to search and find particular information (zoom in), within the large dataset.


OUTCOME

Visualisation of 10k+ papers in an interactive plot which clusters research papers by topic, allowing for an unprecedented eagles-eye view on the data and accelerating discovery of research.

Research and Development (R&D) Topic Modelling Search Engine Data Visualisation
Siemens-Energy
Barcelona, Spain
1 year 6 months
2021-08 - 2023-01

Scalable Named Entity Recognition (NER) microservice

NLP Data Scientist Named Entity Recognition (NER) Microservice Modular Design ...
NLP Data Scientist

The client's team had a bottleneck on manually extracting critical information from large volumes of documents.


ACTION

Create a NER microservice for performing real-time data extraction, via detecting patterns on those documents, and optimising for the most important templates in the dataset.


OUTCOME

Reduced manual document search time by 100+ hours annually. Service was also implemented with a validation step to ensure reliability, and modular design for cross-departmental re-usability.

Named Entity Recognition (NER) Microservice Modular Design Agile
Siemens-Energy
Barcelona, Spain
1 year 1 month
2021-01 - 2022-01

In-house Applicant Tracking System (ATS)

NLP Data Scientist Human Resources (HR) NLP-powered analytics
NLP Data Scientist

The Human Resources (HR) team saw that standard Applicant Tracking Systems (ATS) lacked customisation, and their effectiveness fell short on use-cases where metrics needed tailored for the specific technical offers.


ACTION

Built an in-house ATS tool with NLP-powered analytics, that can accept any number of resumes and recruiter requirements, and display multiple metrics tailored to the specific technical offer.


OUTCOME

An interactive dashboard with matching results, 40% more customisable than standard tools, for streamlining recruitment processes and providing critical insights for hiring decisions.

Human Resources (HR) NLP-powered analytics
Siemens-Energy
Barcelona, Spain
1 year 1 month
2020-01 - 2021-01

Peer-reviewed Natural Language Processing (NLP) research

NLP Data Science Researcher Machine Learning (ML) Natural Language Processing (NLP) Python ...
NLP Data Science Researcher

In the aviation sector, human factors are the primary cause of safety incidents. Intelligent prediction systems, which are capable of evaluating human state and managing risk, have been developed over the years to identify and prevent human factors. However, the lack of large useful labelled data has often been a drawback to the development of these systems.


ACTION

Present a methodology to identify and classify human factor categories from aviation incident reports, introducing a novel classification framework, and pioneering methods linking Machine Learning (ML) to aviation safety.


OUTCOME

The best predictive models achieved a Micro F1 score of 0.900, 0.779, and 0.875, for each level of the taxonomic framework, proving that favourable predicting performances can be achieved for the classification of human factors based on text data. The published research also influenced subsequent academic work, driving innovations in minimising aviation incidents through advanced human factors analysis.

Machine Learning (ML) Natural Language Processing (NLP) Python Data Scientist
Instituto Superior Técnico
Lisbon, Portugal

Aus- und Weiterbildung

Aus- und Weiterbildung

1 year 1 month
2019-09 - 2020-09

European study abroad program

Ghent University
Ghent University
  • Scholarship from Erasmus Grants
2 years 1 month
2018-09 - 2020-09

Intelligent Systems

Master's Degree in Intelligent Systems, Instituto Superior Técnico
Master's Degree in Intelligent Systems
Instituto Superior Técnico

Kompetenzen

Kompetenzen

Top-Skills

Generative AI Natural Language Processing (NLP) Large Language Models (LLM) Artificial Intelligence (AI) Retrieval-Augmented Generation (RAG) Amazon Web Services (AWS) Python SQL Data Scientist Data Analyst

Produkte / Standards / Erfahrungen / Methoden

Profile:

He is a seasoned NLP Data Scientist with a history of developing AI solutions spanning both academic research and industry applications. He has proven proficiency in architecting and implementing high-performance systems, while collaborating with cross-functional teams. Tomás excels at aligning technological innovations with business objectives. He continuously expands his knowledge to stay current with AI advancements.


SKILLS

  • Generative AI
  • Natural Language Processing
  • Large Language Models
  • Retrieval-Augmented
  • Generation (RAG)
  • Amazon Web Services


Work Experience

2023 - 2024

Role: Generative AI Engineer

Customer: BIG Ai


Tasks:

  • Built a multi-modal Retrieval-Augmented Generation (RAG) system with 5+ data sources, including web search, and periodic reflection to ensure continual content relevance and improve accuracy by 20%.
  • Led company-wide Artificial Intelligence (AI) strategy, defining product portfolio and 12-month roadmap.


2021 - 2023

Role: NLP Data Scientist

Customer: Siemens Energy


Tasks:

  • Led Optical Character Recognition (OCR) pipeline, for processing 1M+ multilingual documents in 30+ languages and 20+ formats. Architected AWS/Azure multi-cloud solution.
  • Deployed scalable and re-usable Named Entity Recognition (NER) micro-service for real-time data extraction, reducing manual search time by 100+ hours annually.
  • Developed Topic Modelling and Search Engine tools for Research and Development (R&D), enabling visualization of 10k+ papers and accelerating discovery of research.
  • Built in-house Applicant Tracking System (ATS) with NLP-powered analytics, 40% more customizable than standard tools, providing critical insights for hiring decisions.
  • Supervised a Natural Language Query (NLQ) thesis project, using state-of-the-art Graph Database interfaces


2020 - 2021

Role: NLP Data Science Researcher

Customer: Instituto Superior Técnico


Tasks:

  • Published peer-reviewed research on Natural Language Processing (NLP), pioneering methods linking Machine Learning (ML) to aviation safety.
  • Influenced subsequent academic work, driving innovations in minimising aviation incidents

Programmiersprachen

Generative AI
Experte
Natural Language Processing (NLP)
Experte
Large Language Models (LLM)
Experte
Retrieval-Augmented Generation (RAG)
Experte
Machine Learning (ML)
Experte
Python
Experte
Amazon Web Services (AWS)
Experte
Data Science
Experte
Data Analytics
Fortgeschritten
Agile
Fortgeschritten
SQL
Fortgeschritten
Data Visualisation
Fortgeschritten
Client Relations
Fortgeschritten
Azure
Basics
Google Cloud Computing (GCP)
Basics
Docker
Basics
CI/CD
Basics

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.