Pierre-Alexandre
Broux

Work Experience

Asyora

Aug 2023 - Jun 2025

Full Stack Developer

Situation: Entrepreneurial initiative to launch a SaaS solution aimed at reducing the time spent in traditional meetings.

Task: Design an asynchronous meeting platform incorporating best practices to optimize time management.

Actions:

Complete solution development with Django (Python), Htmx and Tailwind CSS.
Deployment and management of infrastructure on Amazon Web Service (AWS) and DigitalOcean.
PostgreSQL database management.
Application of DevOps best practices such as CI/CD and testing (unit, integration and system tests).
Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Significant reduction in the number and duration of meetings, with a significant improvement in user productivity.

Stack: Python, Django, Amazon Web Service, Docker, Pytest, Tailwind CSS, Htmx, DigitalOcean, Git, Github Actions CI/CD, PostgreSQL, JavaScript.

Groupe Rhapsodie

Oct 2021 - Aug 2024

Machine Learning Engineer

Situation: The company wanted to create an artificial intelligence (AI) laboratory dedicated to the development of specialized solutions for recognizing emotions from text (NLP) and speech.

Task: Designing a robust, scalable infrastructure aligned with the company's vision, while implementing state-of-the-art machine learning models for specific applications.

Actions:

Setting up an infrastructure using MLOps best practices.
Data collection, transformation and annotation.
Development and deployment of deep learning models (PyTorch).
Integration and adaptation of large language models (LLM).
Prototyping intelligent agents with LangGraph et LangChain.
Managing and optimizing computing resources.
Supervision of collaborators to ensure consistency of developments. Algorithm documentation.

Result: Setting up an operational laboratory with optimized MLOps workflow, enabling rapid deployment of AI products.

Stack: Python, Hugging Face, OpenAI API, LangChain, LangGraph, ZenML, PyTorch, Gradio, MLflow, Qdrant, Git, Github Actions CI/CD, Amazon Web Service (AWS), NumPy, Pandas, Pytest, Seaborn, Asyncio, Docker, Django, FastAPI, Tailwind CSS, Htmx, JavaScript.

DataValue Consulting (DVC)

Feb 2021 - Aug 2021

Machine Learning Engineer

Situation: The company wanted to develop artificial intelligence (AI) products specialized in natural language processing (NLP) for its customers.

Task: Implementing and adapting state-of-the-art machine learning models for specific applications, while supervising projects and collaborators.

Actions:

Design and development of Python algorithms for Named Entity Recognition (NER), exploiting Large Language Models (LLMs) such as BERT and RoBERTa.
Supervision of collaborators, ensuring their development and the quality of deliverables.
Advising on the choice and implementation of a high-performance infrastructure tailored to project needs.

Result: Delivery of AI solutions tailored to business needs.

Stack: Python, PyTorch, PyTorch Lightning, Docker, Git, Jupyter, Seaborn, NumPy, Pandas.

Laboratoire informatique du Mans (LIUM)

Dec 2018 - Feb 2021

Machine Learning Engineer

Situation: The laboratory wanted to analyze historical French archives to extract usable information.

Task: Designing artificial intelligence (AI) algorithms to automate the analysis of ancient documents.

Actions:

Development in Python of algorithms integrating optical character recognition (OCR), automatic transcription (ASR) and named entity recognition (NER).
Implementation of a pipeline integrating different technologies (OCR, ASR, NER) to automate complete archive processing.
Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Delivery of tools for better understanding and use of period documents, facilitating historical research and the enhancement of archives.

Stack: Python, PyTorch, Keras, Tensorflow, SLURM, Git, Scikit-learn, NumPy, Jupyter.

Institut national de l’audiovisuel (INA)

Dec 2015 - Dec 2018

Machine Learning Engineer

Situation: The company wanted to help documentalists in their work of annotating audiovisual data, where the growing volume of archives requires automation solutions.

Task: Developing artificial intelligence (AI) diarization algorithms to automate speaker annotation.

Actions:

Development in Python of diarization algorithms taking human intervention into account.
Active contribution to two Open-Source Python projects (S4D and SIDEKIT) specializing in speaker recognition and diarization.
Development of an evaluation metric to accurately determine the performance of human-assisted diarization AI algorithms.
Publishing results in the form of scientific articles and present them at international conferences.
Working in an international context in English, collaborating effectively with non-French-speaking colleagues and partners.
Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Delivery of a tool to improve the annotation speed of audiovisual data, facilitating the work of documentalists.

Stack: Python, Jupyter, Git, SLURM.

Skills

Artificial intelligence (AI)
Generative AI (GenAI)
Large langage model (LLM)
Retrieval augmented generation (RAG)
Natural language processing (NLP)
Intelligent agent
MLOps
DevOps
Web development
Agile/Scrum methodology

Projects

KeePassXC Raycast extension

May 2023 - Current

An open-source TypeScript, React and Node.js extension for Raycast dedicated to KeePassXC and used by over 2,500 users.

S4D

Sep 2016 - Sep 2020

An open-source Python machine learning toolkit dedicated to speaker diarization.

Technologies

Language

C, CSS, HTML, Java, JavaScript, JSON, LaTeX, Nix, Perl, Python, Ruby, Sass, SQL, Typescript, XML, YAML

Database

Qdrant, MongoDB, MySQL, PostgreSQL

Library & framework

Comet ML, Django, FastAPI, Gradio, Htmx, Hugging Face, Keras, LangChain, LangGraph, MLflow, NumPy, Ollama, OpenAI API, Opik, Pandas, Pydantic, Pytest, Pytorch, Pytorch Lightning, React, Scikit- Learn, Seaborn, Tailwind CSS, Tavily, TensorFlow, Unsloth, vLLM, ZenML

Cloud

Amazon Web Services (AWS), DigitalOcean

Other

Docker, Figma, Git, GitHub Actions CI/CD, Kubernetes, Jupyter, Linear, Node.js, pre-commit, Slurm, Terraform

Certifications

Data science & machine learning

Data cleaning, Data visualization, Deep learning, Feature engineering, Machine learning, Natural language processing (NLP), Pandas

Web development

CSS, JavaScript, REST API, Search Engine Optimization (SEO)

Security

RGPD workshop, SecNumacadémie

Other

Docker, GitHub Actions CI/CD, Kubernetes

Education

Le Mans University

2015 - 2020

Doctor of Philosophy - PhD

Le Mans University

2013 - 2015

Master's degree

Languages

English

Full professional proficiency

French

Native proficiency

Work Experience

.css-1wafmg3{box-sizing:border-box;margin:0;min-width:0;color:inherit;-webkit-text-decoration:inherit;text-decoration:inherit;font-size:inherit;font-weight:inherit;}.css-1wafmg3:hover{opacity:0.7;}Asyora

Groupe Rhapsodie

DataValue Consulting (DVC)

Laboratoire informatique du Mans (LIUM)

Institut national de l’audiovisuel (INA)

Skills

Projects

KeePassXC Raycast extension

S4D

Technologies

Certifications

Education

Le Mans University

Le Mans University

Languages

Asyora