Made with
Standard Resume
Learn more

Pierre-Alexandre
Broux

Machine Learning Engineer | Paris, France | contact@pabroux.com | +33659590641 | linkedin.com/in/pabroux | pabroux.com

Work Experience

Asyora

Aug 2023 - Jun 2025
Full Stack Developer

Situation: Entrepreneurial initiative to launch a SaaS solution aimed at reducing the time spent in traditional meetings.

Task: Design an asynchronous meeting platform incorporating best practices to optimize time management.

Actions:

  • Complete solution development with Django (Python), Htmx and Tailwind CSS.
  • Deployment and management of infrastructure on Amazon Web Service (AWS) and DigitalOcean.
  • PostgreSQL database management.
  • Application of DevOps best practices such as CI/CD and testing (unit, integration and system tests).
  • Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Significant reduction in the number and duration of meetings, with a significant improvement in user productivity.

Stack: Python, Django, Amazon Web Service, Docker, Pytest, Tailwind CSS, Htmx, DigitalOcean, Git, Github Actions CI/CD, PostgreSQL, JavaScript.

Groupe Rhapsodie

Oct 2021 - Aug 2024
Machine Learning Engineer

Situation: The company wanted to create an artificial intelligence (AI) laboratory dedicated to the development of specialized solutions for recognizing emotions from text (NLP) and speech.

Task: Designing a robust, scalable infrastructure aligned with the company's vision, while implementing state-of-the-art machine learning models for specific applications.

Actions:

  • Setting up an infrastructure using MLOps best practices.
  • Data collection, transformation and annotation.
  • Development and deployment of deep learning models (PyTorch).
  • Integration and adaptation of large language models (LLM).
  • Prototyping intelligent agents with LangGraph et LangChain.
  • Managing and optimizing computing resources.
  • Supervision of collaborators to ensure consistency of developments. Algorithm documentation.

Result: Setting up an operational laboratory with optimized MLOps workflow, enabling rapid deployment of AI products.

Stack: Python, Hugging Face, OpenAI API, LangChain, LangGraph, ZenML, PyTorch, Gradio, MLflow, Qdrant, Git, Github Actions CI/CD, Amazon Web Service (AWS), NumPy, Pandas, Pytest, Seaborn, Asyncio, Docker, Django, FastAPI, Tailwind CSS, Htmx, JavaScript.

DataValue Consulting (DVC)

Feb 2021 - Aug 2021
Machine Learning Engineer

Situation: The company wanted to develop artificial intelligence (AI) products specialized in natural language processing (NLP) for its customers.

Task: Implementing and adapting state-of-the-art machine learning models for specific applications, while supervising projects and collaborators.

Actions:

  • Design and development of Python algorithms for Named Entity Recognition (NER), exploiting Large Language Models (LLMs) such as BERT and RoBERTa.
  • Supervision of collaborators, ensuring their development and the quality of deliverables.
  • Advising on the choice and implementation of a high-performance infrastructure tailored to project needs.

Result: Delivery of AI solutions tailored to business needs.

Stack: Python, PyTorch, PyTorch Lightning, Docker, Git, Jupyter, Seaborn, NumPy, Pandas.

Machine Learning Engineer

Situation: The laboratory wanted to analyze historical French archives to extract usable information.

Task: Designing artificial intelligence (AI) algorithms to automate the analysis of ancient documents.

Actions:

  • Development in Python of algorithms integrating optical character recognition (OCR), automatic transcription (ASR) and named entity recognition (NER).
  • Implementation of a pipeline integrating different technologies (OCR, ASR, NER) to automate complete archive processing.
  • Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Delivery of tools for better understanding and use of period documents, facilitating historical research and the enhancement of archives.

Stack: Python, PyTorch, Keras, Tensorflow, SLURM, Git, Scikit-learn, NumPy, Jupyter.

Machine Learning Engineer

Situation: The company wanted to help documentalists in their work of annotating audiovisual data, where the growing volume of archives requires automation solutions.

Task: Developing artificial intelligence (AI) diarization algorithms to automate speaker annotation.

Actions:

  • Development in Python of diarization algorithms taking human intervention into account.
  • Active contribution to two Open-Source Python projects (S4D and SIDEKIT) specializing in speaker recognition and diarization.
  • Development of an evaluation metric to accurately determine the performance of human-assisted diarization AI algorithms.
  • Publishing results in the form of scientific articles and present them at international conferences.
  • Working in an international context in English, collaborating effectively with non-French-speaking colleagues and partners.
  • Documenting algorithm designs to facilitate knowledge sharing and ensure reproducibility.

Result: Delivery of a tool to improve the annotation speed of audiovisual data, facilitating the work of documentalists.

Stack: Python, Jupyter, Git, SLURM.

Skills

  • Artificial intelligence (AI)
  • Generative AI (GenAI)
  • Large langage model (LLM)
  • Retrieval augmented generation (RAG)
  • Natural language processing (NLP)
  • Intelligent agent
  • MLOps
  • DevOps
  • Web development
  • Agile/Scrum methodology

Projects

KeePassXC Raycast extension

May 2023 - Current

An open-source TypeScript, React and Node.js extension for Raycast dedicated to KeePassXC and used by over 2,500 users.

S4D

Sep 2016 - Sep 2020

An open-source Python machine learning toolkit dedicated to speaker diarization.

Technologies

Language
C, CSS, HTML, Java, JavaScript, JSON, LaTeX, Nix, Perl, Python, Ruby, Sass, SQL, Typescript, XML, YAML
Database
Qdrant, MongoDB, MySQL, PostgreSQL
Library & framework
Comet ML, Django, FastAPI, Gradio, Htmx, Hugging Face, Keras, LangChain, LangGraph, MLflow, NumPy, Ollama, OpenAI API, Opik, Pandas, Pydantic, Pytest, Pytorch, Pytorch Lightning, React, Scikit- Learn, Seaborn, Tailwind CSS, Tavily, TensorFlow, Unsloth, vLLM, ZenML
Cloud
Amazon Web Services (AWS), DigitalOcean
Other
Docker, Figma, Git, GitHub Actions CI/CD, Kubernetes, Jupyter, Linear, Node.js, pre-commit, Slurm, Terraform

Certifications

Data science & machine learning
Data cleaning, Data visualization, Deep learning, Feature engineering, Machine learning, Natural language processing (NLP), Pandas
Web development
CSS, JavaScript, REST API, Search Engine Optimization (SEO)
Security
RGPD workshop, SecNumacadémie
Other
Docker, GitHub Actions CI/CD, Kubernetes

Education

Le Mans University

2015 - 2020
Doctor of Philosophy - PhD

Le Mans University

2013 - 2015
Master's degree

Languages

English
Full professional proficiency
French
Native proficiency