~/projects/portfolio portfolio.md

Hi, I am Michaël Romagné

Plouneour

Machine Learning Engineer with 5 years of experience helping companies bring ML projects to production through robust MLOps practices.

My Skills

Data Science

ML, NLP, GenAI, PyTorch, AWS Bedrock, HuggingFace, ONNX

MLOps

W&B Weave, Skypilot, DVC, BentoML, ClearML, Mlflow

kube

DevOps

Docker, Gitlab CI, Kubernetes, Helm, Argo CD, Terraform

AWS

Lambda, Step Functions, Batch, EKS, ECS, Sagemaker, S3, Bedrock

Software Dev

Python expert, Django, Celery, FastAPI, uv

streamlit

Data Viz

Streamlit, Grafana, Kibana, Tableau

Data Engineering

Temporal, Dagster, Airflow, Hadoop, Spark, Snowflake

My work

Who I am

I'm a Machine Learning Engineer with a strong mathematical background. My path to Data Science began when I realized the vast potential of applying mathematical concepts to real business challenges.

I sharpened my skills at Ubisoft, where I learned to manage end-to-end ML projects, diving deep into Infrastructure, Data Engineering and MLOps.

Then at GitGuardian, I fine-tuned Transformer models for code security and built the MLOps stacks of the company from scratch.

After these experiences, I wanted to try the freelance path to continue growing in ownership and expand my skills. I joined Sanofi to work on GenAI challenges: building scalable Unstructured Data parsing pipelines, benchmarking Vision-Language Models, and making unstructured knowledge accessible at scale.

In my free time, I'm passionate about Football and data analytics. I built , a platform helping football fans discover and scout players using advanced analytics.

If you want to have a talk, reach out on or at .

Michou

Sanofi - MLOps Engineer | GenAI, LLMOps

Dec 2024 - Present

- Development of an Unstructured Data Pipeline (OCR+VLM with Docling, AWS Textract, Bedrock VLMs, metadata generation with LLMs, chunking, vectorization), processing millions of PDFs for Sanofi teams.
- Developed an internal benchmarking framework with DVC & Weave to compare open-source OCR libraries and VLMs for unstructured data extraction.
- Define MLOps best practices for GenAI teams at Sanofi.
- Develop AI agents for employee companion use case with LangGraph.

GitGuardian - MLE

Oct 2023 - Dec 2024

- Built the MLOps stack from scratch: GitLab CI, SkyPilot, DVC pipelines, Dagster jobs, Streamlit, ONNX Runtime, BentoML, Helm, ArgoCD.
- Fine-tuned and integrated NLP models (CodeBERTa) into the Secrets Detection Engine, reducing false positives by 5x (Django, Celery, Kubernetes, AWS).
- Developed PoCs on automatic remediation for leaked secrets (OpenAI API, AST parsers and code formatting).

Ubisoft - MLE

Feb 2021 - Oct 2023

- End-to-end Fraud Detection project in e-commerce transactions (Ubi Connect and Steam).
Led Research tasks (Feature Engineering, Semi-supervised learning) and put in place MLOps best practices (DVC, remote jobs on K8s, ClearML, model inference on AWS).
The project led to 5% of net sales savings, about 4 millions euros per year, compared to the previous fraud detection product.

- Time Series forecasting on Acquisition, Retention, Monetization and Ubisoft servers vCPU usage. Trained and deployed Generalized Additive Models to improve forecasts.

projects

Football Analytics Machine Learning

The Scouting Arena

Football analytics platform helping fans discover and scout new players.

Personal Project | Ongoing

MLOps NLP Machine Learning

GitGuardian's MLOps Open-Source Stack

The stack we implemented for NLP use cases at GitGuardian.

Duration: 10 min | November 28th 2024

MLOps NLP Machine Learning

A gentle journey to LLMops : Zilliz AOC

Come aboard to discover emerging Open Source MLOps tools and build the best ML stack!

Duration: 13 min | January 3rd 2024

Fraud Detection Machine Learning MLOps

ML for Fraud Detection

Recipe to limit fraud for online merchants.

Duration: 10 min | Sept 30, 2023