Philipp Guldimann

Logo

Data & ML Systems Engineer — LLM evaluation, RAG, and data infrastructure, from zero to production.

View My GitHub Profile

Philipp Guldimann

Data & ML Systems Engineer · Zürich, Switzerland

GitHub · LinkedIn · Email

I build evaluation and data infrastructure for AI products — from zero to production. At a Zürich AI startup I built LLM evaluation infrastructure (23 evaluators, 10+ models assessed) that cut QA cycles from two days to four hours, and I’m now building multi-jurisdiction legal AI pipelines that process roughly one million documents across three countries. I hold an MSc in Machine Intelligence from ETH Zürich, and I care about pragmatic, scalable systems that ship and prove their value with metrics.

Technical focus

LLM evaluation · RAG systems · Data pipelines · MLOps · TypeScript / Python · AWS / Azure


Experience

Data Engineer — Omnilex

Feb 2026 – Present · Zürich, Switzerland

Machine Learning Engineer — LatticeFlow AI

Jan 2025 – Nov 2025 · Zürich, Switzerland


Research & Publications

COMPL-AI — A Benchmarking Framework for Evaluating LLM Compliance with the EU AI Act

Research Intern, Secure Reliable Intelligence Lab, ETH Zürich (Oct 2023 – Mar 2024)

Core contributor to COMPL-AI, evaluating 10+ models across 20 benchmarks spanning capabilities, cybersecurity, privacy, and bias/fairness. Worked on benchmark design, evaluation pipelines, and model integration via Hugging Face Transformers.

Read the paper (arXiv:2410.07959) · Code on GitHub


Education


Technical Skills


Earlier Projects

Computational Intelligence Lab — Text Classification (ETH, 2023)

Report (PDF)

Bachelor Thesis — Detecting Disinformation on Twitter

Thesis (PDF)