About

Co-Founder & Research Scientist — Black Forest Labs

2024 — Present

San Francisco / Freiburg · $300M Series B at $3.25B valuation

Multi-modal research: representation learning and omni-model architectures across image, video, audio, and text.
Project lead on Kontext — in-context image editing via sequence concatenation on a 12B Flux transformer. Core R&D team of 2 people, ~2 months. Introduced KontextBench (1,026 pairs, 5 task categories).
Core contributor to FLUX.1: a 12B MM-DiT with rectified flow matching, 3D RoPE, and guidance distillation, shipped across dev/schnell/pro in four months.
Core contributor to FLUX.2: native multi-reference and editing capabilities in pretraining, massively scaled paired training data.
Creator of Flux Ultra (ultra-high-quality generation) and contributor to Klein (efficient small-scale inference).
Recent published work: Self-Flow, a self-supervised flow matching framework with 2.8× faster convergence.

Applied Model Lead — Stability AI

2022 — 2024

Los Angeles / Remote · Promoted from ML Engineer to Lead in 8 months

Lead author of SDXL — dual text encoders (CLIP ViT-L + OpenCLIP ViT-bigG), multi-aspect training with micro-conditioning, two-stage refinement pipeline. ICLR 2024 Spotlight. 4,000+ citations.
Created CosXL: cosine-scheduled noise scaling and v-prediction for improved color accuracy and quality.
Created Control-LoRAs: depth, canny, and pose conditioning as lightweight adapters.
Contributed to Stable Video Diffusion and SD3 / Scaling Rectified Flow Transformers (ICML 2024 Best Paper).

Self-Directed ML Research

2021 — 2022

Los Angeles, CA

Taught himself deep learning and generative modeling from scratch — diffusion models, UNet architectures, training methodology — through hands-on implementation, tinkering, and open-source mentorship. No formal degree or coursework.

Technical Roles — VFX, Media Production, Software

2015 — 2021

Los Angeles, CA

Seven years of technical problem-solving across complex production environments. Built the rapid-learning instincts and quality intuition that later translated unusually well to ML research.

Builds generative models that actually ship.