ML in production.
Trained, deployed, observed — shipped.
Machine Learning Engineer focused on MLOps and DevOps for AI — building scalable pipelines, cloud infrastructure, and production-grade LLM / multimodal systems. Research on the side.
Six years.
Four chapters.
- —Developing and deploying comprehensive MLOps pipelines for PPTX→video generation at scale.
- —Building multimodal stack: speech synthesis, avatar lip-sync, slide intelligence, RAG.
- —Owning end-to-end model lifecycle from training through canary deploys to production.
- —Architected CDC pipelines (MySQL · Debezium · Kafka · Zookeeper) feeding live ML services.
- —Shipped SmartRemarks — an intelligent text-analysis system for internal operations.
- —Delivered a high-accuracy OCR service for TIN-certificate validation (>99% precision).
- —Built and maintained scalable ML microservices infrastructure.
- —Led design, development and delivery of 13 AI/ML products across conversational AI, NLP, CV and FinTech.
- —Architected LLM-powered enterprise knowledge bots using advanced RAG techniques.
- —Configured and managed scalable AWS infra (EC2, ECS, S3) and CI/CD via GitHub Actions.
- —Partnered with international clients to design and deploy custom ML solutions.
- —Delivered 30+ projects across computer vision, NLP and predictive analytics.
- —Built production systems for image classification, real-time object detection and ASR.
Selected papers,
research on the side.
Google Scholar ↗The graph I think in.
Research up top, engineering in the middle, cloud underneath. The fun is in the edges.
What I'm
shipping now.
Three active workstreams — agentic systems, generative UGC pipelines, and workflow automation. All in production, all instrumented with evals and cost telemetry.
From prompt to
lip-sync video.
A user uploads a PDF, PPTX, or types a prompt. A RAG-backed content agent parses the input, rewrites and structures slide content, then hands off to a TTS engine for voiceover. An avatar renderer synchronises lip movement and gesture, and the final video is encoded and delivered — all on a managed GCP pipeline with quality gates at every stage.
How I ship
AI in 2026.
Rather ask than read?
There's a small model for that.
Let's talk.
Research, production, or both.
Always happy to talk ML, research, or interesting engineering problems. Based in Köln — working across DE & EU.
// send a message

