← All case studies

Data engineering for ML-driven demand forecasting at Zalando

Built the data infrastructure behind machine-learning demand forecasting and price optimization across 500K+ products.

Industry
E-commerce, fashion
Company size
Enterprise, 16,000+ employees
Year
2015 to 2017
Role
Senior Data Engineer
Stack
AWS, PostgreSQL, Exasol, Python, Docker
500K+
Products (SKUs) in scope
100+ TB
Processed per day at Zalando

The challenge

Zalando's demand prediction and price optimization ran on heuristics that needed constant manual tuning. The data science teams wanted to move to modern machine learning, but they lacked a scalable, reliable data foundation to build on.

What I did

I owned the data engineering side: designing, building, and testing the data infrastructure that several data science teams depended on. That meant dependable pipelines, efficient processing of large-scale transactional and behavioral data across 500K+ products, and a foundation that ML models could be trained and deployed on without friction.

Outcomes

  • Gave ML teams a high-performance foundation for model development and deployment.
  • Helped replace manual heuristics with machine-learning-driven pricing.
  • Supported Zalando's shift toward dynamic, AI-driven pricing at scale.
  • Reduced repetitive manual work for the wholesale teams.

Facing something similar?

Book a free, no obligation consultation and let's talk through your situation.

Book a free consultation