Paweł Żak
Senior Data Engineer • Tech Lead • Founder
I design, build, and scale data platforms end-to-end. I’ve led teams delivering fraud detection, real-time baggage tracking, and TB-scale analytics — while staying hands-on with the code. I partner directly with customers to co-design solutions and accelerate delivery through reusable patterns. I also launched MyElib, a cross-platform SaaS for knowledge retention.
Entrepreneurial Experience
MyElib — Live SaaS for Knowledge Retention
- Cross-platform SaaS that syncs e-reader highlights and generates context-aware LLM-powered spaced-repetition flashcards.
- Designed and implemented end-to-end: architecture, deployment, billing, landing page, and marketing using Go, SQLite, Astro/React/TypeScript, Tauri, Stripe, OneSignal, Resend, Cloudflare, AWS.
- 30 early adopters syncing 5k+ highlights across Web, Mac, Windows, iOS, and Android monthly — validating product demand.
Selected Professional Experience
Addepto — Big Data & AI Consulting
Gdańsk (Remote)
Tech Lead, Data Engineering
Feb 2025 – Present
- Led 11 engineers across two flagship projects; owned architecture, reviews, and mentoring while shipping critical components.
- Built a EU government fraud-detection platform (~15s latency) on Databricks (Spark Streaming, Delta Live Tables, Redpanda), now used by 100+ analysts to investigate and monitor fraud data.
- Drove Data Mesh adoption for a global aviation provider; individually built first data product — real-time global scale baggage tracking (Event Hub, Spark Streaming).
- Member of a 10-person leadership team; shaped hiring, delivery, client relationships, and technical strategy.
Senior Data Engineering Consultant
Feb 2024 – Feb 2025
- Rolled out enterprise-ready Databricks Terraform templates (Unity Catalog, dbt, Spark, Terraform, Data Contracts) for an aviation provider.
- Created a Nix-based CI/CD framework adopted in ~40% of projects, improving delivery speed and onboarding.
Data Engineering Consultant
Apr 2023 – Feb 2024
- Migrated on-prem SQL Server → GCP BigQuery + dbt + Cloud Functions with Terraform, cutting warehouse costs ~70% and maintaining >99.9% uptime.
- Developed MLOps pipelines enabling user-driven augmentation of 3D environments.
Amazon
Luxembourg (Remote)
Data Engineer
Jul 2021 – Mar 2023
- Built TB-scale pipelines in AWS Redshift, dbt, Airflow for the Global RME org, reducing reporting latency from 24h to 15min and contributing to multimillion-dollar savings.
- Owned worldwide adoption of Cost-per-Asset and Lost Productivity Hours KPIs in alignment with VP-level stakeholders.
Business Intelligence Engineer
Feb 2021 – Jul 2021
- Built serverless pipelines (Glue, Lambda) powering near-real-time QuickSight dashboards.
Education
Gdańsk University of Technology
BEng, Data Engineering
Technologies
Deep / frequent
Python
Apache Spark
SQL & dbt
Databricks
Kafka/Redpanda/Event Hub
Azure
Terraform
Golang
CI/CD
Docker
Proficient
AWS
Redshift
Kubernetes
Airflow
BigQuery
GCP
Nix
Used on specific projects
Dagster
ClickHouse
TypeScript/React/Astro
SvelteKit
Rust (Tauri)
Interests
Traveling
Reading
Podcasts
Entrepreneurship
Snowboarding
Kitesurfing
Tennis
Squash
Cooking