PORTFOLIO / 2025

JakubWujec

ML Engineer specializing in Machine Learning, Python development and building applications.
Expert in building and deploying ML solutions with modern frameworks. I use Neovim btw.

Currently at Roche
Warsaw, Poland
CURRENTLY
ML Engineer
@ Roche
Mar 2025 — Present
FOCUS
Machine LearningData SciencePythonNeovimDockerVibe Coding FrontendAWSLLMMLOpsData EngineeringBackend DevelopmentRAGSQL

Work

2025-Present

ML Engineer

Roche

Building NPS prediction models. Using LLM-s to enhance user experience. Working with AWS infrastructure to deploy and scale ML models.

Machine LearningAWSPythonData Science
2023-Present

Head of AI & Co-founder

MockIT

Architected AI-powered technical interview platform with real-time voice processing. Built conversation engine with multi-model AI integration, WebSocket audio streaming, and intelligent assessment system.

PythonFastAPIAIWebSocketsLLM
2023-2025

Data Scientist

PKO BP

Developed ETL pipelines using PySpark and BigQuery processing 100M+ records daily. Built Champion Challenger Framework for automated ML pipeline optimization and CRM models for user behavior prediction.

PythonPySparkBigQueryMLOps
2022

Machine Learning Engineer

SonarHome

Upgraded automatic flat valuation model with new features and optimized hyperparameters. Implemented DBSCAN clustering for geospatial anomaly detection with Dash visualization dashboard.

PythonDBSCANDashGeospatial
2021

Data Scientist

Perfect Data

Designed and built recommendation engine from scratch based on user choices and personal values. Created automated data pipelines for text processing from crawlers with MongoDB storage.

PythonMongoDBNLPScrapy

Education

MA Data Science

University of Warsaw
Grade: 5.0/5.0

Undertaken Coursework:

Introduction to Data ScienceMachine Learning 1: Classification MethodsMachine Learning 2: Deep Learning & Neural NetworksPython and SQLBig Data AnalyticsText Mining and Social Media MiningUnsupervised LearningStatistics and Exploratory Data AnalysisWebscraping and Social Media Scraping

BS Computer Science and Econometrics

University of Warsaw
Grade: 4.5/5.0

Undertaken Coursework:

Linear AlgebraMathematical Analysis I & IIProbability CalculusMathematical Statistics I & IIEconometricsTime Series AnalysisComputer ProgrammingMachine Learning in Python

Technical Skills

Machine Learning

Supervised ModelsUnsupervised ModelsNeural NetworksXAITime SeriesNLPTransformersComputer VisionDeep Learning

Python & Development

PythonPandasNumPyScikit-learnTensorFlowPyTorchPySparkFastAPIDjangoPytestVibe Coding FrontendSQLAlchemy

ML Infrastructure & DevOps

DockerMLflowAirflowDVCKubernetesGitGitHub Actions

Cloud & Data

AWS (S3, EC2, SES)GCP (BigQuery, Dataproc)SQLMongoDBStatistics

Let's Connect

Always interested in discussing machine learning innovations, research collaborations, and opportunities in ML and Python development.

© 2025 Jakub Wujec. All rights reserved.
Inspired by Felix Macaspac template design