01.
About
3+ years building & shipping
I'm a Montréal based Data Engineer who works at the intersection of data infrastructure and analytics — building ETL/ELT pipelines that ingest reliably, dimensional models that scale, and Power BI dashboards that turn raw federal data into operational decisions.
Most recently at IRCC, I engineered Azure data pipelines that ingested 1TB+ of structured and unstructured data into ADLS, cut storage costs by 25%, and powered 12+ downstream Power BI reports for 2,500+ federal users — while enforcing PII compliance across 40+ datasets.
Before that, at SEG Products, I built Python / REST API ingestion across 5+ source systems, redesigned a product catalog star schema for 60% faster queries, and led 3 end-to-end SQL Server warehouse migrations with near-zero downtime.
02.
Experience
3 roles · 2022 — 2025
Sep 2024 — Apr 2025
Azure Data Engineer · IRCC ↗
Immigration, Refugees and Citizenship Canada — Montréal
1TB+ingested to ADLS
−25%storage cost
2,500+users supported
12+Power BI reports
- Engineered ETL/ELT pipelines using Azure Data Factory, Python, and Azure Functions to ingest and transform 1TB+ of structured and unstructured data into Azure Data Lake Storage (ADLS), reducing storage costs by 25%.
- Designed Power BI dashboards with modeled operational datasets to track data metrics and KPIs across 2,500+ users.
- Built Python data validation pipelines across 40+ federal datasets, enforcing PII compliance and governance standards.
- Built Azure DevOps / Terraform CI/CD for data pipelines across dev / staging / prod, cutting deploys from 2 hours to 15 minutes.
- Designed dimensional data models for 20+ federal assets, powering 12+ downstream Power BI reports.
- Profiled datasets in Python / Pandas, surfacing pipeline bottlenecks and quality gaps that drove 4 infrastructure decisions.
Azure Data FactoryADLSAzure FunctionsPythonPandasPower BITerraformAzure DevOpsCI/CDStar Schema
Jan — Aug 2023
Data Engineer · SEG Products ↗
Data platform & analytics — Montréal
3d → 1hrelease time
−40%manual workload
+60%query performance
95%+errors caught
- Built Azure DevOps CI/CD for pipeline deployment and schema migration, cutting releases from 3 days to 1 hour.
- Built 8+ Power BI dashboards on SQL Server data models, replacing weekly manual sales / ops reporting.
- Automated Python / REST API ingestion across 5+ source systems, cutting manual data workload by 40%.
- Executed 3 end-to-end SQL Server warehouse migrations with ETL and rollback, achieving minimal downtime.
- Redesigned product catalog star schema ERDs, improving query performance 60% and analytics reliability.
- Developed Python validation and anomaly flagging scripts, catching 95%+ of ingestion errors pre-reporting.
SQL ServerT-SQLPythonPandasPower BIAzure DevOpsREST APIsStar SchemaETL
2022 — 2023
Engineering Intern · OVE Decors ↗
Manufacturing & design — Laval
500+shipments / week tracked
- Contributed to plugin development and UI enhancements with JavaScript / HTML / CSS.
- Supported CRM improvements by optimizing migration scripts in C# + SQL.
- Built automated workflows for catalog updates using Azure Logic Apps.
- Built a smart access tracking system handling 500+ shipments weekly.
JavaScriptHTML/CSSC#SQLLogic Apps
03.
Projects
selected work
SEG · Internal
RFID-Tracked Sample Management ↗
−56%retrieval time
Engineered a Python-automated tracking system with RFID integration to classify and log test samples — reducing manual data entry and improving retrieval time by 56%.
PythonPandasRFIDAutomation
Personal
AI Invoice-to-Statements Web App ↗
End-to-end application leveraging LLM pipelines and NLP to parse heterogeneous invoice formats, extract structured fields, and generate standardized financial statements automatically.
LLMsNLPPythonREST APIs
Personal · ETL
Canadian Household Purchasing Power Pipeline ↗
110 yrsof income data
2,112+household records
ETL pipeline harmonizing 110 years of Canadian income data across 4 sources — integrating 2,112+ household records with group-based imputation for 1,708 missing values and Z-score outlier detection.
PythonSQLPandasAWSETL
Personal
Arcade Cabinet Linux Kiosk ↗
Built a dedicated Linux kiosk system for an arcade cabinet — automated boot-to-launch with Bash & system scripts, optimized input, display, and audio for low-latency gameplay.
LinuxBashsystemd
University
Facial Expression Recognition ↗
Real-time facial expression classifier using convolutional neural networks in PyTorch — data augmentation, K-fold cross-validation, bias evaluation, and latency-aware inference optimization for accurate real-time predictions.
PyTorchCNNsOpenCVPython
04.
Skills
core stack
Languages
PythonSQLPySparkBashC#JavaScriptTypeScriptGit
Azure Data
Data Factory (ADF)Data Lake Storage (ADLS)Synapse AnalyticsAzure FunctionsLogic AppsDatabricks
Data Engineering
ETL / ELT PipelinesApache AirflowData ModelingData WarehousingStar SchemaDimensional Design
Databases
SQL Server (SSMS)PostgreSQLMySQLT-SQLQuery Optimization
DevOps & IaC
TerraformDockerKubernetesCI/CD (YAML)PowerShell
Practices
Agile / ScrumSecure Data Handling (PII)Model EvaluationDisaster Recovery
05.
Education
background
2021 — 2026
B.Eng, Software Engineering · Concordia University
Montréal — Member of the Institute for Co-operative Education
2019 — 2021
Health Sciences College Diploma · Vanier College
Montréal — Honors Program