Location
PolandRate
Years of experience
12+About
I am an experienced Data Engineer with a strong background in building robust data processing frameworks and solutions. Currently, I work at Deutsche Börse, where I am instrumental in designing and developing a new generic function-based framework for data processing using Databricks, SQL, Azure Data Factory, and Azure Functions. My role involves creating fully parameterized ADF pipelines, developing PySpark scripts, and managing the overall process through Azure functions. Previously, I worked at Clouds On Mars on retail and FMCG projects, where I built frameworks utilizing Databricks and PySpark. I implemented Great Expectations for data validation and created comprehensive ETL processes. My work at RB involved optimizing data integration frameworks and developing solutions to process incremental data for machine learning. I hold a post-diploma degree in Big Data from the Warsaw University of Technology and a Master’s degree in Psychology from John Paul II Catholic University of Lublin. My technical skills include advanced knowledge of Python, PySpark, Databricks, Azure Data Factory, and DAX, complemented by intermediate expertise in other Azure resources and MS SQL. I am proficient in English and have a proven track record of delivering high-quality data engineering solutions.Tech Stack
Data Engineering, Azure, Databricks, DAX Studio, Firebase, JavaScript, Microsoft Azure, MySQL, Power BI, PySpark, Python, SQLExperience
- Designed and developed a new generic function-based framework for data processing at Deutsche Börse, leveraging Databricks, SQL, Azure Data Factory, and Azure Functions.
- Created fully parameterized ADF pipelines, PySpark scripts, and SQL stored procedures to manage and log data processing activities.
- Applied Great Expectations for data validation and developed comprehensive ETL processes.
- Optimized data integration frameworks and developed solutions to process incremental data for machine learning at RB.
- Developed automated, event-based solutions for producing Power BI reports within Azure stack at Nielsen, and supported performance improvements for other Power BI projects.
- Conducted end-to-end learning and development programs and created HR reports using Power BI in previous HR roles.
Employment history
• Designed and developed a new generic function-based framework for data processing using Databricks, SQL, Azure Data Factory, and Azure Functions.
• Created fully parameterized ADF pipelines and configuration databases to drive entire flow and logging logic.
• Developed PySpark scripts including generic functions, business custom functions, and master notebook execution functions.
• Implemented Azure functions to trigger and manage data processing workflows.
• Developed SQL stored procedures for logging activities and maintaining data integrity.
• Supervised and mentored team members to enhance their development skills.
• Designed and built new generic frameworks for processing data in retail and FMCG projects using Databricks and PySpark.
• Created PySpark classes, including Databricks Autoloader generic class, incremental loads class, and ETL classes.
• Implemented Great Expectations for data validation and quality assurance.
• Designed logging and configuration layers to streamline data workflows.
• Developed and managed Databricks workflows for efficient data processing.
• Collaborated with cross-functional teams to meet project requirements and deadlines.
• Provided technical support and troubleshooting for data processing issues.
• Rewrote solutions to make data processing frameworks more generic and scalable.
• Created frameworks using Databricks and Data Factory to process incremental data for external vendors and machine learning applications.
• Optimized old measures and data models in Azure Analysis Services (AAS), reducing data size significantly.
• Incorporated new data sources into the main data integration framework.
• Developed new components and enhanced existing ones for data integration processes.
• Conducted performance tuning and optimization of data processing workflows.
• Developed automated, event-based solutions for producing Power BI reports using Azure stack (Azure Data Factory, Logic Apps, SQL Service).
• Created new Power BI reports and supported performance improvements for existing ones.
• Designed and implemented data pipelines to integrate various data sources.
• Collaborated with stakeholders to gather requirements and deliver data solutions.
• Ensured data accuracy and consistency across all reports.
• Conducted end-to-end learning and development programs for employees.
• Built and analyzed HR reports using Power BI to support decision-making.
• Designed and delivered soft skill training programs.
• Developed and implemented HR policies and procedures.
• Collaborated with HR teams to improve employee engagement and retention.