Location
PolandRate
Years of experience
6About
As an Airflow Consultant, I have built a strong career in data engineering and software development. Currently, I work as a Data Engineer for the New York Mets, a position I've held since November 2021. Before this role, I was a Data Warehouse Developer at Lingaro from January 2018 to October 2021, where I honed my skills in database management and data warehousing. Additionally, I worked as a Software Engineer at Asseco Business Solutions from June 2019 to February 2020, which further solidified my technical expertise. I hold a Bachelor's degree in Information Technology from Maria Curie-Sklodowska University, completed in 2016. My technical skill set includes proficiency in BySpark, Apache Airflow, Python, PostgreSQL, SQL, and Java. I am fluent in English (C1 level) and a native speaker of Polish. My diverse experiences and technical skills have prepared me to excel in complex data engineering projects and contribute effectively to any team.Tech Stack
Airflow, Java, MySQL, Python, Spark, SQLExperience
- Developing and Managing Data Pipelines: Design, implement, and maintain complex data pipelines using Apache Airflow to ensure efficient and reliable data processing.
- Data Warehousing: Building and optimizing data warehouse solutions to facilitate efficient data storage, retrieval, and analysis.
- Software Development: Developing and deploying software solutions using Python, SQL, and Java to support data engineering projects.
- Database Management: Managing and optimizing databases, including PostgreSQL, to ensure high performance and data integrity.
- Technical Support and Troubleshooting: Providing technical support and troubleshooting for data pipeline and database issues to ensure smooth operations.
- Collaboration and Communication: Working closely with cross-functional teams, including data scientists and analysts, to understand their data needs and deliver solutions that meet business requirements.
Employment history
Main Responsibilities:
– Develop, implement, and maintain data pipelines using Apache Airflow to ensure efficient data flow and processing.
– Automate data ingestion, transformation, and loading processes to support various business needs.
– Integrate data from multiple sources, ensuring data consistency and integrity.
– Maintain and optimize databases, including PostgreSQL, to support large-scale data operations.
– Monitor data pipeline performance, identifying and resolving bottlenecks.
– Optimize SQL queries and database performance to enhance data processing speed and efficiency.
– Implement data quality checks and validation processes to ensure the accuracy and reliability of data.
– Establish and enforce data governance policies and procedures.
– Work closely with data scientists, analysts, and other stakeholders to understand their data needs and provide tailored solutions.
– Collaborate with IT and software development teams to integrate data solutions into existing systems and applications.
– Provide technical support for data-related issues, troubleshooting and resolving problems promptly.
– Develop and maintain documentation for data pipelines, processes, and system configurations.
– Ensure data security and compliance with organizational policies and relevant regulations.
– Implement and maintain data access controls and encryption methods to protect sensitive information.
– Stay updated with the latest industry trends and technologies in data engineering.
– Continuously seek opportunities to improve data pipeline efficiency and reliability through innovative solutions and best practices.
– Develop and implement scalable data warehouse solutions to meet business requirements.
– Design schemas, star schemas, and snowflake schemas to optimize data storage and retrieval.
– Develop Extract, Transform, Load (ETL) processes to integrate data from various sources into the data warehouse.
– Automate ETL workflows to ensure timely and accurate data updates.
– Create and maintain data models, including conceptual, logical, and physical models.
– Design and optimize database structures to support efficient querying and data analysis.
– Monitor data warehouse performance and identify areas for optimization.
– Optimize SQL queries and database indexes to enhance data processing speed and efficiency.
– Implement data validation and quality checks to ensure the accuracy and reliability of data.
– Develop and enforce data governance policies and procedures.
– Work closely with business analysts, data scientists, and other stakeholders to understand their data needs and provide tailored solutions.
– Collaborate with IT and software development teams to integrate data warehouse solutions into existing systems.
– Develop and maintain comprehensive documentation for data warehouse architecture, ETL processes, and data models.
– Ensure documentation is up-to-date and accessible to relevant team members.
– Design, develop, and maintain software applications using various programming languages such as Python, Java, and SQL.
– Write clean, scalable, and efficient code to meet project requirements and business needs.
– Integrate software applications with existing systems and third-party services.
– Develop APIs and middleware to facilitate seamless communication between different software components.
– Collaborate with stakeholders to gather and analyze software requirements.
– Translate business needs into technical specifications and project plans.
– Develop and execute unit tests, integration tests, and system tests to ensure software quality and reliability.
– Debug and resolve software defects and issues promptly.
– Design and manage databases, including schema design, indexing, and query optimization.
– Ensure data integrity and security within the software applications.
– Monitor and optimize software performance, identifying and addressing bottlenecks.
– Implement best practices for code optimization and efficient resource usage.
– Create and maintain comprehensive documentation for software design, development processes, and user manuals.
– Provide regular progress reports and updates to project managers and stakeholders.
Work closely with other developers, designers, and IT professionals to ensure successful project delivery.
– Participate in code reviews, pair programming, and knowledge-sharing sessions.
– Provide technical support for software applications, troubleshooting and resolving issues reported by users.
– Perform regular maintenance and updates to keep software applications up-to-date and secure.