Location
PolandRate
$24
/ per hour
Years of experience
5+About
I hold an MSc in Data Science and a BSc in Mathematics with a specialization in Data Analysis from the University of Wroclaw. My educational background has provided me with a strong foundation in programming, cloud services, and data analysis, which I have applied in various roles at McKinsey & Company. As a Data Scientist, I have developed advanced analytics solutions, including a contract analysis system using LangChain and GPT models, and built ML pipelines with Azure ML and AWS SageMaker. My experience also includes creating streamlit apps for prototype discussions and preprocessing big data using PySpark. Prior to my current role, I gained valuable experience as an Advanced Analytics Intern at McKinsey, where I built RShiny apps and developed a category spend analysis package. Additionally, during my internship at SENSDX, I contributed to diagnostic testing technologies by building web applications for data collection and analysis. My extracurricular activities, including being a board member of the Students Scientific Circle of Applied Mathematics and participating in workshops like Women in Trading & Technology, have further enhanced my skills in data analysis, communication, and teamwork.Tech Stack
Data Science, AWS, Azure, Github, Machine Learning, Pandas, PySpark, Python, R, SQL, TensorflowExperience
- Created a contract analysis system that includes automated summary generation, entity extraction, clause identification, and a chatbot using LangChain and GPT models.
- Applied ML and statistical methods in Python for spend data categorization, utilizing models like fastText and Hugging Face.
- Developed ML pipelines and real-time/batch endpoints using Azure ML and AWS SageMaker.
- Designed streamlit applications to discuss prototypes with Subject Matter Experts (SMEs), facilitating effective communication and project iteration.
- Used PySpark for preprocessing large datasets, ensuring data readiness for analysis.
- Developed RShiny apps for cost engineering experts and faster picture classification and data collection in previous roles.
- Customized PowerBI dashboards for clients, enhancing data visualization and decision-making processes.
Employment history
Data Scientist II, McKinsey & Company
Jan 2024 - Present
- Worked in a multinational team focused on spend analytics and performance measurement in procurement.
- Created a contract analysis solution with automated summary generation, entity extraction, clause identification, and a chatbot using LangChain and GPT models.
- Applied ML and statistical methods in Python for spend data categorization, using fastText and Hugging Face models.
- Developed ML pipelines and real-time/batch endpoints with Azure ML and AWS SageMaker.
- Designed streamlit apps to facilitate prototype discussions with Subject Matter Experts (SMEs).
- Preprocessed big data using PySpark to ensure data readiness for analysis.
Data Scientist I, McKinsey & Company
Jul 2022 - Jan 2024
- Contributed to spend analytics and performance measurement projects in procurement.
- Utilized Python and statistical methods for data categorization and analysis.
- Developed and deployed ML models using cloud services such as Azure ML and AWS SageMaker.
- Collaborated with cross-functional teams to enhance data-driven decision-making.
Junior Data Scientist, McKinsey & Company
Oct 2021 - Jul 2022
- Supported data science projects by building ML models and analyzing large datasets.
- Created streamlit applications for internal and external stakeholder engagement.
- Conducted data preprocessing and feature engineering using PySpark and Python.
- Assisted in the development of contract analysis tools using NLP models.
Advanced Analytics Intern, McKinsey & Company
Sep 2020 - Sep 2021
- Built an RShiny app for cost engineering experts to create cost formulas for potential savings.
- Developed an R package for comprehensive category spend analysis using statistical methods.
- Performed data cleaning, preprocessing, and quality checks using Alteryx and Python.
- Customized PowerBI dashboards to meet specific client needs, enhancing data visualization.
Data Analyst Intern, SENSDX
Jul 2019
- Worked on diagnostic testing technologies for accurate disease diagnosis at home.
- Developed web applications with RShiny for faster picture classification and data collection.
- Created interactive maps of flu infection trends since 2015 using RShiny.
- Collaborated with the development team to enhance the diagnostic testing platform.
Education history
University of Wroclaw, Poland
2019-2021
MSc in Data Science
University of Wroclaw, Poland
2016 - 2019
BSc in Mathematics
We’ve helped 83 clients with IT recruitment and software development.
Read about a few of them below...