Hello World! My name is

Mrinal Walia.

I love exploring DATA as it is the new SCIENCE!

I am a Junior Data Scientist with 3.5+ years of proactive industry experience and a strong commitment to ongoing learning and innovative ideation.

Presently, I have recently completed my Master’s in Applied Computing with a specialization in Artificial Intelligence from the University of Windsor in Fall 2023. Concurrently, I am engaged in a Data Scientist Co-op position at Loblaws Companies Limited from January to August 2023. In this role, I contribute to the optimization of MLOps pipelines and conduct analysis on competitor datasets to extract valuable insights for the business.

ABOUT ME

I am thrilled to express my interest in the Data Scientist role with my data analysis & analytics skills and AI experience. I am excited to contribute innovative solutions to iconic brands.

At my most recent employer at Loblaws Companies Limited, I achieved milestones like enhanced backtesting leading to a $35M revenue boost, revamped PARROT pipeline, competitor analysis, and a 30% newsletter engagement increase with a proven results-driven approach.

Prior to my Master’s, I worked as a Junior Data Scientist for 2.4 years at TESSACT where I enhanced ML model’s product metrics by 1.35%, developed end-to-end cloud data solutions using Python, GCP, and SQL, and collaborated with senior Data Scientists, building scalable deployment scripts.

I am an active open-source contributor who loves to optimize machine learning models and make valuable predictions out of data. I also share my thoughts and knowledge with others by writing Articles on Data Science, Machine Learning, GenerativeAI, LLMs, Open-Source, and more.

I have been awarded as OpenMined Featured Contributor, multiple times winner for Analytics Vidhya Blogathons, and the top GitHub user in India in the year 2020 by making 3300+ contributions to 300+ repositories.

Here is a list of the technologies that I'm familiar with!
  • Python, R, Java
  • PyTorch, Tensorflow, Neural Networks
  • Data Analysis & Visualizations
  • Machine Learning Modelling
  • Airflow, PowerBI, AWS
  • Spark, Hadoop, & GCP
  • SQL, MySQL, PostgreSQL
  • Git, Jira, MS Tools

PROFESSIONAL EXPERIENCE

  • Developed and executed an enhanced backtesting methodology for wholesale store groups, resulting in an additional $35M in yearly revenue (based on yearly revenue in 2022).
  • Pioneered the revamp of rating metrics for the backtesting pipeline of Promo Optimization Tool, introducing more interpretable and comprehensive metrics to evaluate model performance.
  • Performed data analysis to identify competitors to Loblaws Divisions based on price differences for various products and integrated the feature to the front-end tool for Business to access the competition pricings.
  • Spearheaded the development of a comprehensive newsletter team, resulting in a 30% increase in engagement.
  • Executed privacy-preserving machine learning and federated learning algorithms from PySyft library on sensitive and real private data.
  • Built a one-click end-to-end testing CI/CD pipeline using GitHub actions.
  • Created comprehensive Jupyter Notebooks guides and tutorials (Pull Request: #6633, #6669, #6759, https://github.com/OpenMined/PySyft/pulls/abhiwalia15) to facilitate knowledge sharing and collaboration within the open-source data science community.
  • Improved RMSE and LogLoss metric of a machine learning model for a FinTech project by 1.35% by tactfully framing business problems and communicating challenges and results to key partners.
  • Implemented end-to-end data solutions using cloud platforms, applying advanced analytical methods to manipulate and analyze large, complex datasets for actionable insights.
  • Collaborated with cross-functional teams, including senior Data Scientists, to develop and implement scalable deployments scripts using SQL query processing methods in production environments for CI/CD.
  • Designed and integrated various ETL processes using cutting-edge technologies such as AWS S3, Single Store, Lambda, Kafka, Cloud Watch, EC2, and API Gateway for efficient data processing and management.
  • Consolidated data scraping strategies, encompassing dynamic scraping and API extraction, to prepare datasets.
  • Developed and administered a robust Data Quality Validation Framework, establishing rules to ensure comprehensive data quality checks and processes.
  • Collected, analyzed, processed, and modelled data (5M+) to create actionable plans for ongoing projects in given timeframe.
  • Performed automated data labelling and annotations to multiple classes of data for training machine learning models.
  • Implemented and tested new machine learning models by generating quality assurance plan and test specifications.
  • Analyzed and designed both data and technical requirements with respective team lead and other team members.
  • Planned timely meetings to communicate efficiently with team lead and interns on project progress.
  • Executed back-end pipeline development of ML models (AWS + Spark) based on client’s requirements.
  • Programmed an image classification pipeline for detecting face nebulizer masks with over 98% accuracy.
  • Performed automated data labelling and annotations to multiple classes, saving time for model selection and preprocessing
  • Implemented and tested machine learning model by curating quality assurance plan and test specifications
  • Worked on 3 projects to build creative technological solutions that solved real business problems by taking actions
  • Completed the deliverables well-before timelines and showed outstanding interpersonal and time-management skills

EDUCATION

Jan 2022 - Sep 2023
Master of Applied Computing (Artificial Intelligence Stream)
University of Windsor, Windsor, Ontario
  • During my coursework and specialization in AI, I actively engaged in projects focused on database design, data mining, and advanced data analysis techniques.
  • Completed 2 full-stack projects by possessing a strong foundation in SQL, Python, data visualization, and data management techniques.
Aug 2017 - Jul 2021
Bachelor of Engineering in Computer Science & Engineering
Dayananda Sagar College of Engineering, Bangalore, India

Published one papers in the MATJOURNALS Conference on Artificial Intelligence.

  • Presented the research, “Recovering Human 3D Model from Monocular 2D Images for Detecting Postures Deformities”, 2021; awarded ‘Best Paper Presentation’ in the Image Processing category.
  • Elaborated on deep learning algorithms like BlazePose, DeepPose, OpenPose, etc. used for pose estimation.

Extracurricular Activities

  • Organized and led workshops on Data Analytics, sharing practical insights with peers in the Computer Science program.
  • Initiated a Data Science Club, fostering a collaborative environment for hands-on projects and knowledge exchange.
  • Participated in hackathons, leveraging data-driven approaches to tackle real-world challenges and showcase problem-solving skills.

ACADEMIC PROJECTS

Stocks Analysis & Prediction
Python R StreamLit LSTM RestAPIs NoSQL TensorFlow Git
Stocks Analysis & Prediction
Designed and coded an interactive dashboard for predicting stocks prices (98% accuracy) for next 30 days.
Learn Python for DS and ML
Pandas Jupyter Notebook NumPy Plotly Python
Learn Python for DS and ML
Developed an extensive Python program covering data visualization, NumPy, pandas, web scraping, and machine learning with SciKit Learn.
Facial Emotion Recognition
Python TensorFlow RestAPI NLP MongoDB OpenCV
Facial Emotion Recognition
Achieved 94% accuracy by designing a neural network architecture to recognize 7 types of emotions in real-time.
IPL Team Prediction
Python TensorFlow Seaborn Plotly cufflinks MS Excel
IPL Team Prediction
Developed an extensive Python program to find the best performing players in cricket sports busing the dataset and technologies like data visualization, NumPy, pandas, web scraping, and machine learning with SciKit Learn.
COVID-19 Cases Prediction
Python Machine Learning Data Visualization Visualizations Dashboards Tensorflow.js OpenCV
COVID-19 Cases Prediction
Conducted time series analysis on COVID-19 data, identifying trends in affected cases, deaths, and recoveries. Created data visualizations and dashboards to share valuable insights with the data science department.
Detecting Postures Deformities
Pose Estimation Deep Learning Python MatPlotLib Seaborn OpenCV
Detecting Postures Deformities
Presented at the MATJOURNALS, 2021; awarded ‘Best Paper Presentation’ in the Image Processing category. Elaborated on deep learning algorithms like BlazePose, DeepPose, OpenPose, etc. used for pose estimation.

GET IN TOUCH

My inbox is always open 📥 Whether you have a question or just want to say hello 👋 I’ll try my best to get back to you! Feel free to mail 💌 me about any relevant job updates or interesting project idea.