Skip to content
View amirulshafiq98's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report amirulshafiq98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
amirulshafiq98/README.md

πŸ“Š Amirul Shafiq - Data & Automation Engineer Portfolio

Hi, I’m Amirul πŸ‘‹

I’m an automation-focused data professional who enjoys building end-to-end data workflows from extraction and cleaning to validation and delivery. My experience spans consumer insights, supply chain, market research, and education operations.

I got serious about data during my research internship in Chiang Mai, Thailand, where I worked on optimising honey storage conditions for local beekeepers. The data didn’t give easy answers at first, and that experience taught me the value of experimentation, debugging, and persistence. It also sparked my interest in building reliable, reusable data workflows and improving data quality

Here you will find my projects in Python, SQL and dashboarding (Tableau/Power BI).

resume linkedin


πŸ›  Skills

PostgreSQL Salesforce Power Bi Tableau Python Microsoft Office Google Workspace

πŸ“˜ Table of Contents

πŸ“Œ Highlighted Projects

Organisation Data Processor

updated1

Repository: Data Processor App

Objective: Automate annual cleaning + transformation workflows for non-technical users

Tech: tkinter, pandas, regex, pathlib

Outcome: Reduced processing time from 5 hours β†’ 30 minutes


University Allocation Based on WEF Report (2025)

Histogram

Repository: University Allocation Optimisation

Objective: Allocate students to majors using realistic demand curves + constraints

Description: Designed an Integer Linear Programming (ILP) model using PuLP, factoring in costs, major demand, and budget constraints. Simulated multiple demand curve options (exponential, sigmoid, power law) to model student preferences

Tech: PuLP, matplotlib, numpy, pandas

Outcome: Produced interpretable allocation plans useful for education planning


Mochi Ice-Cream Sensory Analysis

Boxplot

Repository: Mochi Ice cream

Objective: Build an analysis-ready pipeline to identify best-performing Mochi formulation

Methods: Principal Component Analysis (PCA), K-Means Clustering, Data Visualization, Descriptive Statistics

Tech: PostgreSQL, Scikit-learn, Pandas, Plotly, dbt, SQLAlchemy

Outcome: Clean, normalised dataset powering PCA + clustering analytics


More Projects

Python

SQL

Visualisation (Tableau and Power BI)


πŸŽ“ Education

Bachelor (Tech) of Food Technology with Honours
Singapore Institute of Technology (SIT)

Relevant Modules: Applied Data Science, Design of Experiments (DOE), Sensory Science, Consumer Insights, Python Programming, Multivariate and Descriptive Statistics, Optimisation Modelling (ILP/LP/RSM/Centroid Design)

πŸ“œ Certifications

Microsoft Excel Professional.pdf

Tableau Intelligence Analyst.pdf

Google Business Intelligence.pdf

Google Cloud Analytics.pdf

Google Advanced Data Analytics.pdf

Pinned Loading

  1. Work-Stuff Work-Stuff Public

    Collection of files I have created for people to use and automate their workflows. I try to make every project here with the philosophy of 'it just works'.

    Python

  2. Uni_Allocation Uni_Allocation Public

    After reading the WEF report on jobs that can be affected by AI, I decided to simulate this demand with university courses to better allocate students to meet this demand.

    Python

  3. HR_Attrition HR_Attrition Public

    This project presents a comprehensive, full-stack solution designed to accurately understand and predict employee attrition, mimicking a real-world data science pipeline. Leveraging modern machine …

    Jupyter Notebook

  4. mochi_icecream mochi_icecream Public

    A project I did for FYP where we looked at 5 different formulations to determine which one was the best to move to upscale operations

    Jupyter Notebook

  5. olist-sales olist-sales Public

    Olist is a Brazilian e-commerce site that sells all sorts of things. I got this dataset from Kaggle where I did EDA using SQL before visualising in PowerBI

  6. honey honey Public

    Honey project I did back in my internship at Chiang Mai University to predict the best conditions for storage after 6 weeks

    Python