Future Imperfect by HTML5 UP

USING K-MEANS, HIERARCHICAL CLUSTERING, AND DBSCAN FOR MARKET SEGMENTATION IN INSURANCE: A CASE STUDY WITH UNSUPERVISED LEARNING AND DIMENSIONALITY REDUCTION TECHNIQUES

UNSUPERVISED MACHINE LEARNING PROJECT

May 16, 2023 Qichun (Jackson) Yu

The project is developing unsupervised machine-learning models to group customers into segments for the purpose of giving insurance product recommendations. Customers are divided into subgroups based on some types of similar characteristics. The dataset includes summary information on 18 behavioural variables from the 8,950 active credit cardholders. Behaviours include how a customer spends and pays over time. The notebook explores different unsupervised algorithms such as k-means, hierarchical clustering, and DBSCAN for an insurance company to divide customers into groups to optimize marketing campaigns for insurance products. Standardization is used to rescale data to have a mean of 0 and a standard deviation of 1. PCA and TSNE methods are used for dimensionality reduction and visualization. After comparing with the silhouette score and visualized plots, the optimal model is the k-means method with a k value of three that is trained with PCA scaled data. There are small groups of people who have similar behaviours on purchasing, cash advances, credit limits and so on. The K-means clustering method helps identify the group that has similar features. After the segmentation, an insurance company will provide insurance product recommendations based on their characteristics.

Building a Deep Learning Model to Solve Sudoku: A Case Study with CNNs

DEEP LEARNING PROJECT

May 25, 2023 Qichun (Jackson) Yu

Sudoku is a number puzzle game that requires you to fill in digits 1 to 9. The game requires digits 1 to 9 to appear exactly once in each row, column and each of the nine 3x3 subgrids. The project experiment with different neural networks such as CNN. The data have been divided by 9 and subtracted by 0.5 to achieve zero mean-centred data. The CNN model that includes 9 convolution layers with 512 kernels works best with 95% of training accuracy. The study found that an increase in the number of epochs, number of layers, and number of neurons per layer can help improve the accuracy of the neural network model. Moreover, the dropout layer and maxpooling can help prevent overfitting. Adding strides of 3 x 3 is useful but requires large computing power. The main objective of this project is to build a deep learning model for a mobile app company that can analyze the grid of Sudoku to be filled, solve the Sudoku problem, and fill the grid. The convolution neural networks (CNN) is good at extracting features from the dataset and can be used to solve a sudoku game successfully.

Jackson's Blog

Using K-Means, Hierarchical Clustering, and DBSCAN for Market Segmentation in Insurance: A Case Study with Unsupervised Learning and Dimensionality Reduction Techniques

Building a Deep Learning Model to Solve Sudoku: A Case Study with CNNs

USING K-MEANS, HIERARCHICAL CLUSTERING, AND DBSCAN FOR MARKET SEGMENTATION IN INSURANCE: A CASE STUDY WITH UNSUPERVISED LEARNING AND DIMENSIONALITY REDUCTION TECHNIQUES

Building a Deep Learning Model to Solve Sudoku: A Case Study with CNNs

Jackson's Blog

Market Segmentation In Insurance

Solving Sudoku

About Me