Project Overview

This project collected a demographic information of customer data as well as store, product, and promotion data. SQL database was used to store and query the data. Raw data was divdided into multiple tables and integrated according to column properties. To discover customer segment distributions, the factors of media cost and store sales, Tableau public was used to analyze and visualize the results. On the other hand, random forest classification, deep learning, random forest regressor and multiple linear regression model were applied to explore the correlation between member card, occupation, cost, store sales and other features, and make predictions. Finally, a webpage library was built to summarize and display our team work.


Guiding Questions:

How many stores there are? In which cities and states these stores are? Do costumer demographics link to membership card tiers? Is the cost of sales promotions linked to customers demographics? What is media cost, by state, by store type, by food department? Which are top media promotions, the top unit of sales? What are the sales, by department, store, food category?



About FoodMart


Data Source


Database

SQL

PostgreSQL       AWS


Model

Python       Jupyter       VS Code


Visualization


About Us

GitHub       Slack       Zoom


Summary


🔵 1. Membership Card tiers are strongly linked with customer demographics. With deep learning model, we can predict Membership Card with accuracy of 0.970. With random forest model, prediction accuracy can be up to 0.982. Advertising on Membership Card tiers should be targeted via segmentation.

🔵 2. Media Cost can also be predicted by customer demographics with random forest regressor (RMSE: 0.637). Owners & Investors should pay close attention to the makeup of their customers when projecting Sales Promotion budgets.

🔵 3. For media type, Daily Paper advertisements are strongly correlated to store sales, thus newspaper promotions are highly recommended.

🔵 4. Supermarkets have the most of customers & spend more money on advertisements (highest Media Cost by store type). Weekend Markdown is the most effective way to stimulate customers to shop in Supermarkets & Supermarket also invests the most money in their Weekend Markdown promotions.

🔵 5. Store characteristics (Square Feet, Store Cost, Coffee Bar, Video Store, etc.) are strong predictors of Store Sales. Investor should expect to invest in a grocery store’s physical features, amenities if they want to maximize store sales.