Template Credit: Adapted from a template made available by Dr. Jason Brownlee of Machine Learning Mastery.
SUMMARY: This project aims to construct a predictive model using a TensorFlow convolutional neural network (CNN) and document the end-to-end steps using a template. The Chest X-Ray Pneumonia dataset is a binary classification situation where we attempt to predict one of the two possible outcomes.
INTRODUCTION: The dataset contains chest X-ray images selected from retrospective cohorts of pediatric patients of one to five years old from Guangzhou Women and Children’s Medical Center, Guangzhou. The image collection is organized into three folders (train, test, val) and contains subfolders for each image category (Pneumonia/Normal). There are 5,863 X-Ray images with various display resolutions in this collection.
From iteration Take1, we trained a simple three-layer CNN model and used the model’s performance as the baseline.
From iteration Take2, we used the same three-layer CNN model from Take1 and applied it to the same photos with a higher resolution (from 640×480 to 1024×768).
In this Take3 iteration, we will construct a VGG-16 CNN model and compare the result with the baseline model.
ANALYSIS: From iteration Take1, the baseline model’s performance achieved an accuracy score of 100% after 20 epochs using the validation dataset. However, the final model processed the test dataset with an accuracy measurement of only 72.91%.
From iteration Take2, the model’s performance achieved an accuracy score of 62.50% after ten epochs using the validation dataset. However, the final model processed the test dataset with an encouraging accuracy measurement of 83.33%.
In this Take3 iteration, the performance of the VGG-16 model achieved an accuracy score of 87.50% after ten epochs using the validation dataset. However, the final model processed the test dataset with an encouraging accuracy measurement of 76.44%.
CONCLUSION: In this iteration, the TensorFlow CNN model appeared to be suitable for modeling this dataset, but we need to experiment with the TensorFlow model to improve its accuracy.
Dataset Used: Chest X-Ray Images (Pneumonia) Dataset
Dataset ML Model: Binary image classification with numerical attributes
One potential source of performance benchmarks: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia
The HTML formatted report can be found here on GitHub.