Abstract

Medical image classification is crucial for improving diagnosis and treatment, especially when human analysis is time-consuming and less precise compared to Computer-Aided Diagnosis (CAD) systems. Our study focused on classifying microscopic fungi images, addressing a significant yet frequently overlooked health threat posed by fungal infections. We evaluated the effectiveness of transfer learning using four pre-trained models (VGG16, ResNet50, EfficientNetb0, and ViT16) using various metrics like balanced accuracy, MCC, F1-score, and confusion matrices. We also compared deep learning models with traditional machine learning algorithms such as Logistic Regression, Naive Bayes, and Random Forest. To ensure accurate model assessment, we implemented cross-validation for deep learning models to better assess their performance on unseen data. Notably, our findings reveal that EfficientNet outperformed all other models, achieving a remarkably balanced accuracy of 0.9 when augmented. Additionally, we employed GRAD-CAM for model explainability and visualized the Attention mechanism for ViT. These findings underscore the significant potential of deep learning models in medical image classification and their crucial role in addressing critical healthcare challenges.

Important Results

A complete 25-page report of our study can be accessed via the Project_Report.pdf. For brevity, we only showcase the most important figures/findings here.

Performance of classic machine learning algorithms on Histogram of Oriented Gradients (HOG) features extracted from images using OpenCV:

Performance of the VGG16, EfficientNetb0, ResNet50 and Vision Transformer pre-trained deep learning architectures, with weight fine-tuning:

Visualization of Gradients on example microscopy images per deep learning architecture:

Final Results:

the DL models’ generalization ability on unseen data from the same distribution is far superior, having more than 40% better scores than the best ml classifier.

Another important result of this study is the fact that EfficientNetb0 outperformed all other models, when trained both on the original and the augmented dataset, despite it having less parameters than the other models. This indicates that the multi-objective neural architecture search performed from its authors can be a promising method for the architectural design of neural networks

File descriptions

Other

Project_Report.pdf is the 25-page report of our study

Notebooks

classic-ml.ipynb contains code for preprocessing and classifying the images utilizing classical ML algorithms
data_exploration.ipynb contains code for the EDA of the dataset
fungi-efficientnet-classification_final.ipynb contains code for the transfer learning of EfficientNetb0
visiontransformer.ipynb contains code for the transfer learning of Vision Transformer Base 16
fungivgg16classification.ipynb contains code for the transfer learning of VGG16
fungi-resnet-classification_final contains code for the transfer learning of ResNet50

Scripts

engine.py contains functions utilized for the training of Pytorch models
helper-functions.py contains functions mostly used to visualize the results of classification and training tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
Notebooks		Notebooks
Scripts		Scripts
Project_Report.pdf		Project_Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

Important Results

File descriptions

Other

Notebooks

Scripts

About

Releases

Contributors 3

Languages

KyriakosPsa/Transfer-Learning-Vision

Folders and files

Latest commit

History

Repository files navigation

Abstract

Important Results

File descriptions

Other

Notebooks

Scripts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Contributors 3

Languages