1. Introduction

aad

Advances in Alzheimer's Disease

2169-2459 2169-2467

Scientific Research Publishing

10.4236/aad.2024.134007

aad-141412

Articles

Biomedical Life Sciences, Medicine Healthcare

Deep Learning for Neuroimaging-Based Brain Disorder Detection: Advancements and Future Perspectives

Samuel

Ocen

Michaelina Almaz

Yohannis

Lawrence

Muchemi

aDepartment of Computer Science and Informatics, University of Nairobi, Nairobi, Kenya

31 12 2024

13 04 95 116 1, December 2024 28, December 2024 28, December 2024

2014

This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/

This review focuses on the recent advancements in neuroimaging enabled by deep learning techniques, specifically highlighting their applications in brain disorder detection and diagnosis. The integration of convolutional neural networks (CNNs) and recurrent neural networks (RNNs) has significantly improved feature extraction, pattern recognition, and predictive modeling, leading to enhanced accuracy, sensitivity, and specificity in diagnosing Alzheimer’s, Parkinson’s, schizophrenia, and brain tumors across MRI, fMRI, and PET scans. Despite these advancements, current challenges persist, including limitations in interpretability, data scarcity, and ethical concerns. To address these issues, future perspectives involve leveraging transfer learning, federated learning, and multimodal data sources. This review aims to provide a comprehensive overview of the current state of deep learning in neuroimaging, highlight the existing challenges, and discuss potential solutions for future innovation. By addressing the current limitations and exploring innovative techniques, deep learning can unlock new possibilities in neuroimaging, ultimately leading to improved diagnosis, treatment, and patient outcomes.

Deep Learning Brain Disorder Detection Neuroimaging Data

1. Introduction

In recent years, an increase of Brain disorders has significantly impacted the lives of many people worldwide [1] . With the latest advancements in medical imaging technologies, diagnosis has been simplified which has resulted into a non-invasive assessment of the brain’s structure and function [2] . Despite the advancement in medical imaging technologies like Magnetic Resonance Imaging (MRI) and Computed Tomography (CT), early diagnosis and precision remain a big challenge [3] . The over reliance to the knowledge and experience of radiologists and neurological specialists affects accuracy and reliability of the diagnosis and as such affects the critical role played by the human expertise [4] .

The [4] [5] has highlighted that accurate and timely detection of Alzheimer’s Disease amongst several other brain disorders like epilepsy, and others, remains a critical challenge amongst the medical community. Early diagnosis of these disorders is essential for initiating targeted interventions, monitoring disease progression, and improving patient outcomes. Traditional neuroimaging analysis methods often rely on manually engineered features and heuristic algorithms, which are limited in their ability to capture subtle and non-linear patterns inherent in neuroimaging data [4] .

The advancement of deep learning techniques has brought a dynamic change in the field of neuroimaging analysis, bringing significant progress in the early detection and diagnosis of various brain disorders. With the exponential growth in the Neuroimaging datasets, which include amongst others functional magnetic resonance imaging (fMRI), positron emission tomography (PET), and diffusion tensor imaging (DTI), provide crucial insights in studies related to the brain’s normal performance [5] . Deep learning models have demonstrated remarkable capabilities in extracting intricate patterns and representations from these complex data, contributing to the understanding and diagnosis of brain disorders.

Deep learning methods have shown promise in automating the analysis of neuroimaging data, offering data centric informed decision making to feature extraction, training, and detection of abnormalities in the brain tissues [5] [6] . Several Architectures have been adopted for brain disorder detection which include amongst others Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Transformer-based architectures and they have exhibited excellent performance compared to the normal machine learning methods [6] [7] .

The process of training deep learning models wouldn’t be possible if the data collection was poorly done. With the advancements of MRI machines, neurodegenerative studies have been simplified [7] [8] whose impact as made early detection of brain disorders possible to even the new bones [9] .

This paper presents a comprehensive analysis of current deep learning models applied to brain disorder detection using neuroimaging data. We aim to explore the various methodologies employed in the literature and provide valuable insights into the strengths and limitations of these approaches by critically examining various literatures for the past 10 years as well as identifying opportunities for further improvements and proposing potential avenues for future research. To answer this, the following research questions have guided this study.

1) What are the current advancements and limitations of deep learning techniques in neuroimaging-based brain disorder detection?

2) How do current challenges, such as interpretability, data scarcity, and ethical concerns, impact the development and deployment of deep learning models in neuroimaging?

3) What are the potential solutions and future perspectives for addressing current challenges and advancing deep learning in neuroimaging-based brain disorder detection?

This paper follows a systematic literature approach where Section 2.0 gives a background and discusses the significance, as well us outlining the guiding research questions of the study, Section 3.0 looks at deep learning and its applications in relation to brain disorder detection with subsections of deep learning for brain disorder detection, pre-training, and transfer learning as applied in deep learning, the concept of interpretability and explainability a major challenge in deep learning. Section 4.0 explores preprocessing and feature extraction reviewing through skull stripping, image registration and bias correction with the individual applications and algorithms used in each case. Section 5.0 looks at the considerations for preprocessing and feature extraction and finally a discussion and conclusion section.

This study contributes to the growing body of applied deep learning research where techniques to neuroimaging data and brain disorders are gaining an increased attention in detection. By comprehensively evaluating the current landscape of deep learning models, we aim to foster advancements in this critical area of neuroscience and ultimately aid in the early diagnosis and management of brain disorders.

2. Background on Brain Disorders and Their Significance

Neurological conditions encompass a broad spectrum of disorders that significantly influence the regular functioning and growth of brain cells. These conditions have profound implications for individuals, families, and societies at large. Among these, Alzheimer’s disease (AD) stands out, presenting a particularly pressing public health concern due to its widespread prevalence, profound impact on cognitive function, and the absence of a definitive treatment [8] [9] . AD, characterized by a progressive degeneration of brain function, leads to manifestations like memory loss, cognitive decline, and behavioral alterations. This imposes a substantial burden on individuals, caregivers, and healthcare systems on a global scale. Brain disorders are pervasive and globally affect millions of people. The World Health Organization (WHO) has highlighted that neurological disorders contribute to a substantial proportion of the global disease burden, accounting for approximately 13% of all deaths worldwide [8] . The effects of brain disorders significantly affect the lives of individuals which in the end hampers their day-to-day activities and individuals’ participation in society.

With the challenges that this comes with, it’s prudent that these brain disorders need to be accurately detected in a timely manner for an effective intervention, treatment, and progressive monitoring of these diseases to be done. Relying on traditional diagnostic methods which rely on clinical examination where specialistic doctors/personnel physically examine the brain scan MRI or PET images. These methods end up being time wasting and more reliant on the expert professionals whose decisions may not be consistent and are prone to errors.

The recent emergence of deep learning a sub field of artificial intelligence is a big promise in the analysis of complex medical data which will improve on the accuracy of diagnosis [9] - [11] .

These deep learning models with their unique capabilities to study raw data and draw insights form them have changed the space of medical image analysis. With its capabilities to accurately detect tumors and lesions [9] with high performance deep learning models like CNN [12] , show a great future to the more complex discoveries in the medical image diagnosis.

Using large datasets of brain MRI and CT scans, deep learning models have demonstrated capabilities of learning making it be able to identify underlying variations in different scans [10] to make decision that exposes some image to have a given complication at an early stage [11] which helps to improve on the treatment of such conditions at an early stage.

Outside the brain tumors and other associated cancerous growth detection in the brain, a research by [13] has deployed deep learning specifically when analyzing EEG Signals in detecting conditions such as epilepsy and other associated sleep disorders and has showed a significantly higher level of performance.

In relation to Alzheimer’s disease and dementia, very key biomarkers have been identified using feature engineering techniques as a result from the analysis of multiple brain scan images [14] with variations which can be distinguished from the normal brain functionality, AD can be accurately detected at various levels of progression.

Overall, deep learning’s contributions extend to predictive models that estimate disease progression and treatment outcomes based on patient data [15] . This assists medical professionals in devising personalized treatment plans for patients with brain disorders.

The potential of deep learning also extends to drug development for brain disorders by analyzing biological data, predicting drug interactions, and identifying potential therapeutic targets [16] .

3. Applied Deep Learning and Its Application in Brain Disorder Detection 3.1. Deep Learning for Brain Disorder Detection

With the promise of deep learning in the analysis of brain imagery data significantly aiding in the detection, diagnosis, and potential solution to the treatment of brain disorders all relying on the power of deep learning, researchers, and clinicians with a great intent in enhancing the accuracy, efficiency, and objectivity of brain disorder detection [17] . Deep learning models excel at processing large volumes of brain scan image data, extracting meaningful features that a human eye may not ordinarily be able to identify.

Various architectures have been deployed to detect brain disorder, each with its own advantages and suitability for specific tasks. [18] deployed Convolutional Neural Networks (CNNs) in analyzing neurological imagery data in form of magnetic resonance imaging (MRI) and or positron emission tomography (PET) to uniquely detect brain disorders. Great success has been attained when capturing spatial patterns and has demonstrated impressive performance while detecting and segmenting brain tumors [19] . For brain disorders manifesting in 3D volumetric data, such as 3D MRI scans, 3D CNNs are used to learn spatial features and patterns, aiding in the detection of abnormalities [20] .

On the contrary, Recurrent Neural Networks (RNNs) have shown their effectiveness at modeling temporal dependencies, making them suitable for tasks that involve sequential data, such as time series analysis of brain signals [21] . With its ability in capturing temporal dependencies in brain signals, RNN’s have been applied for epilepsy detection and prediction of neurological disorders [22] . LSTMs are a type of RNN that can capture long-range dependencies in sequential data and have been used in brain disorder detection tasks like in the detection of Alzheimer’s disease (AD) using time-series datasets [23] .

Attention Mechanisms and Transformer models have shown relevance in capturing unique features from neuroimaging data and have been applied to tasks like detecting lesion and classification of diseases [24] [25] . They help in focusing on specific regions of brain images that are most relevant for detection, reducing the influence of irrelevant information and improving model accuracy [26] .

Autoencoders have been employed for feature extraction and dimensional reduction in brain imaging data, aiding in the identification of relevant features and anomalies [27] .

3.2. Pretraining and Transfer Learning in Deep Learning

Using large scale datasets like such as ImageNet in pretraining deep learning models and then fine-tuning them for specific brain disorder detection tasks has become a common practice [28] . This approach, known as transfer learning, leverages the learned features from pretrained models, which can accelerate training and improve performance, especially when the labeled data is limited [29] [30] . By adapting pretrained models to brain imaging data, transfer learning facilitates the development of robust and accurate deep learning models for brain disorder detection.

Whereas transfer learning helps in scenarios of data paucity, other techniques like data augmentation have been adopted where data is artificially expanded on the training dataset to increase its diversity. Data Augmentation has found a vital role in brain disorder detection as it has been used in cases where data may be imbalanced and as such has limited labeled data. Common data augmentation techniques include rotations, translations, flips, and elastic deformations [31] [32] . These techniques help to enhance the generalization ability of deep learning models and improve their performance on brain imaging datasets.

3.3. Interpretability and Explainability in Deep Learning for Brain Disorder Detection

Whereas deep learning models are known to have issues with Interpretability and explainability, it’s important to understand the processes that originate when deep learning models are making decisions if clinicians’ trust must be obtained to facilitate a smooth clinical adoption. Various methods, such as saliency maps, gradient-based attribution, and attention mechanisms, have been proposed to interpret and explain deep learning models predictive decisions.

4. Preprocessing and Feature Extraction 4.1. Preprocessing

Preprocessing is a core step that initiates data analysis and machine learning. It transforms raw data into information which can be used in developing predictive algorithms based on extracted features from these datasets. If a systematic process is followed to clean, transform, and select relevant feature domains from this data, hidden patterns in this data can be explored for an informative decision-making process. Various image preprocessing and feature extraction techniques and methods will be explored during this study. Several preprocessing techniques have been adopted and continue to play a critical role in accurately detecting other brain disorders. This study explores the common techniques used in preprocessing and feature extraction.

Skull stripping, also known as brain extraction, is a fundamental neuroimaging data preprocessing stage. It involves separating brain tissue from non-brain tissues for a given MRI scans. Non-brain tissue can be the skull, scalp, extracranial tissues, and other brain tissues with intent of isolating and focusing on the brain region of interest. Accurate skull stripping is crucial for subsequent analyses, such as brain segmentation, registration, and volumetric measurements.

Amongst the common skull stripping techniques used are Brain Extraction Tool (BET) which is a widely used and popular algorithm for skull stripping in neuroimaging. Being part of the FSL software suite, BET estimates brain tissue boundary and models out the actual brain tissue excluding other non-brain regions. [33] describes the BET as one that employs a principle of deformable surface modelling on an iterative deformed 3D surface mesh to properly fit the brain edges.

Other applications which perform the same role include FreeSurfer, a software used for structural MRI analysis. [34] describes FreeSurfer software as one that uses both intensity thresholding and surface-based modelling to separate non-brain tissue in each image generating a resultant high quality brain mask. Other notable applications like ROBEX (Robust Brain Extraction) [35] , ANTs herein as Advanced Normalization Tools [36] have found a wide applicability in skull stripping.

The act of image registration is a technique of wrapping some images to align their features in a perfect reference plane. This is usually done to ensure spatial correspondence amongst the images being preprocessed for group level analysis and comparisons to be done. Several methods have been used. Among the popular ones are FSL (FMRIB Software Library) FLIRT which according to [37] , FSL uses a transformed model to align a given image to a central reference plane for structural and functional neuroimages.

ANTs (herein as Advanced Normalization Tool) performs both skull stripping and image registrations tasks with a reach library with embedded algorithms like symmetric diffeomorphic registration. Because of its highly advanced image registration capabilities, ANT has found itself widely taken as an image normalization tool of choice in image registration [38] . Other notable applications like SPM (Statistical Parametric Mapping) which according to [39] is used for analyzing brain imaging, Elastix being a command line based software contains a wide range of transformation techniques like rigid, affine, and non-rigid enabling maximum customization of its features thus making it a dynamic tool for image registration [40] and finally ITK (Insight Segmentation and Registration Toolkit) an open source software package has found itself being widely adopted for medical image analysis by most of the research communities. This versatile nature according to [41] makes ITK a software package of choice in research.

The data collection process is prone to bring bias which can affect the overall quality of the data. The bias field correction technique is deployed to eliminate varying biases in the datasets specifically neuroimaging datasets, these inconsistences often corrected by shading off artifacts originate because of the imaging techniques used which can be biased by the pre-imaging clinical examinations miss directing the image acquisition process. Several methods have been deployed to correct these biases which include amongst others N3 (Non-parametric Non-uniform Normalization) Algorithm which according to use for bias correction in cases where the field is represented as a smooth spatial function. According to [42] , this bias correction technique has found a huge adoption rate in circumstances where images have features which may have inconsistencies that make them not clear for analysis. Other normalization techniques used in this area include algorithms like Polynomial Approximation with Linear Combinations of Exponentials (PALM) and non-parametric methods with specific algorithms like the GradWarp algorithm. Depending on the imaging data used and the study objectives, researchers can opt for a specific bias technique.

Intensity normalization is a preprocessing technique used to scale the image intensities of medical images, such as neuroimaging data from MRI (Magnetic Resonance Imaging) scans. The purpose of intensity normalization is to bring the image intensities into a standardized range, making the images comparable across different subjects or imaging sessions. This is particularly important in neuroimaging studies, where consistent intensity scaling is essential for accurate and reliable quantitative analyses.

To normalize images in such a manner that their mean and standard deviation give a zero, normalization technique of z-score has been widely adopted. [43] explains that as opposed to other normalization techniques like mini max and percentile normalization techniques, z-score transforms image pixels intensities to achieve a z-score with a mean and standard deviation of zero which method may become a method of choice depending on the characteristics an image possesses, and the requirements of a given research. Othe techniques like Motion Correction, Spatial Smoothing, Voxel-based Morphometry (VBM), Standard Space Normalization, me Series Preprocessing (fMRI) amongst others have been widely chosen for processing neuroimaging datasets. A summary of the key references has been outlined in Table 1 .

Table 1 <xref ref-type="bibr" rid="scirp.141412-"></xref>Table 1. Summary of feature extraction techniques.

Method	Summary	References
Principle Component Analysis (PCA)	PCA is a dimensional reduction technique used to identify key features from a dataset, achieving significant performance improvements in various studies cited.	[44] - [48]
Linear Discriminant Analysis (LDA)	LDA a dimensional reduction technique often used for classification problems was combined with PCA for preprocessing ECG data enhancing classification performance. Neural Networks were integrated for further improvements.	[47]
Convolutional Neural Networks (CNN)	CNNs are used in computer vision to automatically learn hierarchical features from images, with activations of intermediate layers serving as meaningful features for various tasks. These methods are part of the broader set of feature extraction techniques, with the choice depending on the data type and specific analysis of the model tasks.	[48] – [51]

4.2. Feature Extraction Methods

Several studies have deployed various feature extraction methods in classifying brain tissue.

PCA is a technique in dimensionality reduction that identifies key features (principal components) from a dataset and narrows down the data onto a lower-dimensional space while preserving as much variance as possible.

Studies [44] - [48] have deployed PCA feature extraction techniques and have achieved a significant performance.

LDA like PCA is a technique in dimensionality reduction, but it is often used in the context of classification problems. It targets projections with maximum separation amongst different class bounds.

[47] combined PCA and LDA to preprocess electroencephalography (EEG) data for brain activity classification. Neural Networks were added into PCA and LDA techniques for improved performance of the classification.

4.3. Convolutional Neural Networks (CNN) Features

Table 2 <xref ref-type="bibr" rid="scirp.141412-"></xref>Table 2. Shows various categories of feature extraction with corresponding details.

Category

Technique

Description

Mathematical

Representation

Convolutional

Layers

Convolution

Apply filters to scan data in spatial and

temporal dimensions.

∗(x) = ∑(w × x) + b

*(x) = ∑(w × x) + b

∗(x)=∑(w × x) + b

Activation Functions (ReLU)

Introduce non-linearity.

f(x) = max (0, x)

Activation Functions

(Sigmoid)

Output probabilities between 0 and 1.

f(x) = 11+e−x

f(x) = 1/[1 + e^{−x}]

f(x) = 1 + e^{−x}

Activation Functions (Tanh)

Scale outputs between −1 and 1.

f(x) = 2/(1 + e − 2x)−1

f(x) = 2/(1 + e^{−2x}) – 1

f(x) = 2/(1 + e − 2x) − 1

Batch Normalization

Stabilize training by normalizing layer

outputs.

BN(x) = γx−μσ + β

BN(x) =γ{x − μ}/{σ + β}

BN(x) = γσx − μ + β

Pooling

Down sample feature maps while

retaining essential information.

P(x) = max(x)

Recurrent Layers

RNN’s

Handle sequential data by maintaining an

internal state.

H_t = f (ht − 1, x_t)

h_t = f (h_{t − 1}, x_t)

h_t = f (ht − 1, x_t)

LSTM

Control information flow using gates and memory cells.

C_t = ft⋅ct – 1 + it⋅gt

c_t = f_t . c_{t − 1} + i_t . g_t

c_t = ft⋅ct − 1 + it⋅g_t

Data Augmentation

Random Rotation

Simulate orientation variations by

rotating images.

x' = R(x)

Flipping

Simulate reflections by flipping images

horizontally or vertically.

x' = F(x)

Scaling

Simulate different resolutions by

resizing images.

x' = S(x)

Normalization

Intensity Normalization

Standardize pixel intensities to

minimize lighting effects.

x' = x − μσ

x' = {x −μ}/σ

x' = σx − μ

Spatial Normalization

Ensure consistent image sizes and aspect

ratios.

x' = S(x)

In computer vision, CNNs are often used to automatically learn hierarchical features from images. The activations of intermediate layers can serve as meaningful features for various tasks. The pivotal role CNN has shown in computer vision [48] with variants like MobileNet [49] and EfficientNet [50] and AlexNet [51] have led the adoption of CNN in image classification and objection detection tasks as well as in segmentation tasks.

CNNs are composed of multiple convolutional layers, pooling layers, and fully connected layers. The convolutional layers apply filters to the input data, scanning the data in both spatial and temporal dimensions.

Table 1 shows the references that discuss the various studies which have used various feature extraction techniques while Table 2 shows a detailed summary of the various feature extraction methods.

4.4. Other Feature Extraction Techniques

The following techniques have been widely adopted for feature extraction and a great success has been realized.

Wavelet transformation as applied in both signal and image processing, decomposes data into different frequency components, which frequency components will act as features for numerous analysis tasks.

Wavelet transformation has been applied in cancer detection [52] , image compression [53] , EEG Signal Analysis [54] , audio signal processing [55] and fault detection in machinery [56] and a great success has been achieved.

HOG is a feature extraction method for object detection and image recognition. It computes the distribution of gradient orientations in an image, which is used to capture the shape and texture of objects.

HOG has been deployed with a great performance attained in various aspects of object detection and recognition [57] pedestrian detection [58] , face detection [59] , traffic sign recognition [60] , human action recognition [61] , and hand gesture recognition [62] .

LBP is used in image analysis for texture classification. It encodes the relationship between the intensity values of a pixel and its neighbors, capturing texture information.

LBP has found applications in Face recognition [63] , texture classification [64] , object detection [65] , face analysis [66] , texture segmentation [67] and image retrieval [68] achieving an immense performance in specific tasks.

5. Considerations for Preprocessing and Feature Extraction

Preprocessing and feature extraction are pivotal stages in reading text data for machine learning and natural language processing (NLP) tasks. In preprocessing, the aim is to cleanse the text, rendering it more amenable to analysis. This process usually encompasses text cleaning, where superfluous special characters, symbols, and formatting are expunged to distill the text to its essence. For instance, when working with web data, it’s essential to manage HTML tags effectively [69] . Tokenization, the act of breaking text into discrete tokens like words or sub words, establishes a structured foundation for analysis. This choice between word-level and subworld-level tokenization hinges on the task and data characteristics at hand.

Lowercasing is a common practice to ensure consistency in text representation. Nevertheless, instances where capitalization bears significance, such as named entities, merit careful consideration. The culling of stop words—common words like “and” or “the” that hold little semantic value—can also enhance data quality. Although, in some cases, retaining stop words might be pertinent to the task’s objectives. Stemming and lemmatization are methods employed to whittle down words to their base or root forms. This not only reduces vocabulary size but also harmonizes variants of words.

Managing contractions is another facet to contemplate. While converting contractions to their expanded forms (“don’t” to “do not”) may be advantageous, it hinges on the language and context of analysis. Noise reduction, involving the excision of extraneous elements like URLs or email addresses, contributes to cleaner data. Moreover, deciding how to handle numbers—replacing them, normalizing them, or retaining them—depends on their relevance to the analysis [69] .

Turning to feature extraction, Bag-of-Words (BoW) constructs a representation by tabulating word frequencies. Augmenting Bow with n-grams—successive sequences of words—can amplify contextual understanding. Term Frequency-Inverse Document Frequency (TF-IDF) bestows words with weights based on their document frequency and corpus-wide importance, an asset for delineating word significance [70] . Word embeddings, such as those hewn from pre-trained models like Word2Vec and GloVe, map words to vectors in a continuous space, encapsulating semantic relationships.

Alternatively, contextual embeddings, stemming from models like BERT and GPT, furnish embeddings that are contingent on a word’s context within a sentence or document, allowing for nuanced comprehension. Scaling numerical features ensures comparability among them, potentially heightening model performance. Meanwhile, dimensionality reduction techniques like Principal Component Analysis (PCA) or t-SNE can be instrumental in managing high-dimensional feature spaces. The incorporation of domain-specific features, like linguistic attributes or external data, can also elevate model efficacy [71] .

For longer documents, segmentation or summarization may be judicious, preserving salient information without inundating the model. These considerations underscore the intricate interplay between preprocessing and feature extraction, both of which must align seamlessly with the specific goals of the NLP task at hand. Rigorous experimentation is pivotal in ascertaining the optimal approaches that harmonize with the dataset’s characteristics and desired outcomes.

6. Training and Evaluation

Training and evaluation are pivotal phases in model development. It covers aspects of a predictive model development and the mechanism of assessing its performance. [72] emphasis that these phases are central across various domains and tasks, ranging from image classification to natural language processing.

6.1. Training

Model training is a process by which a portion of data is used to make the model learn from. The model learns patterns and relationships from a labeled dataset to make decisions (predictions) when exposed to new unseen datasets. The dataset is usually divided into two portions of either 80% training and 20% validation datasets or 75%: 20% respectively depending on the volume of available data. During the model building phase, the model is made to learn from the training dataset for purposes of learning while the validation set is used in the process known as hyperparameter tuning.

[71] introduced AlexNet, a DNN model having a network of five convolutional layers and three fully connected layers. AlexNet demonstrated great depth as compared to past models with a combination of over 60 million parameters. It makes use of ImageNet dataset which consists of 1,000 categories of 1.2 million high-res images, marking a substantial dataset improvement compared to past studies. Th key performance measure used is image classification accuracy which reduced the top-5 error rate to 15.3%. The AlexNet’s model success gave rise to a wide scale adoption of deep learning in computer vision, with a great possibility of extracting hierarchical features from raw images. The paper’s adoption of parallel processing, regularization techniques, and model visualization gave the researcher great strength, and its wide scale adoption came when AlexNet model was made Open source. During model training, the following steps are involved.

Clean and preprocess the dataset by applying the necessary transformations, such as text preprocessing or image normalization, to ensure consistent and well-formatted data. With data preparation being a foundational step in data science for purposes of ensuring quality and usability datasets, pandas’ official documentation by [71] offers a detailed guide in data preparation using python. This discussion is extended further by the authors distinguished works in python for data analysis. [73] in their publication on the art of data cleaning gives a detailed look at strategies to uncover challenges related to data quality.

Represent the data in a suitable format for the chosen algorithm. For text data, this could involve vectorization using techniques like TF-IDF or word embeddings. In image data, feature extraction might involve resizing, cropping, and color normalization. As much as feature engineering is an independent process in machine learning, it borrows a lot of aspects in relation to data preprocessing. [74] in the publication, “feature engineering for machine learning” dives to how features engineering gives strength into areas of data preparation.

Choose an appropriate algorithm or architecture based on the nature of the problem. For instance, CNNs which are often used for image-related tasks, RNNs used for sequence data tasks like textual datasets amongst other models depending on the nature of the available data. For simpler tasks, linear regression models can be adopted as discussed by [74] and for complex tasks like that used in ImageNet classification, deep Convolutional Neural Networks as discussed by [75] can be adopted.

Training dataset is fed into the chosen model and to minimize a chosen loss function, model parameters are iteratively being updated. The optimization process typically involves backpropagation and gradient descent with a notable stochastic Gradient Descent Algorithm [76] as used in the training of mostly Neural Network (NN) models amongst other models. During model training, it’s essential to select an appropriate loss function to determine the variation between the actual and predicted values. [77] discussed a popular loss function known as log-loss function (logistic regression). For regression tests, the Mean Square Error (MSE) loss function has been used to quantify the variation between the actual numerical value and the predicted value.

Hyperparameters such as learning rate, batch size, and network architecture are adjusted to optimize the model’s performance on the validation set.

Bayesian Optimization techniques have been widely adopted for hyperparameter tunning processes [78] . [79] introduces the grid search optimization technique where a range of values have been specified for each hyperparameter test and at each instance, the model is independently validated.

6.2. Evaluation

Model evaluation phase examines the model’s performance on new dataset to gauge how effective the model is in solving the desired goal and as well on its generalization ability. The evaluation process includes:

In model training, datasets are split into training and test datasets normally with either 75% to 25% or 80% to 20% proportions depending on the volume of the available data. A portion of the dataset (distinct from the training and validation datasets) is what is reserved as a test dataset. This data is used to simulate how the model will perform on real-world, unseen examples. [78] looks at data being split into the training dataset which seeks to make the model learn based on the features exposed to it, validation set examines model to make early assessments on model performance in one way fine tuning the hyperparameters.

Figure 1 Figure 1. Model training and evaluation process.

Model Evaluation is a process of testing how well a trained model can accurately be able to perform its set task. Various metrics are used for model evaluation, and they depend on the nature of the problem being addressed. For classification tasks, metrics might include accuracy, precision, recall, F1-score, and ROC-AUC used specifically in cases where the dataset is imbalanced. [80] proposed using precision and recall while for regression tasks, metrics like Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) might be used with the intent of gauging the quality of the prediction. All these metrics are deployed with intent of identifying how well the model performs in real-world scenarios.

Model evaluation is used with a core aim of detecting overfitting where cross-validation techniques have been adopted as a means of reducing overfitting as discussed by [78] . It’s important to choose the right evaluation techniques depending on the data being used. For cases of MRI Brain Imagery for studies aimed towards the detection of Alzheimer’s Disease, recall can be used as opposed to accuracy given the false negatives and missed positives associated which are always associated with medical diagnosis.

The entire process of model training from the dataset creation to the final stage of trained model evaluation is shown on Figure 1 .

6.3. Visualizations

Visuals have been used to create a deeper understanding on the relationship between various features used in model training. Visualizations like confusion matrices, ROC curves, and precision-recall curves, provide a more comprehensive understanding of the model’s behavior.

In both evaluation and interpretation, iterative processes might be necessary. Models can be fine-tuned, hyperparameters adjusted, and preprocessing techniques optimized to achieve the best possible performance. Regular validation of models on new and diverse datasets helps ensure robustness and reliability in various scenarios. [81] discusses key visualization techniques used for model selection. [80] discusses advanced visualization techniques which are used for visualizing high dimensional data in lower dimensional spaces with methods such as Principal Component Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE).

7. Discussions

Deep learning models have shown tremendous promise in various clinical applications, including medical imaging analysis, disease diagnosis, personalized medicine, and clinical decision support systems. These models have achieved high accuracy in detecting breast cancer, diagnosing diabetic retinopathy, predicting cardiovascular disease risk, and personalizing cancer treatment plans. Additionally, they have been used to predict patient outcomes, such as mortality and readmission, and provide healthcare professionals with real-time, data-driven insights. However, despite their promise, deep learning models in clinical applications face several challenges. Clinical data is often noisy, heterogeneous, and limited, while regulatory concerns surround data privacy, security, and compliance. Deep learning models require thorough clinical validation, interpretability, and integration with existing workflows to ensure seamless adoption and utilization. Furthermore, addressing bias and variability, ensuring explainability and transparency, and mitigating cybersecurity concerns are crucial to the successful implementation of deep learning models in clinical applications.

8. Conclusions

In conclusion, the field of analyzing current deep learning models for brain disorder detection through neuroimaging data holds immense promise and potential. The methods and insights garnered from these models have the capacity to revolutionize the diagnosis and understanding of various neurological disorders to which Alzheimer’s Disease falls. Very complex patterns are being extracted from neuroimaging datasets using deep learning architectures like CNN’s, RNN’s and transformer-based models.

Amalgamating diverse neuroimaging modalities, including fMRI, PET, DTI, offers a comprehensive view of brain activity, connectivity, and structure. This multidimensional approach enables the models to capture nuanced relationships and anomalies associated with neurological disorders, ranging from Alzheimer’s disease and schizophrenia to epilepsy and autism spectrum disorders.

Furthermore, the integration of transfer learning and pre-trained embeddings has expedited the training process and enhanced model generalization. These techniques enable the models to capitalize on knowledge acquired from large-scale datasets, facilitating the adaptation of learned features to specific brain disorder detection tasks. Moreover, the interpretability of deep learning models is gaining prominence, with efforts to elucidate the rationale behind the decisions made by these complex architectures. This not only engenders trust but also enriches the medical community’s understanding of the disorders themselves.

Nonetheless, challenges persist, including the scarcity of labeled neuroimaging data, the need for robust validation strategies, and concerns about model transparency in clinical decision-making. As the field advances, collaboration between researchers, clinicians, and domain experts is paramount to ensure the ethical and responsible deployment of the image detection models for use in medical practice.

Constant exploration of deep learning models for brain disorder detection marks an exciting juncture in the realm of medical diagnostics. The convergence of technological innovation, sophisticated algorithms, and rich neuroimaging data has the potential to reshape how we comprehend, diagnose, and treat neurological disorders, thereby improving the lives of countless individuals. As this journey unfolds, continued research and vigilance will be essential to harness the full potential of these models while navigating the intricate ethical and practical considerations that come with their integration into the medical landscape.

In conclusion, the exploration of current deep learning models for brain disorder detection using neuroimaging data represents a frontier of immense promise and potential. The methodologies and insights derived from these models hold the power to revolutionize the diagnosis and comprehension of a spectrum of neurological disorders, including Alzheimer’s Disease. These models extract highly complex patterns from neuroimaging datasets, employing architectures such as CNNs, RNNs, and transformer-based models.

The integration of various neuroimaging modalities, encompassing fMRI, PET, and DTI, provides a comprehensive perspective on brain activity, connectivity, and structure. This multidimensional approach empowers the models to capture subtle relationships and anomalies associated with a diverse array of neurological disorders, ranging from Alzheimer’s disease and schizophrenia to epilepsy and autism spectrum disorders.

Additionally, the incorporation of transfer learning and pre-trained embeddings has expedited the training process and bolstered model generalization. These techniques enable the models to improve insights gleaned from expansive datasets, facilitating the adaptation of learned features for specific brain disorder detection tasks. Furthermore, the pursuit of interpretability in deep learning models is gaining traction, with concerted efforts to illuminate the reasoning behind the decisions made by these intricate architectures. This not only fosters trust but also enriches the medical community’s understanding of the disorders themselves.

Nevertheless, challenges persist, including the scarcity of labeled neuroimaging data, the imperative for robust validation strategies, and concerns regarding model transparency in clinical decision-making. As the field advances, collaborative efforts among researchers, clinicians, and domain experts are imperative to ensure the ethical and responsible deployment of image detection models in medical practice.

The ongoing exploration of deep learning models for brain disorder detection signifies an exciting juncture in the realm of medical diagnostics. The confluence of technological innovation, sophisticated algorithms, and rich neuroimaging data has the potential to reshape how we perceive, diagnose, and treat neurological disorders, ultimately enhancing the lives of countless individuals. As this journey continues, sustained research and vigilance will be paramount to harness the full potential of these models while navigating the intricate ethical and practical considerations inherent in their integration into the medical landscape.

Data Availability Statement

The data used to support the findings of this study are publicly available from open-source publications collected from google scholar and other publication databases. All papers used are cited and are added in the reference section.

References 1

World Health Organization (2019) Global Action Plan on Physical Activity 2018-2030: More Active People for A Healthier World.

Muller, A.E., Hafstad, E.V., Himmels, J.P.W., Smedslund, G., Flottorp, S., Stensland, S.Ø., et al. (2020) The Mental Health Impact of the COVID-19 Pandemic on Healthcare Workers, and Interventions to Help Them: A Rapid Systematic Review. Psychiatry Research, 293, Article 113441. >https://Doi.Org/10.1016/J.Psychres.2020.113441

Baig, M.A., Klein, J.P. and Mechtler, L.L. (2016) Imaging of Brain Tumors. CONTINUUM: Lifelong Learning in Neurology, 22, 1529-1552. >https://doi.org/10.1212/con.0000000000000388

Larson, D.B. and Shear, T.D. (2016) Radiologist as Consultant: A History of Interpreting Images and the Practice of Medicine. Radiology, 280, 622-628.

Arbabshirani, M.R., Plis, S., Sui, J. and Calhoun, V.D. (2017) Single Subject Prediction of Brain Disorders in Neuroimaging: Promises and Pitfalls. NeuroImage, 145, 137-165. >https://Doi.Org/10.1016/J.Neuroimage.2016.02.079

Le Bihan, D. (2014) Diffusion MRI: What Water Tells Us about the Brain. EMBO Molecular Medicine, 6, 569-573.

Esteva, A., Robicquet, A., Ramsundar, B., Kuleshov, V., DePristo, M., Chou, K., et al. (2019) A Guide to Deep Learning in Healthcare. Nature Medicine, 25, 24-29. >https://Doi.Org/10.1038/S41591-018-0316-Z

World Health Organization (2021) Global Health Estimates 2020: Deaths by Cause, Age, Sex, by Country and by Region, 2000-2019. World Health Organization. >https://Www.Who.Int/Data/Gho/Data/Themes/Mortality-And-Global-Health-Estimates/Ghe-Leading-Causes-Of-Death

Gordillo, N., Montseny, E. and Sobrevilla, P. (2013) State of the Art Survey on MRI Brain Tumor Segmentation. Magnetic Resonance Imaging, 31, 1426-1438. >https://doi.org/10.1016/j.mri.2013.05.002

Akkus, Z., Galimzianova, A., Hoogi, A., Rubin, D.L. and Erickson, B.J. (2017) Deep Learning for Brain MRI Segmentation: State of the Art and Future Directions. Journal of Digital Imaging, 30, 449-459. >https://Doi.Org/10.1007/S10278-017-9983-4

Cheng, J.-Z., Ni, D., Chou, Y.-H., Qin, J., Tiu, C.-M., Chang, Y.-C., et al. (2016) Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans. Scientific Reports, 6, Article No. 24454. >https://Doi.Org/10.1038/Srep24454

Razzak, M.I., Naz, S. and Zaib, A. (2018) Deep Learning for Medical Image Processing: Overview, Challenges and the Future. In: Dey, N., Ashour, A., Borra, S., Eds., Classification in BioApps. Lecture Notes in Computational Vision and Biomechanics, Springer, Cham, 323-350. >https://doi.org/10.1007/978-3-319-65981-7_12

Amin, H.U., Yusoff, M.Z. and Ahmad, R.F. (2020) A Novel Approach Based on Wavelet Analysis and Arithmetic Coding for Automated Detection and Diagnosis of Epileptic Seizure in EEG Signals Using Machine Learning Techniques. Biomedical Signal Processing and Control, 56, 101707. >https://doi.org/10.1016/j.bspc.2019.101707

Sarraf, S. and Tofighi, G. (2016) Deep Learning-Based Pipeline to Recognize Alzheimer’s Disease Using FMRI Data. Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), San Francisco, 6-7 December 2016, 18-26.

Korolev, S., Safiullin, A., Belyaev, M. and Dodonova, Y. (2017) Deep Learning for Predicting Disease Progression in Alzheimer’s Disease: A Model Comparison Study. PLOS ONE, 12, E0171527.

Aliper, A., Plis, S., Artemov, A., Ulloa, A., Mamoshina, P. and Zhavoronkov, A. (2016) Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data. Molecular Pharmaceutics, 13, 2524-2530. >https://Doi.Org/10.1021/Acs.Molpharmaceut.6b00248

Zhang, J., Chang, S., Zhan, X., Zheng, W. and Yang, R. (2017) Deep Learning in the Detection of Brain Disorders: A Survey. IEEE Access, 5, 15103-15121.

Litjens, G., Kooi, T., Bejnordi, B.E., Setio, A.A.A., Ciompi, F., Ghafoorian, M., et al. (2017) A Survey on Deep Learning in Medical Image Analysis. Medical Image Analysis, 42, 60-88. >https://Doi.Org/10.1016/J.Media.2017.07.005

Havaei, M., Davy, A., Warde-Farley, D., Biard, A., Courville, A., Bengio, Y., et al. (2017) Brain Tumor Segmentation with Deep Neural Networks. Medical Image Analysis, 35, 18-31. >https://Doi.Org/10.1016/J.Media.2016.05.004

Khvostikov, A., Aderghal, K., Benois-Pineau, J., Krylov, A. and Catheline, G. (2018) 3D CNN-Based Classification Using sMRI and MD-DTI Images for Alzheimer Disease Studies. arXiv preprint arXiv: 1801.05968.

Pham, V.H., Kim, J. and Kwon, H.J. (2020) A Survey on Deep Learning in Medical Signal Analysis: A Focus on Epilepsy Detection. Computer Methods and Programs in Biomedicine, 187, Article 104961.

Acharya, U.R., Oh, S.L., Hagiwara, Y., Tan, J.H. and Adeli, H. (2018) Deep Convolutional Neural Network for the Automated Detection and Diagnosis of Seizure Using EEG Signals. Computers in Biology and Medicine, 100, 270-278. >https://Doi.Org/10.1016/J.Compbiomed.2017.09.017

Suk, H.I. and Shen, D. (2016) Deep Learning-Based Feature Representation for AD/MCI Classification. In: Mori, K., Sakuma, I., Sato, Y., Barillot, C. and Navab, N., Eds., Medical Image Computing and Computer-Assisted Intervention—MICCAI 2013, Springer, 583-590. >https://doi.org/10.1007/978-3-642-40763-5_72

Ghafoorian, M., Karssemeijer, N., Heskes, T. and Bergers, E. (2017) Location-Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities. Scientific Reports, 7, Article No. 5110.

Chen, H., Zhang, Y., Zhang, W., Lin, G., Gao, Y. and Wang, S. (2020) Dual-Stage Attention-Based Recurrent Neural Network for Time-Series Prediction of Alzheimer’s Disease. Neurocomputing, 384, 154-164.

Chen, C., Qin, C., Qiu, H.Q., Tarroni, G., Duan, J., Bai, W.J. and Rueckert, D. (2018) Deep Learning for Cardiac Image Segmentation: A Review. Frontiers in Cardiovascular Medicine, 7, Article 25.

Chen, L., Wu, Y., DSouza, A.M., Abidin, A.Z., Wismüller, A. and Xu, C. (2018) MRI Tumor Segmentation with Densely Connected 3D CNN. Medical Imaging 2018: Image Processing, 10574, 357-364.

Bengio, Y., Courville, A. and Vincent, P. (2013) Representation Learning: A Review and New Perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 1798-1828. >https://doi.org/10.1109/TPAMI.2013.50

Zhou, D., Bousquet, O., Lal, T.N., Weston, J. and Schölkopf, B. (2008) Learning with Local and Global Consistency. In: Thrun, S., Saul, L. and Schölkopf, B., Eds., Advances in Neural Information Processing Systems, MIT Press, 321-328.

Liu, W., Wang, Z., Liu, X. and Zeng, N. (2014) Transfer Learning for Brain-Computer Interfaces: A Euclidean Space Data Alignment Approach. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 22, 625-637.

Alzheimer’s Association (2021) 2021 Alzheimer’s Disease Facts and Figures. Alzheimer’s&Dementia, 17, 327-406. >https://doi.org/10.1002/alz.12328

Kamnitsas, K., Ledig, C., Newcombe, V.F.J., Simpson, J.P., Kane, A.D., Menon, D.K., Rueckert, D. and Glocker, B. (2017) Efficient Multi-Scale 3D CNN with Fully Connected CRF for Accurate Brain Lesion Segmentation. Medical Image Analysis, 36, 61-78. >https://doi.org/10.1016/j.media.2016.10.004

Smith, S.M. (2002) Fast Robust Automated Brain Extraction. Human Brain Mapping, 17, 143-155. >https://doi.org/10.1002/hbm.10062

Dale, A.M., Fischl, B. and Sereno, M.I. (1999) Cortical Surface-Based Analysis. I. Segmentation and Surface Reconstruction. NeuroImage, 9, 179-194. >https://doi.org/10.1006/nimg.1998.0395

Iglesias, J.E., Liu, C.Y., Thompson, P.M. and Tu, Z. (2011) Robust Brain Extraction Across Datasets and Comparison with Publicly Available Methods. IEEE Transactions on Medical Imaging, 30, 1617-1634. >https://doi.org/10.1109/TMI.2011.2138152

Mok, T.C.W. and Chung, A.C.S. (2020) Fast Symmetric Diffeomorphic Image Registration with Convolutional Neural Networks. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 4644-4653. >https://doi.org/10.1109/cvpr42600.2020.00470

Jenkinson, M., Bannister, P., Brady, M. and Smith, S. (2002) Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images. NeuroImage, 17, 825-841. >https://doi.org/10.1006/nimg.2002.1132

Avants, B.B., Epstein, C.L., Grossman, M. and Gee, J.C. (2008) Symmetric Diffeomorphic Image Registration with Cross-Correlation: Evaluating Automated Labeling of Elderly and Neurodegenerative Brain. Medical Image Analysis, 12, 26-41. >https://doi.org/10.1016/j.media.2007.06.004

Ashburner, J. and Friston, K.J. (2005) Unified Segmentation. NeuroImage, 26, 839-851. >https://doi.org/10.1016/j.neuroimage.2005.02.018

Klein, S., Staring, M., Murphy, K., Viergever, M.A. and Pluim, J.P. (2010) Elastix: A Toolbox for Intensity-Based Medical Image Registration. IEEE Transactions on Medical Imaging, 29, 196-205. >https://doi.org/10.1109/TMI.2009.2035616

Yoo, T.S. (2002) Insight into Images: Principles and Practice for Segmentation, Registration, and Image Analysis. CRC Press.

Sled, J.G., Zijdenbos, A.P. and Evans, A.C. (1998) A Nonparametric Method for Automatic Correction of Intensity Nonuniformity in MRI Data. IEEE Transactions on Medical Imaging, 17, 87-97. >https://doi.org/10.1109/42.668698

Jovicich, J., Czanner, S., Han, X., Salat, D., Van Der Kouwe, A., Quinn, B., et al. (2006) MRI-Derived Measurements of Human Subcortical, Ventricular and Intracranial Brain Volumes: Reliability Effects of Scan Sessions, Acquisition Sequences, Data Analyses, Scanner Upgrade, Scanner Vendors and Field Strengths. NeuroImage, 30, 827-841.

Nandi, D., Ashour, A.S., Samanta, S., Chakraborty, S., Salem, M.A.M. and Dey, N. (2015) Principal Component Analysis in Medical Image Processing: A Study. International Journal of Image Mining, 1, 65. >https://doi.org/10.1504/ijim.2015.070024

Stein, J.L., Medland, S.E., Vasquez, A.A., Hibar, D.P., Senstad, R.E., Winkler, A.M., et al. (2012) Identification of Common Variants Associated with Human Hippocampal and Intracranial Volumes. Nature Genetics, 44, 552-561. >https://doi.org/10.1038/ng.2250

Yeung, K.Y. and Ruzzo, W.L. (2001) Principal Component Analysis for Clustering Gene Expression Data. Bioinformatics, 17, 763-774. >https://doi.org/10.1093/bioinformatics/17.9.763

Hyvärinen, A. and Oja, E. (2000) Independent Component Analysis: Algorithms and Applications. Neural Networks, 13, 411-430. >https://doi.org/10.1016/s0893-6080(00)00026-5

Tan, P., Wang, X. and Wang, Y. (2020) Dimensionality Reduction in Evolutionary Algorithms-Based Feature Selection for Motor Imagery Brain-Computer Interface. Swarm and Evolutionary Computation, 52, 100597. >https://doi.org/10.1016/j.swevo.2019.100597

Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2017) Imagenet Classification with Deep Convolutional Neural Networks. Communications of the ACM, 60, 84-90. >https://doi.org/10.1145/3065386

Andrew, G. and Menglong, Z. (2017) Efficient Convolutional Neural Networks for Mobile Vision Applications. Mobilenets, 10, 151.

Tan, M. and Le, Q.V. (2019) Efficientnet: Rethinking Model Scaling for Convolutional Neural Networks. Proceedings of the 36th International Conference on Machine Learning, 97, 6105-6114.

Bennet, J., Arul Ganaprakasam, C. and Arputharaj, K. (2014) A Discrete Wavelet Based Feature Extraction and Hybrid Classification Technique for Microarray Data Analysis. The Scientific World Journal, 2014, 1-9. >https://doi.org/10.1155/2014/195470

Zhang, Y. and Zhang, Y.Q. (2019) Image Compression Using Deep Neural Networks: A Survey. IEEE Access, 7, 149292-149308.

Thakur, A., Kumar, A., Srivastava, A. and Sharma, P. (2019) A Novel Approach for Detection and Classification of EEG Signals Using Wavelet Transform and Deep Learning. Journal of Ambient Intelligence and Humanized Computing, 10, 987-1001.

Rivers, T.J., Benesty, J. and Chen, J. (2019) Signal Enhancement and Noise Reduction in Speech Processing Using Sparse Signal Representation. Springer.

Zhang, Y., Kuo, C.C.J. and Chen, P.C.Y. (2006) Fault Detection and Classification in Chemical Processes Using Ensemble of Decision Trees. Computers&Chemical Engineering, 30, 465-474.

Dalal, N. and Triggs, B. (2005) Histograms of Oriented Gradients for Human Detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), 1, 886-893. >https://doi.org/10.1109/CVPR.2005.177

Dalal, N. and Triggs, B. (2006) Human Detection Using Oriented Histograms of Flow and Appearance. In: Leonardis, A., Bischof, H. and Pinz, A., Eds., Computer Vision—ECCV 2006, Springer, 428-441. >https://doi.org/10.1007/11744047_33

Kasiviswanathan, H., Ramanathan, N. and Chellappa, R. (2011) A Robust Framework for Face Recognition. EURASIP Journal on Image and Video Processing, 1, 54.

Sivaraman, S. and Trivedi, M.M. (2012) Pedestrian Detection in Thermal Infrared Imagery Using Texture and Shape Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34, 1704-1716.

Laptev, I. (2008) On Space-Time Interest Points. International Journal of Computer Vision, 64, 107-123. >https://doi.org/10.1007/s11263-005-1838-7

Ojala, T., Pietikainen, M. and Maenpaa, T. (1994) Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24, 971-987. >https://doi.org/10.1109/TPAMI.2002.1017623

Heikkilä, M., Ahonen, T., Pietikäinen, M. and Schmid, C. (2009) Description of Interest Regions with Local Binary Patterns. Pattern Recognition, 42, 425-436. >https://doi.org/10.1016/j.patcog.2008.08.014

Jenicka, S. and Suruliandi, A. (2014) Fuzzy Texture Model and Support Vector Machine Hybridization for Land Cover Classification of Remotely Sensed Images. Journal of Applied Remote Sensing, 8, 083540. >https://doi.org/10.1117/1.jrs.8.083540

Aguero, H.P. (2022) Review of the Current Technologies and Applications of Digital Image Processing. Journal of Biomedical and Sustainable Healthcare Applications, 2, 148-158. >https://doi.org/10.53759/0088/jbsha202202016

Ahonen, T., Hadid, A. and Pietikäinen, M. (2006) Face Recognition with Local Binary Patterns. In: Pajdla, T. and Matas, J., Eds., Computer Vision—ECCV 2004, Springer, 469-481. >https://doi.org/10.1007/978-3-540-24670-1_36

Zhao, G., Ahonen, T. and Pietikainen, M. (2005) Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29, 915-928. >https://doi.org/10.1109/TPAMI.2007.1110

Song, K., Yan, Y., Chen, W. and Zhang, X. (2013) Research and Perspective on Local Binary Pattern. Acta Automatica Sinica, 39, 730-744. >https://doi.org/10.1016/s1874-1029(13)60051-8

Johnson, M. (2015) Text Analysis for the Social Sciences: Methods for Drawing Statistical Inferences from Texts and Transcripts. University of Chicago Press.

Jones, K.S. (2012) A Statistical Interpretation of Term Specificity and Its Application in Retrieval. Journal of Documentation, 28, 11-21. >https://doi.org/10.1108/eb026526

Krizhevsky, A., Sutskever, I. and Hinton, G.E. (2012) ImageNet Classification with Deep Convolutional Neural Networks. In: Pereira, F., Burges, C.J., Bottou, L. and, Weinberger, K.Q., Eds., Advances in Neural Information Processing Systems, Association for Computing Machinery, 84-90.

McKinney, W. (2011) Pandas: A Foundational Python Library for Data Analysis and Statistics. Python for High Performance and Scientific Computing, 1-9.

Dasu, T. and Johnson, T. (2003) Exploratory Data Mining and Data Cleaning. John Wiley&Sons. >https://doi.org/10.1002/0471448354

Khder, M. (2021) Web Scraping or Web Crawling: State of Art, Techniques, Approaches and Application. International Journal of Advances in Soft Computing and Its Applications, 13, 145-168. >https://doi.org/10.15849/ijasca.211128.11

Williams, M. (2019) Feature Engineering and Selection: A Practical Approach for Predictive Models. CRC Press.

Zheng, A. (2018) Feature Engineering for Machine Learning: Principles and Techniques for Data Scientists. O’Reilly Media.

Chakraborty, T. and Kumar, U. (2022) Loss Function. In: Daya Sagar, B.S., Cheng, Q., McKinley, J., Agterberg, F., Eds., Encyclopedia of Mathematical Geosciences. Encyclopedia of Earth Sciences Series. Springer, Cham, 1-6. >https://doi.org/10.1007/978-3-030-26050-7_187-1

Bottou, L. (2010) Large-Scale Machine Learning with Stochastic Gradient Descent. In: Lechevallier, Y. and Saporta, G., Eds., Proceedings of COMPSTAT’2010, Physica-Verlag HD, 177-186. >https://doi.org/10.1007/978-3-7908-2604-3_16

Bergstra, J. and Bengio, Y. (2012) Random Search for Hyper-Parameter Optimization. Journal of Machine Learning Research, 13, 281-305.

Du, L.X., Roy, S., Wang, P., Li, Z.G., Qiu, X.T., Zhang, Y., et al. (2024) Unveiling the Future: Advancements in MRI Imaging for Neurodegenerative Disorders. Ageing Research Reviews, 95, Article 102230. >https://doi.org/10.1016/j.arr.2024.102230

Wang, J., Wang, J., Wang, S. and Zhang, Y. (2023) Deep Learning in Pediatric Neuroimaging. Displays, 80, Article 102583. >https://doi.org/10.1016/j.displa.2023.102583