Introduction

Malaria is a life-threatening illness transmitted by female Anopheles mosquitoes that inject plasmodium parasites with one nasty bite. In most cases, plasmodium parasites target healthy red blood cells (RBC) around 1 or 2 weeks after their emergence in the human body [1]. This bacterial infection is hazardous to kids, persons with impaired immune systems, pregnant women, and the elderly at risk [2]. Falciparum malaria is particularly dangerous to pregnant women as it increases stillbirth, maternal death, miscarriage, and newborn [3]. The World Health Organization reported roughly 241 million malaria suspects and 627,000 fatalities in 2021. The African continent is perhaps the most afflicted, accounting for 95% of 90% of deaths and 80% of child disability caused by acute malaria [4, 5].

RBCs were microscopically examined in a thick, thin blood smear frequently used to identify malaria [6]. The thick and thin-smear test aids in identifying the density of parasites in a person's body and malaria species, respectively [7, 8]. Expert pathologists manually analyze blood smear films to get a microscopic diagnosis which is time-consuming, laborious, and unreliable [9]. Malaria patients are usually seen in emerging countries, where diagnostic lab facilities and tools are unavailable. In addition, a global shortage of trained professionals significantly impacts the healthcare systems of developing countries [10]. Therefore, a computer-based screening tool is essential for speedy and reliable malaria analysis [11].

Artificial intelligence (AI) and machine learning (ML) have aided in the development of malaria diagnostic methods that are effective and precise in processing large amounts of parasite-contaminated RBC samples [12,13,14]. Computer-based software will support clinicians in diagnosis and therapy, facilitating established lab practices [15,16,17]. However, this convention performed poorly on massive data and was incapable of learning complex patterns. Therefore, deep learning (DL) algorithms emerged and inspired researchers’ interest in dealing with enormous amounts of data and learning complicated patterns [18, 19]. They have significant growth for medical imaging infection diagnosis [20,21,22]. Malaria parasite analysis is crucial for diagnosing infected cells. In this regard, a deep CNN-based identification helps quickly and accurately analyze malaria parasite images. Several CNN-based classification frameworks and experimental models have extensively been employed on the NIH malaria dataset to improve detection [23, 24]. CNN extracts deep features automatically by avoiding the time-consuming hand-crafted feature extraction and reduces computational power [23, 25].

To our knowledge, this study is the first to introduce a new deep hybrid framework comprised of novel Deep residual and spatial blocks CNN and ensemble learning to analyze malaria parasite-afflicted patients accurately and efficiently. The proposed deep Boosted-BR-STM exploits the channel Squeezing and Boosting (SB) technique with a novel Split Transform Merge (STM) block. Moreover, the developed STM block of deep Boosted-BR-STM uses the concept of homogenous and heterogeneous. The significant contributions are as follows:

  1. 1.

    A new Deep Boosted and Ensemble Learning (DBEL) framework is proposed comprising a new residual learning based Boosted-BR-STM CNN and ensemble learning for detecting RBCs infected with the plasmodium falciparum using blood smear images. The dataset is initially enhanced and reduced in dimension using a discrete wavelet transform (DWT) to improve computational complexity.

  2. 2.

    The proposed deep Boosted-BR-STM exploits the channel SB technique with a novel STM block. Moreover, the developed STM block uses the concept of homogenous and heterogeneous.

  3. 3.

    The innovative SB concept is carefully integrated into the new STM block at abstract, medium, and conclusion levels effectively capturing the diverse pattern of homogeneous, heterogeneous, contrast, and textural variations of the parasitic cell. SB notion is utilized by merging reduced prominent channels with TL-based extracted additional feature maps to improve Boosted-BR-STM performances. In addition, to improve the learning capacity of Boosted-BR-STM and promote a more diverse representation of features, residual learning-based feature map boosting is achieved at the final stage through TL.

  4. 4.

    The proposed framework grants the inherent properties of diverse prominent and boosted channels to the discriminative feature level and fed to the ensemble of ML classifiers, improving the capability of discrimination and generalization of ensemble learning. The ML classifiers' ensemble effectively reduces feature dimension and improves diverse decision space. Ultimately, early detection reduces the possibility of permanent disabilities.

  5. 5.

    The proposed hybrid DBEL framework performance is compared to existing techniques utilized in NIH-malaria original and enhanced datasets.

The rest of the manuscript is arranged in the following ways. “Related work” and “Malaria parasite detection scheme”, the remainder of the article discusses related work and the proposed malaria detection framework. Materials and implementation details are provided in “Experimental configuration”. The performance analysis and outcomes are presented in “Results and discussion”. Finally, the article's conclusion is in “Conclusion”.

Related work

Several ML techniques were automatically determined in stained RBC samples employing methods over time [26]. ML algorithms such as Extreme-Learning Machine methods, Global White Balance, Logistic Regression, SVM, Adaptive Non-linear Grayscale, and KNN have enhanced classification performance [27, 28]. The malaria parasite-infected features like RBCs, color, size, form, and statistical data were extracted and provided to ML algorithms [29, 30]. Moreover, the deep belief network technique was used to classify the parasites, which yielded a 90.21% accuracy [31]. The approaches mentioned above, on the other hand, indicated prediction accuracy in the range of 84–94%. Several researchers used deep CNN-based detection that automatically and quickly identifies cells infected with the malaria parasite, achieving a 91.50% detection rate [31,32,33]. Medical datasets with labels are generally limited in size and thus unsuitable for practical application. Therefore, TL has been utilized in a limited labeled malaria dataset to forecast malaria parasites in RBC [32]. TL-based existing CNNs have been employed to improve and have attained accuracies of 75–94% [33,34,35]. Furthermore, R-CNN for parasite identification has been employed with considerable performance [36].

Previous studies employed VGG16 to identify malaria using the common NIH dataset and achieved 95.96% accuracy [37]. The balanced NIH dataset included 27,556 images and was resized to 224 by 224 pixels. A sequential-tailored CNN was obtained with a 95.90% F1 score, 94.7% sensitivity, and 92.70% accuracy [25]. The existing ResNet50 yielded a 95.40% accuracy using the malaria dataset [38]. Moreover, the custom-made CNN model demonstrated 96.82% accuracy, 96.33% using five convolutional and pooling layers, and 96.82% F1-score [23]. Eventually, a hybrid platform that systematically reduces structural and empirical risk has been recently adopted and achieved a sensitivity of 93.44% and an accuracy of 93.13% [25, 32]. In another study, a modified Capsule Network (CapsNet) hybrid screening algorithm has been reported for automated identification and pixel label based classification (segmentation) of malaria parasite-infected RBCs, achieving an accuracy of 98.70% [39]. Moreover, modified YOLOV3 and YOLOV4 models have been employed on a publicly available malaria dataset and achieved an accuracy of 95.46% and 96.14%, respectively [40]. A recent study employed YOLOv5 and DarKnet-53 Plasmodium Falciparum life stages detection and achieved an accuracy of 95.20% and 96.02%, respectively [41]. However, certain limitations to the prior work are: previous CNN models have been used to detect parasite images designed specifically for natural samples. In contrast, the medical images have distinct patterns and textures of the parasite malaria infection, limiting the model's performance. The prior research primarily focused on the accuracy of the validation dataset. Mostly, the previous studies are evaluated using accuracy; however, the results of a single-measure evaluation of the detecting system could be more reliable. Therefore, the detection rate and F-score are essential indicators of parasite detection. Moreover, testing evaluation measures on a vast, previously unseen test dataset are necessary for determining the detection model’s robustness. Furthermore, several models have been evaluated on a limited dataset of images of malaria parasites.

Malaria parasite detection scheme

The proposed DBEL framework and subsequent prompt effective treatment in sufficient time can detect permanent disabilities from the malaria parasite. The microscope analyzes malaria parasites from blood films stained with various chemical stains [42]. Various scanners, chemical stains, concentrations, and methods can cause color differences. Therefore, analyzing parasite and artifact patterns becomes challenging and causes substantial margin errors. These challenges necessitate designing a new deep CNN and ensemble learning framework to capture complicated patterns and identify malaria infectious microscopic thin smear samples. The developed framework comprised a new Boosted-BR-STM, ensemble learning, and contemporary customized CNNs to screen parasite-infected cells from healthy images. The proposed malaria detection scheme utilizes three experimental arrangements: (1) the proposed DBEL, (2) the proposed DBML framework, and (3) the evaluation of existing CNNs. These existing CNNs learned from scratch and fine-tuned using TL on the malaria NIH dataset. Moreover, these CNNs are evaluated as (i) The deep features are extracted from existing CNNs and provided to the ML classifier (ii) The Softmax-based evaluations. Additionally, the DWT and data augmentation as pre-processing techniques for improving the detection CNNs performance to screen stain impurities or artifacts from parasite images and analyze different parasitic patterns. The overall workflow is arranged in Fig. 1.

Fig. 1
figure 1

The flow diagram of the developed malaria parasite detection scheme

Data enhancement

Discrete wavelet transformations (DWT) effectively produce salient feature maps and reduce noise [43]. DWT divides the image into four sub-parts: high, low, and diagonal-resolution channels. The next level of DWT, the low and diagonal-resolution channels, is stored and altered to provide the necessary DWT coefficients for parasite detection. Moreover, a 75% dimension reduction at each level minimizes computational complexities. The low- and high-resolution images are combined and reconstructed using IDWT to produce highly informative (low-resolution and diagonal features) enhanced images [44]. The original and improved DWT data are shown in Fig. 2.

Fig. 2
figure 2

Parasite and non-parasite (healthy, stain artifact) samples from the NIH malaria dataset are shown in panels (a, c). While (b, d) displays their DWT-enhanced examples, respectively

Data augmentation

Data augmentation is widely applied to boost the robustness of the model and lessen the risk of overfitting. Data instances are increased in runtime (on-the-fly augmentation) during the model training using different data augmentation techniques [45]. This work uses rotation, reflection, and shearing, as shown in Table 1. Moreover, scaling and translation parameters are kept by default [1 1] and translation [± 5], respectively. Data augmentation strategies are employed to learn different variations and improve the model’’s generalization, especially, shearing learn distortions, real-world variations, and occlusions.

Table 1 Data augmentation detail

The proposed deep boosted and ensemble learning framework

This research proposes a new deep DBEL framework that is comprised of the developed Boosted-BR-STM BRNet and ensemble techniques for malaria parasite detection. Moreover, the developed Boosted-BR-STM penultimate layers are used for deep feature extraction. The workflow of the malaria parasite detection framework is illustrated in Fig. 3.

Fig. 3
figure 3

The proposed malaria parasite detection framework comprised the developed boosted-BR-STM and ensemble of classifiers

The proposed channel Boosted-BR-STM CNN

This study introduces a novel Boosted-BR-STM CNN for the detection of parasitic malaria in patients. The CNN incorporates a unique STM-based convolution block to analyze both homogenous and boundary patterns in parasitic regions. Additionally, we employ novel SB concepts within STM to produce condensed and diverse feature maps. The channel SB technique is applied at the initial, intermediate, and final stages to capture textural differences between parasites and typical artifacts [46], as illustrated in Fig. 3.

The Boosted-BR-STM effectively learns abstract and target features by organizing three STM blocks with a consistent structure. Each block consists of four dilated-convolutional units, employing max and average pooling to explore border determination, regional consistency, and textural variation [47, 48], as shown in the Eqs. (13). This systematic implementation aids in efficiently exploring border or edge determination, regional homogenous, and textural variation of parasitic and artifact images.

$${{\text{x}}}_{k,l}= \sum_{i=1}^{{\text{m}}}\sum_{j=1}^{{\text{n}}}{{\text{x}}}_{k+i-1,l+n-1}{{\text{f}}}_{i,j},$$
(1)
$${{{\text{x}}}^{{\text{max}}}}_{k,l}= {{\text{max}}}_{i=1,\dots ,w,j=1,\dots ,w}{{\text{x}}}_{k+i-1,l+j-1},$$
(2)
$${{{\text{x}}}^{{\text{avg}}}}_{k,l}=\frac{1}{{{\text{w}}}^{2} } \sum_{i=1}^{w}\sum_{j=1}^{w}{{\text{x}}}_{k+i-1,l+j-1}.$$
(3)

In the equation, ‘x’ represents the input feature map, size is shown with the symbols ‘k’ x ‘l’ and ‘I’ x ‘j’, and ‘f’ stands for the filter (1) in Eqs. (23). As indicated in the equation, every one of the four convolutional blocks (B, C, D, and E) uses channel SB differently to learn distinct parasitic feature sets (4). While learning blocks D and E from scratch, TL builds additional channels in blocks B, C, M, and N to provide different feature maps. Each STM convolutional block has 32, 64, 128, and 128, 256, 512 channel dimensions when squeezed and boosted, correspondingly [49]. Finally, a systematic approach has been adopted, which involves the stacking of TL-based residual learning-based M and N blocks, culminating in their concatenation at the final stage to effectively learn diverse feature spaces. Three residual blocks are sequentially arranged to facilitate the acquisition of diverse features, wherein the number of channels progressively escalates from 32 to 256.

TL's core role is to gain information from the trained source-domain and solve issues in the target-domain while pursuing a high level of performance. Block A also uses region smoothing techniques to minimize the distortion and outlier acquired while capturing the input images [50]. The boosted channel is handled in block F to lessen connection intensity and obtain ideal attributes.

$${\mathbf{x}}_{{\text{Boosted}}\,=\,}b\left({\mathbf{x}}_{{\text{B}}}|| {\mathbf{x}}_{{\text{C}}}{||\mathbf{x}}_{{\text{D}}}{||\mathbf{x}}_{{\text{E}}}\right),$$
(4)
$$ {\mathbf{x}}_{{DBF}} = \sum\limits_{a}^{A} {\sum\limits_{b}^{B} {v_{a} {\mathbf{x}}_{{{\text{Boosted}}}} } ,} $$
(5)
$$\upsigma \left(\mathbf{x}\right)=\frac{{{\text{e}}}^{{\mathbf{x}}_{{\text{i}}}}}{\sum_{{\text{i}}=1}^{{\text{c}}}{{\text{e}}}^{{\mathbf{x}}_{{\text{c}}}}}.$$
(6)

The feature-maps of blocks D and \({\text{E}}\) are denoted in Eq. (4) by the variables \({x}_{D}\) and \({x}_{E}\), respectively. Likewise, blocks B and C are based on additional channels created using TL and are shown as \({x}_{B}\) and \({x}_{C}\), respectively. b(.) represents the boosting operation b(.). Additionally, the developed CNN employs fully connected layers with dropout layers to collect and preserve target-specific features and minimize overfitting. \({v}_{a}\) serve as an example of the number of neurons in Eq. (5). Lastly, Eq. (6), where c stands for the number of classes, represents softmax, an activation function.


Significance of using auxiliary channels and Squeezing–Boosting (SB) Ideas

The hybrid framework’s representational capacity is enriched by introducing multiple additional channels into the advanced deep Boosted-BR-STM through ensemble learning. The Squeezing–Boosting (SB) ideas employed in the proposed deep CNNs, initially squeezed channels to get salient and informative feature space. Then, combined with each STM block at abstract, mid, and high levels to boost and achieve a diverse feature map using TL-based auxiliary channels. Concatenating the prominent and notable information from various deep CNNs using distinct channels enhances the malaria infection depiction. Moreover, feature-map enhancement is achieved by incorporating auxiliary channels from pre-trained residual models. These supplementary channels are subsequently employed in conjunction with the proposed CNN at the final stage. The SB-based deep CNN effectively learns intricate local and global patterns, enabling the discrimination of textural variations between parasitic and healthy samples.

Ensemble learning

We employ deep boosted feature-maps and classifier ensemble strategies in the proposed hybrid learning. The essence of this boosted hybrid learning lies in enhancing the feature space used for combining ML classifiers. Ensemble classifiers, guided by a voting strategy, amalgamate decisions from various classifiers. This hybrid learning strategy harnesses the strengths of the developed deep Boosted-BR-STM and the discriminating power of ML classifiers by isolating key characteristics. We extract deep feature spaces from the final layers of boosted deep CNNs and feed them to competitive ML classifiers, including SVM [48], MLP [51], and AdaBoostM1 [52]. The activation functions are represented in \({f}_{SVM}\left(.\right)\), \({f}_{MLP}\left(.\right)\), \({f}_{Adaboost}\left(.\right)\) as shown in Eqs. (710).

The proposed DBEL framework extracts deep boosted channels from the proposed Boosted-BR-STM to achieve diverse channels and provides to ensemble classifier. The ensemble method combines the outputs of individual models, leveraging their diverse strengths. The ensemble aggregates predictions from multiple models, reducing the risk of overfitting and enhancing the generalization ability of the proposed framework. Moreover, the DBEL benefits from the generalized detection model and provides deep rich information feature space and ensemble learning [53]. The deep feature maps are generated, the 2nd last fully connected (FC) layer of developed Boosted-BR-STM and customized CNNs and fed to the ML classifier. Ultimately, integrating deep boosting and ensemble classifiers improves the DBEL framework generalization ability. In Eq. (10), \({f}_{Ensemble}(.)\) an ensemble of ML classifiers makes the final decision from the boosted feature-maps.

$${{\text{y}}}_{MLP}= {f}_{MLP}\left({\mathbf{x}}_{DBF}\right),$$
(7)
$${{\text{y}}}_{SVM}= {f}_{SVM}\left({\mathbf{x}}_{DBF}\right),$$
(8)
$${{\text{y}}}_{{\text{Adaboost}}}= {f}_{{\text{Adaboost}}}\left({\mathbf{x}}_{DBF}\right),$$
(9)
$${{\text{y}}}_{{\text{Final}}}= { f}_{{\text{Ensemble}}}{ (f}_{MLP}\left({\mathbf{x}}_{DBF}\right),{ f}_{SVM}\left({\mathbf{x}}_{DBF}\right),{ f}_{{\text{Adaboost}}}\left({\mathbf{x}}_{DBF}\right) ).$$
(10)

Significance of hybrid learning

The training of the proposed deep Boosted-BR-STM and customized CNNs may sometimes cause overfitting. Therefore, the proposed hybrid framework learns the effective discrimination features and improves generalization. Additionally, SVM [48], MLP [51], and AdaBoostM1 [52] represent three distinct ML classifiers that minimize the structural risk. Deep CNNs cause empirical risk by reducing training error using the optimal hyper-parameter selection [48]. The proposed framework significantly reduces both training and test errors and thus improves the generalization. Moreover, ensemble learning aims to increase performance and promotes integrating numerous feature spaces into a single rich information feature vector [54].

Utilization of customized CNNs

Several current CNNs like VGG, ResNet, GoogleNet, DenseNet, and Inception have been adapted to classify parasite malaria images for comparative analysis [23, 25, 32, 35, 37, 38]. The existing CNN’s abstract and final classification layers are customized using additional layers according to the input and target-class dimension of the dataset. CNN models were primarily backpropagation strategies learned from scratch where initial weights were randomly selected. The convolutional layers’ initial weights were borrowed from pre-train models using TL to improve the model convergence. In this regard, we used TL to adopt effective model parameters derived from the modified prior CNNs designed to precisely capture the target-domain specific parasite characteristics mostly on the malaria dataset employing improved filter weights acquired from ImageNet [55].

Experimental configuration

Dataset

The NIH dataset separated parasitic cells from a thin blood smear slide for research on monitoring and diagnosis [25, 56]. The dataset includes samples of falciparum patients collected from the Mahidol Oxford Tropical Medicine Center and Bangladesh’s Chittagong Medical College [57]. The institutes mentioned above are proficient slide readers who differentiate between parasitic images and healthy individuals. Plasmodium is present mainly in positive (parasitic) specimens, while the artifact effect is seen in negative (non-parasitic or healthy) samples with staining and contaminants. The malaria dataset distribution for experimental setup is shown in Table 2 and Fig. 4, pictorially depicting the parasite and non-parasitic or healthy RBC samples.

Table 2 Benchmarked NIH malaria detail
Fig. 4
figure 4

Panel (a, b) DWT enhanced parasitic and normal samples, respectively

Model training

The training and testing parts of the dataset are divided into a 70:30% ratio. Additionally, cross-validation is executed in the training phase, which is partitioned 80:20 into training and validation sets during model training. The training set includes the data validation set when optimum parameters are picked via hold-out cross-validation. Table 3 lists the selected optimal parameters in detail. The MATLAB-2022a tool was used to create the modified CNNs. The tests were carried out on an NVIDIA-GeForce GTX-T-Dell PC with 32 GB RAM and CUDA support. CNNs required nearly 12–24 h during training or 1–2 h for each epoch.

Table 3 Hold-out based optimal hyper-parameter selection

Performance evaluation

Standard performance criteria are used to evaluate the efficiency of the detection CNNs and the developed hybrid learning framework. The detection measures, including Accuracy, Precision, Sensitivity, Specificity, and F-score, are used as optimization metrics for evaluating the technique's performance. These measurements, along with a mathematical explanation and abbreviation, are explained in Eqs. 11.

$${\text{Accuracy}}=\frac{\text{Classified Correctly}}{\text{Total Samples}}\times 100,$$
(11)
$${\text{Precison}}=\frac{\text{Classified Parasitic}}{\text{Classified Parasitic}+\text{ Incorrectly Classified Parasitic}},$$
(12)
$${\text{Sensitivity}}=\frac{\text{Classified Parasitic}}{\text{Total Parasitic Samples}},$$
(13)
$${\text{Specificity}}=\frac{\text{Classified Healty}}{\text{Total Healty Individuals}},$$
(14)
$${\text{F}}-{\text{Score}}=2\frac{\left({\text{Sen}}\times \mathrm{ Pre}\right)}{{\text{Sen}}+{\text{Pre}}},$$
(15)

Results and discussion

This study uses the original and DWT-improved datasets to establish a novel detection DBEL framework system for malaria and distinguish parasite malaria sufferers from healthy individuals. Moreover, a novel Boosted-BR-STM is developed for testing the usefulness of boosting and boundary-region based STM to forecast affected parasite cells in RBC-thin-smears microscopy samples, and its effectiveness is contrasted with modified existing CNNs. Furthermore, the existing CNNs are upgraded, deployed in the TL, and trained from scratch. Table 4 assesses the developed malaria detection employing defined performance metrics.

Table 4 Performance of Boosted-BR-STM and current CNNs on unseen dataset

Performance analysis of detection results

Enhanced dataset evaluation

Impurities, staining, and noise artifacts in the original malaria dataset generated a striking likeness between the normal and parasite samples. In this regard, an enhancement process is essential to remove stains/impurities and impulsive noise from blood smear images. In this case, DWT reduces noise effects, stain impurity, and computing time, attaining streamlined, improved feature channels. Moreover, DWT coefficients preserve important low-resolution and diagonal features that aid in differentiating malaria-infected RBCs from healthy samples. Employing CNN techniques on improved DWT image outperformed the standard dataset in terms of Accuracy (1.86–3.1%), F-Score (1.8–2.9%), Sensitivity (0.60–2.2%), Precision (2.40–5.50%), Specificity (2.6–6.1%), as shown in Table 4 and Fig. 5. The minimum, average, and maximum performance for each parameter is demonstrated in Fig. 5. Moreover, DWT is frequently employed as an enhancement approach for data cleaning, significant feature-map generation, and improved classification framework performance.

Fig. 5
figure 5

The proposed Boosted-BR-STM CNN and DBEL framework performance gain over the existing detection. Moreover, an enhanced DWT samples considerably improved performance

The proposed Boosted-BR-STM

The developed Boosted-BR-STM improved the generalizability of the proposed DBEL scheme compared to early techniques and yielded better results on a DWT-enhanced test dataset. Table 5 and Fig. 5 show a significant enhancement in the developed Boosted-BR-STM to forecast plasmodium falciparum-infected patients. The Boosted-BR-STM exploits homogenous and boundary-driven parasite patterns in the STM block, boosting through STM and residual learning, and TL to improve the Sensitivity and F1-score. Region homogenous and boundary characteristics aid in learning the distorted parasitic samples. Moreover, TL-based generated feature maps and SB can capture subtle contrast and texture variation across artifact and parasite samples.

Table 5 Performance comparison of Boosted-BR-STM and current CNNs on unseen dataset

The data augmentation strategies employed in the training portion yielded performance improvement and model generalization. Moreover, Accuracy and Precision are attained by decreasing false positives, which reduces the medical staff's workload significantly. The tables demonstrate that the Boosted-BR-STM fared better than reported approaches using the NIH dataset. The Boosted-BR-STM performs better than other methods; Accuracy values range from (1.16–7.44%), F-score (1.2–7.5%), and Sensitivity (2.5–4.8%) (Fig. 5). The developed Boosted-BR-STM lower False Negative (FN = 71) and improved sensitivity relative to the finest, most reputable TL-based DenseNet CNN (FN = 131) on unseen DWT enhanced dataset, as demonstrated in Tables 5 and 6.

Table 6 TL-based existing CNNs performance analysis

The proposed hybrid learning

The malaria parasite samples are diagnosed using a new deep-boosted feature-map and ensemble learning (DBEL). In this regard, three competing ML classifiers are ensembled with feature vectors of the proposed Boosted-BR-STM that contribute as feature extractors. Moreover, the malaria parasitic images are detected by extracting the deep features from existing CNNs and providing them to ML classifiers in the DHML scheme. The significance of exploiting deep features is identified and compared with Softmax-based evaluation. The DHML using TL-based fine-tuned existing CNNs scheme outperforms Softmax-based evaluation in terms of Accuracy (0.47–1.7%), F-Score (0.50–1.6%), Precision (0.50–3.6%), Specificity (0.5–4%), as illustrated in Table 7.

Table 7 Performance of TL-based current CNNs deep feature and ML

The proposed DBEL framework

The hybrid learning approach evaluates the performance of deep-boosted feature maps and ensemble learning. The proposed DBEL outperformed the customized CNNs by providing a deep-boosted feature map to the majority voting-based ensembled classifiers. Combining boosted deep feature maps and utilizing three classifiers creates a hybrid of diverse feature spaces and ensemble learning. Boosting learning aims to increase performance and promotes integrating numerous feature spaces into a single rich information feature vector. Moreover, the ensemble classifiers’ improve the proposed framework's differentiation capacity by merging deep feature maps to construct the boosted and diverse feature space.

The individual models within the ensemble like SVM, MLP, and AdaBoostM1 are chosen because they have distinct characteristics and strengths. SVM learns optimal decision boundaries, MLP captures complex non-linear relationships, and AdaBoostM1 enhances the classification of challenging instances. The selection of these classifiers is based on their inherent ability to minimize structural risk, and this contributes to improved performance. The proposed DBEL framework outperformed the existing techniques for classifying malaria parasite samples in terms of Accuracy (1.72–5.65%), F-Score (1.70–5.4%), Precision (1.60–8%), Specificity (1.5–8.6%), as shown in Table 8 and Fig. 5. Finally, the proposed DBEL framework further reduced the FN (34), as contrasted to the best and most reputable DenseNet-201. However, a few samples were missed due to a similarity between malaria-infected and healthy people owing to impurity, stained, or noise anomalies in non-parasitic instances, as seen in Fig. 6. In the summarized way, we have introduced several innovative concepts to systematically enhance the framework’s performance and generalization.

Table 8 Performance analysis of the proposed frameworks
Fig. 6
figure 6

The enhanced stain artifacts and parasitic samples are shown in panels (a) and (b). The parasite images have a high degree of resemblance to artifacts, resulting in a high rate of miss detection


The proposed Deep Boosted-BR-STM CNN novel concepts

  • Boundary and regional feature extraction and STM Block integration

  • Optimal feature map extraction and concatenation of diverse maps to achieve boosted channels through TL-based auxiliary channel generation

  • Channel boosting through TL-based auxiliary channel generation


Deep Hybrid and Ensemble Learning

  • Extracting deep features from the proposed Boosted-BR-STM CNN and utilizing ML Classifiers

  • Exploiting proposed deep features and Ensemble of ML Classifiers for the final malaria parasite detection.

Customized CNNs

TL is an essential DL technique to perform satisfactorily instead of training CNN from scratch. TL adopted custom CNNs utilizing DWT-enriched data outperformed learning from scratch, as shown in Table 6. The TL-based fine-tuned existing CNNs perform better than those learned from scratch and existing techniques: accuracy (0.36–1.6%), F1-Score (0.30–1.4%), sensitivity (0.30–1.9%), precision (0.40–3.5%), and specificity (0.5–4%). Tables 5, 6 and 8 show the comparative study with customized deep-CNN models, along with Fig. 5. TL enhanced the CNN learning convergence through pattern initialization thus, lowering the miss-classification rate while maintaining precision. This demonstrates TL’s ability to analyze medical image fields, especially malaria detection.

Feature space visualization

The distinct class samples enhanced the ability of model learning and robustness. Therefore, the principal components analysis (PCA) aims to depict the significant discriminative features. The top-3 (PC1 vs. PC2 and PC1 vs. PC3) distinct components of the test samples are displayed in Fig. 8, with distinctive coloring for every feature point to distinguish parasite-containing samples from non-parasitic ones [58, 59]. The feature space is examined to better comprehend the proposed Boosted-STM decision-making BR’s behavior in contrast to the existing DenseNet-201 model.

The developed Boosted-BR-STM and DBEL-generated feature space depict the discrimination among parasite and healthy samples, as illustrated in Fig. 7. The Boosted-BR-STM has the highest evaluating effectiveness, according to the 2D PC plots. The current CNN DenseNet-201 has discovered that feature spaces are cluttered and inaccurately unable to distinguish between two classes using the original data. Additionally, compared to DenseNet-201 trained from scratch using improved DWT data, TL-based DenseNet-201 better indicates the instance class.

Fig. 7
figure 7figure 7

The visualization for the generated of PC1, PC2, and PC3 distinct features for the proposed DBEL, Boosted-BR-STM, and DenseNet on DWT-enriched datasets

Graphical analysis

The Sensitivity and Precision metrics for assessing the models are critical for malaria parasite detection. Therefore, Precision–Recall (PR) and Receiver–Operating Curve (ROC) are drawn for the developed DBEL framework, Boosted-BR-STM, and customized CNNs on test data. These graphs depict the capability of the classifier to differentiate across various potential ranges. The detection cut-off of the best positive class classifier can be considerably accessed via ROC and PR curves; results are shown in Fig. 8 [60]. The Boosted-BR-STM reduced false-negative or miss-classified parasitic compared to existing CNNs on an enhanced dataset with a PR AUC = 99.20% [61]. Moreover, the developed DBEL framework maintains minimal false-positives by detecting malarial infections with strong sensitivity due to its high ROC-AUC value.

Fig. 8
figure 8

The developed Boosted-BR-STM with TL-based fine-tuned CNN’s ROC and PR curves on original and DWT enriched data

Conclusion

Early detection of malaria can be treated properly and quickly enough to prevent irregular disability. The proposed hybrid DBEL framework stack deep boosted features of the developed residual Boosted-BR-STM CNN and ensemble learning to detect RBC thin smears microscope images of malaria patients. The proposed DBEL framework benefits from DWT enhancements, data augmentation, incorporating TL, inherent discriminative deep features from the developed Boosted-BR-STM, and ensemble learning to improve detection performance. Moreover, the developed Boosted-BR-STM employs TL for the diverse rich information feature-maps generation, residual learning, and SB ideas in the STM block, enhancing the ability to learn homogeneity, and parasitic structural patterns. Moreover, residual learning systematically extracts features, starting with basic image-level features to more intricate texture-based differences. This novel approach allows for the acquisition of boosted features at varying levels of granularity. The proposed framework achieved an Accuracy of 98.50%, an AUC of 0.996, an F-score of 0.985, and a Sensitivity of 0.992. Furthermore, the proposed Boosted-BR-STM CNN and framework excelled in existing techniques, experimentally in accurately detecting malaria-infected patients. We have assessed our proposed framework’s runtime compared to manual analysis, gauging its efficiency and time-saving potential. It is worth noting that our system's testing time for each parasitic sample averaged 5–7 s. In the future, the proposed DBEL is intended to help healthcare practitioners by developing an application to automatically screen, identify, and stage parasitic malaria patients for clinical examination, as well as impurity, stained, or noise anomalies in non-parasitic instances. Moreover, the developed framework can be employed in diagnosing monkeypox, brain tumors, lung cancer, and breast cancer using medical images.