D2PAM: Epileptic seizures prediction using adversarial deep dual patch attention mechanism

2023-12-01ArfatAhmadKhanRakeshKumarMadendranUsharaniThirunavukkarasuMuhammadFaheem

CAAI Transactions on Intelligence Technology 2023年3期

Arfat Ahmad Khan| Rakesh Kumar Madendran| Usharani Thirunavukkarasu |Muhammad Faheem

1Department of Computer Science, College of Computing,Khon Kaen University,Khon Kaen,Thailand

2Department of Computer Science and Engineering,Rajalakshmi Engineering College, Chennai, India

3Department of Biomedical Engineering, Saveetha School of Engineering,Saveetha Institute of Medical and Technical Sciences(Deemed to be University),Chennai, India

4School of Technology and Innovations,University of Vaasa, Vaasa, Finland

Abstract Epilepsy is considered as a serious brain disorder in which patients frequently experience seizures.The seizures are defined as the unexpected electrical changes in brain neural activity, which leads to unconsciousness.Existing researches made an intense effort for predicting the epileptic seizures using brain signal data.However,they faced difficulty in obtaining the patients' characteristics because the model's distribution turned to fake predictions, affecting the model's reliability.In addition, the existing prediction models have severe issues,such as overfitting and false positive rates.To overcome these existing issues, we propose a deep learning approach known as Deep dual-patch attention mechanism (D2PAM) for classifying the pre-ictal signals of people with Epilepsy based on the brain signals.Deep neural network is integrated with D2PAM, and it lowers the effect of differences between patients to predict ES.The multi-network design enhances the trained model's generalisability and stability efficiently.Also,the proposed model for processing the brain signal is designed to transform the signals into data blocks,which is appropriate for pre-ictal classification.The earlier warning of epilepsy with the proposed model obtains the auxiliary diagnosis.The data of real patients for the experiments provides the improved accuracy by D2PAM approximation compared to the existing techniques.To be more distinctive, the authors have analysed the performance of their work with five patients,and the accuracy comes out to be 95%,97%,99%,99%,and 99%respectively.Overall,the numerical results unveil that the proposed work outperforms the existing models.

K E Y W O R D S artificial intelligence techniques, classification, learning (artificial intelligence)

1 | INTRODUCTION

A general neurological disorder is Epilepsy, and it poses considerable effects on the economy and society[1]along with having different underlying causes [2].There is a neuronal activity which is based on abnormal and excessive as a result of Epileptic seizure in the brain cortex, and it can be confirmed using the EEG(scalp electroencephalogram)[3],EEG(stereoelectroencephalography) [4], or ECoG (electrocorticography)[5].Interestingly, studies are increased to use of the brain's signal data for many years to predict the seizure by the scalp EEG or the detection in earlier stages [1, 6, 7].A crucial requirement motivates the effort for giving the medical experts and patients the reliable warning when the time between the onset of disabling symptoms in patients for the intervention on time and starting the measured evolution of ictal in the brain signal to change the evolution of seizure potentially[6].This study provides insights into understanding the mentioned techniques for the propagation and initiation of the seizure [1].

Early detection or the prediction of reliable seizures is difficult in a computational way, despite the increase in the enhancement of the algorithms and devices over the decades.Primarily, there is great susceptibility to different sources of bioelectric noise,difficulties of considerable posing in the brain activities interpretation and analysis[8]during the recordings of the scalp EEG that are used daily.There is a long-established EEG approach to obtaining electro-physiological signals from brain region,and the activities are recorded with less noise over scalp EEG.Moreover, the EEG signals vary from patient to patient,which can be large due to the implantation of the EEG electrodes depending on the zone of epileptogenic that differs between the patients.The broad distribution of demographic and different kinds of epilepsy patients raises the high variance from patient to patient.Secondly, the open question is the process of detecting the early seizure.Although the trained neurologist finds the spikes of pre-ictal having 60 min before the seizure onset,there are no existence of any proper definition for inter and pre-ictal periods[9].The time window is determined based on the capability to perceive the brain signal changes and the seizure onset region.Thirdly,a restricted number of patients are having EEG and recorded seizures compared to the datasets of EEG, which is scaled that includes hundreds of patients in one research [2].It is complicated to mobilise the previous analysis based on EEG for classifying the epileptic pre-ictal signal[10,11].

The electrophysiological signals are captured directly by the approach of long-established called EEG through the implanted electrodes in a deep way using the surgery from the brain to the tissue of the brain[4].The electrical activities are recorded,having considerable information and low noise over the scalp EEG.There are 15 years of seizure history obtained by the patient.The doctor put a few electrodes in the doubtful region to detect the lesion of Epilepsy.The electrode of EEG is inserted into the brain of humans, which has many contacts which are recorded that is 7–15 contacts having 3 mm centre-tocentre distance in a typical way with every shaft of the electrode such asAtoJ.The recordings of the EEG signal have greater amplitudes of 50–1500 µv and give alterations over the broad frequency range of up to 10 kHz[12].

Deep learning (DL) approaches have been highly successful in image classification tasks like medical imaging in recent years.Consider an example that the CNNs as convolutional neural networks are checked for having the great capacity for learning a higher level of features from the data of MRI and enhance the performance of the diagnosis of brain disease having the many author's efforts [13].Moreover, the pre-defined ROIs are having the diagnosis of ES in a manual way by having the experience from experts for most of the previous methods of DL for implementing the diagnosis models based on CNNs that turn to inadequate significance of every difference with the help of similar space of template and that is not included the complete atrophy features related to the disease that is distributed in the complete brain.However,some methods of DL obtain the particular output for the locations of pathological that neglects the interpretability problems in the practices of medicine because of the characteristics of the black box for the neural networks.Furthermore,recent works on epileptic seizure prediction utilised enhanced DL algorithms which tend to overcome the class imbalance problems by extracting the geometric and handcrafted features[14, 15].Widely, General Adversarial Networks and their enhanced versions were adopted; however, they are limited with stability issues [16, 17].Some regions have structural changes in MRI scans that have great correlative features of pathological because brain atrophy happens locally in a general way.On the other hand, the remaining regions have less essential data for the distinction.Hence, the important difficulty for the diagnosis of MRI based on DL for improving the discriminative feature's identification has(i)informative microstructures having the regions locally,and(ii)essential regions in the image globally in a relative way.

1.1 | Research motivation

Accurate detection and prediction of brain seizures was a major concern in the state-of-the-art works.However, they inaugurate many researches works on brain seizure prediction using machine learning and DL algorithms, but they failed to attain the desirable accuracy.Some of the major research problems are listed in this section, which motivated us to propose a better research work.The research motivations are:

• High False Positive Rates: Almost many of the existing works extracts many features includes low level features,high level features etc., However, they were lacked with performing proper feature engineering (i.e.not analyse/discriminate the features properly), and it leads to discrepancies during prediction; therefore, causing higher false positive rates.

• Poor Classifiers: The existing works lack with adopting proper classifier for brain seizure detection/prediction.For instance, many of the existing works use conventional machine learning algorithms, such as Support Vector Machine(SVM), regression tress, KNN etc., However, the adoption of conventional machine learning algorithms leads to underfitting,interoperability,and they do not withstand with larger datasets causing class imbalance issues.

1.2 | Research contribution

F I G U R E 1 Architecture model

D2PAM(Deep dual-patch attention mechanism) is anticipated to mention the difficulties mentioned earlier in diagnosing ES by finding the pathological locations in a discriminative way.More particularly, there are three important components inD2PAM:PAMas the attention module of multi-instance learning pooling, attention-aware global classifier, and Patch-Nets mentioned in Figure 1.D2PAMcan learn the structural features in a discriminative way from many patches of MRI locally that are distributed in the brain even though the Patch-Nets have the blocks of spatial attention.Thus, the features are provided with various weights to combine to the representation of global features via the pooling attention for the complete information related to brain structure depending on the global classifier to construct for the diagnosis of ES.The suggested technique is evaluated in the proposed method on two datasets from the public dataset, and the results from the experiments on many tasks of ES-oriented classification,such as prediction of MCI conversion and classification of ES, to implement the proposed methods ofD2PAMperform well than many modern methodologies in terms of generalisation and accuracy.Apart from the existing techniques, the proposed model gives the contributions that need to be summarised as below:

1)D2PAMis the Deep dual-patch attention mechanism suggested to enhance the ES diagnosis performance that can automatically obtain the structural features from the scans of sMRI globally and locally and maintain the decisions of classifications related to ES in the unified structure.

2) The designing of the Patch-Nets having blocks of spatial attention in every patch needs to extract the features in a discriminative way and improve the abnormal local features to change the micro-structures generated using brain atrophy.

3)PAMOperation such as patch attention mechanism is suggested for every patch with relative contribution to obtain the various representation of the weighted feature globally for the complete brain structure.

1.3 | Research organisation

The work is drafted as: Section 2 provides explanation of diverse approaches.In Section 3,the methodology is discussed with numerical results in Section 4.Section 5 belongs to the conclusions.

2 | RELATED WORKS

Some tools are modelled to characterise the seizure pattern from the brain signals.Moreover,many tools are concentrated on the data of scalp EEG.The characteristics of seizures are extracted by Oh et al.[13] from the newborn EEG using the representation of EEG signals with the help of Beta distribution in the domain of time and frequency.Singh et al.[18] designed a technique that depends on the decomposition of singular values used for dynamically searching the unique phases of humans'epileptic seizures.The similarities are identified in epileptic seizures.Spatio-temporal dynamics by Faust et al.[19]depends on EEG.The degree of time-varying non-linear correlation is evaluated by this method among the scalp electrodes,which is the onset and when spreading the seizure.A technique to detect the multivariate seizure is developed by Usman et al.[20] using the representation of the signals of EEG having the seizures of epileptic via the data of EEG analysis,and the methodology is the standard technique of machine learning to find the seizures from the data of EEG[21].The use of simultaneous EEG scalp in patients is accessed by Duan et al.[22]having focal Epilepsy,which undergoes the evaluation of intracranial EEG after the presurgical testing in a detailed way.The transition to seizure from normal,identified by Zhang et al.[23],is not an immediate phenomenon.Apart from this, there is a procedure with resilience progressive loss of the neural network.A generalised retrospective based on CNN,and a model of seizure classification based on the patient is used by Usman et al.[24]with the help of data from scalp EEG and intracranial information.

There is no chosen method directly for analysing the data of EEG even though the previous tasks provided the seizure patterns of EEG in a detailed way due to(i)the fixed position of the EEG channel.On the other hand, the electrodes of EEG are implanted in the anato-electro-clinical research has a detailed analysis for the zone of epileptogenic potentially of each patient[25],(ii)the EEG has a sampling frequency which is lesser than EEG[26],(iii)When using the trained model for the new patients, the parameter has adjusted in a manual way that is unavoidable even now [27].Some researchers are tried to find the patterns of seizure in the data of EEG.The collaborative application has a rich and dedicated interface of intuitive graphics that are provided for the analysis to record the brain with ECoG,EEG,and EEG called Brainstorm[28].The MNE tool [29] is used to visualise, explore, and analyse the data on neurophysiology.A tool is used by Yildirim et al.[30] to predict the timing of seizures and find the areas of epileptic using the data of EEG.The classification tool of epileptic seizures is for analysing the spectrum of power from the signals of EEG as stated by Baratloo et al.[31].Moreover,these tools have restricted applicability because of the high complexity of data [32].These drawbacks need to be resolved using the proposed model (Tables 1 and 2).

3 | MODEL DESIGN

To handle the issues encountered in the existing approaches,this work proposes a novelD2PAMto predict the appropriate features for ES prediction.The proposed model contains three diverse modules:patch network,pooling and attention module with global classifier.The patch level pattern analysis with attention blocks helps to learn the discrimination among the features from various patches.The patch-level features are provided with various weights and merged with the global feature representation from the entire structure based on the global classifier for ES prediction.The performance of the proposed model is compared with online available dataset and the outcomes are related to prediction process.The proposed model outperforms various existing approaches in terms of generalisation and performance.The proposedD2PAMis used in the proposed system with complete architecture and important components, and the loss function depends on the learning of multi-instance and the mechanisms' attention.Lastly,the implementation details are provided in the proposed model.

T A B L E 1 Summary of existing works in epilepsy.

T A B L E 2 Algorithms used for comparing the performanceparameters with the proposed deep dual-patch attention mechanism.

The suggested model ofD2PAMthat is presented in Figure 1 has four important stages that have instance selection to compose a bag X that is Patch Location Proposals,and the instance-level features have transformationfthat is Patch-Net,the transformed instances have the combination φ that is described attention pooling, the combined bag level features are used for the classificationgsuch as Attention-Aware Global Classifier presented.The positive classification has the probabilityΘis presented asΘ(X) =g∅f(X).

3.1 | Patch analysis

The initial selection of patches from the scans is used by Patch location proposals as the input for the proposed system.A theoretical methodology is proposed using the patch extraction to concern the set of comparisons on the patches rather than the features based on voxelwise.The image of MR is divided uniformly in the proposed system into many patches(fixed size)asW×W×W,with no intersection for simplifying the calculationsandeliminating the relevantdata.Moreover,the partitioned patches are not relevant to the atrophy,which is abnormal and is created by ES.A considerable difference between the control and experimental groups is found in the test method.The patch locations in the proposed experiment have a more considerable difference between the group of ES in the brain's abnormal atrophy regions.Then,thet-tests are used in the proposed system for sorting all the patches information.The voxel-wise features have the average to compute in the proposed system in one patch,like the feature related to the patch level.Comparing the two groups of features based on patch level is grouped in patch location accordingly to the patients with ES and general controls using thet-test in the training set.Thep-value is obtained in the proposed system at the patch location that presents the location information.The computation of all thep-values is done on all the normalised locations using theand forming the map ofp-value to cover the complete brain image of MR.In addition, the smallp- values have the locations to consider roughly for greater discrimination.The number of patches is chosen in the proposed system based on the map ofp-value at the locations in one image having the smallp- values for composing the bag,such asX={I1,I2,···,Ik}here,the number of chosen patches is presented bykandIi∈RW*W*Win the proposed model as the input.

3.2 | Spatial patch network model

The Patch-Net structure, shown in Figure 1, has the spatial attention block.The Patch-Net has two tasks such as (i)learning the spatial attention-aware to represent the patch-level feature, and (ii) the effect score is the output to indicate the capability to trigger the label of the bag.The blocks of spatial attention are utilised to enhance the feature for the discriminative parts in the patches having a fixed size.The proposedD2PAMmethodology has all the Patch-Nets with similar architecture.

1) Patching Net:More abstract representations of features are learned using PatchNet, alike the former part as the backbone from original patches and lowering the feature map's size.It has four layers of 3D convolutional,and the patches have max pooling to choose the input patches size.The primary convolutional layer consists of the size of the kernel as 4×4×4.There are three layers of convolutional with a similar filter size as 3 × 3 × 3.The filter size is 2 × 2 × 2 during down sampling.The total channels(conv1 to 4)are 32, 64, 128and128 where the layers'training is done in the unit stride, having the feature maps of non-zero padding.ReLU(rectified linear unit)and Batch normalisation (BN)activations follow every convolutional layer.The two branching modules are extended by Patch-Net depending on the output of feature maps from conv4.This work presents one of the spatial blocks of attention to learn the patch-level representation of spatial attention-aware that isC×w×w×was size here, the number of channels is presented asC, and the feature maps'size is presented asw×w×w.Another module has the global pooling, sigmoid function, and the fully connected layer (FCL) concentrated on generating the affect score that is utilised for finding the pathological locations in a discriminative manner.The Patch-Nets give the output from the local patch-level features to handle the three-dimensional shape to learn the spatial relationship between patches and also for the best combination of patches rather than creating one-dimensional feature vectors in many previous transforms based on instance level.

2) Spatial channel and maximal pooling: The spatial attention block is designed in the proposed system to embed into the proposed Patch-Net that is motivated by the proposed module of spatial attention for choosing the feature extraction related to local structure from the image patches of 3D.The block of spatial attention has the architecture presented in Figure 1.There are two various pooling with the axis of the channel that is average channel pooling and max channel pooling which are chosen for creating the two maps of feature in the average features and max features name accordingly.Thus,the concatenation of the two maps of features is done by having the 2 ×w×w×wsize as the input for the consecutive convolutional layer that is padding: 1 to maintain the feature maps size and stride: 1,size of the kernel as 3×3×3.The convolutional layer has the output that is concerned like the map of spatial attention like the similar size as the maps of feature from conv4,(Aspatial∈Rw×w×w) having the score of attention at every location to be restricted to the range (0 →1) via the sigmoid layer.The spatial varying of various parts in the patch is described by the map of spatial attention that gives the feature representations which is multiplied elementwise in the conv4 output having the aspatial attention map computed to represent the local spatial attentionaware in a structural way created ultimately.Thus, the suggested block of spatial attention is explained in the proposed system having different formulae.The conv4 has the output denoted asF= {F1,F2, · · ·,FC}, here, the number of channels is presented asCandFi∈Rw*w*w.The expression of max pooling with the axis of the channel is presented below:

Here, the sigmoid activation is presented as σ, the convolutional layer weights W, and the concatenation is presented as [;].The representation of the patch-level spatial-attentionaware featureFis given below:

Here, element-wise multiplication is represented as ⊗.

3.3 | Pooling

Here,W0andW1are used like the CL in the proposed system.Then, the CL share the parameters in feature descriptor processing with layers convolutional to process the descriptor of the max feature.The effectiveness score learned from every feature of intrapatch is concerned with evaluating the contribution of every patch rather than two maps of patch attention learned from the relationships of interpatch.All the Patch-Nets give the affect scores to create the effect vector asa={a1,…,aC};here,total patches isC.The vector extension is done as a similar size to the maps of patch attention.Then,three maps of attention are combined to theApatchattention map comprehensively using the summation of element-wise activated using theσsigmoid function later.

Lastly, the existing representations globally are multiplied,having the maps of patch attention for producing the Global as feature representation of global attention aware.

Here, the multiplication of tensor is presented as ⊗.The proposed attention of pooling is not considering all the features of the map rather than depending on the most discriminant patch,yet providing various weights to every patch rather than merging all the patches in an equal way that is apart from the traditional average pooling and max pooling.Hence,attention pooling emphasises the feature representation to enlighten noise interference for the critical patches.On the other hand,the connection remains between the patches of key and unwanted patches for eliminating the loss of the relevant features, potentially enhancing the classification performance and lowering the rate of misdiagnosis for particular subjects.More particularly, the pathological locations are identified using the patch attention map that is a reference.

3.4 | Global network layer

Figure 1 shows the global classifier with attention awareness to continue representing the Global as bag-level, considering higher correlations between the patches and generating the final diagnosis.The CL give the supreme higher level extraction of feature abilities on the inter-patch features for DL compared with direct, FCL for exploring the correlation between the features.Then, two layers of the convolutional network are used for learning the feature representation based on attention awareness in front of the global classifier from the pooling to extract more information related to structure between features and patches with channels.There are two CL with 64 and 128 filters,each having a 2×2×2 similar size and unit stride that follows using the BN(batch normalisation)and ReLU activations (rectified linear unit).Thus, the downsampling of the feature maps toF∈R64*1*1*1is chosen by the adaptive 3D average pooling.Thus, the representation of the feature is flattened like the consecutive two fully connected layers as the input having 2 and 32 units for creating two scores that are normalised using the softmax function to present the positive and negative probability accordingly.The attention mechanism is done for learning the integral feature representation for the complete structural information of the brain in the scans of MRI depends on the various output of weighted feature from the attention pooling and the classification of output gives the outcome to predict the MCI conversion or the classification of ES.

3.5 | Loss function

The label of image level is concerned with the peculiar guidance utilised in the backpropagation to update the proposed weights of networkWsince the labels of image level are provided only during the ambiguous patch-level labels.The loss function is to train the model with cross-entropy loss as it is defined as follows:

Here, the number of images is represented asN, and the probability of exact prediction forXnis represented asP(Yn|Xn;W).The training losses are backpropagated as the end-toend network to the Patch-Nets andPAMpooling from the global classifier to assist in updating the network parameters having the optimisation algorithm such as Adam.The proposed network learns the map finally asXtoYto reduce the loss function.

3.6 | Model execution

The activation of batch normalisation is used in the proposed system to alleviate the over-fitting problem after CL.Patches are made for weight sharing and to reduce the trained parameters, particularly during the higher requirement of the cohort patches.Image patches extract patch from various anatomical structures that efficiently augment the diversity to train the data.

The proposed method's performance and generalisability are determined using an EEG dataset.More particularly, the samples are divided in the proposed system from the dataset to train and test with samples as 80% to use for training the model.On the other hand, balance samples of 20% are taken for the test dataset.The hyper-parameters are selected,and the models are trained on the training dataset with 5-fold CV.Thus,the model is trained to have hyper-parameters which are optimised to test to perform the test dataset.The proposed system is determined as the independent dataset to check the generalisability and powerfulness of the proposed model.

Thep- value map is computed in the training stage to cover the complete image using the set of comparisons on the training test, that is, 4 subsets for every round in 5-fold for initialising the locations of the input patch.Thus, the patches are extracted to feed from the chosen regions in the images for the relevant patch accordingly.Adam optimiser is used to train the suggestedD2PAMfor 100 epochs that has ～ 6 h to determine the balance validation.Hyper-parameters such as the number of channels and rate of learning is 0.001, the size of the batch is 10, the size of the patch is 25 × 25 × 25, and the number of patches is 60 to select using the mean performance validation over all the folds.The patches are extracted at a similar location utilised in the training phase from the unseen image for the diagnosis of ES to the trained network in the test stage depending on the pre-processed image that considers～ 0.25.

4 | EXPERIMENTAL RESULTS

The evaluation of the anticipated method is validated using certain standard classification approaches.The proposedD2PAMmodel is evaluated with various existing approaches.

4.1 | Dataset

The significance of the anticipated model is examined using the EEG signals for seizure and healthy signal classification.Here,the dataset is acquired from online resources, that is, Bonn University,whichis a publicand opensource dataset withvarious groups,that is,AtoE.Every group is sensed in various conditions and recorded in diverse conditions of the subjects.DatasetAandBare determined to be healthy and recorded during awake conditions.Then,CandDare monitored during inter-ictal period andEgroup is known as ictal period.The recorded signals with 23s duration use a 10–30 electrode system with a 173.6 Hz sampling rate.Every group in the dataset is composed of single EEG channel epochs with a 4097 sampling rate.

4.2 | Metrics

The measurements are evaluated, such as precision, accuracy,sensitivity as recall,false alarm rate(FAR),area under the curve(AUC) and f1-score are presented below.

Here, the total true pre-seizure as pre-ictal segments of EEG is presented by TP,FN presents the total false no-seizure segments ofD2PAM,and TN presents the total true normal as no-seizure for the segments ofD2PAM, and FP presents the total false pre-ictal segments ofD2PAM.The model's performance is evaluated using the F1 score in the unbalanced data.FAR is utilised frequently for measuring Epilepsy misreporting using the software in the clinical epilepsy diagnosis.Under the ROC curve, the area definition for AUC is presented.

4.3 | Experimental setup

Here,leave-one-out is used to check the model's performance and five samples are chosen,which has no-seizure EEG data and pre-ictal EEG data is testing data and the balanced data is training data.The conventional method like SVM[31],Diffusion Convolutional Neural Network (D-CNN) D-CNN[33], Deep Convolutional Neural Network (DP-CNN) [33], CNN-based voting [32], Partial Interpretable Estimators (PIES) [34],Transformer[35],Long Short Term Memory(LSTM)[36],EDANN with LSTM[37]and ED-ANN with transformer[38]are used for comparison,as shown in Table 2.Data pre-processing and various data input are employed for all the methodologies.The data has the hidden representation for every window with size of 1s.The model provides the result from prediction for every window with a size of 1 s.The classifiers like DP-CNN and CNN-voting shows window data representation when the size of the window as 1s.EEG seizure based adversarial learning known as PIES(Patient-Independent Epileptic Seizure)is used for classification where PIES has the fixed input of data length.The researchers report that the experiment's setting is used in the proposed system is to lower the input data length to 1 s.On the other hand,experimental outcomes are compared to replace the domain categorydiscriminator whichusessimilar CNN encoder.Four CL exist in the encoder,four ReLu regularisation functions,four max-pooling layers,and the dropout layer as 0.5.Lastly,the hidden vector has a dimension of 32.A similar domain discriminator of BiLSTM is used by both models with 64 hidden units and 2 bidirectional LSTM layers.The important discrimination among LSTM andD2PAMis considered.The label classifier and similar encoder module are there.The proposed system uses a similar structure to the EDANNTransformer for classification purpose.Also,data pre-processing is used by all of the method where 0.0005 is set as the rate of learningαof theD2PAMmodel.TheD2PAMmodel is implemented on the framework of Pytorch and to deploy having the Intel i9 CPU.The model is trained to complete by using the 14 h on average.

4.4 | Experimental outcomes

The proposedD2PAMsystem has the performance, and the benchmark model is presented in Table 3 for the 5 test patients on the data.Tables 3-5 present the examination results that the domain-based adversarial model classification method is suggested to obtain better outcomes.The methodology depends on theD2PAMobtaining better outcomes in terms of F1-score on all the patients in a specific manner.The proposed system performs well than the model of benchmark using the approximate 4.5% concerning the F1 score presented in Tables 4 and 5.It is possible because of the EEG signal nature.The characteristics of Epilepsy for a few people are very obvious for the simple model to perform better due to the signal of EEG, including the characteristics of a non-stationary one.The model ofD2PAMperforms well than the model of baseline using the approximation of 9%concerning the F1 score based on Tables 3-5,which provides the concept of using the adversarial learning of domain that lower the interference of every information efficiently.On the other hand,the table values gives the method of PIESD that depends on adversarial learning to perform as same as the common method based on CNN.Tables 3-5 give the comparison of the proposed model with the various existing approaches.The comparison is made among the proposed and the existing SVM,D-CNN,DP-CNN,CNNbased voting,PIES,LSTM,transformer,ED-ANN with LSTM and ED-ANN with transformer.

The proposedD2PAMmodel concerning FAR is stable,having less FAR that is essential in the practical applications of clinics.The above tables present the data examination with the proposed model having a bad index of FAR of 3%,which shows better performance.TheD2PAMmodel is used to compare the performance having the various methods of channel order on the five patients to check the reordering effect of the segment rows of EEG in the pre-processing of data.The default channel order of EEG uses the Random as the sorting of EEG default order based on the ID channel.TheD2PAMhas the performance to show from the results of experiments on the data after reordering the channel is 6%best over the average to prove the efficiency of the pre-processing of data.

The accuracy is compared in the proposed system for the data having various lengths since the proposed system utilises the data of variable length to train.The process of varying the model's accuracy usingD2PAMis presented in Figures 2–11,with various lengths of fragment sizes.If the length of the fragment is lower than 5 s,the accuracy rate is increased rapidly with the length of the fragment.The proposed model has the accuracy increased slowly if the size of fragment length is 5–10,like the increases in the length of the fragment.Also, the proposed model has the accuracy unchanged in the basic form if the length of the fragment is greater than 10s when length is higher.The proposedD2PAMsystem needs a particular size fragment length for obtaining suitable signals.The proposedD2PAMmodel obtains the amount of information that lowers the length of the fragment to increase the wrong prediction for the likelihood.Moreover,the noise is increased in the data for too long fragments,which lowers the accuracy.The optimum recognition for the fragment length shown by the experiments is approximated to 10 s.Physicians need time in the clinic to predict the characteristic signals of Epilepsy is approximate 10 s, that is,consistently having the results from the experiments.

A considerable influence on the accuracy is obtained by using the parameter γ in the experiment.The parameter γ influences 0–0.5 and is explored in the proposed system on the model ofD2PAMfor the patient-1EEG data.Figure 2 observes that the model's accuracy increases, increasingγifγis lesser than 0.3.The model's accuracy reached the highest value,approximately 95% and increased byγ, and the model's accuracy decreases gradually ifγis 0.3.The domain discriminator contribution for the complete model is less relative, and the system learns the incorrect data easily regarding the patient that turns to the less performance of the model ifγis less.The interference of the domain discriminator is done by having the label classifier learn ifγis higher than 0.3 and the model's accuracy is lowered.The discriminator of the domain is essential for the model ofD2PAMfor approximate functioning.The ablation experiment is performed in the proposed system for further research on the domain discriminator influence and the experiment's label classifier, where the discriminator of the domain defines to remove the domain discriminator and add the domain discriminator using discriminator.The Transformer and LSTM are used in to compare the results of the F1-score as the label classifier.Tables 3-6 give the higher impact by the domain discriminatorfrom the experiment on the F1-score.More specifically, theD2PAMmodel has a more pronounced impact.

T A B L E 3 Performance evaluation using data on patient 1.

T A B L E 4 Performance evaluation using data on patient 2.

T A B L E 5 Performance evaluation using data on patient 3.

Various existing approaches are widely adopted for analysing the neurological disorder which is the reflection of brain variations.Moreover, owing to the local atrophy, some brain regions show structural variations which are extremely correlative with pathological features.However, the primary challenge is to enhance the prediction of discriminative features.To address these issues, the proposedD2PAMmodel is adopted for earlier prediction of ES.The proposed model uses patch network, attention model for analysing the global features for the entire brain information.Based on this, a global classifier model is proposed for ES prediction.In terms of practical implementation, neurophysiologists can find vital patters with the help of the proposed methodology, which leads to the suitable,accurate,and timely treatment of patients.

F I G U R E 2 Comparison of performance metrics with Patient data 1.

F I G U R E 3 Comparison of performance metrics with Patient data 2.

F I G U R E 4 Comparison of performance metrics with Patient data 3.

F I G U R E 5 Comparison of performance metrics with Patient data 4.

F I G U R E 7 False alarm rate with patient 1.

F I G U R E 8 False alarm rate with patient 2.

F I G U R E 9 False alarm rate with patient 3.

4.5 | Discussions

F I G U R E 1 0 False alarm rate with patient 4.

F I G U R E 1 1 False alarm rate with patient 5.

The accuracy of the proposed model for sample 1 is 95%,precision is 90%, recall is 92%, F1-score is 90%, AUC is 98%and FAR is 0.01.The accuracy of the proposed model for sample 2 is 99%, precision is 95%, recall is 96%, F1-score is 96%, AUC is 99% and FAR is 0.0117.The accuracy of the proposed model for sample 3 is 99%,precision is 98%,recall is 99%, F1-score is 98%, AUC is 99% and FAR is 0.01.The accuracy of the proposed model for sample 4 is 99%,precision is 96%,recall is 99%,F1-score is 96%,AUC is 99%and FAR is 0.01.The accuracy of the proposed model for sample 5 is 95%,precision is 85%, recall is 99%, F1-score is 96%, AUC is 98%and FAR is 0.01.Based on the analysis with the provided samples, it is proven that the model gives better prediction outcomes than the other approaches.

Figures 2–6 emphasise the comparison of accuracy, precision,recall,F1-score,AUC and FAR results of five patients.Inthat, our proposed workD2PAMachieves higher results than other existing classifiers in all the five patients' data.The numerical comparisons for the five are provided in Tables 3–7.Specifically, the graphical results on FAR are shown from Figures 7–11.In that inference also, the proposed work achieves lesser FAR than the existing models.

T A B L E 6 Performance evaluation using data on patient 4.

T A B L E 7 Performance evaluation using data on patient 5.

Based on the analysis with various observations, it is proven that the model attains competing outcome in ES prediction.When compared to other approaches, the proposed model shows better performance.The reason behind this is that the proposed model deals with the correlation among the features better than the existing approaches.The proposed model tries to extract the essential features which are more difficult in the existing approaches.The proposed model gives better result to project the efficiency of the model for predicting the pathological condition.At last,the proposed model outperforms the existing approaches by demonstrating the finest feature extraction for ES prediction by balancing the contribution of the proposed model.

However, the model attains superior performance in ES prediction; there are still come limitations that influence the generalisation ability of the proposed model.The input patch size is fixed.With the fixed size, feature extraction is completely complex and also the network construction is highly complex.It is essential to merge the patch location generator for prediction of patches for ES prediction.The detection network is optimised as an E2E network model.

5 | CONCLUSION

A Deep dual-patch attention mechanismD2PAMis suggested in the proposed system for diagnosing computer-aided ES with three important components.They are (i) the discriminative features are extracted from the local patches using the blocks of spatial attention having PatchNets, (ii) the relative contribution of every patch is balanced by using the attention pooling operation, and (iii) the decisions for the ES oriented diagnosis is made by using the attention-aware global classifier depends on the merged representation of feature for the complete structure of the brain.The suggested method ofD2PAMis determined from the dataset in many diagnosis tasks of ES.The proposed system identifies the discriminative pathological locations in the scans of sMRI and obtains the best performance diagnosis over many modern methodologies from the results of experiments.The accuracy of the proposed model for sample 1 is 95%,for sample 2 is 99%,for sample 3 is 99%,for sample 4 is 99%,for sample 5 is 95%respectively.

ACKNOWLEDGEMENT

We would like to acknowledge the Department of Computer Science, College of Computing, Khon Kaen University for its support in conducting this study.Also,the authors are thankful to Chitapong Wechtaisong for its support and fruitful guidance.

CONFLICT OF INTEREST STATEMENT

The authors declare no conflicts of interest.

DATA AVAILABILITY STATEMENT

The data will be available upon request to the corresponding author.

ORCID

Arfat Ahmad Khanhttps://orcid.org/0000-0003-0918-8874

Rakesh Kumar Madendranhttps://orcid.org/0000-0002-5059-7269

CAAI Transactions on Intelligence Technology

2023年3期