A New Mixed Clustering-Based Method to Analyze the Gait of Children with Cerebral Palsy

2021-12-15JingHuLingZhangJieLiandQirunWang

Computers Materials&Continua 2021年2期

Jing Hu,Ling Zhang,Jie Liand Qirun Wang

1Department of Rehabilitation Medicine, Children’s Hospital of Nanjing Medical University, Nanjing, 210008,China

3Shenyang Jing’an Mental Health Hospital, Shenyang, 110163,China

2Liaoning Children’s Hospital, Shenyang, 110085,China

4Department of Physics and Astronomy, University College London, London, 0044,UK

Abstract:Cerebral palsy is a group of persistent central movement and postural developmental disorders, and restricted activity syndromes.This syndrome is caused by non-progressive brain damage to the developing fetus or infants.Cerebral palsy assessment can determine whether the brain is behind or abnormal.If it exists,early intervention and rehabilitation can be carried out as soon as possible to restore brain function to the greatest extent.The direct external manifestation of cerebral palsy is abnormal gait.Accurately determining the muscle strengthrelated reasons that cause this abnormal gait is the primary problem for treatment.In this paper, clustering methods were used to compare and analyze the differences between the abnormal and normal gait parameters of children with and without cerebral palsy.Since the collected data contains overlapping data that may be mutated,while the centroids are also different,the expected result is stratified.To solve this problem, a mixed clustering method is proposed that combines the advantages of K-means and hierarchical clustering, meaning that each set of data shows a similar trend to specific parameters.Experiment results show that this method can detect cerebral palsy through the difference between the abnormal gait of children with cerebral palsy and that of normal children.

Keywords: Cerebral palys;gait analysis;children;mixed clustering

1 Introduction

Cerebral palsy refers to a syndrome caused by non-progressive brain damage and developmental defects from the beginning of conception to infancy, which manifests primarily in the form of dyskinesia and abnormal posture.Seventy per cent of those with the syndrome are children with spastic cerebral palsy,which brings a heavy burden to their families and society.Accordingly, the question of how to improve these children’s symptoms, enhance their quality of life, and reduce the burden on families and society is an important research topic [1,2].

Due to the abnormal muscle strength and muscle tension, the appearance of pathological reflexes, the persistence of primitive reflexes and the presence of secondary contractures and deformities, children with cerebral palsy often exhibit a specific pathological gait (“scissors gait” and “dance gait”) while walking,leading to walking dysfunction, which has an adverse impact on their walking during their lifetime.Gait analysis plays a highly important role in the clinical diagnosis and rehabilitation treatment of children with cerebral palsy [3,4].Through gait analysis, the key joints and muscles involved in dyskinesia can be determined, the relationship between walking disorders and the movement of the trunk and upper limbs can be understood, and the potential influence of walking aids on gait can also be determined.Such analysis thus provides a theoretical basis for clinical diagnosis, treatment, and the production of orthopedic rehabilitation braces [5-7].

Different gait performance in children with cerebral palsy is caused by differences in muscle strength and compensatory effects after disease.Under the premise of not considering the compensatory effect in a short time after disease, accurately determining the cause of the muscle strength differences that lead to abnormal gait is the premise of solving the problem.While preoperative experimentation would be a better solution,many experiments in medicine cannot be conducted directly on the human body.This has led to the study of corresponding mechanical model simulation experiments:That is, establishing a model similar to the research object or process, then indirectly studying the object or process through model simulation [8-10].The dynamic simulation analysis of walking movement, with the help of an optical capture system, is conducted through inverse analysis of the kinematics data of each part in order to calculate the muscle group contraction mode, after which the forward dynamics analysis method is used to calculate the abnormal muscle strength [11].This method can not only analyze the relationship between gait characteristics and muscle strength changes, but can also be used to explore the effect of muscle strength changes on gait changes, as well as analyze the influence of muscle strength adjustment and elevation on gait performance;thus,it represents a good preoperative virtual experiment program[12-14].

Compared with traditional gait analysis,dynamic simulation analysis has a number of advantages in the field of biomechanics [15-17].Firstly, high data utilization not only pays attention to gait data and joint movement, but also enables the movement of each part to be analyzed through the model.Secondly, the simulation calculations of the model can be used to accurately calculate parameters such as joint force and muscle strength during walking.Thirdly, this form of gait research enables the reasons for gait abnormality to be identified, and also facilitates analysis of gait improvement after adjustment of muscle strength to determine the reasons behind it.Moreover, the analysis method is efficient and its overall credibility is high.

Accordingly,this paper will use the clustering method to study the causes of abnormal muscle strength leading to hemiplegic gait and analyze the influence of abnormal muscle strength adjustment on the gait characteristics of children with cerebral palsy.Moreover, this paper comprehensively analyzes the differences between the parameters of abnormal gait and normal gait of children with cerebral palsy,studies the abnormal gait movement and dynamic performance, and explores the relationship between abnormal gait muscle strength and gait characteristics.

2 Related Works

2.1 Clustering Methods

Clustering methods have been gradually developed alongside developments in the fields of statistics,computer science and artificial intelligence.One of the key advantages of clustering methods compared to other machine learning methods is their ability to group processes simply and quickly [18-20].Generally speaking, clustering algorithms can be classified into partition-based, density-based, grid-based and constraint-based methods.In the current era of big data, the increasing amount of data and the growing diversity of data forms has led to the clustering algorithm being more widely used.At the same time,these developments have resulted in higher requirements for the clustering algorithm itself [21].

Using the threshold value of 1012 bytes of effective data,clustering algorithms can be divided into two categories:namely,small data clustering and big data clustering.Small data clustering mainly embodies the basic concept behind the clustering algorithm,while big data clustering embodies the concept along with the architecture and other aspects.Moreover, in terms of the specific implementation algorithm of the bottom layer of clustering, there is no essential difference between the two; in other words, the specific algorithm used to implement big data clustering still uses small data clustering technology.

The traditional algorithms based on partition, density and grid can be classified as partition-based clustering methods.According to the differences in data objects and their clustering forms, the small data clustering algorithms can be divided into traditional and intelligent clustering methods.Traditional clustering methods can in turn be sub-divided into two categories:partition clustering and hierarchical clustering [22].For their part, intelligent clustering methods can be sub-divided into five categories:artificial neural network clustering, kernel-based clustering, sequential data clustering, complex network clustering and intelligent search clustering.Finally, big data clustering methods can be divided into three categories:parallel clustering,distributed clustering and high-dimensional clustering.

The K-means algorithm is a kind of unsupervised learning method, as well as a clustering algorithm based on division.Generally speaking, Euclidean distance is used as an index to measure the similarity between data objects, where this distance and similarity are inversely proportional [23,24].The core idea behind the K-means algorithm involves randomly selecting k initial clustering centersCi(1 ≤i≤k) from the dataset.The Euclidean distance between the remaining data objects and the cluster centerCiis then calculated and the cluster center nearest to the target data object is identified, after which the data object is assigned to the cluster corresponding to the cluster center [25,26].In the next step,the average value of the data objects in each cluster is calculated as the new cluster center.This process is repeated iteratively until the cluster center no longer changes or the maximum number of iterations is reached.For its part,hierarchical clustering is a kind of clustering algorithm with a tree structure; as the name suggests, it decomposes the clustering data hierarchically.Depending on the decomposition method used, these algorithms can be divided into two categories, i.e., agglomerative hierarchical clustering algorithms and split hierarchical clustering algorithms[27].

Following the introduction of the support vector machine, kernel-based methods have become increasingly important in the machine learning and pattern recognition fields [28,29].The kernel clustering method maps sample points from input space to high-dimensional space through kernel function.This type of non-linear mapping can make the non-linear separable data set become linearly separable in the high-dimensional feature space.Accordingly, the linear method is used to conduct clustering in the high-dimensional space, which greatly improves both the performance and scalability of non-linear clustering [30,31].As the complexity of this kernel clustering method is very high, it is necessary to use Mecer theory to transform the kernel.

Deep learning is also a kind of artificial neural network.As the field of deep learning has rapidly developed, research into artificial neural networks has also reached a higher level.There are three basic conditions that must be met for an artificial neural network clustering method to be implemented [32-34].Firstly, in order to ensure that each input sample has different outputs in the network to the greatest extent possible, each unit must be the same except for some randomly distributed parameters.Secondly,each element must have limited strength.Thirdly, there must be some competitive mechanisms among elements.After the network model has been built, the artificial neural network learns the knowledge from the input data, then uses this to adjust the weight vector and the threshold value of the neuron until the output error of the network reaches the expected end of the training[35].

2.2 Gait Analysis of Children with Cerebral Palsy

At present,there is a large body of research focusing on the gait analysis of adults with hemiplegia after stroke,and comparatively less focusing on children with cerebral palsy.In China,a three-dimensional gait analysis system is used to quantitatively analyze the three-dimensional kinematics of hemiplegia gait.Gait performance is evaluated by analyzing differences in lower limb joint mobility and gait time parameters.The researchers of reference 20 analyzed the difference between hemiplegic and normal gaits with reference to the front and rear tilt of the pelvis,lateral movement of the body’s center of gravity and vertical movement range.The authors of reference 21 evaluated the changes in lower limb joint motion among hemiplegic patients by comparing the differences between hemiplegic gait and normal gait in the sagittal, horizontal and coronal planes[36].

In terms of the muscle strength diagnosis of hemiplegic gait, taking advantage of the introduction of clinical electromyography (EMG), the authors of reference 22 used surface electromyography (SEMG) to investigate changes in the tibialis anterior and posterior muscle groups during free walking for hemiplegic and normal gaits.These authors determined that the main causes of hemiplegic gait were abnormal muscle tone and motor control caused by central lesions.In the experiment,the gastrocnemius and tibialis anterior muscles on the side affected by hemiplegia exhibited a synchronous contraction mode.The gastrocnemius on the affected side was overactive, while the electromyographic activity of the tibialis anterior muscle was decreased.This study laid the foundation for the use of SEMG in clinical gait muscle force diagnosis.Furthermore, the authors of reference 23 collected gait data by using the threedimensional capture system to establish a gait model of hemiplegia for simulation analysis, then compared and analyzed the changes in joint angle and muscle strength before and during electrical stimulation [37-40].

The effectiveness of a multi-channel electrical stimulator for hemiplegia gait rehabilitation was verified by the improvements in joint mobility and muscle strength observed during electrical stimulation.These results also qualitatively proved the correlation between the improvement of muscle strength and gait improvement.

In hemiplegic gait rehabilitation research, the researchers in reference 24 investigated the strength of muscle spasm after stroke, on the basis of normal gait.The method used involved adjusting the muscle length and speed of feedback, with the forward dynamics simulation showing a strength increase after the change of the gait characteristics.The results reveal that when the gastrocnemius, soleus muscle, and rectus femoris muscle are overexcited, the muscle length and feedback speed increase, leading to a hemiplegic gait.The researchers in reference 25 further studied the rehabilitation training of hemiplegic gait patients and determined that the ankle joint flexor muscle plays a vital role in hemiplegic rehabilitation training.By means of treadmill fast walking and fast functional electrical stimulation training for patients with mild paralysis, the activation degree of the metatarsal flexors was changed or the muscle strength of metatarsal flexors increased in order to verify the correlation between the parameters of clinical gait movement and the muscle strength simulation results[41-44].

3 Mixed Clustering Methods

When conducting a comparative analysis of the gait of children with cerebral palsy,a method that can analyze the difference mutation is needed,as this will allow doctors to adjust the treatment measures more quickly.In this section,we propose a new mixed clustering method to analyze the abnormal gait of children with cerebral palsy; this approach combines multiple clustering methods, specifically K-means and hierarchical clustering.

3.1 Initialization

Firstly,the partition clustering algorithm is utilized to create an initial partition for a particular dataset.An iterative relocation algorithm is then used to improve the clustering by moving the samples between categories.Finally, the clustering criterion is applied to end the movement and determine the quality of the results.In general,the criterion applied is the following square error criterion:

whereSnrepresents the square error,qdenotes the sample points,andmrefers to the average value of each cluster.We cluster the collected data based on the above model, then find the similarity value of each sequence relative to the abnormal gait of children with cerebral palsy.The present study focuses on the differences in gait between normal children and those with cerebral palsy.Moreover, when analyzing the abnormal gait of children with cerebral palsy, time is also an important feature; therefore, time is used as the basis of the x-axis in the present paper.The value of the x-axis is the distance between the two.

In Eqs.(1)and(2),xnis the value of the time distance.By calculatingxminuspand then multiplying it by 0.1,xncan be obtained(all values are multiplied by 0.1 to normalize the data,which makes the clustering results more accurate).We edit the distance using a similar function and subsequently obtain the shortest distance.The similarity method is utilized in Eq.(3).

whenaj=bi,the distance betweenajandbiis 0.Ifaj/=bj,we utilize the equation in the second line;i.e., we insert, delete, and replace them as needed, then compare the results.Thus, each change adds a distance unit betweenajandbi.

3.2 Clustering Process

The data series collected from the children with cerebral palsy were compared with the gait of normal children to calculate the approximate value.The data analysis process used to find the similarity values is of an unstructured type.During the data analysis,sequence cutting is not based on a single circle of spirals but is instead based on the length of time.Therefore, the loop process used is also based on the size of the indeterminate loop in the unstructured pattern.

The clustering process is as follows.First, the data sequence is separated from the database and then clustered by the clustering method.The input data is divided into two types, namely the cerebral palsy(x) data and isolated normal children(y) data.Once the data is entered, the system will determine that the data sequence is considered to be the minimum value.

The K-means method and fuzzy clustering method are combined,as in Eqs.(4-6).Here,xrepresents the x value of all nodesx=x1,...,xn,kdenotes the number of clusters,andnis the number of nodes(k≤n).Each μiis a constant variable.Moreover,wijis used to find the degree of difference, while μiis used to maximize the square of the deviation.The functions of these variables are similar; therefore, thewijcan be combined with the μivariable.

A common method used to select the initial class center can be described as follows.The clustering algorithm is used many times.Each time it is used, a group of different random initial cluster centers is selected, after which the optimal cluster results are selected from the cluster result set.The initial cluster center corresponding to the cluster results is considered to be the optimal initial cluster center.

In this paper,however,clustering centers are initialized based on the peak clustering method.The basic idea is to find the points with higher sample density and identify them as the clustering center by constructing a density function,then realize the clustering of samples by calculating the distance between the samples and the clustering center.The first step of the algorithm involves constructing a grid in data space.The grid density is closely related to the accuracy and speed of clustering:Specifically, the more finely the grid is divided, the larger the candidate set of the cluster center is, the more computation is needed, and the slower the clustering process.The second step is to construct the peak function, which is the core of the peak clustering algorithm.Through the peak function, the denser points in the data space can be calculated.In general, the Gauss density function is used as a peak function:

in which δ is a constant.When the data around the candidate centerviincreases,the peak height of the point is higher,while the probability of the point being selected as the cluster center is also higher.By calculating the peak function,the distribution map of the peak function can be obtained.The cluster center will be selected in the third step.Firstly,the point with the largest peak function is selected as the first cluster center from the candidate set of cluster centers, after which the next cluster center is found by successively removing the points with the largest peak function and the points around these points.Moreover, it can be seen from the process of peak clustering that the clustering centers of the peak clustering results are distributed in the areas with the highest sample density, while the number of clusters does not need to be specified in advance.

4 Mixed Clustering for Analyzing Abnormal Gait in Children with Cerebral Palsy

4.1 Gait Cycle Division

Reasonable gait division is especially important to gait analysis.Regarding gait division, when considered broadly, the whole gait cycle can be divided into two parts—namely, the support phase and the swing phase—the support of the research side.From a more fine-grained perspective, reference can be made to Wang Jinsong and the California RLA gait division method, which performs gait cycle division using gait joint position and contact time.In conjunction with the need for a phased analysis of walking exercises,reference is also made to the RLA partitioning method.

The function of the supporting phase is to maintain posture stability and generate forward propulsion.This phase includes four stages:

1.Initial Contact:The heel or other parts of the foot contact the ground on the opposite side to the toe that is off the ground;

2.Mid Stance:The opposite side of the toe is adjacent to the foot(the trunk is directly above the support leg);

3.Terminal Stance:The first landing of the foot on the opposite side of the heel;

4.Pre-Swing:The opposite side of the heel is first landed to the support leg toe off the ground.

The function of the oscillating phase is to allow the body to continue to move forward and prepare for repositioning the limb.This phase includes three stages:

1.Initial Swing:Support the leg toe off the ground to the adjacent position of the opposite foot;

2.Mid Swing:The adjacent position of the contralateral foot to the vertical surface of the tibia;

3.Terminal Swing:The vertical ground position of the tibia reaches the foot again to touch the ground.

4.2 Semantic Similarity Measurement

The clustering process has been described in detail in the previous section.As the collected gait data is discrete, the similarity measurement method of discrete variables is adopted.In this paper, the similarity measure method of the binary variable is used.All binary variables among samples take the total number of different numerical values;the corresponding similarity measurement formula is as follows:

Here,wis the matching coefficient,indicating the degree of difference when the variable takes different values.Generally speaking, the value of the parameter is either 1, 2 or 1/2.In particular, whenwis 1, the distance is called the Hamming Distance.

4.3 Differences in Gait between Children with Cerebral Palsy and Normal Children

The abnormality of gait in children with cerebral palsy is most intuitively reflected in the gait movement parameters.Based on the analysis of pace, step width, gait cycle, and other indicators, the typical gait of children with cerebral palsy can be obtained.Due to the dyskinesia of the affected side, the patient will actively use bias or contralateral compensation to compensate for or eliminate the damage while walking.Although these measures can allow the patient to walk independently, the wrong gait characteristics will lead to further deterioration of muscle activity, and ultimately, loss of walking ability.

In the gait data,indicators such as pace,step size,gait cycle and support phase ratio are used to analyze the main manifestations of abnormal gait in children with cerebral palsy,as well as to judge the asymmetry,while the body center of gravity displacement reflects the performance of the upper body swaying state and the lap gait in the gait walking.

The comparison of gait parameters can provide the most intuitive understanding of abnormal gait performance.Tab.1 compares the abnormal and normal gaits of children with and without cerebral palsy after analysis and comparison by the previous method.

From the comparison of abnormal and normal gait data,it is clear that the differences between the step size,stride length,stride speed,support time,and step width of the affected leg and normal standard value are significant.It can be seen from this data that the step size and pace of the gait in children with cerebral palsy are much smaller than is the case for normal gait:the step size is only 1/5th of the normal gait,the pace is less than 1/10th of the normal gait, and the gait cycle time is greater overall than the normal gait.Moreover,combined with the support phase and the swing comparison case, the total support phase ratio of the affected limb is greater than the normal gait, the double support phase is longer, and the single support is significantly smaller than is the case for normal gait.

Table 1:Gait comparison of children with cerebral palsy and normal children

The contrast of the step width reflects the large lateral displacement of the gait in children with cerebral palsy.The step width of the children with cerebral palsy is about 1.5 times that of the normal gait,which is a significant outer lap gait.The cerebral palsy patient uses an enlarged step width to increase the support area of the body in order to stabilize the trunk,while the difference in the bearing capacity of the two limbs and the shift of the center of gravity combine to make the patient's trunk sway when the support side changes.Combined with the step size, affected by the movement of the joint during walking, the pace is related to the balance stability of the lower limbs and muscle strength; when compared with normal gait data, these can reflect the problem of limited joint angle and related muscle weakness.On the whole, the abnormal gait of the children with cerebral palsy was characterized by smaller step speed and length, as well as a longer step width and gait cycle,which reduced the length of limb support.

4.4 Center of Gravity Difference

Through three-dimensional kinematic analysis of children with cerebral palsy, along with quantitative analysis of the displacement motion characteristics of the center of gravity of the hemiplegic gait and normal gait in the vertical and lateral directions, the figure below presents a comparison of the eccentric gait obtained from the analysis method outlined in the previous section and the normal gait center shift displacement curve in Fig.1.

In the walking exercise,the center of gravity of the human body is kept in front of the humerus in the pelvis for a long time,while the change of the center of gravity in the anteroposterior direction can reflect the motion velocity characteristics of the subject.Because the pace of children with cerebral palsy is also much smaller than that of normal children, the contrast between the anteroposterior direction and the pace is the same, while the change in the center of gravity in the vertical and lateral directions reflects the overall characteristics of the gait movement.Usually, the center of gravity shift for a normal walking child involves a vertical displacement of 3-4 cm; by contrast, the center of gravity shift for a child with cerebral palsy is only 1-2 cm,and the overall movement of the patient in the vertical direction is small.

Moreover, the lateral displacement of the center of gravity of the child with cerebral palsy is about 11 cm, more than twice the normal range.The child presents a swinging motion of the upper body.The lateral movement determines the degree of the center of gravity shift.The child increases the support area by increasing the step width to in turn increase the stability of the support phase,while the lateral displacement of the center of gravity increases the exercise energy required.

5 Conclusion

This paper proposes a simple and effective clustering method, which combines K-means and hierarchical clustering and thus contains the advantages of these two methods.Moreover, the proposed method is utilized to find the differences between the abnormal gait of children with cerebral palsy and the gait of normal children.Similar to the K-means method, the abnormal trend of the abnormal gait can be found by using the proposed mixed clustering method to analyze the abnormal direction of the movement.Accordingly, using an approach similar to hierarchical clustering, we can determine the phase difference of the child’s movement.

Funding Statement:J.Li gratefully acknowledge the financial support by the NSFC (61772280),http://www.nsfc.gov.cn.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

Computers Materials&Continua

2021年2期