APP下载

An Optimal Scheme for WSN Based on Compressed Sensing

2022-08-24FirasIbrahimAlZobiAhmadAliAlZubiKulakovYuriiAbdullahAlharbiJazemMutaredAlanaziandSamiSmadi

Computers Materials&Continua 2022年7期

Firas Ibrahim AlZobi, Ahmad Ali AlZubi, Kulakov Yurii, Abdullah Alharbi,Jazem Mutared Alanaziand Sami Smadi

1Department of Information Systems and Networks, Faculty of Information Technology, The World Islamic Sciences &Education University, Jordan

2Computer Science Department, Community College, King Saud University, Saudi Arabia

3Department of Computer Engineering, National Technical University of Ukraine“Igor Sikorsky Kyiv Polytechnic Institute, Ukraine

Abstract: Wireless sensor networks (WSNs) is one of the renowned ad hoc network technology that has vast varieties of applications such as in computer networks, bio-medical engineering, agriculture, industry and many more.It has been used in the internet-of-things (IoTs) applications.A method for data collecting utilizing hybrid compressive sensing (CS) is developed in order to reduce the quantity of data transmission in the clustered sensor network and balance the network load.Candidate cluster head nodes are chosen first from each temporary cluster that is closest to the cluster centroid of the nodes, and then the cluster heads are selected in order based on the distance between the determined cluster head node and the undetermined candidate cluster head node.Then, each ordinary node joins the cluster that is nearest to it.The greedy CS is used to compress data transmission for nodes whose data transmission volume is greater than the threshold in a data transmission tree with the Sink node as the root node and linking all cluster head nodes.The simulation results demonstrate that when the compression ratio is set to ten,the data transfer volume is reduced by a factor of ten.When compared to clustering and SPT without CS, it is reduced by 75% and 65%, respectively.When compared to SPT with Hybrid CS and Clustering with hybrid CS, it is reduced by 35% and 20%, respectively.Clustering and SPT without CS are compared in terms of node data transfer volume standard deviation.SPT with Hybrid CS and clustering with Hybrid CS were both reduced by 62% and 80%, respectively.When compared to SPT with hybrid CS and clustering with hybrid CS, the latter two were reduced by 41% and 19%, respectively.

Keywords: Compressed sensing; computer networks; sensor networks; ad hoc networks

1 Introduction

A wireless sensor network (WSN) is usually composed of several wireless sensor nodes deployed in the monitoring area.Each sensor node perceives environmental information, processes it and transmits it to the sink node in a wireless multi-hop manner [1].Wireless sensor nodes are usually deployed in unattended field areas or complex industrial control sites, so it is extremely inconvenient to replace the battery.At the same time, the computing, storage and energy resources of the wireless sensor node are extremely limited [2,3].Therefore, the monitoring data collected by the sensor node is compressed before proceeding and transmission can effectively extend the survival period of wireless sensor networks.At present, data compression and reconstruction techniques in wireless sensor networks have become one of the core issues in the field of research [4,5].

Wireless sensor network is generally composed of a large number of energy-constrained sensor nodes and several base stations [6].The WSN nodes can be deployed to industrial automation products, traffic control, electronic warfare, traffic automation, electronic health and network control for scene monitoring [2].However, with the continuous development of WSN, the bandwidth and data volume required for signal acquisition have increased geometrically, so it is necessary to establish a new mechanism to reduce energy consumption, cost, delay, communication volume and the amount of information bits tested.When a large-scale WSN is designed and deployed to cover a geographical area, the data observations of neighboring nodes have temporal and spatial correlation.The clustered network structure can effectively eliminate data redundancy, reduce data communication volume and communication distance, and has strong scalability, Load balancing, strong robustness, etc.In WSN,the communication between nodes is the main reason for node energy loss [7,8].WSNhas the situation that some nodes die prematurely due to the unbalanced data flow, which guarantees transmission under the premise of data quality, how to reduce the energy consumption of data transmission and balance the load of network nodes is of great significance to extend the life cycle of the network [9].

In recent years, the development of compressive sensing (CS) [10-14] technology has brought revolutionary breakthroughs to wireless sensor network data collection technology [15].CS can be much lower than Nyquist sampling The collection of data information is completed under the condition of speed, thereby reducing the amount of network data transmission, reducing network energy consumption, and prolonging the life cycle of the network [16].Assume that the network is composed of 1 Sink node andNsensing nodes.Let x be 1 withNvector elements, collection of data representing a network, each data element corresponding to a sensor node collected.The x thinning process, x =ψs, where ψ is anN×Ntransform group, s is the coefficient vector.If the coefficient vector s only containsknon-zero elements(k≪N),then x is said to bekin the ψ field-sparse whenkis small,in accordance with compressive sensing, only the transmission signal vector x ofMobservations composed of the vector y object to sink node can be transmitting information, y =φx,φ is aM×Nrandom matrix of(M≪N).After the Sink node collects the measured value y, the original data signal x can be reconstructed by solving anl1-norm optimization problem or a heuristic algorithm such as OMP [17-19].

In the tree-based data collection method, leaf nodes far away from the sink node need to transmit fewer data packets, and nodes closer to the sink node need to transmit more data packets.However,after using CS to process the node data, each node needs to transmitMdata packets, that is to say,M×Ndata packets need to be transmitted in a network ofNnodes, which is still a very large value.Literature [20-22] proposed hybrid compressed sensing method, in which the leaf node far away from the sink node transmits the original data, and the node closer to the sink node uses compressed sensing technology for data compression.The above methods use CS in the routing tree,and the clustered structure is compared with the tree structure in terms of robustness and network load balancing and other advantages [23-25].The robustness is stronger because even if the nodes in the cluster die unexpectedly due to physical factors, it has little impact on the network topology.Balanced load is due to clustering number of nodes in the network can be balanced, which can alleviate the bottleneck effect in the tree network.However, the above methods ignore the geographical location of the network and the distribution of nodes.Research shows that in sensor networks, the collection algorithm is designed according to the distribution of nodes which can reduce the amount of data transmission [26].Reference [27] proposes a method for data collection using hybrid CS technology in a clustered sensor network, where CS is not used for data transfer in clusters, and CS is used between clusters.The amount of data transmission is effectively reduced by minimizing the transmission consumption in the cluster.However, this method does not consider the huge load of nodes close to the cluster head and nodes with large connectivity in a large-scale network.The network load is unbalanced and ignored and the impact of the number of inter-cluster transmission hops on the data transmission volume is negligible.Reference [28] proposed a clustered data fusion algorithm based on non-uniform division, which divides the network into non-uniform grids.The farther away from the base station, the larger the grid.A node with the largest remaining energy is selected as the cluster head, and the node selects the cluster head according to the signal attenuation strength of the broadcast message of the cluster head.Simulation experiment results show that the algorithm can save energy and balance energy consumption, and significantly extend the life cycle of the network.Reference [29] proposed a hierarchical data fusion algorithm that combines wavelet support vector machine (SVM) and evidence theory.The algorithm uses wavelet transform to perform data-level denoising processing on the original signal, and establishes a SVM multi-classifier model, use feature parameters as input vectors to perform feature-level fusion of leak detection at ordinary nodes.Using improved evidence combination rules, the decision-level evidence combination is performed at the convergence node to obtain the final decision of the pipeline network state.Simulation results show that this method can effectively improve the accuracy of leak source location detection and reduce detection errors.Reference [30] studied the selection of the modulation method of wireless signal transmission in order to improve the efficiency of wireless network data transmission and reduce energy consumption, and proposed a low-power encoding method at the physical layer.Reference[31] proposed a low-energy signal sampling method in wireless networks to increase the lifetime of sampling nodes.Reference [32] uses Fourier transform, discrete cosine transform and wavelet transform to establish the signal sparse base according to the characteristics of the time sequence or space sequence of the wireless sensor network, generates the sparse representation data of the signal,and then samples the sparse data.This can greatly reduce the sampling time and space consumption.These methods use the spatial correlation of the detected data to compress and encode the data, but they cannot effectively deal with the abnormal event data, and the computational complexity is high.The compressed sensing theory proposed in recent years provides a new data acquisition method for wireless sensor networks [33-35].According to the theory of compressed sensing, a sparse signal can be accurately reconstructed with a few sampling times, and the sampling can be done by linear projection of the detection data.In this way, sensor nodes can complete data collection in a compressed manner without additional computational overhead.For the wireless sensor network, although it has the characteristics of convenient construction, strong adaptability, and high transmission efficiency,there are some limitations in some aspects, such as energy supply, sensor life cycle, delay, bandwidth,signal distortion, and transmission cost [36-43].

In a clustered network, because the cluster head nodes need to converge and forward a large amount of data, reducing the number of topology hops and the amount of data used for inter-cluster transmission has an important impact on extending the life cycle of the network.

The main contributions of this paper are as follows:

•This paper considers the determined cluster head distance between the node and the undetermined candidate cluster head node, and the candidate cluster head node with the shortest distance from the parent cluster head node is selected in turn as the child cluster head node,thereby optimizing the cluster head selection mechanism in the network and sacrificing the transmission volume in a small number of clusters premise of quickly reducing the amount of transmission between the clusters.

•Greedily construct a data transmission tree with the sink node as the root node and connecting all cluster head nodes, and transfer the data to the sink node.

•The data transmission amount in the cluster is higher than the threshold nodes and used for inter-cluster transmission use CS compressed data transmission to achieve the purpose of balancing network load and reducing data transmission.

Simulation experiments show that it is combined with a variety of tree-based data collection algorithms and clustered hybrid CS data collection algorithms, the proposed algorithm can effectively reduce the amount of network data transmission.Compared with traditional tree-based data collection algorithms, reference [27] and clustered networks that do not use CS technology for data collection can effectively balance the network load.

2 Proposed Model and Analysis

2.1 Model Establishment and Examples

Assumptions: 1) All nodes in the network are independently and uniformly distributed in the sensing area; 2) All nodes in the network have the same initial energy and transmission rate; 3) All nodes in the network can pass GPS or other positioning technologies [28,29] to perceive the location information.

First, the cluster head node is obtained through the cluster head election algorithm, and then each ordinary node chooses to join the cluster head node closest to its position, and divides the network into several clusters.Each ordinary node passes the data to the cluster head node through the shortest path algorithm to obtain the cluster internal data collection model.Each cluster head node compresses the data within the cluster, and then transmits the data to the sink node through the routing tree constructed by greedy selection to obtain the inter-cluster data collection model.Fig.1 is an example of clustered data collection based on hybrid CS method.The cluster head node performs compressed sensing sampling on the data in the cluster according to the measurement matrix φijgenerated by the pseudo-random number generator of each nodevjin the cluster.Thei-th cluster head nodeCHiin the network passes the generated sub-matrix φHimeasures the original data xHicollected in the cluster to obtain the observation value φHiof xHi.The cluster head nodeCHiobtainsMobservation value signals after compression measurement, andMis determined by the node numberNand the sparsity of the original data are determined.The data transmission tree constructed by each cluster head node through greedy selection transmits data to the sink node.

Taking Fig.1 as an example, suppose the network is divided into five clusters.The cluster head nodesCH1,CH2,CH3,CH4,CH5are used to transmit data through a data transmission tree constructed by greedy selection with the Sink node as the root node.The original data x is constructed from the data collected in five clusters, and the original signal matrix x = [xH1xH2xH3xH4xH5]Tis constructed, and the measurement matrix φ= [φH1φH2φH3φH4φH5]Tis generated by pseudo-random numbers.

The measured valueyreceived by the sink node is the sum of the data of all cluster head nodes sampled by compressed sensing, as shown in Eq.(1).In each transmission, the cluster head node fuses its own data with the data of the sub-cluster head node, and then pass the observations to the Sink node through the greedy routing tree.The Sink node can reconstruct the original signal x based on the received observationyand the measurement matrix φ.

Figure 1: Illustration of proposed node distribution with CS deployment

2.2 Theoretical Analysis and Proof

The clustered hybrid CS data collection method in this paper includes two aspects of data transmission: the nodes in the cluster pass the data to the cluster head node through the shortest path algorithm, and the data is passed to the sink node through the greedy routing transmission tree between the clusters.Nodes whose data transmission volume is higher than the threshold and nodes used for inter-cluster transmission use CS to compress data transmission.The communication volume of any node in the network is the sum of its own communication volume and the communication volume of all its child nodes.The node’s data transmission jumps the more the number, the data of this node will be transmitted multiple times in the parent nodes at all levels, increasing the communication volume of the network.Therefore, in order to save the energy of the nodes in the network, the data collection method designed needs to reduce the transmission in the network as much as possible, that is, reduce the total number of hops of data packet transmission as much as possible.

2.2.1 The Number of Clusters of the Network

In the clustered hybrid CS data collection method, the number of divided clusters in the network has an important impact on the network transmission volume.When the number of clusters in the network is too small, the transmission volume within the cluster will increase sharply.When the number of clusters in the network is too large for a long time, the number of cluster head nodes will increase, and the amount of transmission between clusters is bound to increase.Therefore, there is an optimal number of clusters in the clustered hybrid CS data collection method to minimize the network transmission volume.The optimal number of clusters in the network is expressed as:

Among them,N*cis the optimal number of nodes in each cluster,λ is the node distribution density,Sis the number of small grids in the side length of the cluster after the network clustering is completed,andais the node distribution divided by the node communication radius as a parameter side length of the grid area,Mis the number of observations required to reconstruct the signal, that is to say, the optimal number of clusters in a network ofNnodes is.

2.2.2 Intra-Cluster Transmission of the Network

As shown in Fig.2, the network is divided intoa×asmall grid area, the small grid is inscribed in a circle with the transmission radius of the node as a parameter.Among thema=,ris the transmission radius of the node, so that the same grid nodes in the area can communicate with each other,Sis the number of small grids in the side length of the cluster after the network clustering is completed.

Figure 2: Intra-cluster transmission

The cluster head node in the network is located in the center of the network and close to the sink node, so that the average number of hops for the nodes in the cluster to transmit data to the cluster head node is the smallest.Nodes in the same grid as the cluster head node need to pass 1 hop data is transmitted to the cluster head node.The node of thehlayer needs to passhhops at most to pass the data to the cluster head node, and at leasth- 1 hops to pass the data to the cluster head node.The number of grids in thehlayer is 8(h- 1),h≥2.Assuming that the distribution density of nodes in the network is λ, the number of nodes in a single grid is λa2.The number of data transmission hops for a single cluster node to transfer data to the cluster head node is

due to

Therefore, the number of hops for a sensor node in a single cluster to transmit data to the cluster head node is

Therefore, the upper bound of the total number of hops in the network’s cluster communication transmission is

In the same way, it can be seen that the lower bound of the total number of hops in the network’s cluster communication transmission is

2.2.3 Inter-Cluster Transmission of the Network

As shown in Fig.3, the network is divided into a cluster structure ofSa×Saby the clustering algorithm.Assuming that the cluster head node is located at the edge of the grid area where the cluster centroid is located, the distribution density of nodes in the network is λ, then each cluster has λS2A2nodes, the network is divided intoN/λS2A2clusters.

Figure 3: Inter-cluster transmission

When the cluster head node for sensing the data is sampled, the data is transmitted to the Sink node is greedy routing tree.Each cluster head node after transmitting the compressed data to their parent measured from the nearest cluster head node Sink nodes subject tojump transfer to the Sink node.Therefore, the number of hops between the clusters of the network is

The theoretical value of the total number of transmission hops in the network is the sum of the number of transmission hops within a cluster and the number of transmission hops between clusters.The upper bound of the total number of network transmission hops is

The lower bound of the total number of hops in network transmission is

3 Algorithm Design

For a given network,G= (V,E), wherein,Vis the Sink node andNset of sensor nodes.When the distance ofVnodeiandjis within the transmission radius of the node, then edgesei,jbetween two nodes, andEis the set of all edges.The main steps of the proposed algorithm is mentioned in Algorithm 1.

Algorithm 1: CH election 1: According to the obtained optimal value c of the number of network clusters, randomly select c nodes in the network as temporary cluster heads 2: Each ordinary node in the network chooses to join the temporary cluster head closest to itself, and calculates the centroid position of each temporary cluster in the network from the node position in the temporary cluster, (xCH1 cg,yCH1cg), (xCH2cg,yCH2cg),...,(xCHccg,yCHccg), where xCHicg =ni∑k=1 xk/niis the current temporary cluster average value of the horizontal coordinates of the node, yCHi cg =ni∑k=1 yk/ni,is the average value of the vertical coordinates in the current temporary cluster.The node with the shortest distance to the centroid of each temporary cluster is selected as the new temporary cluster head 3: Repeat step 2 until the network topology no longer changes, and get c centroid positions 4: For each temporary cluster, select tcandidatenodes closest to the centroid position as candidate cluster heads 5: Calculate the distance between the candidate cluster head and the sink node in each temporary cluster, select the candidate cluster head node closest to the sink node in the temporary cluster as the cluster head, and determine the first cluster 6: Calculate the distance between the candidate cluster head node in the remaining temporary clusters and the previous cluster head node, and select the candidate cluster head node closest to the previous cluster head node in the remaining temporary clusters as the cluster head 7: Repeat step 6 to get c cluster head nodes.

After the network selectsccluster heads by, other nodes choose to join the cluster head node closest to itself, and the network is divided intocclusters.The ordinary nodes in the cluster only need to pass the data to the cluster head node roughly located in the center of the cluster and the cluster head node needs to pass the collected data to the sink node through multiple hops.The selectedccluster head nodes are all close to the cluster centroid.Algorithm 1 does not increase the intra-cluster transmission as much as possible, so that the distance between the adjacent cluster head nodes becomes smaller and the cluster head node is as close to the sink node as possible, which can effectively reduce the number of data transmission hops in the network.

Greedy routing tree construction algorithm: WithUrepresents a set consisting of all cluster head node Sink nodes obtained by the algorithm 1,Uinrepresents a cluster-head node has been added to the set of routing tree, theU-Uinis not added to the tree in the cluster-head node.

Algorithm 2: Greedy routing tree construction 1: Initialization: Uin= VSink 2: Calculate the distance between each node in the set U - Uinand each node in the set Uin 3: Select the node with the shortest distance to add to the routing tree 4: Update the set Uin 5: Repeat steps 2 and 3 until the Uin= U

Algorithm 2 constructs a data transmission tree with the Sink node as the root node through greedy selection.Each sub-cluster head node only needs to pass the data to the nearest parent cluster head node, and the number of transmission hops between any two cluster heads.Both are the smallest,which ensures that the Sink node can reconstruct the original signal x according to the measurement matrix φ after receiving the observation y.

4 Simulation Results

4.1 Simulation Configuration

This paper uses MATLAB simulation tool to simulate and compare the proposed algorithm with other data collection schemes.Clustering without CS has the same clustering structure as the algorithm in this paper but does not use CS for data compression.Short path sink node in tree (SPT) without CS collects the data of the sensing node through the shortest path tree constructed.The sink node in SPT with Hybrid CS collects the data of the sensing node through the shortest path tree constructed, and the number of child nodes in the tree (including itself) is greater than the observed value threshold.The node uses CS to compress the data.The optimal tree with hybrid CS proposed in [22] uses hybrid CS to construct the minimum spanning tree and the shortest path tree for the nodes, and then greedily choose to join the data transmission nodes of the tree construct a transmission tree with the least energy consumption.Clustering with Hybrid CS is a clustered hybrid CS data collection scheme proposed in[27].The nodes in the cluster transmit data to the cluster head node through the shortest path [31].The cluster head node after CS sampling the data, the observation value is passed to the sink node by constructing a backbone tree connecting all cluster head nodes and sink nodes [31-35].

Network parameter setting: the network area is set to 20×10, the coordinate of the sink node is(0, 0), the number of nodes in the networkNis 400~1200, the node density λ of the network is 2~6,and the transmission radius of the node isrsettingmeans, the candidate cluster head nodetcandidatesettoN/40,ρ=N/Mis the compression ratio and it values were set to 5 and 10, which ensure adequate observations can accurately reconstruct the original signal .The simulation results are the average of 20 random topology networks.

Four indicators are measured in the simulation:

1) The total amount of data transmitted.The amount of data received by the sink node in one collection cycle.

2) Transmission volume reduction ratio.The transmission volume reduction ratio of the algorithm in this paper compared to other algorithms.The transmission volume reduction ratio compared to clustering without CS is the difference in the data transmission volume in the same cycle between clustering without CS and the algorithm in this paper divided by clustering transmission volume of without CS in 1 cycle.

3) In-cluster and inter-cluster load balancing evaluation index.The average deviation of the number of members in the cluster and the average deviation of the average distance from the cluster head to other member nodes.The inter-cluster load balancing evaluation index,the difference between adjacent cluster heads average distance and the average deviation of the distance between adjacent cluster heads.

4) The standard deviation of the transmission volume of each node in one collection cycle in the network and the average standard deviation of the transmission volume of the nodes in the cluster.

4.2 Simulation Results

Fig.4a is the simulation result of the data transmission volume of the algorithm in this paper and the other five data collection algorithms under the condition of a compression ratio of 5.Fig.4b is the comparison of the data transmission volume of each data collection algorithm when the compression ratio is 10.

Figure 4: Comparison of total data transmission with increasing number of nodes

As shown in Fig.4, the proposed technique considerably decreases the transmission volume when compared to Clustering without CS and SPT without CS.The suggested approach outperforms SPT with Hybrid CS in terms of data transfer volume.This is because in the clustering structure, the sensing node only needs to transmit the data to the cluster head node roughly located in the center of the cluster, and the sensing node in SPT with Hybrid CS transfers data to the parent node that is close to the sink node, which will greatly increase the number of transmission hops and increase the amount of data transmission in the network.The proposed algorithm is more effective than clustering with hybrid CS.The amount of data transmission is reduced because it determines the cluster heads in turn based on the distance from the determined cluster head node to the undetermined candidate cluster head node, and reduces the number of data transmission hops while increasing the amount of intra-cluster transmission as little as possible and reduce the amount of inter-cluster transmission.For nodes whose data transmission amount is higher than the threshold and nodes used for intercluster transmission,use CS to compress data transmission to reduce the amount of data transmission.The proposed algorithm and the optimal tree scheme proposed in [22] are compared.The network transmission volume of the hybrid CS is slightly reduced, but the network based on the clustering structure has better fault tolerance.In the tree-based network structure, when some nodes in the tree are exhausted and die, the network topology changes will occur, causing the transmission tree to no longer be efficient.

4.3 Transmission Reduction Ratio Analysis

Fig.5a shows the simulation results of the data transmission reduction ratio between the proposedand the other four data collection algorithms under the condition of a compression ratio of 5.Fig.5b shows the compression ratio between the proposed and the other four data collection algorithms.The simulation result of the data transmission volume reduction ratio under the condition of 10.It can be seen from Fig.5b that when the compression ratio is 10, the proposed algorithm reduces the transmission volume of clustering without CS by about 75%, and reduces the transmission volume by about 65% compared with SPT without CS.Compared with SPT with Hybrid CS, the transmission volume is reduced by about 35%.Compared with the Clustering with Hybrid CS proposed in [27], the transmission volume is reduced by about 20%.Obviously the reduction ratio does not decrease with the increase of the number of nodes, which shows that the proposed algorithm has better scalability.It can be seen from Fig.5a that when the compression ratio is 5, compared to the case of Clustering without CS and SPT without CS with a compression ratio of 10, the transmission volume reduction ratio is only reduced by about 5% to 10%.This shows that the proposed algorithm can effectively reduce the transmission volume of the network even in the case of a large number of sampled signals.The reduction ratio of transmission volume is reduced by 2% to 5% when compared to the case where the compression ratio of SPT with Hybrid CS and Clustering with Hybrid CS is 10.Because the suggested approach focuses on lowering the transmission volume across clusters, this is the case.As the compression ratio is reduced, the total network transmission volume is reduced as well, resulting in a lower transmission volume reduction ratio when compared to SPT with Hybrid CS and Clustering with Hybrid CS.

4.4 Load Balancing Analysis

This section compares and analyzes the network load balance between the proposed and other algorithms.Fig.6 shows the standard deviation of the transmission volume of each node between the proposed and other five data collection algorithms under the condition of a compression ratio of 10.

Clustering without CS and SPT without CS is not used to compress and measure nodes whose transmission volume exceeds the compressed sensing threshold, the transmission volume of nodes in the backbone tree is very large, while the transmission volume of leaf nodes is very small, so the transmission volume between nodes is low.The discrepancy is significant, indicating that the load is unbalanced.As a result, the cluster network outperforms the tree network in terms of load balancing.The tree architecture is adopted by SPT with Hybrid CS and Optimal Tree with Hybrid CS, resulting in leaf nodes and fusion node transmission volume varying substantially.The suggested algorithm is better balanced than Clustering with Hybrid CS.This is because nodes in the cluster with a data transmission volume greater than the threshold, as well as nodes used for inter-cluster transmission,use CS to compress data transmission.

Figure 5: Comparison of reduction ratio of the proposed and existing algorithms

Figure 6: Comparison of standard deviation of the nodes

The balance of intra-cluster communication consumption is one of the important factors that affect load balancing.Intra-cluster communication consumption is proportional to the number of member nodes in the cluster and the distance between the cluster head and its members, so the proposed algorithm is verified for load balancing within the cluster.The performancemainly considers the average deviation of the number of members in the cluster Intra-memdeviand the average deviation of the average distance from the cluster head to its member nodes.The specific results of Intra-disdeviare shown in Tab.1.

The Intra-memaverin Tab.1 represents the average number of member nodes in the cluster, and Intra-disaverrepresents the average distance from the cluster head to its member nodes.The two average variables mainly reflect the average load of communication consumption within the cluster, the smaller the average load, the smaller the network energy consumption.The deviation variable mainly reflects the degree of difference between different clusters, the smaller the deviation, the smaller the difference,and the more balanced the load.From Tab.1, the proposed algorithm is compared with reference [27]algorithm.The load balance index deviation and average difference within the cluster are slightly worse than reference [27].This is because the proposed algorithm mainly considers the application of CS to minimize the transmission between clusters.

Table 1: Performance analysis of proposed and existing algorithms

Tab.2 is the average standard deviation Std-intraaverof the transmission volume of the nodes in the cluster, reflecting the average difference of the transmission volume of the nodes in the cluster.The smaller the average standard deviation, the more balanced the load of the nodes in the cluster.

Table 2: Comparison of the standard deviation of the proposed and existing algorithms

It can be seen from Tab.2 that the average standard deviation of the transmission volume of the nodes in the cluster of the proposed algorithm is lower than that of reference [27] method under 400~1200 nodes, which proves that the proposed algorithm performs the calculation on the nodes whose data transmission volume in the cluster is higher than the threshold.Compressed measurement reduces the difference in the amount of transmission between the nodes in the cluster and balances the load in the cluster.

Communication consumption balance between clusters is another important factor to measure load balance.The average distance and average deviation between adjacent cluster heads are tested.The specific results are shown in Tab.3.The average distance between adjacent cluster heads Interdisaverreflects the communication consumption between cluster heads, while the average distance deviation between adjacent cluster heads Inter-disdevireflects the degree of difference in communication consumption.From Tab.3,wwe can see that, the proposed algorithm has lower value than reference [27]in average distance and average deviation.This is because the selection of cluster head nodes considers the distance between the determined cluster head and the undetermined candidate cluster head node,and the distance from the parent cluster head is selected in turn.The shortest candidate cluster head node is the sub-cluster head node.When the inter-cluster transmission tree is greedily constructed,it can effectively reduce the transmission distance between the clusters, reduce the communication consumption, and balance the load.

Table 3: Comparison of load balance of the proposed and existing algorithms

Fig.7 compared the network lifetime of the proposed and existing algorithms under increading number of CS measurements.As can be seen from Fig.7, the network lifetime of all the schemes is decreasing with increasing number of CS measurements.However, the network lifetime of the proposed algorithm is better than the existing algorithms.This makes the proposed algorithm superior over the existing schemes because it provides more robustness and long-time performance.

Figure 7:Comparison of the network lifetime of the proposed and existing algorithms under increasing number of CS measurements

5 Discussion

This paper takes into account the determined cluster head distance between the node and the undetermined candidate cluster head node, and the candidate cluster head node with the shortest distance from the parent cluster head node is chosen as the child cluster head node, thereby optimizing the cluster head selection mechanism in the network and sacrificing transmission volume in a small number of clusters on the premise of quickly reducing the amount of transmission between the node and the undetermined candidate cluster head node.Clustering with hybrid CS is less successful than the suggested technique.The amount of data transmission is reduced because the cluster heads are determined in turn based on the distance between the determined cluster head node and the undetermined candidate cluster head node, and the number of data transmission hops are reduced while intra-cluster transmission is increased as little as possible and inter-cluster transmission is reduced.Use CS to compress data transfer for nodes whose data transmission amount exceeds the threshold and nodes utilized for inter-cluster transmission to lower the amount of data transmission.The proposed algorithm has more effective results than existing algorithms which makes it useful for WSN data transmission.

6 Conclusion

In a clustered sensor network, this research provides a data gathering strategy based on hybrid CS.The cluster head selection mechanism in the network has been optimised in order to lower the quantity of data transmission in the network and balance the network load.1) Determine that the first cluster head is the candidate cluster closest to the sink head node.The candidate cluster head node with the lowest distance from the parent cluster head node is picked as the child cluster head node, and the network is partitioned into clusters, based on the distance between the determined cluster head node and the indeterminate candidate cluster head node; 2) Build a data transmission tree with the Sink node as the root node and connect all cluster head nodes as quickly as possible.; 3) The quantity of data transmission in the cluster is larger than the threshold, and the nodes utilized for inter-cluster transmission employ the CS to compress the data transmission to fulfil the goal of balancing network load and lowering data transmission.The suggested algorithm outperforms clustering without CS,SPT without CS, and SPT with Hybrid CS, according to the experimental data.Clustering using Hybrid CS can significantly cut data transmission and improve network load balancing performance.Other critical parameters such as energy efficiency and latency will be used in future study to evaluate the effectiveness of the proposed method.

Acknowledgement:The authors extend their appreciation to King Saud University for funding this work and would like to thank the editors and reviewers for their review and recommendations.

Funding Statement:This work was supported by the Researchers Supporting Project (No.RSP-2021/395), King Saud University, Riyadh, Saudi Arabia.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.