APP下载

Tactical intention recognition of aerial target based on XGBoost decision tree

2018-05-09WANGLeiLIShizhong

WANG Lei, LI Shi-zhong

(College of Mechatronic Engineering, North University of China, Taiyuan 030051, China)

0 Introduction

In modern war, predicting enemy target intention accurately, assessing the current situation in time and providing a reliable basis to take countermeasures at the next moment, play an important role in control of the war situation. In the past, conflict analysis, markov analysis[1], Bayesian network technology[2]etc were presented. In 2017, Cao S Y proposes an improved method of target intent recognition for high-dimensional data similarity[3].

The process of target intention recognition mainly includes the following steps: ① collecting data from the sensors, such as altitude, speed, distance etc; ② processing the data and extracting the feature; ③ analyzing the target intent according to the recognition mechanism; ④ obtaining the results.

In the military environment, the target intention recognition is based on the relevant data, such as weapon equipment, operational knowledge manual, military expert’s experience, map and meteorological data etc. However, it is difficult to obtain and process these data to meet the requirements. In order to construct an intention recognition model successfully, it is necessary to go on a great deal of theoretical researches and engineering practices.

The difficulty of target tactical intention recognition is to identify intention precisely in the situation based on misinformation[4]. In order to solve this problem, a recognition method based on XGBoost (eXtreme Gradient Boosting) algorithm[5]is presented in this paper. As a parallel boosting decision tree, XGBoost has the advantages of fast running speed, good effect and processing large scale data. XGBoost decision tree is compared with method in the Ref.[3]. And the result shows that the accuracy of target true intention is greatly improved by using the XGBoost method.

1 Fundamental of XGBoost decision tree

XGBoost is an implementation of the Gradient Boosting machine algorithm, and it can do parallel computation, which makes the recognition real-time. The traditional gradient boosting decison tree (GBDT) algorithm only uses first order derivative information. When training then-th tree,it needs to use the remnants of formern-1 trees. Therefore, it is difficult to achieve distributed computation. In this paper, square loss function is adopted, and the regular term is added to the loss function to obtain the optimal solution, which is useful for balancing the loss function, decreasing the complexity of the mode and avoiding overfitting.

1.1 Defining objective function of XGBoost

The objective function is defined as

Ω(ft)+const,

(1)

where

(2)

where const is constant;γis number of leaf nodes;wjis weight of nodes;λis parameter of the L2 regularization term that controls the weight of the model complexity.

F(t)=

Ω(ft)+const,

(3)

1.2 Defining complexity of decision tree

Definingft(x)=ωq(x),ω∈RT,q∶Rd→{1,2,…,T}, substituting Eq.(2) into Eq.(3) and removing the constant item,the following result is obtained as

(4)

whereωis the leaf weight of tree;qis the structure of tree. The instance set in leafjis defined asIj={i|q(xi)=j} andIj∈I.

(5)

Since the derivative ofF(t)with respect toωjequals to 0, the general result is

(6)

Substituting Eq.(6) into Eq.(5),Fis obtained as

(7)

The smaller the value ofFis, the better the structure of the tree is.

1.3 Building XGBoost decision tree

For every leaf node, the greedy method is used to add a segmentation to split the leaf node into two child nodes from the depth 0 of the decision tree to depth n. After adding the segmentation point, the function is considered as

(8)

There are 6 kinds of features including altitude, speed, distance, relative heading angle and the situation of the search radar and attack radar. In this paper, the target intention is classified as attack, penetration, scout and retreat.

For each feature, the nodes are sorted according to the features values. Then each leaf node is split. For every split step, all the segmentation schemes are traversed to find the optimal segmentation point. The pseudo-code is shown as follows:

Input:I, instance set of current node

Input:m, feature dimension

Gain←0

fork=1 tomdo

GL←0,HL←0

forjin sorted (I, byxjk) do

GL←GL+gj,HL←HL+hj

GR←G-GL,HR←H-HL

end

end

Output: Split with max score

The tree is considered as a function which maps the features to intent. Then the probability of intent is calculated by using logistic regression method.

1.4 Data standardization

Due to the inconformity of the data’s units, it is necessary to standardize the data to 0-1. The method adopted in this paper of Min-Max is

(9)

where maxAis the maximum value, while minAis the minimum value.

1.5 Dempster-Shafer combination rule

It is possible to provide evidences for multiple sources for a given frame of discernment. It is necessary that all sources should be independent. To combine belief functions with Dempster-Shafer, the basic probability assignment is used. The combination rule’s numerator is as

(10)

The whole process is shown in Fig.1.

Fig.1 Whole process of intention recognition

2 Experiment and results

Part of the data is assumed in Table 1, and the data is collected at different time by the sensor.

In the experiment, the objective intention space set by expert and its standard values[7]including height, speed, distance, relative heading angle, situation of the search radar and the attack radar are adopted to classify data of the Table 1 into four kinds of intentions including attack, penetration, retreat and scout.

Table 1 Data collected by sensor

Since the standard value has a continuous or discrete range, each row in Table 1 is classified according to its range respectively. The classification result is shown in Table 2.

Table 2 Classification results

The decision tree is constructed by the method introduced in section 1.3 and maps the example data to target intent. Then the probability of enemy target intent is calculated by logistic regression. Based on the python package provided by XGBoost, this paper uses the interface of XGBClassifier to build the XGBoost decision tree. And the whole process is programmed with python. The first column is considered as label data, and other columns as train data. Before the data of Table 2 used as the input data, the first column data needs to be quantified. Therefore, intentions of attack, penetration, scout, retreat are encoded as 0, 1, 2 and 3, respectively. And other column data is standardized according to section 1.4. Part of the codes are listed as follows. Num_class represents classified categories, while max_depth is the maximum depth of tree, and other parameters are default.

#load the train data, label data and target data;

train=table 2's data except for the first column

label=the first column data of table 2

target=example data in Ref.[7]

#use interface XGBClassifier to build the XGBoost decision tree;

XGBoost=XGBClassifier(

num_class=4,

max_depth=6,

min_child_weight=1,

gamma=0,

objective=′multi:softprob′)

#train the XGBoost decision tree;

XGBoost.fit(train, label)

#predict the probability of target intent;

yprob=XGBoost.predict(target).reshape(target.shape[0], 4)

The calculated results (yprob) are shown in Table 3. Andt0,t1,t2,t3represent arbitrary time series, as in Ref.[7].

Table 3 Probability distributions

According to Dempster-Shafer rule of combination, the intent distribution probability fromt0tot3in Table 3 is sequentially synthesized firstly. Then the sequential intention result is obtained in Table 4. And the maximal probability represents the real target intent at current time. According to the comparison with Table 3 in Ref.[3], it can be found that the true intent recognized by the presented method is the same as method in Ref.[3] fromt0tot3.

Table 4 Target sequence intention probability

To test the performance of XGBoost, the comparisons with method in Ref.[3] are shown in Figs.2 and 3. Fig.2 presents the target 01’s intent probability, and intent probabilities of target 02 and 03 are shown in Fig.3. The full line and dotted line represent XGBoost and Ref.[3]’s method, respectively.

It can be observed that the trend of intent is consistent. Fig.2 shows that target 01’s scout intent trend rises from timet0tot3, while others decrease. The probability of scout is 0.823 5 att3, and it is higher than 0.525 1 of method inRef.[3]. As to Fig.3, it is obvious that the probabilities of penetration increases fromt0, while the probabilities of other three intentions decrease. However, the probability of attack intention increases and reaches 0.577 4 fromt1tot3, which is higher than 0.384 6 of method in Ref.[3].

Fig.2 Intention probability of target 01

Fig.3 Intention probabilities of targets 02 and 03

For further comparison, Tables 5 and 6 are listed. The second columns of the Tables 5 and 6 represent the target’s real intention probabilities calculated by XGBoost, and the third column is the Ref.[3]’s method. It is obvious to know that the accuracies have increased 29.48% and 19.28% att3, respectively.

Table5Comparisonoftarget01’srealintentionprobability

TimeXGBoostRef.[3]’smethodImprovedaccuracy(%)t00.31520.30071.45t0-t10.42540.34088.46t0-t1-t20.65020.428222.20t0-t1-t2-t30.82350.525129.48

Table6Comparisonofrealintentionprobabilitiesoftargets02and03

TimeXGBoostRef.[3]’smethodImprovedaccuracy(%)t00.31520.30071.45t0-t10.50050.354514.60t0-t1-t20.39600.34405.20t0-t1-t2-t30.57740.384619.28

3 Conclusion

In this paper, a method based on XGBoost to predict the target intent is presented. To test the performance of this method, comparisons with the method presented in Ref.[3] are carried out based on the same data in Ref.[7]. Since the target intent has been recognized in Ref.[7], the goals of this paper and Ref.[3] are to improve the accuracy under the condition of ensuring true recognition. And the experiment result shows that our method performs better than method in Ref.[3] in accuracy of recognizing intent, which implies that XGBoost method can provide valuable solution to evaluate the situation of modern war.

[1] Yuan Z J, Xu Z G, Deng S H. The model for predication of fighting intenting under sequential game. System Engineering Theory and Practice, 1997, 7(7): 72-78.

[2] Li W S, Wang S B. Situation assessment based on bayesian networks. System Engineering Theory and Practice, 2003, 25(4): 480-483.

[3] Cao S Y, Liu Y A, Xue S. Target tactical intention recognition method of improved high-dimensional data similarity. Transducer and Microsystem Technologies, 2017, 36(5): 25-28.

[4] Yao Q K, Liu S J, He X Y. Research and prospect of battlefield target operational intention recognition. Journal of Command and Control, 2017, 3(2): 127-131.

[5] Chen T, Guestrin C. Xgboost: A scable tree boosting system. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

[6] Yang F B, Wang X X. D-S evidence theory conflict evidence synthesis method. Beijing: National Defense Industry Press, 2010.

[7] Sun Y L, Bao L. Study on recognition technique of targets’ tactical intentions in sea battlefield based on D-S evidence theory. Ship Electronic Engineering, 2012, 32(5): 48-51.