Deep Spectrum Prediction in High Frequency Communication Based on Temporal-Spectral Residual Network

2018-09-06LingYuJinChenYumingZhangHuajiZhouJiachenSun

China Communications 2018年9期

Ling Yu, Jin Chen,＊, Yuming Zhang, Huaji Zhou, Jiachen Sun

1 College of Communications Engineering, Army Engineering University of PLA, Nanjing 210000, China

2 Science and Technology on Communication Information Security Control Laboratory, Jiaxing 314033, China

Abstract: High frequency (HF) communication is widely spread due to some merits like easy deployment and wide communication coverage. Spectrum prediction is a promising technique to facilitate the working frequency selection and enhance the function of automatic link establishment. Most of the existing spectrum prediction algorithms focus on predicting spectrum values in a slot-by-slot manner and therefore are lack of timeliness.Deep learning based spectrum prediction is developed in this paper by simultaneously predicting multi-slot ahead states of multiple spectrum points within a period of time. Specifically, we first employ supervised learning and construct samples depending on longterm and short-term HF spectrum data. Then,advanced residual units are introduced to build multiple residual network modules to respectively capture characteristics in these data with diverse time scales. Further, convolution neural network fuses the outputs of residual network modules above for temporal-spectral prediction, which is combined with residual network modules to construct the deep temporal-spectral residual network. Experiments have demonstrated that the approach proposed in this paper has a significant advantage over the benchmark schemes.

Keywords: HF communication; deep learning; spectrum prediction; temporal-spectral residual network.

I. INTRODUCTION

1.1 Background and motivation

High frequency (HF) communication whose frequency range is from 3 MHz to 30 MHz is capable of realizing over-the-horizon communication depending on ionosphere reflecting signals. Due to its appealing merits such as fast network reconstruction and strong maneuverability, HF communication is widely used in military, emergency and transoceanic communications [1, 2]. The ionospheric characteristics are influenced by various factors such as time, season, weather, geographical location and solar activity [2, 3], thus available communication frequency window and channel quality also correspondingly vary.Therefore, it is a vital but difficult task to choose the appropriate working frequency in HF communication. Spectrum prediction, one of critical technology, combined with channel selection and automatic link establishment is a promising solution, which is expected to improve the success rate and reliability of link establishment. For instance, spectrum prediction can be used in either HF radio transmitters or receivers to preliminarily select a batch of candidate channels with high quality, which avoids the blindness of frequency allocation and makes preparations for channel sounding.The actual working frequency can be further determined by channel sounding on these predicted candidate channels which can reduce time consumption and then link is automatically established.

For any user in HF communication, both long-term and short-term predictions are of great significance [4, 5]. However, most of prediction algorithms only consider making short-term prediction in a slot-by-slot manner.In this paper, we investigate long-term prediction of HF communication, which is also referred as deep spectrum prediction because the proposed spectrum model makes predictions of longer time span than traditional models and mines inner relationship of spectrum data more deeply than other benchmark schedules.Spectrum data are modeled as an image series and an idea of spectrum data visualization is introduced from computer vision, which is inspired by the work in reference [6]. The application of deep learning in computer vision field has gained wide attention in recent years [7, 8, 9]. Compared to tremendous effort of hand-crafted feature required in machine learning, deep learning can automatically extract and learn the features of samples better,bringing a substantial improvement in performance [10]. Thus, its impressive abilities to extract feature and solve problem motivated us to apply deep learning in temporal-spectral deep prediction. The future characteristics of HF communication are wideband, intelligent and integrated. There is no doubt that the application of emerging deep learning in spectrum prediction can make HF communication more intelligent. Based on observations above,we construct the spectrum images and reasonably design the HF spectrum dataset based on different time scales. The residual neural network and convolution neural network in deep learning are tailored to make deep prediction of HF spectrum points. To our best knowledge, this is the first time to apply deep learning in spectrum prediction from an image perspective.

Deep learning based spectrum prediction is developed in this paper by simultaneously predicting multi-slot ahead states of multiple spectrum points within a period of time.

1.2 Related work

Many studies have been carried out in spectrum prediction which can be found in the survey[11]. These studies mainly apply methods like hidden markov model (HMM) [12], partial periodic pattern [13] and artificial neural network[14] to predict binary series of spectrum occupancy, or make predictions of specific spectrum values based on support vector regression(SVR) [15]. Recently, long short term memory(LSTM) neural network has been applied in spectrum prediction [16]. Majority of existing prediction algorithms belong to short-term prediction and their datasets are constructed by sliding window, where spectrum states in next time slot are predicted by mining historical data within fixed time slots.

Studies of effective prediction algorithm in HF band are relatively few. It is very dif ficult to predict specific spectrum values because HF band is complex and in fluenced by many external factors. Reference [17] improves the accuracy of ionosphere prediction model by introducing the International Reference Ionosphere model and adding a parameter amending module to so-called ITS model. Reference[18] is one of the first studies to apply cognitive radio in HF communication, and make short-term predictions of the sojourn time of a primary user in the band based on HMM model. Reference [19] employs neural network to successfully capture the 24-hour, seasonal and long-term trend in the variability of congestion of HF band for broadcast users based on several years data.

1.3 Contributions

The main contributions of this paper are summarized as follows:

• We propose a spectrum data visualization approach to make deep spectrum prediction from an image inference perspective. Different from the existing spectrum prediction in a slot-by-slot manner, the proposed approach is more efficient which can predict multi-slot ahead spectrum states of multiple spectrum points simultaneously.

• We combine multiple neural network modules to construct the proposed deep temporal-spectral residual network. Each module consisting of residual units is designed to capture the internal relationship of HF spectrum data with different time scales.

• We validate the superiority of the constructed temporal-spectral residual network for deep spectrum prediction. Real-world HF spectrum data are used in experiments and it is observed that prediction performance of the proposed approach is better than the benchmark algorithms.

The remainder of this paper is organized as follows. Section II compares the difference between the proposed model and traditional prediction model, and analyzes the correlation of HF spectrum data in the time and frequency domain. Section III presents the structure of the deep learning neural network and construction of dataset. Section IV shows the experimental results of deep prediction for each HF spectrum point. The last part provides conclusions for this paper.

II. SYSTEM MODEL AND PROBLEM FORMULATION

In HF radio communication, spectrum prediction can effectively reduce the time and energy consumption of frequency selection.Either the prediction of single spectrum point or prediction of multiple spectrum points is based on sliding window in most related studies. As shown in figure 1(a), traditional prediction model learns the inherent relationship from N consecutive column vectorsto predict xtand each column vector represents spectrum data of F spectrum points. Window with fixed length moves forward slot-by-slot over time and states or qualities of spectrum points in different time slots can be predicted. Considering the actual demand and forecast timeliness,the prediction model needs to simultaneously predict the values with the acceptable error within a relatively long period of time. Therefore, we consider deep prediction where spectrum data in a fixed time period are considered as an image. Taking figure 1(b) as an example,spectrum data from the tth slot to the (t+T′)th slot of F spectrum points are reshaped to be an image which is denoted as Xt. The height of the image represents the number of slots while the width of the image represents the number of spectrum points. Then all achieved HF spectrum data can be formed into an image series X ={...,Xt, XT′+t, X2T′+t,....}. Time span T′ can be adjusted depending on practical needs. Likewise, the proposed prediction model learns relationship from the several spectrum images in the previous time periods and predicts the spectrum values of multiple frequency points within multi-slot ahead.

The main differences between traditional and proposed spectrum models are listed as follows:

• Spectrum data within a relatively long time period are modeled as an image in the proposed model. Multiple consecutive images are concatenated into a three-dimensional tensor as one sample. While in traditional prediction model, spectrum data in several continuous time slots are constructed as one sample whose size is two-dimensional.

Fig. 1. The comparison of prediction models.

Fig. 2. The correlations of spectrum data in 20:00～21:00 of 4 July with data in other three time intervals.

• The prediction efficiency of the proposed model is higher than the traditional one. It can simultaneously predict values of multiple spectrum points within a relatively long time period. But traditional model has to carry out several predictions to achieve similar results. The longer time period is,the more times traditional model makes prediction.

Before spectrum data visualization, we first analyze the correlation of these data in the time domain and frequency domain. The raw spectrum data are organized as matrix with size F×T, where x·,t,t ∈ {1,...,T } represents values of all spectrum points in the tth slot and xf,·, f ∈ {1,2,...F} represents evolution trajectories of one spectrum point. The correlation in the time domain and frequency domain is calculated by Pearson’s correlation coefficient[20], as equations (1) and (2) show, respectively:

Taking the matrix of a spectrum image from 20:00 to 21:00 in July 4th, 2007 as an example, figure 2(a), (b) and (c) show correlation coefficient of time domain with itself,data in previous hour and the same period of previous day, respectively. We can see that the correlations in time domain are quite strong and decrease with time gap of two images increases. Figure 2(d), (e) and (f) represent correlation coefficient of frequency domain with itself, data in previous hour and the same period of previous day, respectively. It is observed that there exist some frequency band blocks with similar features such as band from 25MHz to 27.5MHz. To conclude, spectrum data of different spectrum points in different time periods has an indistinct connection. One of the main advantages of deep learning is its ability to automatically extract features and capture inherent relationships in data. The details of the proposed prediction model using deep learning will be shown in next section.

III. DEEP TEMPORAL-SPECTRAL RESIDUAL NETWORK

Figure 3 presents the deep temporal-spectral residual network (DTS-Resnet) for HF deep spectrum prediction. It consists of four modules which are minute scale module, hour scale module, day scale module and fusion module. Residual unit and convolution unit are basic units of these modules.

3.1 Basic units

One basic unit is convolution unit, and its calculation is shown in equation (3):

where W(1)represent convolution kernel,whose elements are adjustable parameters of the convolution unit as well as b(1). ⊗ means convolution of the input tensor with the convolution kernel while f(.) represents activation function. Relu function f( x ) =max(0,x)is one of commonly used activation functions,which can save the training time of deep learning neural network [22].

Another basic unit is residual unit, whose structure is shown in figure 4 [21]. When the input of the residual unit is denoted as X(l), its output X(l+1)is as follows:

Fig. 3. The structure of proposed network.

Fig. 4. The structure of residual unit.

where F(.) is the residual function. It means two repetitive operations of relu activation followed by convolution unit in this paper.The residual unit combines the input with its high-level feature as the output, resulting in better propagation of potential features. It has been demonstrated to be very effective for training super deep neural networks in [23].

3.2 Whole structure of proposed network

As mentioned above, spectrum data in a fixed time period can be constructed as an image with time span as image height and the number of spectrum points as the image width. Suppose that the time span is set to be T′slots and F spectrum points are measured, the spectrum image in time span[t, t+T′) is denoted as a tensor Xt∈ ℝ1×T′×F.HF spectrum data are collected over time, so all spectrum images can be represented as a set X ={...,Xt, XT′+t, X2T′+t,....}. To conclude, spectrum data are constructed as many one-channel image-like tensors according to the given time span and tensors are arranged in chronological order.

For the tth time interval, all image-like tensors previous to that interval are divided into three categories: i) tensors within one hour; ii)tensors within one day and iii) tensors one day ago. When samples are constructed, the periodicity on 24-hour scale and closeness within several hours are both taken into consideration. Part of tensors in each category are fed into corresponding modules of DTS-Resnet.Three modules of different time scales share the same architecture. They all starts from one convolutional layer, then multiple layers of residual unit are stacked and finally one convolutional layer. The fusion module is composed of multiple convolutional layers and achieves prediction results by fusing outputs of three modules. The design of the proposed network is to capture the regulation of HF spectrum points in different time scales by residual learning. The mean square error between predictions and real values is computed to help the adjustment of network parameters.

Here we take day scale module as an example to detail the composition of time modules.At first ldtensors in the third tensor categor ya r e concatenated along with the first axis as one tensor∈ ℝld×T′×F. This tensor is fed into the first convolutional layer and output tensor is. After residual units of L layers and one convolutional layer, the final output tensor is. Similarly, lmtensors in the first tensor categoryare concatenated along with the first axis as one tensor∈ ℝlm×T′×F. The output of minute scale module is denoted as. lhtensors in the second tensor categorycan be transformed to a tensorafter the same operation. When T′is no less than one hour,minute scale module will be deleted and only outputs of the other two modules are fed into the fusion module.

Before tensor fusion, the outputs of three modules should be concatenated along with the first axis. The fusion module is only composed of M convolutional layers without any pooling layer. The reason for this design is to guarantee that the shape of output tensor is consistent with that of input tensor. The final output is the prediction results Xˆtwhile real values in the tth time interval are denoted as Xt. DTS-Resnet is trained by back propagation algorithm to minimize the prediction error, as equation (5) is shown:

where w, b mean trainable paremeters in DTS-Resnet. The process of DTS-Resnet for HF deep prediction is listed in Algorithm 1.

IV. EXPERIMENT EVALUATION

4.1 Data preprocessing

The HF spectrum data in the paper comes from the RWTH Aachen University spectrum measurement campaign [24]1In the original datasets[24], the resolution bandwidth of each individual spectrum band is 200 kHz and the inter-sample time is 1.8 seconds, which results in 48000 samples one day [20].and HF band in this dataset ranges from 20MHz to 30MHz.All data represents power spectral density(PSD) in dBm and is preprocessed using the Min-Max normalization method to scale the data into the range [−1,1]. The timestamps for data collection within one minute are not constant as well as data missing in original dataset of this paper, which put limitations on image-like tensor construction. Therefore,time scale in the paper is only detailed to minutes. Deep spectrum prediction on smaller time scale can be investigated if a more complete and better HF spectrum dataset can be achieved in future. For each spectrum point,data in each minute is about 34 measured values and their average is regarded as the final measured spectrum value in that minute. After finishing the construction of the dataset D,eighty percent of the samples is divided into Dtrainand the remains belong to Dtest.

4.2 Hyper-parameters setting

For three modules with different time scales,the number of residual units is set to 3. The convolutions in the first and last convolutional layer and residual units all have 64 filters whose size is all 3× 3. To keep the shape of input and output tensor of convolution operation consistent, border-mode is used when padding. For the fusion module, three convolutional layers are stacked and the number of filters of size 3× 3 is 64, 32 and 1, respectively. Adam algorithm [25] is employed to find the optimal network parameters because its rate of convergence is faster. In this paper,time span is set to be 60 minutes and therefore the minute scale module is omitted. We predict values of multiple spectrum points within one hour based on four image-like tensors.Two are in previous two hours for hour scale module and the other two are in the same time interval of previous two days for day scale module.

4.3 Baseline schemes

The models used to be the baselines are as follows, the input information of which is all the same:

•SVR: It is developed from support vector machine, usually used in regression and prediction. The kernel employed here is radial basis function, which is non-linear.

•CNN: It is short for convolution neural network, including only multiple convolutional layers. The filters of each convolutional layer except the last layer are 64 and size is 3× 3.

•FC_LSTM: In this neural network, both modules with day scale and hour scale are replaced by fully connected feedforward network. The fusion module is undertaken by multiple LSTM layers.

•DTS-Resnet-nofusion: It is quite similar to the proposed model except that there is no fusion module and the last convolutional layer has one filter of size 3× 3. The output tensors of two modules are merged by element-wise addition.

4.4 Results and discussions

Fig. 5. The RMSE of prediction for each spectrum point.

Fig. 6. The cumulative distribution function of RMSE for all spectrum points.

Figure 5 shows the prediction performance of the proposed model and other benchmark models for each spectrum point. Here the root of mean square error (RMSE) is defined as the evaluation metric. It corresponds to the estimation error of the spectrum of the next time slot whose time scale is minute. It can be seen that the prediction error between any two frequency points is very different, which depends on the regularity of the evolution trajectories of spectrum states. The prediction errors for spectrum points around 27.32MHz are much larger than others, which means those spectrum points are more difficult to make prediction. It is found that weak correlation for spectrum points around 27.32MHz results in poor prediction performance while strong correlation for other spectrum points brings about better performance, as shown in figure 2. The DTS-Resnet model performs better than others on majority of spectrum points other than the range from 24MHz to 26MHz where it is a little worse than CNN.The reason may be the difference of initialization or the spectrum points features. Figure 6 shows the curve of cumulative distribution function obtained from figure 5, which also illustrates that the proposed model has absolute advantage in a statistical sense. For about ninety percent of spectrum points, the RMSE of prediction is less than 0.29. Compared with DTS-Resnet-nofusion model, we can find that the fusion module of DTS-Resnet plays an important role for deep HF spectrum prediction.

V. CONCLUSIONS

The prediction of HF spectrum has always been a tricky but urgent problem because spectrum data in this frequency band have a non-explicit complex relationship. In this paper, we explore an advanced and intelligent deep learning model to predict PSD values in given time period at the same time. The dataset is reasonably constructed based on the implied periodicity and closeness of the HF spectrum points. The proposed model,named DTS-Resnet, combines residual units and convolutional layers to perform different functions. The experimental results demonstrate that DTS-Resnet has better prediction performance than other prediction models.In spite of this, the prediction only considers mining the features from measured spectrum values without involving the factors such as time, geographical location, weather and sunspot activities. This is one research direction of future HF spectrum prediction. Moreover,spectrum prediction is such an example which is proposed for dynamic spectrum access in cognitive radio networks (CRN) and then is applied to HF communication. Just like Koski and Furman proposed in [26], studies on power and energy transfer [27] or spectrum sharing strategy [28, 29, 30] in CRN can also be transferred into HF communication, which is regarded as another research direction.

ACKNOWLEDGMENT

This work was supported in part by the National Natural Science Foundation of China(Grants No. 61501510 and No. 61631020),Natural Science Foundation of Jiangsu Province (Grant No. BK20150717), China Postdoctoral Science Foundation Funded Project (Grant No. 2016M590398 and No.2018T110426), Jiangsu Planned Projects for Postdoctoral Research Funds (Grant No.1501009A), and Natural Science Foundation for Distinguished Young Scholars of Jiangsu Province (Grant No. BK20160034).