APP下载

High-resolution bone microstructure imaging based on ultrasonic frequency-domain full-waveform inversion*

2021-01-21YifangLi李义方QinzhenShi石勤振YingLi李颖XiaojunSong宋小军ChengchengLiu刘成成DeanTa他得安andWeiqiWang王威琪

Chinese Physics B 2021年1期
关键词:李颖补贴园区

Yifang Li(李义方), Qinzhen Shi(石勤振), Ying Li(李颖), Xiaojun Song(宋小军),Chengcheng Liu(刘成成), Dean Ta(他得安),3,‡, and Weiqi Wang(王威琪)

1Department of Electronic Engineering,Fudan University,Shanghai,China

2Human Phenome Institute,Fudan University,Shanghai,China

3Academy for Engineering and Technology,Fudan University,Shanghai,China

Keywords: quantitative imaging,full-waveform inversion,bone microstructure,ultrasonic computed tomography,high resolution

1. Introduction

Bone strength depends on multiple characteristics, such as bone mass, geometrical morphology, and architecture.[1]Bone geometrical morphology and micro-architecture are important features to determine the mechanical properties.[2,3]Osteoporosis is a common bone metabolic disease, typically characterized by the loss of bone mass, reduced thickness,and increased porosity,therefore resulting in an increased risk of fracture.[4]Severe osteoporosis reduces the quality of life,increases the suffering of patients, even endangers patient’s life.[4]Consequently, daily monitoring and early diagnosis of bone diseases are of great significance, especially in aging societies.[5]Bone mineral density (BMD) measured by dual energy x-ray absorptiometry (DXA) is the gold standard for evaluating bone status.[6]However, the comprehensive evaluation of bone based on DXA is still challenging,because it is difficult for DXA to quantify bone microstructure and biomechanical properties.[7–9]High-resolution magnetic resonance imaging (HR-MRI) and high-resolution peripheral quantitative computed tomography(HR-pQCT)are two principal modalities of bone imaging with high spatial resolution,which can provide both geometry and microstructure.[10]However,the clinical applications of HR-MRI and HR-pQCT are limited due to the relatively high costs, long examination time,presence of ionizing radiation,and poor portability.[10]

Ultrasound is a mechanical wave, which is physically suited to probe the elastic and structural properties of bone.Quantitative ultrasound (QUS) techniques are deemed to reflect bone mechanical parameters beyond BMD with the advantages of being inexpensive and non-ionizing.[6]Therefore,QUS based techniques were proposed as alternatives to xray based methods to characterize bone.[11]QUS approaches are mainly divided into two categories. One is for cortical assessment using ultrasound guided wave (UGW)[12–14]or first arrival signal (FAS)[15]propagation in the long cortical bones. The other is for cancellous measurements,utilizing ultrasonic through-transmission[16]and backscatter techniques.[17,18]Recently, QUS capitalized on machine learning[19,20]and deep learning[21,22]for bone or porous media assessment. These studies have yielded promising results to predict material parameters, such as thickness,[23,24]porosity,[2,25]bulk velocities,[22,23]phase velocity,[26]etc.However,these studies can hardly provide entire morphological features of bone and ultrasonic imaging of bone tissue remains a challenging task.

The most common method of medical ultrasound imaging makes use of pulse-echo sonography, which generates backscattered reflection images. This modality is suitable for ultrasonic imaging for soft tissues with small heterogeneity,based on the assumption of uniform sound velocity.[27–30]While for the hard tissue (i.e., bone), the speed of sound in bone differs significantly from that of the surrounding soft tissue.[31,32]The pulse-echo method is difficult to accurately image the irregular bone or multilayer structures without a prior of sound speed distribution.[33,34]Olofsson[35]applied phase shift migration (PSM) to image regularly layered objects. Some studies further utilized ray-tracing technique[32,33]or modified PSM[36–38]to image the irregularly hard materials. However,the sound velocity distribution must be obtained[33,34]or estimated in advance.[32]It is unrealistic for ultrasonic bone imaging with a prior of sound velocity model in the actual scene. Renaud et al.[32]achieved in-vivo imaging the first section of long bone using prior ranges of sound velocity to iteratively estimate velocities layer by layer. Considering the large acoustic impedance on the cortical bone surface and the porous structure in cancellous part,the strong reflections generated from the boundary and ultrasonic wave attenuation induced by multiple diffraction,scattering,anelastic absorption,and elastic mode-conversions lead to a very complex wavefield for bone imaging.[32,33,39,40]The high amplitude signals from the boundaries may overlap and interfere with the relatively weak signals scattered inside the bone structure. These superimposed effects make it difficult to accurately image inside trabeculae, pores, and the second cross-section.[32]Li et al.[41]successfully reconstructed the surface trabeculae of bovine cancellous bone in virtue of ultrasonic backscatter parameters,but cannot present trabecular distribution below surface.

Ultrasonic computed tomography (USCT) is an emerging tool to quantitatively realize parametric imaging using inversion algorithms. The algorithms are computational techniques initially developed to image the interior of the Earth.[42]Tomography techniques are basically divided into three categories. The first one is the travel-time tomography based on ray theory,which uses the first arrival time of ultrasonic waveform to reconstruct the velocity distribution.[43,44]By merely considering refraction, this method is efficient and stable[43]but only valid when the imaging objects are much larger than the wavelength.[45]Its resolution is limited and cannot reconstruct structure below the scale of the diameter in the first Fresnel zone.[39,46]The second method is the diffraction tomography (DT), which takes diffraction and scattering into account with a linearized scattering model such as the Born or Rytov approximations.[47,48]This method is much more time consuming but has higher resolution.[47]The third family is the reconstruction technique called full-waveform inversion(FWI).[45,49,50]FWI attempts to reconstruct the parametric images of the medium by trying to match the measured signals with the results of the full wave equation.[50]FWI is actually the most accurate one in inversion community for allowing higher order diffraction and scattering to be considered in its numerical solver.[51]With a higher cost of computational effort, the FWI provides higher resolution and accuracy than previous two methods that only utilize part of the recorded wavefield.

From travel-time inversion to FWI, the inversion techniques were first used in medical ultrasound imaging for soft tissues.[52–54]In recent years,the inversion methods have been attempted to apply in hard tissue imaging,[39,47]particularly in bone imaging.[11,31,40,55,56]Bone is a heterogeneous, porous,and highly attenuated medium.[55]For using limited information of wavefield, travel-time inversion can only be applied to image macroscopic morphology with relatively low resolution. As the phase transition through the bone tissue is too large, DT based on the first-order Born or Rytov approximation cannot be utilized to tackle the problem with large acoustic impedance misfit.[55]Therefore, Lasaygues et al.[40]proposed a compound USCT to address this issue. Some studies[11,47,57]with iterative approaches based on high-order approximations can also image high-contrast targets. A method called distorted Born diffraction tomography (DBDT)[11]was used to image bone-mimicking phantoms, obtaining faithful geometry and sound speed images.With the recorded full wavefield and the experience of imaging the Earth’s internal structure, FWI is expected to transfer to hard tissue imaging, further providing higher quality images. Bernard et al.[31]gave fairly accurate map of the longitudinal wave speed for cortical bone quantitative imaging in simulation study using time-domain full-waveform inversion(TDFWI).In addition,the approximate density map was also provided to assess the cortical bone. Whereas TDFWI is computationally intensive.

It is understood that the achievable spatial resolution of FWI is approximately half of the wavelength, i.e., λ/2.[39,58]Using megahertz frequency, the FWI can theoretically image pores and trabeculae in bone tissue with the size of millimeter or sub-millimeter.

As FWI possesses unequivocal advantages in hard tissue imaging via considering the full information of wavefield,the frequency-domain full-waveform inversion(FDFWI)method was introduced into bone parametric imaging in this study. Unlike TDFWI based on the iterative fitting of complete time series between modeled and observed data,the FDFWI only depends on the full information of wavefield at some discrete frequencies. This data reduction leads to efficient computation. Besides, FDFWI is more convenient to be directly performed from low to high discrete frequency components, and there is no need for utilizing lowpass filtering,which is a key process for TDFWI to avoid trapping into the local minima.[59,60]To the best of our knowledge, this study is the first report in which the FDFWI method is applied to simultaneously estimate sound velocity and mass density in bone quantitative imaging. The performance of the method is demonstrated by tubular bone phantom, single distal fibula model,and finally with a distal tibia-fibula bone pair model.

The rest of paper is organized as follows. Model assumptions,the theory of FDFWI,and the selected optimization algorithm are stated in Section 2. The numerical simulation setup and the simulated examples are described in Section 3.The results are provided in Section 4. Section 5 gives the discussion,and finally,the conclusions are presented in Section 6.

2. Theory and methods

The objective of this study is to conduct two-dimensional(2-D) parametric imaging of bone cross-sections from ultrasonic signals collected from a ring array transducer. The plane of the ring array probe(i.e., the imaging plane)is orthogonal to the long axis of the bones.

2.1. Problem statement

The tomography parametric imaging is essentially an inverse process to estimate the bone material parameters,such as sound velocity and mass density, which are utilized to quantitatively characterize the bone. The parametric models (i.e.,sound velocity and mass density) are discretized as a M×N grid.

The ultrasound tomography acquisition setup is depicted in Fig. 1. The bone is immersed in water and surrounded by a ring array transducer with n elements. The positions of the elements in the ring array transducer are determined by the location vectors ri(1 ≤i ≤n). Each element in the ring array transmits ultrasonic pulses in sequence, and all elements receive signals at the same time. For FDFWI, the transmitting pulse S(rsi,ω)(1 ≤si≤n)is the excitation source in the frequency domain,and the received complex wavefield at the receiver position rri(1 ≤ri≤n)is described by Dobs(rri,ω).The measurements acquired from all emitter-receiver pairs can be stacked into a n×n dimensional matrix Dobs(ω). Similar to the measured wavefield,the simulated wavefield denoted as Usim(rri,C,ρ,ω)obtained by numerical calculation at the receiver location rri(1 ≤ri≤n)with bone material parameter models (i.e., C(r),ρ(r)) is also stacked into a n×n dimensional matrix Usim(C,ρ,ω). FDFWI is typically formulated as a least square minimization problem,and the cost function satisfies the following relationships:[61]

where H denotes the Hermitian transpose, and e(C,ρ,ω) is the residual vector between the modeled and measured wavefields. It is worth noting that the wavefield is explicitly depicted in terms of frequency ω and parametric models (i.e.,C(r),ρ(r)). The optimization is performed on one discrete frequency component at a time, and the numerical simulated wavefield depends on the parametric models. When the parametric models are iteratively updated in FDFWI,the numerical simulated wavefield is also updated with the new parametric models.

Fig.1. Ultrasound tomography transducer acquisition setup.

2.2. Forward problem

The propagation of ultrasonic wave is controlled by the acoustic wave equation. Forward modeling of ultrasonic propagation in frequency domain can be described by the Helmholtz equation with variant density[62]

To perform forward numerical calculation, the model is discretized on a M×N grid. To balance the accuracy and efficiency of the numerical calculation, the grid size is set to be slightly less than λmin/5.[61]λminis the smallest wavelength and is computed using the sound velocity in water at the maximum frequency component. Similar to the parametric model, the forward wavefield P(r,ω) is sampled as a M×N grid at each frequency ω. The discrete pressure values are reorganized as a M×N dimensional vector P. Similarly, the source S(r,ω) is also vectorized as a M×N dimensional vector S. The vector S is sparse and has only one non-zero value whose index corresponds to the position of the emitter at the grid. With the wavefield and the wave source vectorized, the Helmholtz operator in Eq. (3a) is discretized,assembling a sparse and large matrix H with dimension of MN×MN. Matrix H is actually the numerical approximation of the Helmholtz operator underlying partial differential stencil. The numerical construction of H and the absorbing boundary can refer to the work detailed in Ref. [62]. After discretization, the Helmholtz Eq. (3) can be rewritten in matrix form

where H may be a high-dimensional matrix according to the specific requirement of model discretization. Direct inverse computation of high-dimensional matrix is time-consuming and inefficient. To solve Eq. (4) for a multiple source problem, matrix factorization methods such as LU decomposition are strongly recommended for increasing the computation efficiency.[63]When LU decomposition is used to solve Eq.(4)once for one source, LU constituents of H can be stored to rapidly solve wavefield P for other sources. This is critical to save the computational costs of iterative solutions for inverse problems with multiple sources.

2.3. Inverse problem and the adjoint-state method

Since there are large data quantities and model parameters involved in FDFWI, local optimization methods[42,50]based on gradient descent are generally used to solve the cost function (2). Accurately calculating the gradient of the cost function in iterative process is a key point for the success of inversion[42,61]

where the gradients ∇Cand ∇ρare taken with respect to the sound velocity and mass density, respectively. The gradient formulations in Eq.(5)are matrix-vector products. JCand Jρare the Jacobian matrices of the forward problem that contains the Fr´echet derivatives of the received wavefield with respect to model parameters(i.e.,C,ρ). e(C,ρ,ω)is the residual vector that is the same as that in Eq.(2).

The inversion is conducted step by step from low to high frequencies from a selected set of frequency components. Given the current estimations of the parameter models C(i)(ω),ρ(i)(ω) and the current gradients∇CE(C(i),ρ(i),ω),∇ρE(C(i),ρ(i),ω), the updated parameter models with the normal gradient descent method can be written as

where α is the step size which could be a constant or determined by a line search method.[64]Nevertheless, the gradient computation in such an FWI problem including multiple parameters and high-dimensional matrix is computationally challenging. If the gradients in Eq. (5) are directly calculated, the computations are so intensive. When the model parameter in one grid point is disturbed for differential computation every time,a forward calculation is needed to obtain the recorded wavefield. The forward calculation requires performing as many as the number of model parameters for each source.[50,65]Therefore,a total of n×M×N times of forward calculation is needed to get JCand Jρfor n sources per iteration. When the model parameters are large,the calculation of this part takes a huge amount of time. The dimension of JCand Jρis n2×MN. Obviously, it is too time-consuming for bone quantitative imaging in practical use.

Fortunately,there is an alternative way called the adjoint state method[65]that does not compute the Jacobian matrix explicitly to get the gradient. This method was developed to efficiently compute the gradient. In the numerical community, it is now a well-known method for gradient calculation of a functional with respect to the model parameters. The functional depends on model parameters through state variables, which are solutions of the forward problem.[42]The adjoint state corresponds to the wavefield emitted and backpropagated from the receivers, which is deemed independent of the model parameters.[66]This approach is often implemented resorting to an augmented functional also called associated Lagrangian. It is actually an optimization problem with partial differential equation (PDE) constrained in this study.The augmented functional is defined as

Fig.2. Flow chart of the FDFWI.

where 〈·〉 denotes the scalar product of M×N dimensional complex vectors and rsi(1 ≤si≤n)depicts the position of the source.Usim(rsi,C,ρ,ω)and Dobs(rsi,ω)show the simulated and measured wavefields generated by the source S(rsi,ω)that situated at rsi. Note that Va(rsi,ω) and P(rsi,ω) represent the adjoint and forward wavefield vectors related to the source S(rsi,ω) located at rsi. They are both M×N dimensional vectors. Cℎris the selective matrix onto positions of the receivers for all sources, with the dimension of n×MN.Va(rsi,ω)is the adjoint state and also the Lagrange multiplier per source and per frequency component,which is defined by

2.4. Optimization algorithm

There are many local optimization methods based on gradient descent, for instance, the steepest descent, conjugate gradient,Newton,Gauss–Newton,and quasi-Newton.[67]The convergence speed of the steepest descent is slow, which is not suitable for the large data optimization problem in FWI.The convergence rate of Newton and Gauss-Newton is faster than the conjugate gradient and the steepest descent, leading to more stable inversion process.[67]However, the computation and storage cost of the Hessian or approximate Hessian matrix is so intensive in iterative process (i.e., Xk+1=Xk-H-1∇XkE, where Xkdenotes the model parameter, and H denotes the Hessian matrix). H-1indicates the search direction. It is usually necessary to avoid the direct inverse of such large matrix, especially for the strong nonlinear and large-scale problem in FWI. Therefore, a matrix-free variant of the quasi-Newton method known as limited-memory Broyden–Fletcher–Goldfarb–Shanno algorithm (L-BFGS)[68]is utilized in this paper, allowing constructing an approximation of the inverse Hessian in a recursive way without explicitly forming it. Only a few gradients of the previous nonlinear iterations, typically 3–20 iterations, need to be stored in L-BFGS to compute the approximate Hessian. Here we choose m=20. The L-BFGS method represents a negligible storage and computational cost compared to the conjugategradient algorithm,[50]and a more efficient convergence rate than the steepest descent.[31]The implementation of L-BFGS algorithm is detailed in the literature.[68]

2.5. Regularization

FDFWI for bone imaging is a nonlinear and ill-posed inverse problem. The most common way to alleviate the inverse instability and ill-posedness caused by the high-contrast structures[50,69]and the presence of noises[31]is to use regularization. Tikhonov and total variation (TV) regularization are well known in the inversion process. The former is helpful to restore the continuity and smoothness of the images.[70]Whereas the latter is able to preserve the sharpness of the edges.[71]The bone parametric imaging requires not only to maintain the continuity and smoothness of some areas, such as soft tissue and bone, but also to restore some regions with high sound velocity contrast,for instance sharp boundaries between bone and soft tissue. Therefore,we tried to combine the advantages of the above two regularizations. In this paper,the Sobolev space norm was employed as a regularization term to keep a balance between sharpness and smoothness in imaging.Mathematically,it is depicted by[72]

The Sobolev space norm can easily be added to the cost function with weight λSN, and the gradient of regularization term (13) is added to the gradient of the cost function (11)for model parameter update. The mathematical features of Sobolev space norm can refer to the literature.[72]In brief,it is a natural blend of Tikhonov and TV regularization.[72]

where ε is utilized to avoid being divided by zero when calculating the gradient of the regularization term. As long as the value is small enough,it will have little impact on the restored images. We selected ε =10-7, λSN=10-4, and P=1.5 in this work. The sum of Eqs. (11) and (13) gives the gradient of a single model parameter with the regularization term. In the iterative calculation, the gradients were combined into an M×N dimensional vector to update the model parameters.

(1)人力投资。苏州工业园特别注重人才的培养和引进,不仅引进了诸中科大、中国人民大学等26所国内高等教育院校,还引进了如美国加州伯克利大学、加拿大滑铁卢大学等世界名校资源。此外,园区出台各种优惠政策引进高层次人才,如2017年人才薪酬补贴每人3万元、4万元、5万元不等,共有787名硕士及以上学位的领军、高层次和紧缺骨干人才获得了此项补贴,此举为园区发展提供源源不断的动力。

3. Numerical simulations setup

FDFWI is a nonlinear and ill-posed inverse problem,meaning that an infinite number of models can match the data,particularly when starting inversion with high frequency.[50]Therefore, the initial frequency must be low enough. If not,the so-called cycle-skipping effect will result in the convergence to a local minima.[50]To avoid this effect, the TDFWI proposes consecutive inversion with gradually increasing frequency component using lowpass filtering,as low frequencies are insensitive to cycle-skipping effects.[31]However,the FDFWI provides a more natural workflow for this multiscale approach by performing successive inversions from low to high discrete frequency components.[59,60]

For simplicity, the transmitting sources are ideal point sources in this study. The source waveform is wide-band Ricker wavelet with a dominant frequency fc

The time domain waveforms and amplitude spectrum of the Ricker wavelet pulse are shown in Fig. 3. Two sources with different center frequencies were used to cover the bandwidth we need. For the transmitting source with fc=200 kHz, the utilized bandwidth is from 100 kHz to 400 kHz. For the transmitting source with fc=1.75 MHz,the used bandwidth ranges from 400 kHz to 3.5 MHz. The sound speed and mass density in bone tissue were derived from previous literature,[73]in which the sound speed is 2800 m/s and the mass density is 1800 kg/m3. For the surrounding media(i.e. ,water or soft tissues),the sound velocity and mass density are set to 1500 m/s and 1000 kg/m3. The spatial size(i.e.,grid size)in the numerical model was calculated by the maximum frequency component and the background sound velocity 1500 m/s.128 sources and 128 receivers were utilized to produce and receive the ultrasonic signals.

Fig.3. Ricker wavelet source. (a)Time-domain signal. (b)Amplitude spectrum,with a central frequency fc=200 kHz.

3.1. Tubular bone phantom

In the first numerical model,a simple tubular bone phantom with a thickness of 2 mm was used, situating in the center of the ring array. The grid size Δh was 60 μm and the 18 mm×18 mm model was therefore discretized as 301×301 grids. The diameter of the ring array transducer was 17 mm.In this case, the mass density model was known and only the sound velocity model was estimated. The speed of sound in water(1500 m/s)was utilized as the initial guess of the velocity model. The simulated data was noiseless,so regularization based on Sobolev space norm was closed in this section, as well as in Subsections 3.2 and 3.3. The frequency of inversion was initially set to 100 kHz and then gradually increased to 3.5 MHz with a step of 100 kHz.

3.2. Single distal fibula model

In the second numerical model, a single distal fibula model was utilized. The model was made by binarization of a HR-pQCT image of distal fibula (Figs. 5(a) and 6(a)). The sizes of the discretized model and the circular ring transducer were the same as those in tubular bone phantom. However,in this circumstance, the sound velocity and mass density were simultaneously estimated. The frequency components were set from 100 kHz to 3.5 MHz with an interval of 50 kHz. For low frequencies are essential for the reconstruction of mass density,[50,74]the smaller interval can ensure enough components with relatively low frequency in inversion. Both the initial sound velocity and mass density model were uniform, in which the velocity was set to 1500 m/s and the mass density was 1000 kg/m3(physical parameters in water).

3.3. Distal tibia-fibula pair model

In the third numerical model,we used a distal tibia-fibula pair model. The model was also derived from binarization of a HR-pQCT image (Figs. 9(a) and 10(a)). As presented in Subsections 4.1 and 4.2, since the inversion results of the first two models verified that the FDFWI can achieve a good reconstruction when the frequency reached 2.5 MHz, the frequency set here was selected from 100 kHz to 2.5 MHz with an interval of 50 kHz. The grid size was set to 100 μm and the 60 mm×60 mm model was discretized as 601×601 grids.The diameter of the ring array transducer was 58 mm. This was a challenging scene involving multiple scattering between the two bones. The complex distribution of trabeculae in the distal tibia would introduce significant multiple diffraction,scattering,and reverberation effects. The initial values of velocity and density model were identical with those in the single distal fibula model.

3.4. Robustness against noise

To investigate the robustness of FDFWI against noise,Gaussian random noise was added to the synthetic data generated in the distal tibia-fibula pair model for velocity and density inversion. The reference pressure was defined as the mean amplitude of the received complex signals. The cases with signal-to-noise ratio (SNR) of 30 dB, 10 dB, and 0 dB were respectively performed and discussed. The settings of inversion were consistent with those in the previous Subsection 3.3,except that the Sobolev space norm was utilized. It should be noted that the regularized weight λSNrepresents the ratio between the data fidelity item and the penalty term. λSN=10-4was chosen via the qualitative evaluation of the final inversion results.

4. Results

4.1. Tubular bone phantom

The inversion results with different maximum frequencies are shown in Fig.4. When the inversion frequency gradually increases to 1.5 MHz, both the outer and inner edges of the phantom are recovered.When the frequency reaches 2.5 MHz,the image becomes clearer with fewer artifacts. However, as the frequency ranging from 2.5 MHz to 3.5 MHz, the improvement of image quality is relatively limited. The result shows that the FDFWI has ability to accurately restore the macroscopic morphology. The inversion error of the velocity is small. If the imaging region in the ring array transducer is selected as the region-of-interest(ROI),the root-mean-squareerror(RMSE)of sound velocity in ROI is 149.76 m/s,and the mean relative error in ROI is 6.97%.The RMSE exclusively in bone area is 250.52 m/s,and the mean relative error is 8.95%.

Fig.4. FDFWI results of the tubular bone phantom for velocity model with different maximum frequencies. (a)True velocity model with ring array transducer. (b)–(e)Result obtained with final frequency of 0.5 MHz,1.5 MHz,2.5 MHz,and 3.5 MHz,respectively.

4.2. Single distal fibula model

The inversion results with different maximum frequencies are shown in Figs.5 and 6. When the inversion frequency gradually increases to 1.5 MHz,both the outer and inner edges of the velocity map are recovered accurately. Meanwhile,partial microstructures in the fibula,such as pores and trabeculae,can be clearly depicted.When the frequency reaches 2.5 MHz,the velocity map becomes clearer and some subtler features can be precisely displayed. Similar to the result of tubular bone phantom, as for the frequency ranging from 2.5 MHz to 3.5 MHz, the improvement of the image quality is relatively limited. The result shows that the FDFWI can provide high-resolution images for fibula bone. Both the geometry and microstructure can be clearly and accurately recovered.Due to the simultaneous inversion of two parameters, the inversion task is relatively more complex. The error of sound velocity increases compared with the single speed inversion in the previous section. For the sound velocity, RMSE in ROI is 258.31 m/s, and the mean relative error in ROI is 12.01%. RMSE in the bone area is 408.39 m/s,and the mean relative error is 14.59%. For the mass density,RMSE in ROI is 150.96 kg/m3,and the mean relative error in ROI is 10.78%.RMSE in the bone area is 226.74 kg/m3,and the mean relative error is 12.60%. As presented in Fig. 7, the profile errors of the inversion results with the final frequency from 2.5 MHz to 3.5 MHz are similar, which further proves the limited improvement of the final images with the maximum frequency being higher than 2.5 MHz. Figure 8 shows that the inversion results are in good agreement with the most of structure and edge information extracted from the true model. Although the outer boundary of the density map is reconstructed well, the inaccurate recovery of microstructure in bone area can be observed. The inversion result of sound velocity is better than that of mass density.

Fig.5. FDFWI results of the single distal fibula for velocity model with different maximum frequencies. (a)True velocity model(derived from HR-pQCT image). (b)–(e)Result obtained with final frequency of 0.5 MHz,1.5 MHz,2.5 MHz,and 3.5 MHz,respectively.

Fig.6. FDFWI results of the single distal fibula for density model with different maximum frequencies. (a)True density model(derived from HR-pQCT image). (b)–(e)Result obtained with final frequency of 0.5 MHz,1.5 MHz,2.5 MHz,and 3.5 MHz,respectively.

Fig.7. Comparisons of profiles for velocity and density inversion results at z=9 mm and x=9 mm with those of the true velocity model in Fig.5(a)and the true density model in Fig.6(a)using maximum frequency of 2.5 MHz(a)and 3.5 MHz(b),respectively. The black and red lines are the profiles of the true model and the inversion result,respectively.

Fig. 8. Comparison structure and edge information between inversion results and true models with the maximum frequency 2.5 MHz. (a)Velocity inversion result. (b)Density inversion result. The yellow line represents structure and edge information extracted from the true model.

4.3. Distal tibia-fibula pair model

Fig.9. FDFWI results of the distal tibia-fibula pair for velocity model with maximum frequency 2.5 MHz. (a)True velocity model(derived from HR-pQCT image). (b)Velocity inversion result. (c)Comparison structure and edge information between inversion result and true model.The yellow line represents structure and edge information extracted from the true model.

Fig.10. FDFWI results of the distal tibia-fibula pair for density model with maximum frequency 2.5 MHz. (a)True density model(derived from HR-pQCT image). (b)Density inversion result. (c)Comparison structure and edge information between inversion result and true model.The yellow line represents structure and edge information extracted from the true model.

Fig.11. Comparisons of profiles for inversion velocity and density results with those of the true velocity model in Fig.9(a)and density model in Fig.10(a). (a)Velocity profiles,(b)density profiles at z=20 mm,z=43 mm,x=15 mm,and x=40 mm. The black and red lines are the profiles of the true model and the inversion result,respectively.

4.4. Robustness against noise

As shown in Fig. 12, the sound velocity maps can still be recovered well despite the presence of noise. Even in the case of low SNR(i.e.,0 dB),the geometry of bone and some relatively larger microstructure can be reconstructed, but the trabecular direction is difficult to be identified. The results with noises display the robustness of the method to random Gaussian noise in velocity inversion. Figure 13 indicates that the noisy effect on density inversion is greater than that on velocity inversion,especially for the case with low SNR.

Fig.12.Comparison structure and edge information between velocity inversion result and true model of the distal tibia-fibula pair with different SNRs. (a)No noise. (b)SNR=30 dB.(c)SNR=10 dB.(d)SNR=0 dB.The yellow line represents structure and edge information extracted from the true model.

Fig.13. Comparison structure and edge information between density inversion result and true model of the distal tibia-fibula pair with different SNRs. (a)No noise. (b)SNR=30 dB.(c)SNR=10 dB.(d)SNR=0 dB.The yellow line represents structure and edge information extracted from the true model.

5. Discussion

In practice, ultrasonic waves in cortical bone travel as guided waves. The shear and longitudinal waves are coupled with each other.However,ultrasonic sources and receivers immersed in water do not generate or record shear waves. Meanwhile,elastic conversions are small close to normal incidence,which is the most important part of wavefield for FWI, both in transmission and reflection imaging.[39]The layout of ring array plus a nearly circular imaging plane(cross-section)can ensure relatively slight effects of elastic conversions.[39]From the perspective of accurate model matching,the closer the numerical forward model is to true bone and surrounding tissue,the better the observed wavefield can be approximated. That is,a more complete account of the physics effects during inversion, including absorption, anisotropy, and elasticity in hard tissues, can theoretically provide a more quantitatively accurate model with physical properties,but at a cost of significant computation increasing to acquire the final model. A recent in-vivo FWI study[39]shows that a simple isotropic and acoustic model is adequate for providing well-resolved images for hard tissue(i.e.,the skull)and soft tissue(i.e.,the brain). The problem investigated in this paper is very similar to the issue that has been addressed successfully for brain imaging using FWI.[39]Consequently, the assumptions of the bone and surrounding soft tissue are acoustic and isotropic media.The simulated results show that the simply acoustic model can recover high-resolution image, including hard tissue (i.e., bone) and surrounding soft tissue(i.e.,marrow,muscles,fat,and skin).

The spatial resolution of ultrasonic imaging is the minimum distance between two reflectors that can be distinguished. The axial resolution of the traditional pulse-echo method depends on pulse duration and is equal to half the spatial pulse length.[33]The lateral resolution is determined by the width of the ultrasonic beam.[33,34]The pulse-echo method can achieve good axial resolution at high frequencies using short pulse length. The lateral resolution can be improved capitalizing on synthetic aperture or focused ultrasound.[33,36]However, it is necessary to know the distribution of the velocity model or to estimate the sound velocity prior to bone imaging.[32]Otherwise,the imaged bone will be distorted and displaced due to the large velocity contrast.In addition,the advance estimation of sound velocity further increases the complexity of the algorithm.

where K is the wavenumber, f is the frequency of the incident wave, and θ is the scattering angle, that is the angle between the source and the receiver. Bernard et al.[31]have demonstrated the importance of having a circular sampling of the wavefield in simulation study. It is not possible to reconstruct an accurate and high-resolution image using only transmitted or reflected data. That is the reason why we chose ring array transducers in this paper (i.e., scattering angles covering all directions).

If we compare the inversion results of sound velocity,as shown in Figs. 5 and 9, with binarized images derived from HR-pQCT(i.e.,Figs.5(a)and 9(a)),the FDFWI method can achieve high-resolution bone images with sub-millimeter which are close to those of HR-pQCT.However,the results are only reconstructed with numerical simulation,and the performances of in-vitro and in-vivo experiments need to be further verified in the future work.

The inverted velocity maps are very close to the true models, both in terms of geometry and velocity values, even for the distal tibia-fibula pair model with multiple reflections and diffractions between the two bones.At the same time,the multiple scattering and diffraction effects in trabecular region further pose a significant imaging challenge. Nevertheless, the velocity maps still can be reconstructed well, even with significant noise. In contrast, the density inversion accuracy is far less accurate. In Figs. 6 and 10, the artifacts of density maps in marrow cavities are more than those in the velocity maps. As shown in Fig. 10, the inversion results in part of the bone regions cannot accurately present the true thickness,and are underestimated to some extent. The poor recoveries of boundaries in the region, where the two bones are close to each other,can be observed.There are two main reasons to account for this phenomenon. One is that the multiple reflection and diffraction exist between the boundaries of the two bones.The other is that it is difficult for the receivers to directly obtain the reflected waves generated by the bone boundaries that are adjacent to each other,for the reflected wavefields will be affected by the scattering wavefield induced by the trabeculae and cortical boundaries that are closer to the ring array transducer. The receivers get more information of transmitted wavefield rather than that of the reflected wavefield in this region. Nevertheless, the density inversion is sensitive to the reflection rather than the transmission wavefield.[50]

In geophysics, studies[50,75]have proved that simultaneous inversion of sound velocity and density is difficult. To obtain a relatively good estimation of density,either an extremely low initial frequency[76,77]or multiple times for inversion are required.[77]The sound velocity is sensitive to the data of various scattering angles, but the mass density is only interested in that of small scattering angle.[50]As a result of Eq. (15),only the wavefields sampled from reflections, which contain the high spatial frequency information,are available for mass density reconstruction. Since mass density is an important indicator in clinical bone disease diagnosis, investigating more effective methods for accurate density recovery is our future work.

As shown in the inversion results (i.e., Figs. 4–6), there exist some noises in the marrow cavity, which are similar to the noisy effects in study.[31]Two underlying causes could account for the effects. One is that the frequency-domain information for inversion using several frequency components is still limited rather than utilizing the complete frequencies. On the other hand,the noisy effect could represent an overestimate in some local area for parametric imaging. In a sense,it could be a local minimum that is very close to the global optimal solution. The noisy effect deserves further refinement in the future. Due to the reflection, diffraction, or scattering caused by the circular transducer,overestimates for some grids at the locations of the elements in the ring array can be observed in Fig.6(b)for mass density inversion at initially low frequency components, but the phenomena remarkably improved with increasing frequency until the effects disappeared.

As depicted in Fig. 12, even though the SNR is 0 dB,the geometries of bones can be reconstructed in velocity inversion. By comparing Figs.12(a)and 13(a)with Figs.12(b)and 13(b),the image quality of the noisy case with the SNR of 30 dB is comparable to that of the noiseless one. In the case of 10 dB in Fig. 12(c), the velocity inversion results are still good,and part of trabeculae and pores can be restored. As discussed above, accurate density inversion has been proved to be difficult,whereas figures 13(c)and 13(d)verify that significant noises may further complicate this task. With the SNR of 0 dB,there is much more noise in the density inversion than that in the velocity inversion. Therefore,the results show that the FDFWI is robust to random Gaussian noise in velocity inversion. The robustness comes from that the FDFWI method only approximates the wavefield information in the measured signals by calculating the full wave equation rather than producing noise.Therefore,the presence of noise may slow down the convergence to the true model, but the physical principle of FDFWI leads to its strong immunity to noise and the ability to reconstruct bone images at a significant noise level. This further indicates that the proposed FDFWI method has the potential to adapt to the low SNR experimental environment.

The omni-directional point source in the simulation was different from the experimental transducer element in the size,beam width,and direction,which might lead to the differences of the excited waveforms between the simulation and experiment. The differences between the simulated and experimental sources could further influence the inversion processes. To best match the observed wavefields,the setting of the element size in simulation should be consistent with that in the experiment, and the simulated sources could be defined as the experimental measured excited waveforms of the elements in the ring array. Another kind of the source calibration and estimation techniques can refer to the reference.[61]

In practical application, it is difficult to obtain a transducer with bandwidth from 100 kHz to 2.5 MHz, especially covering the low frequency with sufficient SNR.There are several solutions to address this issue. One is that the results derived from travel-time[54]or adaptive waveform inversion[39]can be used as the initial model for inversion,which is closer to the real model. In this case, the closer the model is to the real model, the higher the initial frequency can be used.The second way is to utilize a wide-band transducer, which now has been available to customize a wide-band transducer with multiple center frequency coupled. The third method is that the inversion can be performed using multiple transducers with the same size and different dominant frequencies. In this study,two sources with different center frequencies were used to cover the bandwidth we need.

Ultrasonic attenuation in bone may be resulted from a number of factors, including boundary reflections, ultrasonic diffraction,scattering,and reverberation effects introduced by porosity and trabeculae, anelastic absorption in the bone and surrounding tissues, and elastic mode-conversions. In this study, the anelastic absorption and elastic mode-conversions were not considered. This model assumption underestimated the attenuation effect in bone tissues, which might degrade the imaging quality of bone structure for experimental cases.Meanwhile, it should be noted that the frequency-dependent attenuation coefficients in different individuals are quite different. It is necessary to dynamically adjust the maximum frequency according to the actual situation. If the attenuation coefficients in the bone and soft tissues are significantly larger than the mean attenuations,[78–80]the maximum frequency should be decreased to a lower value to guarantee sufficient SNR.The decrease of the maximum frequency may influence the resolution and the image quality. As aforementioned, if the efficient computation is available, a more complete consideration of the physics effects for forward modeling is worth further investigation in the future.

To facilitate modeling,the theoretically parametric models (i.e., sound velocity and mass density) were constructed using binary values in this study. The bone and surrounding soft tissue were assumed to be homogeneous media. In clinical applications, both soft tissue and bone may exhibit heterogeneity,which represent as multivalued parameter models.The influence of surrounding soft tissues on QUS based assessment of bone properties is a long-standing problem.[13,81]It is not easy for some quantitative approaches,such as guided wave, backscatter to accurately consider the influence of heterogeneity in soft tissue. However, theoretically, this is not a problem for FDFWI,which iteratively updates the physical parameters in the grids of the simulated model during the optimization process until the best match reaches. The FDFWI method does not need to explicitly differentiate the bone and soft tissue,nor does it require the additional processing of heterogeneity within the same tissue. The properties of bone and surrounding soft tissues can be inverted jointly.The ultrasound propagation through different tissues or different regions of the same tissue will carry specific features,which will be utilized to precisely characterize the model parameters in inversion.Both high-contrast and low-contrast areas can be accurately reconstructed.

Unlike TDFWI based on the iterative matching of complete time series between simulated and observed data, the FDFWI only depends on the frequency-domain wavefields at limited discrete frequencies. This data reduction improves calculation efficiency. In Ref. [31], a model with the dimension of 667×667 took about an hour on a powerful desktop PC (specific conditions were not provided) to obtain the final inversion result,utilizing only 5 frequency bands(12 min/frequency band). For a parametric model with the dimension of 601×601 in this study, it took approximately one and a half hours to get the final results using 49 discrete frequency components (1.84 min/frequency component, core Intel Core CPU i9-9900K@5.10 GHz, DDR 64 GB, without graphic processing unit GPU).Compared to TDFWI,the computation efficiency of FDFWI is obviously improved.For clinical application, FDFWI for three-dimensional (3-D) imaging has the potential to be performed less than 10 min using parallel computation,[39]distributing one source to one GPU in virtue of several GPU-pool based servers to further speed up the inversion process. The FDFWI might meet the realtime requirement for medical imaging with the improvement in computation in the future.

6. Conclusion and perspectives

In this study, USCT based on FDFWI has been applied in 2-D bone quantitative imaging. The simulated results show that the method can achieve high-resolution parametric imaging. Not only macroscopic morphology but also microstructure (i.e., wavelength scaled pores and trabeculae) can be reconstructed clearly and accurately in the velocity map with sub-millimeter resolution. The performance of the velocity map is close to that of HR-pQCT in numerical simulation. Although restoring high-precision density maps using FDFWI has also been proved to be a difficult task, the method can still restore the macroscopic morphology and part of the microstructure in density map.The method has also been demonstrated being robust against Gaussian noise in velocity inversion. Therefore, the proposed FDFWI method has the potential for the diagnosis and daily monitoring of bone disease. It can also be applied to image multilayer and irregular objects with high-contrast impedance in industrial nondestructive testing. Future work will focus on experimental data instead of synthetic data to further demonstrate the effectiveness of the method.

猜你喜欢

李颖补贴园区
An overview of quantum error mitigation formulas
新增200亿元列入耕地地力保护补贴支出
完形填空专练(三)
苏通园区:激荡开放潮 十年再出发
“三清一改”农民能得到哪些补贴?
园区的开放样本
从园区化到国际化
Human body
Stochastic responses of tumor immune system with periodic treatment∗
“二孩补贴”难抵养娃成本