Weighted Self-Adaptive Threshold Wavelets for Interpolation Point Selection Used in Interconnect MOR
2018-03-05XinshengWangandMingyanYu
Xinsheng Wang and Mingyan Yu
(School of Astronautics, Harbin Institute of Technology, Harbin 150001, China)
1 Introduction
The developing trend in VLSI is aggressive scaling in the feature size, in turn, leading to increase the chip size, and then resulting in a growth in chip’s interconnect complexity. Meanwhile, the chip’s operating frequency is also increasing as well. Thus, an important step in the VLSI design is able to simulate and estimate interconnect line delay, cross-talk noise, and other characteristics in a reasonable time. MOR technique has proved to be a very powerful tool in expanding complexity of nano-scale integrated circuits, since it can dramatically decrease the simulation time of complex interconnect system by constructing small dimension models which save the important properties of the original system. The MOR of interconnect modules has been investigated comprehensively in the last score years[1-12].
A lot of methods that are based on Krylov-subspace projection techniques have been adapted to the model order reduction[5-6, 13-17]. These methods are adopted widely because of their iterative property and small computational cost versus balanced truncation approximation method, which is one of singular-value decomposition (SVD) techniques, and are well fit for reducing the complexity of VLSI interconnect networks. But, the classical single point expansion Krylov sub-space techniques can only match the low frequency response characteristics of interconnect system. Future interconnect system is more and more important in high frequency effects, so frequency selective MOR techniques are also more and more fundamentally important. MOR methods based on frequency selection are either using frequency weighted truncation technique or frequency linearized rational interpolation technique[18]. In addition, adaptive order multi-point MOR is provided by Lee[14]. Afterward, Alam[19]proposed arnoldi projection methods based on multi-shifted technique on account of wavelet transform to select the relevant frequency points. However, this method can not accurately obtain appropriate number of expansion points due to providing the man-made threshold, so it may have a long elapsed time or poor fitting degree. Chu[20]also gave a similar model simplification method of the lumped transformer mechanism, which was restricted to small scale interconnect system due to computing the residual complexity at each expansion point. So, all of these techniques are either computationally complex or not given how to choose the threshold.
In this paper, we show a weighted self-adaptive threshold wavelet interpolation point selection method using in Krylov subspace MOR. The method realizes optimmal matching of the original system response by a low order realization in a large frequency range. Thus, we choose the relevant interpolation point by using weighted threshold Harr wavelet transform to detect the dominant frequencies of system response in complex frequency regions. In order to show the advantage of the proposed method, we apply the method to H-clock tree circuit and power grid net circuit, which are modeled as RC and RLC model in VLSI. The simulation results show that the method in the paper gives higher accuracy than Haar wavelet method of man-made threshold.
2 Interconnect Systems’ MOR Principle
2.1 Background of MOR
In analyzing interconnect in VLSI, if we assume that a linear subnetwork contains only lumped elements, and then the modified nodal admittance (MNA)[21]formulation of the subnetwork may be used.
(1)
where,x∈Rnis a vector ofnstate variables including node voltage appended by independent voltage source current, and linear inductor current,C∈Rn×nandG∈Rn×nare constant matrices giving lumped memory and memoryless circuit elements respectively,bandcTdefine the linear maps between inputs and outputs, andu,yare input and output vector respectively. As a matter of fact reality simulations of large scale interconnect circuit, all of above matrices are big and sparse. Due to the simulation cost for a large interconnect system, MOR methods have been constructed to reduce the simulation complexity[22].
2.2 Projection-Based Order Reduction
Consider a single interconnect system such as single-input single-output (SISO) systemA(s) in descriptor form in Eq.(1). In projection-based model order reduction, the primitive state variablesxare approximated by a vectorxr∈Rqof reduced dimensionq≪n:
x≈Vxr
(2)
whereV∈Rn×qis a matrix with full column orthogonal vector. The reduced systemAr(s) approximating the input-output characteristic of the primitive systemA(s) is generally obtained by:
(3)
One possible approach for the computation ofVis the Krylov subspace methods, also known as moment matching method. Given an expansion frequencys0∈C, then we can take the Taylor series expansion of the transform functionH(s) ins0like:
(4)
wheremis0is theith order moment ats0. We define:
A=-(s0C+G)-1C,R=(s0C+G)-1b
(5)
The moments are computed usingmis0=cTAiR. Thus, the Krylov space is shown as:
κq(A,R)=[R,AR,…,Aq-1R]
(6)
The Arnoldi based algorithm can be used to iteratively create an orthonormal basisVq∈Rn×qfrom the successive Krylov sub-spaceκq(A,R). The underlying moment matching mechanism only guarantees the accuracy in a radiusfraround the expansion frequency pointf0. As the approximation orderqincreases,frincreases at the same time; but the relationship betweenfrandqis not so obvious[23]. And because the Arnoldi iteration firstly approximates the relevent poles in the vicinity of the expansion frequency point, it may make the reduced order models that do not match accurately the primitive system’s output response in the interest frequency range. Through multiple point rational Arnoldi iterative method to reduce order of the primitive system at different frequency expansion points, thus the transform function can be approximated accurately based on MOR technique across an interest frequency range. Therefore, the key problem is to reduce the complexity of choosing these interpolation points[24].
3 Self-Adaptive Weighted Threshold Haar Wavelets for Interpolation Point Selection in Rational Arnoldi Method
In mathematics, the simplest wavelet is Haar wavelet . The weakness of the Haar wavelet is not continuous, but this is also the advantage for the analysis of sudden transition signals. In this paper, we also use this property to detect the variation of system transform characteristic. The calculating Haar wavelet transform process includes first initializing one dimension array with the all sampled valuesNand then sweeping all samples at each level based on Haar wavelet basis decomposition[25]. After the above two steps, sharp changes in the data stand out, while smooth parts are approach to zero. In order to screen out more unimportant expansion points in the interest frequency range, we should detect the transform coefficients by a threshold. The threshold is a criterion which makes a distinction between the trivial coefficients parts and the important coefficients including significant components. Threshold is very efficiently used in sparse or near-sparse variations signals where only a small number of coefficients subset can be employed for the selection of shift expansion points[25].
It notes that the threshold is very important in interpolation point’s selection. The poor choices of threshold will result in a long elapsed time or poor fitting degree in MOR. The procedure for generating threshold is listed in Table 1. Then, the interpolating points selected based Haar wavelet are used to reduce order of the primitive system based on rational Arnoldi method. At each interpolation point, the matching moment order is determined by the required error. In this paper, relative error, which is the difference value of transform functions in continuous reduced order procedure, and absolute error are used. We apply the accurate absolute error proposed by Odabasioglu[23]in our method. For simplicity in this paper, we only give the main results shown in Eq.(7), not detailed derivations. The symbol in Eq.(7) is Ref.[23]. All the vectors in every interpolation point compose the orthogonal matrixVwhich is used to achieve finally reduced order system.
(7)
Meanwhile, the weighted self-adaptive threshold Wavelet MOR can also be used multiple-input multiple-output (MIMO) systems based on block Arnoldi technique. The error above mentioned can also extend to MIMO systems.
Table1Self-adaptiveweightedthresholdforHaarwavelettransformbasedMOR
Algorithm:Self⁃AdaptiveWeightedThresholdWaveletMOR(Input:C,G,B,L,i,j,ω,ωintr,error,lrel_abs_err;Output:Vk) /∗C,G,B,L(c)aresystemmatrix; iandjaresampledfrequencynumberN=2jinwidefrequencydo⁃mainandN1=2iininterestfrequencydomain; ωandωintrarewidefrequencyrangeandinterestfrequencyrange; errorisrequirederrorlimit; lrel_abs_errisasigntoidentifyusingrelativeerrororabsoluteerror;∗/ foreachfrequencypoint S[k]=ω[0]+k∗(ω[1]-ω[0])/2j; S1[m]=ωintr[0]+m∗(ωintr[1]-ωintr[0])/2i; endfor //ComputeHaarwavelettransformcoefficient Co=waveidt(S); Co1=waveidt(S1); //Computeweightedthreshold th=average_weighted_non_zeros(Co); th1=average_weighted_non_zeros(Co1); if(th<=th1) threshold=th1; else if(ininterestfrequencyrange) threshold=th1; else threshold=th; endif endif //Selectexpansionpoint compareeachcoefficientai,j(iandjareshiftanddilatedvaria⁃bles) ifai,j>threshold saveni;//importantexpansionpoint endif foreachexpansionpoint if(lrel_abs_err==1) while(relativeerror>error) ArnoldimethodcomputeVi//ithexpansionmatrix endwhile else while(absluteerror>error) ArnoldimethodcomputeVi//ithexpansionmatrix endwhile endif //ComposedeachexpansionpointVi Vk=[V1,…] //BasedonVkMORprocess Subspaceprojectiontoreduceorderoforiginalsystem end
4 Simulation Results
A simple mesh power grid circuit and a clock tree circuit shown in Fig.1 and Fig.2 are studied to prove the efficiency and accuracy of the method proposed in this paper. The horizontal parameters of Fig.1:R=0.05 Ω/mm,L=0.502 5 nH/mm,C=0.155 2 pF/mm and the vertical parameters are:R=0.06 Ω/mm,L=0.548 0 nH/mm,C=0.142 3 pF/mm. Each line is 30 mm long and divided into 20 sections, so the full ordernis equal to 719[26]. The clock tree circuit is shown in Fig.2; the full ordernis 33. The driver resistor is 80 Ω, the other distributed resistor is 60 Ω, distributed capacitance is 1 pF, and the all load capacitances are 1.2 pF[27]. From the frequency response of the two circuits, we generate a set of sampled response points in frequency domain that are used by the weighted threshold Haar wavelet MOR method. We compared the accuracy and efficiency of weighted threshold with man-made threshold across a large range of frequency domain (ω=[0, 1012]). All simulations are done by Matlab on Lenovo Yangtian series personal computer. For clock tree circuit, the run time in frequency range 2.5×1011to 3×1011and 0 to 1012including 220sampling points is 119.526 s and 120.009 s respectively. While the run time in interest frequency range 2.5×1011to 3×1011Hz including 64 sampling points is 0.007 98 s. The following simulation results use absolute error; the results from relative error are consistent.
Fig.1 The power grid mesh circuit
Fig.2 The clock tree circuit
4.1 The Comparison of Weighted Threshold and Mean Threshold
To prove the validity of the weighted threshold method, we verified it on clock tree circuit. Fig.3 shows the fitting degree between weighted and mean threshold reduced order method (uniform interpolation points) compared to original signal in our interested frequency range. From Fig.3, we can observe the superiority and accuracy of weighted threshold method.
Fig. 3 The comparison between the two threshold methods on clock tree circuit
4.2 The Comparison of Self-Adaptive Threshold and Man-Made Threshold
In order to prove the advantage of adaptive threshold method, we analyze the fitting degree between man-made threshold reduced order method and self-adaptive threshold reduced order method compared with original system. The superiority of self-adaptive threshold reduced order method is given in Fig.4 for the clock circuit ofn=33.
While the man-made threshold is oversized in Fig.4(a), the reduced order method by these interpolation points leads a low fitting degree compared with original system due to choosing little expansion point. In undersized threshold case in Fig.4(b), although the reduced order method by these interpolation points has the same fitting degree as the one based on the self-adaptive threshold method, the elapsed time is longer. In the clock tree circuit, the running time for man-made threshold reduced order method is 1.750 113 s; one for self-adaptive threshold is 1.449 630 s. This is because too many expansion points are selected. With the increase of the scale of interconnect system, the superiority is bigger and bigger. In order to prove the method to be applicable to large-scale system, we analyze a power grid system ofn=719. The Fig.5 shows the result, which is the same as clock tree circuit.
Fig.4 The fittering degree comparison of clock tree circuit
Fig. 5 The fitting degree comparison of power grid cricuit
In summary, the self-adaptive threshold can select optimal expansion points. Besides, using the selected expansion points, the reduced system fits better with the original system.
5 Conclusion
In this paper, we propose a weighted self-adaptive threshold wavelet expansion MOR technique which is based on Krylov sub-space methods to construct reduced order system models that are accurate and efficient over an extensive range of frequency. We can dynamically screen out valuable expansion points by wavelet transform and weighted self-adaptive threshold. This method gives an automated method to construct a low order system which is accurate in interest frequency range. The simulation results prove that the method can provide accurate and efficient reduced order model for complex interconnect structures.
[1]Ferranti F, Nakhla M, Antonini G, et al. Parameterized model order reduction of delayed systems using an interpolation approach with amplitude and frequency scaling coefficients. IEEE Workshop on Signal and Power Integrity. Piscataway: IEEE, 2012. 57-60. DOI: 10.1109/SaPIW.2012.6222911.
[2]Ferranti F, Nakhla M S, ANtonini G, et al. Multipoint full-wave model order reduction for delayed PEEC models with large delays. IEEE Transation Electromagnetic Compatibility, 2011, 53(4): 959-967. DOI: 10.1109/TEMC.2011.2154335.
[3]Alam M, Nieuwoudt A, Massoud Y. Frequency selective model order reduction via spectral zero projection. Proceedings of the IEEE ASP-Design Automation Conference. Piscataway: IEEE, 2007. 379-383. DOI: 10.1109/ASPDAC.2007.358015.
[4]Alam M, Nieuwoudt A, Massoud Y. Wavelet-based passivity preserving model order reduction for wideband interconnect characterization. IEEE Symposium Quality Electronicd Design. Piscataway:IEEE, 2007. 432-437. DOI: 10.1109/ISQED.2007.170.
[5]Alam M, Nieuwoudt A, Massoud Y. Frequency selective model order reduction via spectral zero projection. Proceedings of the IEEE Dallas Circuits and Systems Conference. Piscataway: IEEE, 2006. 379-383. DOI: 10.1109/ASPDAC.2007.358015.
[6]Odabasioglu A, Celik M, Pileggi L T. PRIMA: Passive reduced-order interconnect macromodeling algorithm. IEEE Transaction on Computer-Aided Design of Integrated Circuits System, 1998, 17(8): 645-654. DOI: 10.1109/43.712097.
[7]Pillage L T, Rohrer R A. Asymptotic waveform evaluation for timing analysis. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 1990, 9(4): 352-366. DOI: 10.1109/43.45867.
[8]Feldmann P, Freund R. Efficient linear circuit analysis by Pade approximation via the lanczos process. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 1995, 14(5): 639-649. DOI: 10.1109/43.384428.
[9]Antoulas A C. Approximation of Large-scale Dynamical Systems. New York: SAIM, 2005.100-115.DOI: 10.1137/1.9780898718713.
[10]Pinnau R. Model Reduction via Proper Orthogonal Decomposition. Berlin: Springer, 2008. 93-108.
[11]Tan S X-D, He L. Advanced Model Order Reduction Techniques for VLSI Designs. Cambrede: Cambridge University Press, 2007. 60-100.
[12]Li Y T, Bai Z, Su Y, et al. Model order reduction of parameterized interconnect networks via a two-directional Arnoldi process. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2008, 27 (9): 1571-1582. DOI: 10.1109/TCAD.2008.927768.
[13]Silveira L M, Kamon M, Elfadel I, et al. A coordinate transformed Arnoldi algorithm for generating guaranteed stable reduced order models of RLC circuits. Proceedings of the IEEE/ACM International Conference Computer-Aided Design. Piscataway: IEEE, 1996. 288-294. DOI: 10.1109/ICCAD.1996.569710.
[14]Lee H J, Chu C C, Feng W S. Multi-point model reduction of VLSI interconnects using the rational Arnoldi method with adaptive orders. Proceedings of the IEEE Asia Pacific Conference Circuits and Systems.Piscataway: IEEE, 2004. 1009-1012. DOI: 10.1109/APCCAS.2004.1413052.
[15]Pati A, Kumar A, Chandra D. Suboptimal control using model order reduction. Chinese Journal of Engineering, 2014, 37(5): 1-5. DOI: 10.1155/2014/797581.
[16]Lavania S, Nagaria D. Eigen spectrum based moment matching technique for model order reduction. Proceedings of the 39th National System conference. Piscataway: IEEE, 2015.1-5. DOI: 10.1109/NATSYS.2015.7489100.
[17]Wittmuess P, Tarin C, Kech A, et al. Parametric model order reduction via balanced truncation with taylor series representation. IEEE Transaction on Automatic Control, 2016, 61(11): 3438-3451. DOI: 10.1109/TAC.2016.2521361.
[18]Elfadel I M, Ling D D. A block rational Arnoldi algorithm for multi-point passive model-order reduction of multiport RLC networks. Proceedings of the IEEE/ACM International Conference Computer-Aided Design. Piscataway: IEEE, 1997. 66-71. DOI: 10.1109/ICCAD.1997.643368.
[19]Alam M, Nieuwoudt A, Massoud Y. Efficient multi-shifted Arnoldi projection using wavelet transform. Journal of Circuits, Systems and Computers, 2007, 16(5): 699-709. DOI: 10.1142/S0218126607003927.
[20]Chu C C, Lee H J. Applications of multi-point Arnoldi algorithms to linear lumped transformer model simplifications. IEEE Power Engineering Society Summer Meeting. Piscataway: IEEE, 2000. 2406-2411. DOI: 10.1109/PESS.2000.867367.
[21]Ho C W, Ruehli A E, Brennan P. The modified nodal approach to network analysis. IEEE Transactions on Circuits and Systems, 1975, 22(6): 504-509. DOI: 10.1109/TCS.1975.1084079.
[22]Wang J M, Chu C C, Yu Q, et al. On projection-based algorithm for model-order reduction of interconnects. IEEE Transactions on Circuits and Systems I: Fundamental Theory and Applications, 2002, 49(11):1563-1585. DOI: 10.1109/TCSI.2002.804542.
[23]Odabasioglu A, Celik M, Pileggi L T. Practical considerations for passive reduction of RLC circuits. Proceedings of the IEEE/ACM International Conference on Computer-Aided Design. Piscataway: IEEE, 1999. 214-219. DOI: 10.1109/ICCAD.1999.810652.
[24]Alam M,Nieuwoudt A, Massoud Y. Model order reduction using spline-based dynamic multi-point rational interpolation for passive circuits. Analog Integrated Circuits and Signal Processing, 2007, 50(3): 273-277. DOI: 10.1007/s10470-006-9017-5.
[25]Massoud Y, Alam M, Nieuwoudt A. On the selection of spectral zeros for generating passive reduced order models. Proceedings of the 6th International Workshop on System-on-Chip for Real-Time Applications. Piscataway: IEEE, 2006. 160-164. DOI: 10.1109/IWSOC.2006.348228.
[26]Chu C C, Lee H J, Feng W S, et al. Interconnect model reductions by using the AORA algorithm with considering the adjoint network. Proceedings of the IEEE International Symposium on Circuits and Systems. Piscataway: IEEE, 2005.1278-1281. DOI: 10.1109/ISCAS.2005.1464828.
[27]Xu Q, Mazumder P, Ding L. Novel macromodeling for on-chip RC/RLC interconnects. Proceedings of the IEEE International Symposium on Circuits and Systems. Piscataway: IEEE, 2002.189-192. DOI: 10.1109/ISCAS.2002.1010421.
杂志排行
Journal of Harbin Institute of Technology(New Series)的其它文章
- Review: Natural Polymer Electrolytes for Lithium Ion Batteries
- Active Disturbance Rejection Control for a Multiple-Flexible-Link Manipulator
- A Practical Strategy of Unbalance Identification and Correction for 2-DOF Precision Centrifuges
- Investigation on Noise Control Technology of Ultra-High Expansion Ratio Turbine
- Prediction of Potential Disease-Associated MicroRNAs Based on Hidden Conditional Random Field
- POD Analysis and Low-Dimensional Model Based on POD-Galerkin for Two-Dimensional Rayleigh-Bénard Convection