W.A.Shaikh,S.F.Shah,S.M.Pandhiani and M.A.Solangi
1Department of Mathematics and Statistics,Quaid-e-Awam University of Engineering,Science&Technology,Nawabshah,Sindh,Pakistan
2Department of Basic Sciences&Related Studies,Mehran University of Engineering&Technology,Jamshoro,Sindh,Pakistan
3Department of General Studies,Jubail University College,Al Jubail,Saudi Arabia
ABSTRACT This investigative study is focused on the impact of wavelet on traditional forecasting time-series models,which significantly shows the usage of wavelet algorithms.Wavelet Decomposition(WD)algorithm has been combined with various traditional forecasting time-series models,such as Least Square Support Vector Machine(LSSVM),Artificial Neural Network(ANN)and Multivariate Adaptive Regression Splines(MARS)and their effects are examined in terms of the statistical estimations.The WD has been used as a mathematical application in traditional forecast modelling to collect periodically measured parameters,which has yielded tremendous constructive outcomes.Further,it is observed that the wavelet combined models are classy compared to the various time series models in terms of performance basis.Therefore,combining wavelet forecasting models has yielded much better results.
KEYWORDS Impact;wavelet decomposition;combined;traditional forecasting models;statistical analysis
Due to the predictive importance,researchers have developed various forecasting models.As better environmental forecasting arts can be used to make appropriate management decisions,researchers are continually striving to improve the effectiveness and efficiency of the models.For decades the term wavelet has been used for the exploration of signal processing and geophysics.Therefore,this article looks at the WD combined with various traditional forecast time-series models.For decades the term wavelet has been used for the exploration of signal processing and geophysics.The last decade has shown vast interest in wavelets;it is a subject area that can be appropriate applicable and coalesced in various fields such as applied mathematics,physics,electrical engineering,etc.
Consequently,the WD has significantly impacted various fields,such as image processing,differential equations,statistics,and chemical signal processing[1,2].WD algorithm is used as a mathematical approach for extracting nontrivial and potentially useful information from different data types such as historical data information,re-analysis,local or global climate model simulations,etc.Usually,WD based models are required to analyse the datasets and decomposing them thoroughly.Its hypothesis is to get better re-construct datasets with minimal losses.The wavelet transform technique uses the signal frequency from the time domain to the wavelet domain.Therefore,the newly acquired domain has contained more complex basis functions called wavelets,mother wavelets,or analysing wavelets[3].
Wavelet-based models are a noteworthy edge in de-noising the datasets to develop an efficient model.It has made it easy to analyse streamflow processes on different parameters without eliminating the effects of the time-frequency accompanied by conventional bandpass filters.The WD tool can let on information within the signal in both frequencies,time and scale domains[4,5].The WD application controls the time-frequency or signal scale content and judges the temporal variation spectrum[6].In contrast,the Fourier transforms interpret a quite different perspective that allows estimating the signal frequency but is not suitable to estimate the time-frequency dependence.Therefore,the wavelet transform has its origins in the Fourier transform.Thus,the WD has allowed for tracking the time evolution of processes at various scales in the signal as it has both time and scale measures of localisation.The WD tool signals can be classified as high limited frequency events or a significant number of scale-variable methods because it provides explicit information for rate forecasting classification[7].In a review of the applications of the wavelet transform in hydrological dataset modelling describing the multifaceted information that can be obtained from such an analysis and recognition of seasonality,streamflow trends,and data de-noising.
The performance and accuracy of the traditional time series forecasting models continuously may be improved.Therefore,it can be inspired by the researchers to intend an improved version of the models[8].This study describes the performance impact of the WD as an optimization in the traditional time series models in which the optimal response is continuously exchanged during the simulation and can be approved.The effectiveness of the models has been tested on two different streamflow datasets,including Indus and Chenab Rivers.
To endorse the discussed TS forecast models and forecast the rivers,streamflow of the rate of the rivers have been collected 484 and 550 months,respectively,from two renowned Indus and Chenab Rivers of Pakistan(Figs.1–2).
Figure 1:484 months streamflow rate of Indus River
Figure 2:550 months streamflow rate of Chenab River
The use of wavelet application with the various traditional forecasting models such as LSSVM,ANN and MARS has improved the efficiency of the models and found excellent outcomes.These tractable combined models have been implemented as efficient tools on streamflow datasets to forecast phenomena that provide comprehensive signals information.The developed combined wavelet with AI models implements the following two-step protocol for forecasting activities.
(1)The WD methodology has been used as a preprocessor of input datasets.As a result,it has a time-frequency signal analysis at distinct intervals in the time-domain and considerable detail about input datasets.
(2)After obtaining the input signal by WD,it has been used for further processes as AI input in various traditional forecasting models.
Initially,the forecasting time-series datasets have been decomposed into a sub-time-series{W1,W2,···,Wp,Cp}by the WD algorithm,where,W1,W2,···,WpandCphave been described as detailed time-series and background time-series,respectively.Commonly,these have different roles in the forecasting time-series datasets and each sub-time-series have different behaviour.Therefore,the attribute and influence on the forecasting time-series dataset have different from each other.The developed LSSVM,ANN and MARS forecast models in whichtare the input sub-time-series of the models,and the forecasting time-series datasets att+Ttime have the output of models.WhereTis described as the measurement of forecast time-frequency.Finally,wavelet network models(WNM)were created,in which certain weights have been learned using specific techniques[9].The key object of forecasting time-series datasets to the WNM algorithm is to construct combined models with the WD algorithm.
The algorithm of WD ability to de-noise non-stationary signals into sub-signals at different levels has a suitable resource for improved streamflow elucidation[10].The novel developed forecasting models have been individual capacities that comprised WD and some conventional AI model techniques.The combined wavelet with various AI forecasting models has been an appropriate methodology[11].According to forecasting time series models,the wavelet technique is becoming an ever more effective and essential tool used in models.The primary reason for WD has analysed the time series datasets in the time and frequency domain of the valuable decomposition of the original time series by taking useful information in various frequency levels using wavelet functions.The main advantage of using the WD is its robustness,as it does not include any potentially erroneous assumption or parametric testing procedure.The following WD mathematical structure is defined as a continuous time-seriesx(t),t∈[?∞,∞]:
where definedt,τands∈[0,∞]for time,for time step that iterates the window function and wavelet scale respectively,whereasψ(t)is known as mother wavelet,it can be defined as follows:
Therefore,the continuous wavelet transform(CWT)can be defined as follows[12];
wherem,n∈Zthat control the scale and length of the time-series datasets respectively,s0is stated,fixed dilation step greater than 1,τ0is the location parameter,which must be greater than zero.The parameterss0=2 andτ0=1 are the most common preference.For a discrete time-seriesx(t)that occur attdiscrete time,then WD formation is defined as follows:
where,Wm,nis the wavelet coefficient of the WD at defined scales=2mandτ=2mn,x(t)is discrete time-series(i.e.,t=1,2,...,N?1)andN∈Zto the power of 2(N=2m);nis the time translation parameter that changes in the ranges 0 Decomposition levelsM= 2.6848 andM= 2.7403 estimated whenn= 484 andn= 550 monthly streamflow datasets(Figs.1–2)have been used for Indus and Chenab Rivers,respectively,and approximatelyM= 3 round-off decomposition level has used for both said rivers in this research work.By the hypothesis of Mallat’s,the original discrete time-seriesx(t)decomposed into a series of linearity independent estimation and detail signals by the inverse WD and defined as follows: or inverse WD also in a simple format defined as follows: where,AM(t)is known as the residual term at levelsMandDm(t)(m=1,2,...,M)are the detailed sub-series of information capable of catching specific functions of data interpretation value.The following various forms of transfer functions(TF)have been emerged and vastly used in the model[13]: The human brain’s functioning principle influenced an artificial neural network(ANN)as a forecasting model.Several architectures in the literature are available to forecast the streamflow and many other applications,one of which is the ANN algorithm mostly used.It is comprised of a network system with many interconnected nodes called neurons.The number of layers of an ANN is used to classify it,and layer(s)exists between an input and an output layer.Therefore,a single-layer feed-forward(SLFF)neural network is an architecture with just one layer for establishing connection among the nodes of the input,middle,and output layers.This type of system is characterised as a multi-layer feed-forward(MLFF)neural network built by more than one middle layer[14].ANN has the merits of fault-tolerance,an efficient nonlinear developing capability that provides a well-organised model for streamflow forecasting.Though the model has some drawbacks,such as slow optimization processing,model complexities,and the approximation of the applications could not be overlooked.Therefore,this forecasting architectural model needed to be improved.The WD has been compounded to the ANN model for this purpose.The WANN model has been combined with the strengths of the WD and ANN applications and to achieve nonlinear strong approximation capability.WANN model architectural design has based on multilayer perceptron(MLP).WANN is quite corresponding to that of(1+1/2)layer neural network and has contained the following three layers[15]: Layer-i.input layer,which is introduced to the network and takes one or more than one inputs. Layer-ii.the hidden layer,where data is manipulating procedure with a feed-forward neural network accompanied with orthonormal WD basis by activation functions developed. Layer-iii.output layer,which contains one or more linear combiners and the corresponding estimations are consistent with the given inputs. The training process has acquired the weights of the network connections.The WANN model can have various TF of different nodes in identical or different layers.The TF such as sigmoid,hyperbolic tangent functions are used for hidden layers,and there is no appropriation for the output layer.The WANN model has been successfully used for forecasting estimations.Two key approaches to developing the WANN model technique are described as following: ?The WD technique and the ANN model processing are used separately.Firstly decomposed,the input signal employs various WD basis functions(Eq.(1))with neurons in the hidden layer.Then,the wavelet coefficients have one or more output activities in their input weights have adjusted according to the certain learning algorithm. ?In this case,two structures,WD mathematical and ANN artificial intelligence algorithms have been combined and performed.The transferal and dilation of the WD accomplished weights that have been adjusted according to a certain learning algorithm. Only dyadic dilations and translations of the WD have developed the wavelet basis function whenever the first approach occurs.Therefore,this objective approach of WANN has often been known as a wavenet. The MARS model schemes discoveries to forecasting continuous numeric outcomes.Appropriate,the MARS model scheme has been implemented in two stages containing forward-backward stepwise techniques.The stepwise forward technique has observed a large set of input variables(basis function)with a different knot;though,this stepwise technique might be developing complexity and a multi-layered model[19].Determination of such type of model has weak forecasting presentation.For increasing forecast accuracy,the backward stepwise technique eradicates the pointless variables amongst the before chosen set,which may have fewer effects on the approximation procedure pruned by the MARS.For the projection ofxthe input variable to a novelyoutput variable based on appropriation,named basis functions that defines point of inflection along with the input range[20]: In theseyfunctions,xtreat as input andcchosen is a threshold value is said to be a knot.The function is useful in forward-backward stepwise techniques for each input unknown to classify the position of knots,where the value of the function changes.Theseybasis functions are called Spline functions,which is ac?knotreflected pair.The following description is the standard form of the MARS model[21]. where the output variableyestimated by the MARS model,c0is constant,ciis theithbasis function coefficient determined by minimising the Root Mean Squared Errors(RMSE),andBi(x)is theithbasis function.The optimal MARS model scheme is designated based on the smallest value of the Generalised Cross-Validation(GCV)principle.The GCV is defined as follows[17]: where,yiis the objective of output,f(xi)is the projected output,nis the number of inputs,andC(M)is a penalty,expressed as follows: wheredis the penalty for each basis function consisted in the model,Mis the number of basis functions.Wavelet Multivariate Adaptive Regression Splines(WMARS)model has combined the prospective techniques of WD and MARS for achieving robust nonlinear estimation potentiality.WMARS model binding designed is found on a multi-layer perception(MLP).The WMARS model is used for forecast estimations and has been dependent on an input layer,hidden layer,and output layer[22]. The statistical parameters are used to demonstrate the effectiveness in terms of forecastability of the models assessed by comparing the actual and forecasted values.Usually,the Mean Absolute Error(MAE),the Root Mean Square Error(RMSE),and the correlation coefficients(CC),are used to determine the efficiency of the models and outcomes fitted to the best fit line[23,24] The statistical parameters(21)–(23)have been used to assess the impact of discussed different models.MAE statistics provide an appropriate picture of the actual position in terms of the projected value error,whereas the RMSE statistics are the deviation of the models between the observed and projected values,the least values of these statistics are evaluation criteria for the best model.Similarly,the degree of the linear correlation coefficient is measured by the CC and observed the best flow effect with its high value[25]. This article examines the wavelet impact on traditional forecasting models by fitting input hydrological time-series datasets collected from Indus and Chenab Rivers.The computational code of the conversed forecasting models has been written in the MATLAB application,including the wavelet toolbox. The six(M1–M6)appropriate input data specimens have been prepared and used in the training and testing phases for traditional and combination of WD forecast models as shown in Table 1.M1–M6 and WM1–WM6 input combinations signify the number of variables based on earlier analysis for monthly river streamflow rates in Figs.1–2. Table 1:Model structure with different input combinations for forecasting streamflow The training dataset of the models is described for approximated parameters and the testing dataset has characterised by choosing the best combination model amongst every number of hidden layers considered.A trial-and-error technique has estimated the optimum complexity of conversed models.The statistical approaches,such as the Correlation Coefficient(CC),the Mean Absolute Error(MAE)and the Root Mean Square Error(RMSE),respectively,have estimated the outcomes.The estimates of both streamflow datasets are described in Tables 2–7 regarding accuracy and errors of the forecasting time-series models,such as LSSVM,WLSSVM,ANN,WANN,MARS and WMARS,respectively. In Table 2,the testing phase outcomes of the LSSVM model are observed that the M6 input combination performed well for Indus and Chenab Rivers.Likewise,Table 3 examines that WM2 and WM6 input combinations of the WLSSVM model have been estimated good results for both Indus and Chenab Rivers,respectively.After observing both tables,Table 3 yielded more inspired outcomes using the wavelet transform combined model compared to Table 2. Table 2:Forecast outcomes of LSSVM model Table 3:Forecast outcomes of WLSSVM model Additionally,in Table 4 is noticed thatM2,M5 andM1,M3 input combinations of the ANN model are better for Indus and Chenab Rivers.Similarly,WM5 andWM1 input combinations of the WANN model have given improved outcomes in the analysis of Table 5 for both Indus and Chenab Rivers.Afterward,perceiving both tables,the estimations of Table 5 are more attractive outcomes using the combined model paralleled to Table 4. Table 4:Forecast outcomes of ANN model Table 5:Forecast outcomes of WANN model Furthermore,in Table 6 is analysed thatM3 andM6 input combinations of the MARS model are obtained appropriate results for Indus and Chenab Rivers.As well in Table 7 is observed thatWM4 andWM5 input combinations for the WMARS model have been found improved estimations for both Indus and Chenab Rivers.Therefore,Table 7 valuations are improved due to the usage of the combined model than in Table 6. Table 6:Forecast outcomes of MARS model Table 7:Forecast outcomes of WMARS model Figs.3–4 are described the predicted linear trend line scatter displaying the LSSVM,WLSSVM,ANN,WANN,MARS and WMARS models employing the testing phase and show the performance as relatively to observed data on Indus and Chenab banks.The scatter lines are described in the equation of regression line(y=a+bx)by MATLAB software for each model. Clearly,Fig.3 shows that the forecasting model with wavelet transforms performs remarkably because scatter graphs are more appropriate than traditional models. Figure 3:Linear trend line scatter graphs of predicted and observed for Indus streamflow through testing phase Similarly,Fig.4 shows that forecasting models with wavelet transform have better forecasting scatter graphs than traditional models. The approximations of each model are shown in Table 8,scrutinised by various statistical parameters with small errors and large coefficient values with the evidence that models with wavelet decomposition are more efficient than the traditional models. Table 8:Comparison of forecasting results The CC-values of WD combined models are close to 100% for Indus and Chenab datasets compared to traditional models.Therefore,the WD algorithm combined with traditional models has made a tremendous impact and performs the role of a gadget to deliver improved estimations of both streamflow rivers.Consequently,the combined WLSSVM,WANN,and WMARS forecasting methodologies have been used as the second type models and provide excellent results instead of the first type traditional models LSSVM,ANN and MARS. Figure 4:Linear trend line scatter graphs of predicted and observed for Chenab streamflow through the testing phase It is concluded that the use of wavelet application with the addressed forecast time-series models has improved the efficiency and yielded tremendous results.The traditional forecasting time-series models have been prescribed by utilising the impact of the wavelet algorithm.The significance of wavelet information is to improve the efficiency of the models that determines the appropriate outcomes for time-series models.The performance of the wavelet combined models mapping with their associated resampling outcomes.Filtrations of the streamflow time-series datasets have been interpreted from the WD application and these features have not been observed in traditional models.The nonlinear input combination models have been constructed with the WD application and used as input estimators with traditional models that improve the forecast efficiency of the combined models.Therefore,the WD application has become an efficient and interesting valuable tool to analysed simulations of time-series datasets models in various domains. Thus researchers have a good argument in the future for the extensive usage of the wavelet algorithm to build up the novelty in the model or improve the existing models by suitable transform other than wavelet.The WD algorithm provides the optimum ability to pick the appropriate input and logically improves the output of the traditional forecasting models. Acknowledgement:This study has been reinforced by the Department of Basic Sciences &Related Studies,Mehran University of Engineering &Technology,Jamshoro,Sindh,Pakistan.The authors have gratefully acknowledged the institute for its support and cooperation in the research activity and providing a healthy research environment and facilities. Funding Statement:The authors have not received any financial support for the research,authorship,and publication of this article. Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.3.2 Wavelet Artificial Neural Networks(WANN)
3.3 Wavelet Least-Squares Support Vector Machine(WLSSVM)
3.4 Multivariate Adaptive Regression Splines(MARS)
3.5 Statistical Parameters Assessment
4 Results and Discussion
5 Conclusions
Computer Modeling In Engineering&Sciences2022年3期