Xun-Zhng Pn ,Xi-Rn M ,Li-Ning Wng ,Y-Chen Lu ,b,Ji-Qun Di ,Xing Li
a School of Economics and Management,China University of Petroleum,Beijing 102249,China
b Economics & Technology Research Institute,China National Petroleum Corporation,Beijing 100724,China
c Key Laboratory of Oil and Gas Market Simulation and Price Forecasting,China National Petroleum Corporation,Beijing 100724,China
d Institue of Energy,Peking University,Beijing 100080,China
Keywords:
ABSTRACT
In recent years,with the rapid growth of refining and chemical production capacities,the gap between refined oil production and demand has expanded in China.China's refined oil wholesale market has gradually changed from a seller's market to a buyer's market.The Chinese government is relaxing market access for refined oil products(NDRC,2013).In this context,refined oil wholesale prices have basically achieved marketization in China.Refined oil uses crude oil as the primary raw material,so its prices are closely related to international crude oil prices.Assessing the spillover of international crude oil prices on China's refined oil prices could provide a reference for decision-makers to understand the relationship of China's oil market with the international market.Refined oil prices are also closely related to the interests of oil companies.Forecasting prices could help China's oil industry improve its production and operation management.
Abbreviations:ARCHAutoregressive conditional heteroscedasticity BPBack propagation CCC-GARCH Constant conditional correlational-generalized autoregressive conditional heteroscedasticity DCC-GARCH dynamic conditional correlational-generalized autoregressive conditional heteroscedasticity GARCHGeneralized autoregressive conditional heteroscedasticity KMOKaiser-Meyer-Olkin LSTMLong short-term memory PCAPrincipal component analysis PCA-BPPrincipal component analysis-back propagation VARVector autoregression VAR-BEKK-GARCH Vector autoregression-Baba,Engle,Kraft,and Kroner-generalized autoregressive conditional heteroscedasticity
In developed countries,refined oil prices have been marketized and adjusted flexibly.It has been proposed in the literature that the variation of refined oil prices in developed countries is driven by the variation of international crude oil prices,and there is a longterm equilibrium relationship between the two prices.For example,through a multivariate Johansen test,Asche et al.(2003)found that changes in crude oil prices would affect refined oil product prices in the Northwest European market in the long term.Gjolberg and Johnsen(1999)established a single-equation model with crude oil prices as the exogenous variable and concluded that crude oil prices determined refined oil prices in the United States(U.S.).Borenstein et al.(1997)established an asymmetric error correction model and found that the data from many developed countries supported the asymmetric relationship between crude oil prices and gasoline prices.Borenstein and Shepard(2002)showed that wholesale gasoline prices had a lagged response to the shock of crude oil prices.Chouinard and Perloff(2007)studied the influences of different factors on gasoline retail and wholesale prices in the U.S.and found that the crude oil price variation was the dominant factor driving the gasoline price variation.Compared with developed countries,China's refined oil market has been affected by governmental policies to a deeper degree and for a longer time.Some Chinese researchers used retail price data to explore the linkage of China's refined oil market with the international crude oil market.For example,Mi(2020)established an error correction model to analyze fluctuations of retail prices of refined oil products in China and international crude oil prices.It was found that crude oil prices had a guiding effect on China's refined oil retail prices.By using a cointegration analysis and an asymmetric error correction model,Cui and Li(2015)found that there was a longterm cointegration relationship between international crude oil prices and China's gasoline retail prices,and changes in crude oil prices preceded changes in gasoline retail prices in time.Han et al.(2017)adopted the VAR model and found that the shock of international crude oil prices had a significantly codirectional impact on China's refined oil retail prices.They concluded that the variation of refined oil retail prices in China followed the variation of crude oil prices in the international market,and China passively adjusted retail prices in response to the fluctuation of international prices.
The literature has mostly applied traditional econometrics or machine learning models(Costa et al.,2021)as the main method for forecasting energy prices including crude oil prices.For example,He et al.(2021)used an improved PCA method,Hsu et al.(2016)used a quadratic sinusoidal trend approach,and de Medeiros et al.(2022)used a mixed data sampling model to forecast crude oil prices,respectively.Di Sanzo(2018)proposed that the GARCH model had good analysis and prediction performance for the memory and dynamic switching mechanism of crude oil volatility.Urolagin et al.(2021)processed data using Markovian and Z-Score transformations,and then used the LSTM neural network to predict WTI(West Texas Intermediate)crude oil prices.Cen and Wang(2019)also applied the LSTM network to the WTI and BRENT crude oil price forecasting.Sun and Lei(2021)applied the empirical mode decomposition to time series,and then used the PCA to determine the input of the BP neural network to predict carbon prices.Liu et al.(2021)showed that the BP neural network had excellent performance in forecasting crude oil prices.
For China,almost all existing studies in the literature have focused on retail prices whose frequency is 10 working days.Compared with retail prices of state-owned oil companies,wholesale prices of private enterprises and local refineries are more affected by the market and can better reflect the real supplydemand situation in the market.In China,the trading volume of refined oil products from private enterprises and local refineries has maintained rapid growth in recent years,reaching approximately one-third of the national total in 2020(CNPC,2021).China has been exploring the reform of oil-pricing mechanism(Chen S et al.,2020).On 13 January 2016,the National Development and Reform Commission(NDRC)of China improved its oil-pricing mechanism with the announcement“refined oil prices should be adjusted normally;when they should rise,they will rise,and when they should fall,they will fall”(www.ndrc.gov.cn/xxgk/zcfb/tz/201601/t20160113_963566.html).Since then,oil price adjustments in China have been sped up,and more entities have entered the market.In addition,in practice,Chinese oil companies are not only concerned about the trend of crude oil prices,but also the trend of refined oil prices,both of which are closely related to their benefits.However,as mentioned above,existing studies of price forecasting have mainly focused on crude oil.
This paper could contribute to the literature in the following aspects:
·For the first time,our own-monitored daily-frequency wholesale prices of China's private enterprises and local refineries will be released.The spillover effects of international crude oil prices on China's refined oil prices from these high-frequency data could better reflect price fluctuations and help decision-makers understand the real relationship between China's refined oil market and the international oil market.
·By dividing the original data into two time series,this paper will further compare the changes in spillover before and after the NDRC announcement,which could help decision-makers understand the effects of their improved mechanism and further promote China's oil-pricing mechanism and policy design.
·This paper will also try the PCA-BP neural network model to forecast China's refined oil wholesale prices.Initiating further studies on refined oil price forecasting could help provide additional price information to assist the production and operation management of Chinese oil companies.
The remainder of this paper is organized as follows.Section 2 describes the methods and data.Section 3 conducts the spillover analysis.Section 4 conducts the price forecasting.Lastly,Section 5 provides conclusions and policy suggestions.
In economics,spillover means that the activity of one object has an effect on another object,which may be in a different market and seem unrelated to it.Spillover generally includes mean spillover and volatility spillover.The mean spillover is a first-order moment which reflects the interaction between markets by changes in means.Empirical studies often use the VAR model(Sims,1980)to analyze the mean spillover between markets,and could use methods such as the Granger causality analysis(Granger,1969)and the variance decomposition(Wickens,1998)to further measure the temporal order of changes and factor contributions,respectively.The volatility spillover is a second-order moment which reflects the interaction between markets by changes in variances.Empirical studies often use the GARCH family models(Bollerslev,1986),such as BEKK-GARCH,CCC-GARCH,and DCC-GARCH,to examine the volatility spillover between markets,and could use the Wald Test(Sims,1980a)to verify the volatility spillover.The VAR(p)model is given as Eq.(1),where Rtis a stationary time sequence with a lag order of p,μis the model constant,Фjis the coefficient matrix,and εtis the residual term and normally distributed with a zero mean and constant variance.
There are many nonlinear and random factors in crude oil and refined oil markets(Ouyang et al.,2021).Compared with other multivariate GARCH models,the BEKK-GARCH model(Engle and Kroner,1995),which requires fewer parameters to be estimated and does not mandate a positive definiteness of the covariance matrix,has been widely used in the study of the volatility of energy finance time series(e.g.,Thenmozhi and Maurya,2020;Zankawah and Stewart,2020;Zolfaghari et al.,2020).In particular,BEKKGARCH(1,1)is most commonly used in the literature(Tan et al.,2019;Chen Y et al.,2020;Yu et al.,2020)to avoid the increasing number of estimated parameters.Therefore,this paper will use the daily-frequency price data to run the VAR(p)-BEKK-GARCH(1,1)model,as shown in Eq.(2),to quantify spillover effects of international crude oil(WTI and Brent)prices on China's diesel and gasoline wholesale prices,respectively.GARCH establishes a conditional variance equation for the residual sequence of the mean equation.In Eq.(2),Htrepresents the conditional variancecovariance matrix(hi,tprovides the variance of the i-th market variable and its conditional covariance with other markets on the tth day),A is the ARCH coefficient matrix which reflects the volatility clustering,B is the GARCH coefficient matrix which reflects the volatility persistence,C is the lower triangular constant matrix,and σi,tis the standard normal distribution residual.
The PCA(Pearson,1901)is a commonly used statistical method for linearly transforming correlated variables X=(X1,X2,…,Xp)Tinto uncorrelated composite variables Y=(Y1,Y2,…,Yn)T(n<p;Yi=u1iX1+u2iX2+…+upiXp).Y that satisfies the following conditions is called the principal component of X:1)ui'ui=1;2)there is no correlation between Yiand Yj;3)among X1,X2,…,Xplinear combinations that are not correlated to Y1,Y2,…,Yi-1and satisfy 1),Yihas the largest variance.More information on the PCA can be found in studies such as Abdi and Williams(2010),He et al.(2021),and Lee and Jemain(2021).
The BP neural network(Rumelhart et al.,1986)is a multilayer feedforward neural network trained according to an error back propagation algorithm,which can learn a range of input-output mapping patterns.The basic structure of the BP network is illustrated in Fig.1.Before training,the weights between the input layer and the hidden layer,and between the hidden layer and the output layer,are initialized to random values.The learning of the BP neural network is generally divided into two processes:forward propagation and error back propagation.In the forward propagationprocess,the input information is transmitted to the output layer through the calculation of the hidden layer.If the expected target(e.g.,a preset error value)is not achieved,the error change of the output layer is calculated and transmitted back to the input layer,and the weights between layers are adjusted in the direction of error reduction.The above processes are repeated until the target is finally achieved.More information on the BP neural network can be found in studies such as Hecht-Nielsen(1992),Gupta and Nigam(2010),Sun and Huang(2020).In this paper,the PCA and the BP neural network are combined(the PCA-BP model)to try to forecast refined oil wholesale prices in China.Before running the BP,using the PCA to process raw data could help reduce potential deviations caused by autocorrelation and collinearity,and could also help the network avoid the curse of dimensionalityand improve calculation efficiency.
This paper uses the daily-frequency wholesale prices of 0#diesel and 92# gasoline of private enterprises and local refineries during the period from 3 January 2013 to 31 December 2020 as China's refined oil prices.The data are obtained from the monitoring of the Key Laboratory of Oil and Gas Market Simulation and Price Forecasting of China National Petroleum Corporation(see Appendix;it's easy to find that refined oil wholesale prices fluctuate more greatly than retail prices in China).The daily WTI and BRENT crude oil spot prices during the same period,obtained from the WIND database(www.wind.com.cn),are used as international crude oil prices.WTI crude oil,the light intermediate base crude oil in U.S.West Texas,is the benchmark for global crude oil pricing.BRENT crude oil is the second largest crude oil pricing benchmark in the world and also plays an important role across the international community.The trends of the four price series are overall consistent(Fig.2),which intuitively features a potential linkage between China's refined oil wholesale prices and international crude oil prices.At the beginning of 2020,due to COVID-19 and geopolitics,international crude oil prices plummeted,causing negative WTI prices.This paper converts price data to rate of return data for subsequent spillover analysis.
As shown in Table 1,the standard deviation of the return rate of international crude oil prices is much larger than that of China's refined oil wholesale prices,implying that crude oil prices fluctuate greatly and are susceptible to other factors,which is consistent with the findings of Nonejad(2020).The variation of China's refined oil wholesale prices is comparatively small,which is inseparable from China's national regulations on retail prices.China has imposed restrictions on gasoline and diesel retail prices,setting“ceiling prices”and“floor prices”and“when crude oil prices in the international market linked for domestic refined oil are higher than$130 per barrel,the maximum retail prices of gasoline and diesel will not rise or only rise little;when crude oil prices are lower than$40 per barrel,the maximum retail prices will no longer be adjusted downwards”(www.gov.cn/zwgk/2009-05/08/content_1308543.htm).All time series pass the stationary test at the 1%significance level,indicating that they are stationary.
Fig.1.Schematic diagram of a typical three-layer neural network structure.Notes:there might be several hidden layers in a neural network.
Fig.2.Variations of China's refined oil wholesale prices and international crude oil prices.
Table1 Descriptive statistics.
In this paper,the VAR model is constructed according to a lag order of 8(the optimal lag order according to the likelihood ratio test,final prediction error,Akaike information criterion,Hannan-Quinn information criterion,and Schwartz information criterion).It is found from Table 2(the VAR model passes the stationary test)that China's refined oil(both diesel and gasoline)wholesale prices are greatly affected by its own previous values(lag terms),which is consistent with China's unique oil-pricing formation system.Meanwhile,the results of significance show that China's refined oil wholesale prices are also affected by international crude oil prices.It is preliminarily inferred that international crude oil prices have a mean spillover on wholesale prices of refined oil in China.
Table2 Model estimation results.
The Granger causality analysis is further used to evaluate the temporal order of changes in China's refined oil wholesale prices and international crude oil prices,and the variance decompositionis used to explore the contribution of crude oil price variations to wholesale price variations.Table 3 clearly shows that WTI is the Granger cause of diesel and gasoline at the 1% significance level,and BRENT is the Granger cause of diesel and gasoline at the 5%significance level.This justifies that international crude oil prices are the Granger cause of China's refined oil wholesale prices.Changes in crude oil prices precede and might have a guiding effect on changes in wholesale prices,which is consistent with the findings for retail prices in Mi(2020),and Cui and Li(2015).
Table3 Granger causality test results.
Fig.3 further reflects that China's refined oil market does not respond immediately to external shocks from the international crude oil market(the decomposition result is 0 when the period is 1)and there is a certain time lag.Contributions of WTI and BRENT price variations to China's refined oil wholesale price variations increase gradually over period.For example,the contribution of the WTI(BRENT)price variation to the diesel price variation increases from 0.62%(0.61%)in period 2 to 4.30%(2.83%)in period 10,and increases from 0.47%(0.39%)in period 2 to 4.18%(3.05%)in period 10 for gasoline.Overall,the impact of the WTI price variation on China's refined oil wholesale price variation tends to be slightly greater than that of BRENT.
In Table 4,a21and b21represent the ARCH and GARCH volatility spillover of WTI crude oil prices to China's refined oil wholesale prices,which reflect the clustering and persistence of volatility,respectively.a31and b31correspond to BRENT.It is found that all coefficients except for a31and b21for diesel are significant at the 1%level.This indicates that WTI has a significant ARCH effect on both diesel and gasoline in China,and a significant GARCH effect on gasoline;BRENT has a significant GARCH effect on both diesel and gasoline,and a significant ARCH effect on gasoline.|a21|>|a31|implies that WTI shows a stronger agglomeration shock to China's refined oil market,and|b31|>|b21|implies that BRENT shows a stronger persistent volatility transmission.In summary,international crude oil prices have a significant volatility spillover on China's refined oil wholesale prices.
Table4 Volatility spillover for refined oil wholesale market.
The Wald test further reflects(Table 5)a significant volatility spillover of WTI and BRENT to diesel and gasoline,verifying that the international crude oil market has an important impact on China's refined oil market.
Table5 Wald test results.
In this section,the original data are divided into two time stages by the NDRC's announcement on 13 January 2016 to explore the effects of the improved oil-pricing mechanism on the linkage of China's refined oil market with the international crude oil market.
Fig.3.Variance decomposition results.
The VAR models constructed in the two time stages(model details are not shown here due to the space limitation)verify that there is a relationship between China's refined oil wholesale prices and international crude oil prices within the entire time frame studied in this paper.Table 6 further shows that WTI is the Granger cause of diesel and gasoline at the 1%significance level in both two stages;BRENT is the Granger cause of gasoline at the 5% significance level in both two stages,and becomes the Granger cause of diesel at the 5% significance level in the second stage.
Table6 Granger causality test results by time stage.
By comparing the two-stage variance decomposition results(Fig.4),it is found that the contribution of the BRENT price variation in the first stage is lower than that in the second stage,reflecting that with the improvement of China's oil-pricing mechanism and the relaxation of the oil market,the impact of the BRENT price variation on the variation of China's refined oil wholesale prices is increasing.The contribution of the WTI price variation for diesel in the second stage is smaller than that in the first stage when the period is greater than 4,indicating that with the mechanism adjustment,the impact of the WTI price variation on China's diesel wholesale price variation is weakening to some extent.
The volatility spillover by time stage is given in Table 7.For diesel,WTI and BRENT have a significant GARCH effect in both stages,indicating that the international crude oil market has already formed a significant persistent volatility transmission to China's diesel wholesale market since 2013.In the second stage,WTI and BRENT also have a significant ARCH effect.For gasoline,in the first stage,WTI and BRENT have a significant ARCH effect but no significant GARCH effect,indicating that the international crude oil market did not form a significant persistent volatility transmission to China's gasoline wholesale market at that time.However,in the second stage,WTI and BRENT are shown to have both significant ARCH and GARCH effects on gasoline,implying that a significant persistent transmission has been gradually formed since 2016.In summary,with the improved oil-pricing mechanism,the volatility spillover from the international crude oil market to China's refined oil market gradually increases.
Fig.4.Variance decomposition in two time stages.Notes:‘I’indicates the first time stage(before 2016/1/13)and‘II’indicates the second time stage(after 2016/1/13).
Table7 Volatility spillover for refined oil wholesale market by time stage.
Wholesale prices provide useful information to assist the production and operation of oil companies.On the basic of the literature(e.g.,Borenstein and Shepard,2002;Zhang et al.,2018;Li et al.,2019;Yang et al.,2021),the spillover analysis above,the reality of
These three principal components are linear combinations of the original seven input variables,as given in Eq.(3).For gasolinerelated data,two principal components are extracted following the same steps,as given in Eq.(4).In both equations,X1to X7represent WTI spot prices,BRENT spot prices,Daqing spot prices,Shengli spot prices,diesel(gasoline)productions,crude oil productions,and consumer price indices,respectively.Note that these components are based on monthly data between 2013 and 2020.Future applications could dynamically update according to prediction accuracy requirements and the latest available data.
China's refined oil market,and data availability,the original input variables selected here for forecasting diesel(gasoline)wholesale prices include WTI and BRENT crude oil spot prices,Daqing and Shengli crude oil spot prices in China,diesel(gasoline)productions in China,crude oil productions in China,and consumer price indices in China.Daqing and Shengli crude oil spot prices are obtained from the WIND database.China's crude oil,diesel and gasoline productions and consumer price indices are obtained from the EPS DATA platform(www.epsnet.com.cn/index.html#/Home).Since these data are monthly-frequency,this section provides a monthly forecast for wholesale prices,and uses the average daily prices for each month in Fig.2.Overall,seven original input variables are used to forecast monthly diesel and gasoline wholesale prices in this section.
For diesel-related data,the KMO and Butterlet tests find that the original input variables are correlated,so the PCA processing is performed first.As mentioned earlier,using PCA to preprocess data is helpful to reduce biases caused by potential issues such as data autocorrelation and collinearity.The correlation coefficient matrix of the original variables is calculated to derive the total variance explained table(Table 8).According to the principle that the cumulative percentage of variance is greater than 80%,the first three components are extracted as the principal components for diesel.
Table8 Total variance explained for refined oil wholesale prices.
In this paper,a three-layer BP neural network is applied with an input layer,a hidden layer and an output layer,and the optimal number of nodes in the hidden layer is determined through debugging.Of the monthly samples between January 2013 and December 2020,70%are used as the training set,15%are used as the validation set,and the remaining 15% are used as the test set.The training target for error is set to 0.5%,the threshold of learning rate is set to 1%,the minimum allowable gradient value in training is set to 1×10-6,and the maximum number of training times is set to 10,000.As long as one of these settings is reached,the training will stop.
For diesel,the input of the BP network is the three principal components identified in Eq.(3),and the output is predicted diesel wholesale prices.Forecasting results are illustrated in Fig.5a-d.It is shown that predicted values of the training set are highly consistent with actual prices(the R-value reaches 0.97),indicating that the model has been well trained.Predicted values of the validation set are also consistent with actual prices,indicating that the model learned by the training set is reliable.The difference between predicted values of the test set and actual prices are small(the R-value reaches 0.99),further suggesting that the trained and validated BP neural network has a good capability to forecast China's diesel wholesale prices.For all sample data,predicted values can be considered to capture the fluctuation and magnitude
of actual prices(the average error is only 3.92%).For gasoline,the input is the two principal components identified in Eq.(4),and the output is predicted gasoline wholesale prices.As presented in Fig.5e-h,the R-values of the training set,verification set,and test set all exceed 0.96.Predicted values and their trend are highly consistent with actual prices(the average error is only 2.58%),suggesting that the trained and validated BP neural network also has a good capability to forecast gasoline wholesale prices.In summary,the PCA-BP neural network model is shown to be an applicable approach to forecasting China's refined oil wholesale prices.
Fig.5.Comparison of predicted values(Output)and actual values(Target)of wholesale prices.
In this paper,for the first time,China's daily-frequency refined oil prices were contributed to the literature.The results obtained from the VAR-BEKK-GARCH model justified that international crude oil prices had significant mean spillover and volatility spillover effects on China's refined oil prices.With the improvement of China's oil-pricing mechanism,the volatility spillover from the international crude oil market was further enhanced.The PCA-BP model was competent for providing a reference for China's refined oil wholesale prices.
Oil will continue to play an important role in China's economy in the coming several decades(Pan et al.,2017,2020).In the wave of marketization of refined oil prices,some suggestions can be made here.First,China could continue to improve its oil-pricing mechanism.The government could try to further shorten the oil price adjustment period and relax restrictions on the price adjustment range,which could promote a closer linkage of China's refined oil prices with international crude oil prices.Second,the government might also consider issuing refined oil futures.Refined oil futures could further increase the vitality of China's oil market,enable oil prices to accurately reflect the real supply-demand relationship,and drive the reform of the spot market.Third,the government could promote reasonable and orderly competition in the market,and gradually shift from a regulator to a supervisor,so that China's refined oil market can be steadily integrated with the international market and eventually fully marketized.
There are limitations in this paper.For example,due to the time difference between the large-scale outbreak of COVID-19 in China and across the world,this paper didn't assess the impact of COVID-19.Due to China's regulations on retail prices,the global outbreak of COVID-19 would likely have a greater impact on international crude oil prices than on China's refined oil prices.Although novel daily-frequency data of China's refined oil prices were contributed,the models used in this paper(while competent for research questions here)were not innovative.Future studies could consider developing improved or more advanced models to assess these new data or forecast prices.
Acknowledgements
The authors are grateful to the financial support from the Science Foundation of China University of Petroleum,Beijing(2462020YXZZ038).
Appendix A.Supplementary data
Supplementary data to this article can be found online at https://doi.org/10.1016/j.petsci.2022.03.013.