<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OJS</journal-id><journal-title-group><journal-title>Open Journal of Statistics</journal-title></journal-title-group><issn pub-type="epub">2161-718X</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ojs.2020.103030</article-id><article-id pub-id-type="publisher-id">OJS-100794</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Forecasting the Monthly Reported Cases of Human Immunodeficiency Virus (HIV) at Minna Niger State, Nigeria
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Nwanne</surname><given-names>Christiana Umunna</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Samuel</surname><given-names>Olayemi Olanrewaju</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Statistics, University of Abuja, Abuja, Nigeria</addr-line></aff><pub-date pub-type="epub"><day>08</day><month>05</month><year>2020</year></pub-date><volume>10</volume><issue>03</issue><fpage>494</fpage><lpage>515</lpage><history><date date-type="received"><day>8,</day>	<month>April</month>	<year>2020</year></date><date date-type="rev-recd"><day>7,</day>	<month>June</month>	<year>2020</year>	</date><date date-type="accepted"><day>10,</day>	<month>June</month>	<year>2020</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  There has been a moderate increase in newly diagnosed HIV-infected Minna populace, which calls for serious attention.
   
  This study
   
  used time series data based on monthly HIV cases from January 2007 to December 2018 taken from the statistical data document on HIV prevalence recorded in General Hospital Minna, Niger State.
   
  The methodology employed to analyze the data is base
  d
   on mathematical models of ARMA, ARIMA and SARIMA which were computed and diagnosed. From the results of parameter estimation of the models, ARMA(2, 1) model was the best model among the other ARMA models using information criteria (AIC). Diagnostic test was run on the ARMA(2, 1) model where the results show that the model was adequate and normally distributed using Box-Lung test and Q
  -
  Q plot respectively. Fur
  thermore, ARIMA of first and second differences w
  as
   estimated and ARIMA(1,
   
  0,
   
  1) was the best model from the result of the AIC and diagnostic test carried out which revealed that the model was adequate and normally distributed using Box-Lung and Q-Q plot respectively. Furthermore, the results obtained in the ARMA and ARIMA models were used to arrive at a combined model given as ARIMA(1, 0, 1) 
  &#215; SARIMA(1, 0, 1)<sub>12</sub>
   
  which was subsequently estimated and found to be adequate from the result of the Box-Lung and Q-Q plot respectively. Post forecasting estimation and performance evolution were evaluated using the RMSE and MAE. The results showed that, ARIMA(1, 0, 1) 
  &#215; SARIMA(1, 0, 1)<sub>12</sub> is the best forecasting model followed by ARIMA(1, 0, 2) on monthly HIV prevalence in Minna, Niger state.
 
</p></abstract><kwd-group><kwd>Human Immunodeficiency Virus</kwd><kwd> Autoregressive Moving Average</kwd><kwd> Autoregressive Integrated Moving Average</kwd><kwd> Seasonal Autoregressive Integrated Moving Average</kwd><kwd> Forecasting</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>HIV infection has spread over the last 30 years and has a great impact on health, welfare, employment and criminal justice sectors; affecting all social and ethnic groups throughout the world. Recent epidemiological data indicate that HIV remains a public health issue that persistently drains our economic sector having claimed more than 25 million lives over the last three decades [<xref ref-type="bibr" rid="scirp.100794-ref1">1</xref>]. The estimated overall number of People Living with HIV (PLWHIV) by the end of 2014 was approximately 36.9 (34.3 - 41.4) million and Sub-Saharan Africa was the most affected region, having 25.8 (24.0 - 28.7) million PLWHIV and 66% of all people with HIV infection living in the region (Yi, 2007). Of all people living with HIV globally, 9% of them live in Nigeria [<xref ref-type="bibr" rid="scirp.100794-ref2">2</xref>]. Most cases of HIV infection in Nigeria occur via heterosexual means with epidemics more pronounced among the females [<xref ref-type="bibr" rid="scirp.100794-ref3">3</xref>]. The country already burdened by political instability and endemic political corruption as a result of almost 33 years of military rule now seems prepared to “wipe out” the virus within a few decades [<xref ref-type="bibr" rid="scirp.100794-ref3">3</xref>]. Notwithstanding the progress in institutional reforms and political commitment to tackle the disease, the country has seen more citizens placed on life-saving medication of active antiretroviral therapy (AART) to increase the survival of such HIV seropositive individuals [<xref ref-type="bibr" rid="scirp.100794-ref3">3</xref>].</p><p>This study reviewed a discussion on the prevalence of HIV in Minna, Niger State and developed a best model that predicts the monthly HIV cases in Minna by means of the Seasonal Autoregressive Integrated Moving Average (SARIMA) with Box-Jenkins Method. HIV which stands for “Human Immunodeficiency Virus” is a serious disease that is caused by a virus that spread through the body fluids which attacks the body immune system just like cancer and can lead to death. Dissimilar to some different infections, the human body can’t dispose of HIV. That implies that once you have HIV, you have it forever [<xref ref-type="bibr" rid="scirp.100794-ref2">2</xref>]. HIV is found throughout the world and is prevalent in sub-Saharan Africa, accounting for 70% of new infections yearly [<xref ref-type="bibr" rid="scirp.100794-ref2">2</xref>]. Worldwide, an estimated 36.9 million people are living with HIV and about 2 million people became newly infected in 2014 [<xref ref-type="bibr" rid="scirp.100794-ref4">4</xref>].</p><p>The earliest report of HIV dates back to 1981 with five cases of Pneumocystis carinii pneumonia in healthy young homosexual men in Los Angeles, CA. At the time, it was described as “cellular-immune dysfunction” related to “sexual contact” [<xref ref-type="bibr" rid="scirp.100794-ref5">5</xref>]. Since then, tremendous efforts have been made worldwide for the diagnosis, control and prevention of HIV. Thirty-five million people are currently living with human immunodeficiency virus (HIV) globally. While 9.7 million infected people are receiving antiretroviral therapy, 2.3 million people are newly infected every year. Transmission via semen is one of the most prevalent methods of HIV-1 transmission, accounting for up to 80% of new infections every year.</p><p>In the majority of cases, HIV is a sexually-transmitted infection. However, HIV can also be transmitted from a mother to her child, during pregnancy or childbirth (through blood or fluid exposure), or through breastfeeding. Non-sexual transmission can also occur through the sharing of injection equipment such as needles.</p><p>Today, scientists are still working to find a treatment for HIV and the recent studies show that a new vaccine will be developed by 2025 [<xref ref-type="bibr" rid="scirp.100794-ref6">6</xref>]. These are quite promising studies for the whole world. However, it is important to understand people who are living with that virus are also struggling with social, economic and psychological problems. UNAID and the National Agency for the Control of AIDS estimate that there are 1.9 million people living with HIV in Nigeria (Punch News Paper).</p><p>Results from the Nigeria HIV/AIDS Indicator and Impact Survey (NAISS) indicate a national HIV prevalence in Nigeria of 1.5% among adults aged 15 - 49 years. The survey revealed an improvement in the national prevalence rate from 3.4% in 2012 to 1.9% in 2018.</p><p>The President of Nigeria, Muhammadu Buhari early last year (2019) launched the Revised National HIV and AIDS Strategic Framework 2019-2021, which will guide the country’s future response to the epidemic.</p>Aim and Objectives<p>The general objective of this study is to develop a best model that can predict the monthly HIV cases in Minna. This is to be achieved through the following Specific objectives:</p><p>1) Formulate time series models on the data collected.</p><p>2) Conduct a diagnostic check on the models formulated to determine the most suitable model.</p><p>3) Estimate the parameters of the various models and forecast the HIV prevalence.</p></sec><sec id="s2"><title>2. Empirical Framework and Theoretical Issues</title><p>A few related works of the use of SARIMA methodology to model epidemic incidence include the following; [<xref ref-type="bibr" rid="scirp.100794-ref7">7</xref>] worked on forecasting monthly cases of Human immunodeficiency syndrome (HIV) of the Philippines. The researchers utilized advanced statistical tool in developing the model using univariate Box-Jekins method in forecasting the HIV cases per month. The result showed that monthly cases of HIV in the Philippines had an upward trend. The researchers came up with the best model based on AIC which is (2, 1, 0) &#215; (0, 0, 1)<sub>12</sub>.</p><p>[<xref ref-type="bibr" rid="scirp.100794-ref8">8</xref>] used HIV infection data from 1985 to 2012 to fit ARIMA models. Akaike Information Criterion and Schwartz Bayesian Criterion statistics were used to evaluate the constructed models. Estimation was via the maximum likelihood method. To assess the validity of the proposed models, the mean absolute percentage error (MAPE) between the number of observed and fitted HIV infections from 1985 to 2012 was calculated. The fitted ARIMA models were used to forecast the number of HIV infections from 2013 to 2017 and the result showed that the fitted number of HIV infections was calculated by optimum ARIMA(2, 2, 1) model from 1985-2012 and the number was similar to the observed number of HIV infections, with a MAPE of 13.7%.</p><p>[<xref ref-type="bibr" rid="scirp.100794-ref9">9</xref>] conducted a study with the aim of formulating a model to determine the trend, prevalence and projecting HIV/AIDS epidemics in Ethiopia. Data were obtained from UNAIDS and Ministry of Health bulletin in Ethiopia. The data was analyzed using Autoregressive Integrated Moving Average (ARIMA) time series analysis model and the ARIMA(2, 3, 2) appeared to be providing the best fit for the observed data.</p><p>[<xref ref-type="bibr" rid="scirp.100794-ref10">10</xref>] worked on Epidemiology and ARIMA model of positive-rate of influenza viruses among children in Wuhan, China. The study aims to describe the epidemiology of influenza viruses among children in Wuhan, China during the past nine influenza seasons (2007-2015) and to predict the positive rate of different types of influenza virus in the future. Their study suggests that the ARIMA model can be used to forecast the positive rate of different types of influenza virus.</p><p>The estimated results of model showed that Peads incoming is influenced by seasonal variation of data, [<xref ref-type="bibr" rid="scirp.100794-ref11">11</xref>] works on Energy Consumption Forecasting Using Seasonal ARIMA with Artificial Neural Networks Models. The quarterly energy consumption of the United States from January 1973 to June 2015 is used. It aimed to forecast the residential energy consumption in U.S. using the Box-Jenkins methodology and Artificial Neural Network approach and compared their results in order to know the best model for predicting energy consumption in U.S. From their results they concluded that the forecasting accuracy is not quite significant. But, the performance of ANN model is better than SARIMA model in terms of forecasting accuracy from the test data using MAE and MAPE, the opposite result happens for MSE. While the SARIMA model fits better the historical data (training data) than ANN models using all performance parameters.</p><p>[<xref ref-type="bibr" rid="scirp.100794-ref12">12</xref>] also worked on Forecasting Precipitation Using SARIMA Model: A Case Study of Mt. Kenya Region. Two objectives were formulated from their research which is to determine the forecasted values of precipitation in Mt. Kenya region and also to determine the accuracy of the SARIMA model in forecasting precipitation in the same region. Monthly data collected from Kenya meteorological department covering a period of 1995 to 2010 for wind data and 1970 to 2011 for precipitation data but will be limited to the available wind data. SARIMA models were fitted and the least AIC and BIC value was picked which is SARIMA(1, 0, 1) &#215; (1, 0, 0)<sub>12</sub> that turns out to be the best model since it has the least values of the information criteria and forecasting evaluation was conducted using the RMSE.</p></sec><sec id="s3"><title>3. Research Methodology</title><sec id="s3_1"><title>3.1. Research Design</title><p>The research design adopted for this study is a descriptive and Box-Jenkins research design. Descriptive survey design is a research design in which data is collected consistently to explain and predict the given situation. For this purpose, non-seasonal Box Jenkins approach is used to find the best fitted, the best forecasting model and the accuracy of the forecasting values are checked by comparing residuals. The steps of the suggested model and its forecasting can be explained in the following steps. Determining whether the time series is stationary or not is a very important concept before making any inferences in time series analysis. Therefore, Augmented Dickey Fuller (ADF) and Phillips-Person (PP) tests will be used to check the stationarity of the data series. There are several methods that can be used to fit a time series model, among them, ARMA, ARIMA, and SARIMA model which will be used on the stationary data of this study.</p></sec><sec id="s3_2"><title>3.2. Population of the Study and Research Sample</title><p>The study was carried out based on monthly data on HIV prevalence as secondary data, which was collected from document based on January 2007 to December 2018 retrievable document from the Statistical data record on HIV prevalence from the record of Communicable diseases in Minna general hospital for both male and female.</p></sec><sec id="s3_3"><title>3.3. Method of Data Collection</title><p>Documentary evidence constitutes the instrument of data collection. The major sources of data are from Minna general hospital Statistical record on communicable diseases. The data for this study are secondary monthly HIV data sourced from the General hospital Minna in Niger state from January 2007 to December 2018.</p></sec><sec id="s3_4"><title>3.4. Technique of Data Analysis and Model Specification</title><p>The advances in Time Series enable researchers to use those techniques in their analysis to re-analyze the traditional rotation analysis applied in earlier studies [<xref ref-type="bibr" rid="scirp.100794-ref13">13</xref>]. The central idea behind model identification is a time series derived from ARIMA process which has some sort of theoretical autocorrelation properties. Fitting the empirical autocorrelation patterns with the theoretical ones helps to identify the potential tentative model for the given time series data. In this step, transformation of observed time series to stationary is inevitable.</p><p>The software that was used for the test is Eviews 4.0 version.</p></sec><sec id="s3_5"><title>3.5. Autoregressive Moving Average (ARMA) Models</title><p>We can have combinations of the two processes to give a new series of models called ARMA(p, q) models. The Autoregressive model (AR) and moving average (MA).</p><p>Where</p><p>AR of order p is:</p><p>X n = m + e n + φ 1 X n − 1 + φ 2 X n − 2 + ⋯ + φ p X n − p (3.4)</p><p>for n ≥ 0, where {e<sub>n</sub>} n ≥ 0 is a series of independent, identically distributed (iid) random variables, and m is a constant.</p><p>MA of order q is:</p><p>X n = m + e n + θ 1 e n − 1 + θ 2 e n − 2 + ⋯ + θ q e n − q , (3.5)</p><p>for n ≥ 1 where θ 1 , ⋯ , θ q are real numbers and m is a real number.</p><p>The general form of the ARMA(p, q) models where p is used for the number of autoregressive components, and q for the number of moving average components is written as:</p><p>X n = m 1 + ∑ k = 1 p φ k X n − k + ∑ j = 1 q θ j e n − j + e n , n ≥ 0 , (3.6)</p><p>where {X<sub>n</sub>} n ≥ 1, is some constant, and the φ<sub>k</sub> and θ<sub>j</sub> are defined as for AR and MA models respectively.</p></sec><sec id="s3_6"><title>3.6. Autoregressive Integrated Moving Average (ARIMA) Models</title><p>Autoregressive (AR), Moving Average (MA) or Autoregressive Moving Average (ARMA) models in which differences have been taken are collectively called Autoregressive Integrated Moving Average or ARIMA models. A time series {Y<sub>t</sub>} is said to follow an integrated autoregressive moving average model if the d<sup>th</sup> difference W t = ∇ d Y t is a stationary ARMA process. If {W<sub>t</sub>} follows an ARMA(p, q) model, we say that {Y<sub>t</sub>} is an ARIMA(p, d, q) process. For example, for practical purposes, we can usually take d = 1 or at most 2.</p><p>Consider then an ARIMA(p, 1, q) process. With W t = Y t − Y t − 1 , we have</p><p>W t = ϕ 1 W t − 1 + ϕ 2 W t − 2 + ⋯ + ϕ p W t − p + ε t − θ 1 ε t − 1 − θ 2 ε t − 2 − ⋯ − θ q ε t − q (3.7)</p><p>Or, in terms of the observed series,</p><p>Y t − Y t − 1 = ϕ 1 ( Y t − 1 − Y t − 2 ) + ϕ 2 ( Y t − 2 − Y t − 3 ) + ⋯ + ϕ p ( Y t − p − Y t − p − 1 )     + ε t − θ 1 ε t − 1 − θ 2 ε t − 2 − ⋯ − θ q ε t − q . (3.8)</p></sec><sec id="s3_7"><title>3.7. Seasonal Autoregressive Integrated Moving Average (SARIMA) Models</title><p>The ARIMA model (3.7) is for non-seasonal non-stationary data. A purely seasonal time series is the one that has only seasonal AR or MA parameters. Seasonal autoregressive models are built with parameter called seasonal autoregressive (SAR) parameters. The SAR parameters represent the autoregressive relationships that exist between time series data separated by multiples of the number of periods per season. Box and Jenkins have generalized this model to deal with seasonality. Their proposed model is known as the Seasonal ARIMA (SARIMA) model. In this model seasonal differencing of appropriate order is used to remove non-stationarity from the series. A first order seasonal difference is the difference between an observation and the corresponding observation from the previous year and is calculated as X t = Y t − Y t − s . For monthly time series S = 12 and for quarterly time series S = 4 This model is generally termed as the SARIMA(p, d, q) &#215; (P, D, Q)<sub>S</sub>.</p><p>For a seasonal time series of order s, [<xref ref-type="bibr" rid="scirp.100794-ref14">14</xref>] proposed that {X<sub>t</sub>} be modelled by:</p><p>A ( L ) Φ ( L s ) ∇ s d X t = B ( L ) Θ ( L s ) ε t (3.9)</p><p>where the series must have been subjected to seasonal differencing D times and non-seasonal differencing d times, ∇ s = 1 − L s , being the seasonal differencing operator. Moreover, Φ(L) and Θ(L) are the seasonal autoregressive and moving average operators respectively. These seasonal operators are polynomials in L.</p><p>Suppose that Φ ( L ) = 1 + φ 1 L + φ 2 L 2 + ⋯ + φ P L P and Θ ( L ) = 1 + θ 1 L + θ 2 L 2 + ⋯ + θ Q L Q , then the time series {X<sub>t</sub>} is said to follow a multiplicative seasonal autoregressive integrated moving average model of orders p, d, q, P, D, Q and s, designated (p, d, q) &#215; (P, D, Q)<sub>s</sub> SARIMA model.</p></sec></sec><sec id="s4"><title>4. Presentation of Result and Finding</title><p>To really come out with a good forecasting model of the HIV Prevalence Recorded in General Hospital Minna (2007-2018) data, ARMA, ARIMA and SARIMA models were fitted to the series. Furthermore, this section also explains the behavior of the rate of contracting HIV in Minna general hospital of Nigeria, test for unit root, specification of the models, estimation of the parameters of the forecasting model using the above model, selection of the best competing forecasting models using AIC while forecast evaluation of these models using Root Mean Square Error, Mean Absolute Error and Mean Absolute Percentage Error and forecast plot for seasonal models were critically looked into.</p><sec id="s4_1"><title>4.1. Descriptive Statistics of the HIV Data</title><p>In this section, we discuss empirical results beginning with preliminary analysis conducted with the aim to determine the normality of the data. Skewness, kurtosis and Jarque-Bera show the normality of the distribution. A distribution is said to be normal when skewness is approximately zero and kurtosis is three. Also, the probability of the Jarque-Bera statistics tells whether the series is normal or not. The null hypothesis of the Jarque-Bera test says that the distribution is a normal one. Therefore, if the probability is less than 0.05, we reject the null hypothesis and conclude that the distribution is not normal (<xref ref-type="table" rid="table1">Table 1</xref>).</p><p>Furthermore, from the Jarque-Bera test for normality of each of the variables, it was observed in the above table that the variables “HIV prevalence” p-value is less than 0.1 (10%) level of significance and not at 5% level. Thus, the enter variable is normally distributed at 10% level of significance. This is a strong factor of the fundamental assumptions of the application of ARMA, ARIMA and SARIMA models. Hence, data differencing transformation is considered in order to correct for the normality assumption violation (<xref ref-type="table" rid="table2">Table 2</xref>).</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Descriptive statistics of the HIV prevalence recorded in General Hospital Minna (2007-2018)</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Statistics</th><th align="center" valign="middle" >HIV Prevalence</th></tr></thead><tr><td align="center" valign="middle" >Mean</td><td align="center" valign="middle" >85.51389</td></tr><tr><td align="center" valign="middle" >Median</td><td align="center" valign="middle" >80.00000</td></tr><tr><td align="center" valign="middle" >Maximum</td><td align="center" valign="middle" >228.0000</td></tr><tr><td align="center" valign="middle" >Minimum</td><td align="center" valign="middle" >0.000000</td></tr><tr><td align="center" valign="middle" >Std. Dev.</td><td align="center" valign="middle" >46.75049</td></tr><tr><td align="center" valign="middle" >Skewness</td><td align="center" valign="middle" >0.487059</td></tr><tr><td align="center" valign="middle" >Kurtosis</td><td align="center" valign="middle" >2.559941</td></tr><tr><td align="center" valign="middle" >Jarque-Bera</td><td align="center" valign="middle" >6.855339</td></tr><tr><td align="center" valign="middle" >Probability</td><td align="center" valign="middle" >0.032463</td></tr><tr><td align="center" valign="middle" >Sum</td><td align="center" valign="middle" >12314.00</td></tr><tr><td align="center" valign="middle" >Sum Sq. Dev.</td><td align="center" valign="middle" >312542.0</td></tr><tr><td align="center" valign="middle" >Observations</td><td align="center" valign="middle" >144</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Augmented dickey-fuller test of stationarity (ADF) of the HIV Prevalence Recorded in General Hospital Minna (2007-2018) data</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="3"  >Null Hypothesis: HIV has a unit root</th><th align="center" valign="middle" ></th></tr></thead><tr><td align="center" valign="middle"  colspan="3"  >Exogenous: Constant, Linear Trend</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle"  colspan="4"  >Lag Length: 0 (Automatic—based on SIC, maxlag = 13)</td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" >t-Statistic</td><td align="center" valign="middle" >Prob.*</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Augmented Dickey-Fuller test statistic</td><td align="center" valign="middle" >−4.411370</td><td align="center" valign="middle" >0.0029</td></tr><tr><td align="center" valign="middle" >Test critical values:</td><td align="center" valign="middle" >1% level</td><td align="center" valign="middle" >−4.023506</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" >5% level</td><td align="center" valign="middle" >−3.441552</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" >10% level</td><td align="center" valign="middle" >−3.145341</td><td align="center" valign="middle" ></td></tr></tbody></table></table-wrap><p>*MacKinnon (1996) one-sided p-values.</p></sec><sec id="s4_2"><title>4.2. Parameter Estimation of ARMA Models and Models Selection</title><p><xref ref-type="table" rid="table3">Table 3</xref> shows the results of parameter estimation and model selection for the ARMA models, where results of the different estimation parameter of ARMA were estimated with most of the parameter significant at 1% and 5%. AIC was used to select the best model that will be used for ARIMA and SARIMA model because it is the combination of AR and MA model. From the AIC, ARMA(2, 1) was selected to be the best model since it has the smallest AIC. With this selection, our ARIMA model will be AR(2) and MA(1) while the integrated difference will be of one (1) and two (2).</p></sec><sec id="s4_3"><title>4.3. Diagnostic Tests for ARMA Models</title><p>Using the best model in <xref ref-type="table" rid="table2">Table 2</xref>, the result of <xref ref-type="table" rid="table3">Table 3</xref> shows the P-value for ARMA(2, 1) indicates there is no evidence that the residuals are dependent. This further confirms that the ARMA(2, 1) model is adequate.</p><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Parameter estimation of ARMA models and models selection</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >ARMA(1, 1)</th><th align="center" valign="middle" >ARMA(2, 1)</th><th align="center" valign="middle" >ARMA(1, 2)</th><th align="center" valign="middle" >ARMA(2, 2)</th></tr></thead><tr><td align="center" valign="middle" >Intercept</td><td align="center" valign="middle" >−527.6081</td><td align="center" valign="middle" >−489.1910</td><td align="center" valign="middle" >−618.0471</td><td align="center" valign="middle" >−489.2602</td></tr><tr><td align="center" valign="middle" >AR1</td><td align="center" valign="middle" >−0.2819*</td><td align="center" valign="middle" >−0.7670*</td><td align="center" valign="middle" >−0.9625*</td><td align="center" valign="middle" >−0.7684*</td></tr><tr><td align="center" valign="middle" >AR2</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >−0.6447*</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >−0.6443*</td></tr><tr><td align="center" valign="middle" >MA1</td><td align="center" valign="middle" >−0.7131**</td><td align="center" valign="middle" >−0.2148**</td><td align="center" valign="middle" >0.1985*</td><td align="center" valign="middle" >−0.2130</td></tr><tr><td align="center" valign="middle" >MA2</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >-</td><td align="center" valign="middle" >−0.7837**</td><td align="center" valign="middle" >−0.0041</td></tr><tr><td align="center" valign="middle" >Log Likelihood</td><td align="center" valign="middle" >−2149.62</td><td align="center" valign="middle" >−2111.91</td><td align="center" valign="middle" >−2153.72</td><td align="center" valign="middle" >−2111.91</td></tr><tr><td align="center" valign="middle" >AIC</td><td align="center" valign="middle" >24.8858</td><td align="center" valign="middle" >24.6035</td><td align="center" valign="middle" >24.9447</td><td align="center" valign="middle" >24.6152</td></tr><tr><td align="center" valign="middle" >BIC</td><td align="center" valign="middle" >24.9406</td><td align="center" valign="middle" >24.6768</td><td align="center" valign="middle" >25.0176</td><td align="center" valign="middle" >24.7067</td></tr></tbody></table></table-wrap><p>* at 1%, ** at 5%.</p><p><xref ref-type="fig" rid="fig1">Figure 1</xref> presents the trends analysis of the monthly data on HIV prevalence during the period of 2007 to 2018. The HIV prevalence started in January 2007 at a very slow prevalence rate. Until about September, 2008 when there was a sharp increase on the prevalence from 50 units to about 170 units. This clearly suggests an outbreak in the HIV virus. Although a relative decline in this trend was similarly observed as from July 2009 through to mid-year 2012. Another sharp increase in the trend is also observed in November 2012 but declined to almost zero in May 2015. With a steady gradual steady increase observed from march 2016 till date. This shows that if something is not done immediately the trend will go out of control.</p></sec><sec id="s4_4"><title>4.4. Parameter Estimation of AR, MA, ARMA AND SARMA Models and Models Selection</title><p><xref ref-type="table" rid="table3">Table 3</xref> shows the results of parameter estimation and model selection for the AR, MA, ARMA &amp; SARIMA models, where results of the different estimation parameter of the models were estimated with most of the parameter significant at 1% and 5%. AIC was used to select the best model. The models AR, MA, ARMA AND SARIMA were considered because the data set is in stationary at its original state and thus requires no differencing and transformation. Hence, the order and combination of the AR and MA component of the model is determined from the Correlogram plot below (<xref ref-type="table" rid="table4">Table 4</xref>).</p><p>These plots are used to choose the order parameters for candidates ARMA model. The simple moving average (MA) model is a parsimonious time series model used to account for very short-run autocorrelation. It does have a regression like form, but here each observation is regressed on the previous innovation, which is not actually observed. A weighted sum of previous and current noise is called Moving Average (MA) model.</p><p>Model identification started with autocorrelation analysis. Plots of autocorrelation function (ACF) and partial autocorrelation function (PACF) (<xref ref-type="fig" rid="fig2">Figure 2</xref>) showed only the first lag of the ACF was significant (i.e. laying outside the grey</p><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> Correlogram plot</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="3"  >ACF and PACF Model Description</th></tr></thead><tr><td align="center" valign="middle"  colspan="2"  >Model Name</td><td align="center" valign="middle" >MOD_5</td></tr><tr><td align="center" valign="middle" >Series Name</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >HIV</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Transformation</td><td align="center" valign="middle" >None</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Non-Seasonal Differencing</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Seasonal Differencing</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Length of Seasonal Period</td><td align="center" valign="middle" >12</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Maximum Number of Lags</td><td align="center" valign="middle" >16</td></tr><tr><td align="center" valign="middle"  colspan="2"  >Process Assumed for Calculating the Standard Errors of the Autocorrelations</td><td align="center" valign="middle" >Independence (white noise)<sup>a</sup></td></tr><tr><td align="center" valign="middle"  colspan="2"  >Display and Plot</td><td align="center" valign="middle" >All lags</td></tr><tr><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td><td align="center" valign="middle" ></td></tr></tbody></table></table-wrap><p>95% CI band). It was also observed that the first few lags of ACF did not decay with time. Based on the autocorrelation structure, several potential models were <xref ref-type="table" rid="table5">Table 5</xref>. Candidate models proposed.</p><p>identified.</p><p>ACF plots display correlation between a series and its lags. In addition to suggesting the order of differencing, ACF plots can help in determining the order of the MA(q) model. Thus, as observed from the ACF plots we have MA(1, 2, 3, 4, 5, 6).</p><p>Based on the ACF/PACF plots the following candidate models was proposed (<xref ref-type="table" rid="table5">Table 5</xref>).</p><p>The candidate model with the smallest value of the residual sums of squares is the model that best fit the data at hand. Also, using order selection strategy proposed in Hannan and Rissanan (1982) and used by [<xref ref-type="bibr" rid="scirp.100794-ref15">15</xref>] and [<xref ref-type="bibr" rid="scirp.100794-ref16">16</xref>], the model with the least Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) is the best among other models under consideration.</p></sec><sec id="s4_5"><title>4.5. Parameter Estimation for Candidate Models Codes and Summary Using R-Console</title><p>&gt; library(forecast)</p><p>&gt; library(“ggplot2”)</p><p>&gt; library(“forecast”)</p><p>&gt; library(“tseries”)</p><p>&gt; data = ts(read.csv(“data.hiv.csv”, header = TRUE, stringsAsFactors = FALSE))</p><p>&gt; ma1 &lt;- arima(data, order = c(0, 0, 1))</p><p>&gt; ma2 &lt;- arima(data, order = c(0, 0, 2))</p><p>&gt; ma3 &lt;- arima(data, order = c(0, 0, 3))</p><p>&gt; ma4 &lt;- arima(data, order = c(0, 0, 4))</p><p>&gt; ma5 &lt;- arima(data, order = c(0, 0, 5))</p><p>&gt; ma6 &lt;- arima(data, order = c(0, 0, 6))</p><p>&gt; summary(ma1)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 1))</p><p>Coefficients:</p><p>ma1 intercept</p><p>0.6401 85.2213</p><p>s.e. 0.0594 4.8635</p><p>sigma^2 estimated as 1273: log likelihood = −719.33, aic = 1444.66</p><p>&gt; summary(ma2)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 2))</p><p>Coefficients:</p><p>ma1 ma2 intercept</p><p>0.6542 0.3323 85.0295</p><p>s.e. 0.0869 0.0709 5.5283</p><p>sigma^2 estimated as 1125: log likelihood = −710.43, aic = 1428.87</p><p>&gt; summary(ma3)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 3))</p><p>Coefficients:</p><p>ma1 ma2 ma3 intercept</p><p>0.6543 0.4557 0.4208 85.0665</p><p>s.e. 0.0847 0.0761 0.0764 6.4169</p><p>sigma^2 estimated as 939.9: log likelihood = −697.68, aic = 1405.37</p><p>&gt; summary(ma4)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 4))</p><p>Coefficients:</p><p>ma1 ma2 ma3 ma4 intercept</p><p>0.6724 0.5009 0.5015 0.1576 85.0292</p><p>s.e. 0.0812 0.0890 0.0862 0.0740 7.0642</p><p>sigma^2 estimated as 912.2: log likelihood = −695.54, aic = 1403.08</p><p>&gt; summary(ma5)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 5))</p><p>Coefficients:</p><p>ma1 ma2 ma3 ma4 ma5 intercept</p><p>0.6746 0.5279 0.5277 0.2259 0.1656 84.9291</p><p>s.e. 0.0869 0.1014 0.0925 0.0848 0.0793 7.6515</p><p>sigma^2 estimated as 884.3: log likelihood = −693.31, aic = 1400.63</p><p>&gt; summary(ma6)</p><p>Call:</p><p>arima(x = data, order = c(0, 0, 6))</p><p>Coefficients:</p><p>ma1 ma2 ma3 ma4 ma5 ma6 intercept</p><p>0.6370 0.493 0.5412 0.2747 0.2829 0.1507 84.9344</p><p>s.e. 0.0871 0.100 0.0939 0.0884 0.1044 0.0952 8.1966</p><p>sigma^2 estimated as 869.9: log likelihood = −692.15, aic = 1400.31</p><p>&gt; ar1 &lt;- arima(data, order = c(1,0,0))</p><p>&gt; summary(ar1)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 0))</p><p>Coefficients:</p><p>ar1 intercept</p><p>0.7637 84.4499</p><p>s.e. 0.0532 10.3551</p><p>sigma^2 estimated as 900.1: log likelihood = −694.55, aic = 1395.1</p><p>&gt; arma1&lt;-arima(data, order = c(1, 0, 1))</p><p>&gt; arma2&lt;-arima(data, order = c(1, 0, 2))</p><p>&gt; arma3&lt;-arima(data, order = c(1, 0, 3))</p><p>&gt; arma4&lt;-arima(data, order = c(1, 0, 4))</p><p>&gt; arma5&lt;-arima(data, order = c(1, 0, 5))</p><p>&gt; arma6&lt;-arima(data, order = c(1, 0, 6))</p><p>&gt; summary(arma1)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 1))</p><p>Coefficients:</p><p>ar1 ma1 intercept</p><p>0.8448 −0.1980 84.4697</p><p>s.e. 0.0555 0.0981 12.3268</p><p>sigma^2 estimated as 878.1: log likelihood = −692.79, aic = 1393.58</p><p>&gt; summary(arma2)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 2))</p><p>Coefficients:</p><p>ar1 ma1 ma2 intercept</p><p>0.8311 −0.2073 0.0587 84.5462</p><p>s.e. 0.0640 0.1039 0.1051 12.0457</p><p>sigma^2 estimated as 876.1: log likelihood = −692.63, aic = 1395.27</p><p>&gt; summary(arma3)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 3))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 intercept</p><p>0.768 −0.1151 0.026 0.1704 84.6665</p><p>s.e. 0.096 0.1311 0.109 0.0967 11.0960</p><p>sigma^2 estimated as 858: log likelihood = −691.16, aic = 1394.33</p><p>&gt; summary(arma4)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 4))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 intercept</p><p>0.8049 −0.1446 0.0054 0.1729 −0.0938 84.6406</p><p>s.e. 0.0928 0.1231 0.1021 0.0932 0.1069 11.4072</p><p>sigma^2 estimated as 853.3: log likelihood = −690.78, aic = 1395.57</p><p>&gt; summary(arma5)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 5))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 ma5 intercept</p><p>0.7282 −0.0884 0.0604 0.2236 −0.0561 0.1425 84.8513</p><p>s.e. 0.1266 0.1438 0.1100 0.0963 0.1093 0.1021 11.1302</p><p>sigma^2 estimated as 841.4: log likelihood = −689.83, aic = 1395.66</p><p>&gt; summary(arma6)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 6))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 ma5 ma6 intercept</p><p>0.6865 −0.0478 0.0853 0.2518 −0.0288 0.1512 0.0439 84.9150</p><p>s.e. 0.1731 0.1839 0.1286 0.1198 0.1288 0.1054 0.0970 10.9617</p><p>sigma^2 estimated as 840.2: log likelihood = −689.73, aic = 1397.46</p><p>&gt; sarma1&lt;-arima(data, order = c(1, 0, 1), seasonal = list(order = c(1, 0, 1), period = 12))</p><p>&gt; sarma2&lt;-arima(data, order = c(1, 0, 2), seasonal = list(order = c(1, 0, 2), period = 12))</p><p>&gt; sarma3&lt;-arima(data, order = c(1, 0, 3), seasonal = list(order = c(1, 0, 3), period = 12))</p><p>&gt; sarma4&lt;-arima(data, order = c(1, 0, 4), seasonal = list(order = c(1, 0, 4), period = 12))</p><p>&gt; sarma5&lt;-arima(data, order = c(1, 0, 5), seasonal = list(order = c(1, 0, 5), period = 12))</p><p>&gt; sarma6&lt;-arima(data, order = c(1, 0, 6), seasonal = list(order = c(1, 0, 6), period = 12))</p><p>&gt; summary(sarma1)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 1), seasonal = list(order = c(1, 0, 1), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 sar1 sma1 intercept</p><p>0.8399 −0.1750 −0.6316 0.7791 84.4652</p><p>s.e. 0.0552 0.0977 0.3538 0.3155 13.1139</p><p>sigma^2 estimated as 845.2: log likelihood = −690.59, aic = 1393.17</p><p>&gt; summary(sarma2)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 2), seasonal = list(order = c(1, 0, 2), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 ma2 sar1 sma1 sma2 intercept</p><p>0.8245 −0.1879 0.0717 −0.6498 0.8021 0.0059 84.5483</p><p>s.e. 0.0630 0.1054 0.1039 0.6473 0.6575 0.1714 12.8811</p><p>sigma^2 estimated as 841.7: log likelihood = −690.35, aic = 1396.7</p><p>&gt; summary(sarma3)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 3), seasonal = list(order = c(1, 0, 3), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 sar1 sma1 sma2 sma3</p><p>0.7548 −0.0856 0.0523 0.1840 −0.1493 0.3056 −0.0128 0.1610</p><p>s.e. 0.0966 0.1308 0.1067 0.0948 0.5515 0.5475 0.1286 0.1154</p><p>intercept</p><p>83.514</p><p>s.e. 13.396</p><p>sigma^2 estimated as 812.1: log likelihood = −688.04, aic = 1396.08</p><p>&gt; summary(sarma4)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 4), seasonal = list(order = c(1, 0, 4), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 sar1 sma1 sma2 sma3</p><p>0.7795 −0.1115 0.034 0.1849 −0.0602 0.5665 −0.4278 −0.1129 0.1898</p><p>s.e. 0.0991 0.1308 0.106 0.0937 0.1185 1.4295 1.4192 0.2226 0.1184</p><p>sma4 intercept</p><p>−0.1457 83.3802</p><p>s.e. 0.2379 12.8059</p><p>sigma^2 estimated as 808.5: log likelihood = −687.79, aic = 1399.58</p><p>&gt; summary(sarma5)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 5), seasonal = list(order = c(1, 0, 5), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 ma5 sar1 sma1</p><p>0.7210 −0.0708 0.0679 0.2325 −0.0364 0.1106 0.2541 −0.1284</p><p>s.e. 0.1306 0.1528 0.1115 0.1026 0.1226 0.1020 1.3827 1.3780</p><p>sma2 sma3 sma4 sma5 intercept</p><p>−0.0660 0.1698 −0.0931 −0.0366 83.5674</p><p>s.e. 0.1874 0.1119 0.2567 0.1631 12.2876</p><p>sigma^2 estimated as 802.2: log likelihood = −687.19, aic = 1402.38</p><p>&gt; summary(sarma6)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 6), seasonal = list(order = c(1, 0, 6), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 ma2 ma3 ma4 ma5 ma6 sar1 sma1</p><p>0.6824 −0.0383 0.0986 0.2631 −0.0172 0.1171 0.0532 0.476 −0.3471</p><p>s.e. 0.1673 0.1804 0.1330 0.1244 0.1307 0.1025 0.1065 NaN NaN</p><p>sma2 sma3 sma4 sma5 sma6 intercept</p><p>−0.0877 0.1773 −0.1227 −0.0133 0.0171 83.6397</p><p>s.e. NaN 0.0892 NaN 0.1252 NaN 12.6138</p><p>sigma^2 estimated as 801.4: log likelihood = −687.08, aic = 1406.16</p><p>Estimated value of the parameter of the best model</p><p>&gt; summary(sarma1)</p><p>Call:</p><p>arima(x = data, order = c(1, 0, 1), seasonal = list(order = c(1, 0, 1), period = 12))</p><p>Coefficients:</p><p>ar1 ma1 sar1 sma1 intercept</p><p>0.8399 −0.1750 −0.6316 0.7791 84.4652</p><p>s.e. 0.0552 0.0977 0.3538 0.3155 13.1139</p><p>sigma^2 estimated as 845.2: log likelihood = −690.59, aic = 1393.17.</p><p>The result shows the estimation of the best model and also identifies the significance of its parameter. Based on the computed value of the coefficient for each parameter and its standard error, the absolute quotient value of the AR1, MA1, SAR1, SMA1 respectively, is greater than 0.05, it means that there is statistical sufficient evidence to say that the parameters are significant (<xref ref-type="table" rid="table6">Table 6</xref>).</p><table-wrap id="table5" ><label><xref ref-type="table" rid="table6">Table 6</xref></label><caption><title> Candidate models performance summary based on the Akaike information criterion (AIC)</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >sn</th><th align="center" valign="middle" >MODEL</th><th align="center" valign="middle" >log likelihood</th><th align="center" valign="middle" >Akaike info criterion (AIC)</th><th align="center" valign="middle" >Model Rank</th></tr></thead><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >MA(1)</td><td align="center" valign="middle" >−719.33</td><td align="center" valign="middle" >1444.66</td><td align="center" valign="middle" >3</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >MA(2)</td><td align="center" valign="middle" >−710.43</td><td align="center" valign="middle" >1428.87</td><td align="center" valign="middle" >6</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >MA(3)</td><td align="center" valign="middle" >−697.68</td><td align="center" valign="middle" >1405.37</td><td align="center" valign="middle" >12</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >MA(4)</td><td align="center" valign="middle" >−695.54</td><td align="center" valign="middle" >1403.08</td><td align="center" valign="middle" >11</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >MA(5)</td><td align="center" valign="middle" >−693.31</td><td align="center" valign="middle" >1400.63</td><td align="center" valign="middle" >10</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >MA(6)</td><td align="center" valign="middle" >−692.15</td><td align="center" valign="middle" >1400.31</td><td align="center" valign="middle" >13</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >AR(1)</td><td align="center" valign="middle" >−694.55</td><td align="center" valign="middle" >1395.1</td><td align="center" valign="middle" >5</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >ARMA(1, 1)</td><td align="center" valign="middle" >−692.79</td><td align="center" valign="middle" >1393.58</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >ARMA(1, 2)</td><td align="center" valign="middle" >−692.63</td><td align="center" valign="middle" >1395.27</td><td align="center" valign="middle" >7</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >ARMA(1, 3)</td><td align="center" valign="middle" >−691.16</td><td align="center" valign="middle" >1394.33</td><td align="center" valign="middle" >4</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >ARMA(1, 4)</td><td align="center" valign="middle" >−690.78</td><td align="center" valign="middle" >1395.57</td><td align="center" valign="middle" >8</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >ARMA(1, 5)</td><td align="center" valign="middle" >−689.83</td><td align="center" valign="middle" >1395.66</td><td align="center" valign="middle" >16</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >ARMA(1, 6)</td><td align="center" valign="middle" >−689.73</td><td align="center" valign="middle" >1397.46</td><td align="center" valign="middle" >17</td></tr><tr><td align="center" valign="middle" >14</td><td align="center" valign="middle" >SARIMA(1, 0, 1) (1, 0, 1)<sub>12</sub></td><td align="center" valign="middle" >−690.59</td><td align="center" valign="middle" >1393.17</td><td align="center" valign="middle" >1*</td></tr><tr><td align="center" valign="middle" >15</td><td align="center" valign="middle" >SARIMA(1, 0, 2) (1, 0, 2)<sub>12</sub></td><td align="center" valign="middle" >−690.35</td><td align="center" valign="middle" >1396.7</td><td align="center" valign="middle" >15</td></tr><tr><td align="center" valign="middle" >16</td><td align="center" valign="middle" >SARIMA(1, 0, 3) (1, 0, 3)<sub>12</sub></td><td align="center" valign="middle" >−688.04</td><td align="center" valign="middle" >1396.08</td><td align="center" valign="middle" >18</td></tr><tr><td align="center" valign="middle" >17</td><td align="center" valign="middle" >SARIMA(1, 0, 4) (1, 0, 4)<sub>12</sub></td><td align="center" valign="middle" >−687.79</td><td align="center" valign="middle" >1399.58</td><td align="center" valign="middle" >14</td></tr><tr><td align="center" valign="middle" >18</td><td align="center" valign="middle" >SARIMA(1, 0, 5) (1, 0, 5)<sub>12</sub></td><td align="center" valign="middle" >−687.19</td><td align="center" valign="middle" >1402.38</td><td align="center" valign="middle" >19</td></tr><tr><td align="center" valign="middle" >19</td><td align="center" valign="middle" >SARIMA(1, 0, 6) (1, 0, 6)<sub>12</sub></td><td align="center" valign="middle" >−687.08</td><td align="center" valign="middle" >1406.16</td><td align="center" valign="middle" >9</td></tr></tbody></table></table-wrap><p>*The best performing model.</p><p><xref ref-type="fig" rid="fig3">Figure 3</xref> shows the residual plot of the best model created as part of residual diagnostics of the model. This shows that the variance of the error term are seems to be constant. It also shows that the average of the residual is approximately equal to zero.</p><p><xref ref-type="fig" rid="fig3">Figure 3</xref> further shows the residual analysis to identify the normality of error terms. Since the computed p-value of Jarque-Bera test with p-value is greater than 0.05 level of significance, there is statistical evidence not to reject or fail to reject the null hypothesis of the normality of error term. This means that the error term is normally distributed.</p><p><xref ref-type="table" rid="table7">Table 7</xref> shows the residual analysis in identifying the independency of error term for Autoregressive Conditional Heteroskedasticity (ARCH). Since the computed p-value Box-Ljung test is equal to 0.1846 which is greater than the assigned alpha 5%, there is a statistical sufficient evidence to say that the error term is independent.</p><p><xref ref-type="fig" rid="fig4">Figure 4</xref> shows the Independency of error term generalized autoregressive conditional heteroskedasticity (GARCH) (informal way). It is however noticed</p><p>that no spike hits the line at any lag, this strongly suggests that the model is free of white noise (<xref ref-type="fig" rid="fig5">Figure 5</xref>).</p></sec><sec id="s4_6"><title>4.6. Forecast with the Fitted Model</title><p>One of the objectives of fitting and selecting the best model from AR/MA/ ARMA/SARIMA model to data is to be able to forecast its future values. The model that best fits the data going by the various statistics given in <xref ref-type="table" rid="table8">Table 8</xref> below is SARIMA(1, 0, 1) &#215; (1, 0, 1)<sub>12</sub>.</p><p><xref ref-type="fig" rid="fig6">Figure 6</xref> shows the point forecast (blue), it indicates that the forecasted value from the created model has an increasing and decreasing trend from 2019 January-2019 October with a semi-continuous increase in January, 2019 till October, 2019.</p><table-wrap id="table6" ><label><xref ref-type="table" rid="table7">Table 7</xref></label><caption><title> Ljung-box test. Independency of error term for Autoregressive Conditional Heteroskedasticity (ARCH)</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="7"  >data: Residuals from ARIMA(1, 0, 1)(1, 0, 1) [<xref ref-type="bibr" rid="scirp.100794-ref12">12</xref>] with non-zero mean</th></tr></thead><tr><td align="center" valign="middle"  rowspan="2"  >Model</td><td align="center" valign="middle"  rowspan="2"  >Number of Predictors</td><td align="center" valign="middle" >Model Fit statistics</td><td align="center" valign="middle"  colspan="3"  >Ljung-Box Q (18)</td><td align="center" valign="middle"  rowspan="2"  >Number of Outliers</td></tr><tr><td align="center" valign="middle" >Stationary R-squared</td><td align="center" valign="middle" >Statistics</td><td align="center" valign="middle" >DF</td><td align="center" valign="middle" >Sig.</td></tr><tr><td align="center" valign="middle" >HIV-Model_1</td><td align="center" valign="middle" >0</td><td align="center" valign="middle" >0.603</td><td align="center" valign="middle" >7.5221</td><td align="center" valign="middle" >5</td><td align="center" valign="middle" >0.1846</td><td align="center" valign="middle" >0</td></tr></tbody></table></table-wrap><p>Total lags used: 10.</p><table-wrap id="table7" ><label><xref ref-type="table" rid="table8">Table 8</xref></label><caption><title> Forecast of data using SARIMA(1, 0, 1) &#215; (1, 0, 1)<sub>12</sub></title></caption><table><tbody><thead><tr><th align="center" valign="middle"  colspan="8"  >Forecast data</th></tr></thead><tr><td align="center" valign="middle"  colspan="2"  >Date</td><td align="center" valign="middle" >Point.Forecast</td><td align="center" valign="middle" >Lo.80</td><td align="center" valign="middle" >Hi.80</td><td align="center" valign="middle" >Lo.95</td><td align="center" valign="middle"  colspan="2"  >Hi.95</td></tr><tr><td align="center" valign="middle"  rowspan="10"  >2019</td><td align="center" valign="middle" >JANUARY</td><td align="center" valign="middle" >96.94</td><td align="center" valign="middle" >59.68</td><td align="center" valign="middle" >134.20</td><td align="center" valign="middle" >39.96</td><td align="center" valign="middle" >153.93</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >FEBRUARY</td><td align="center" valign="middle" >91.71</td><td align="center" valign="middle" >46.97</td><td align="center" valign="middle" >136.45</td><td align="center" valign="middle" >23.28</td><td align="center" valign="middle" >160.14</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >MARCH</td><td align="center" valign="middle" >91.21</td><td align="center" valign="middle" >41.87</td><td align="center" valign="middle" >140.56</td><td align="center" valign="middle" >15.75</td><td align="center" valign="middle" >166.68</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >APRIL</td><td align="center" valign="middle" >80.85</td><td align="center" valign="middle" >28.50</td><td align="center" valign="middle" >133.20</td><td align="center" valign="middle" >0.79</td><td align="center" valign="middle" >160.91</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >MAY</td><td align="center" valign="middle" >76.16</td><td align="center" valign="middle" >21.79</td><td align="center" valign="middle" >130.53</td><td align="center" valign="middle" >−6.99</td><td align="center" valign="middle" >159.31</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >JUNE</td><td align="center" valign="middle" >80.46</td><td align="center" valign="middle" >24.71</td><td align="center" valign="middle" >136.20</td><td align="center" valign="middle" >−4.80</td><td align="center" valign="middle" >165.72</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >JULY</td><td align="center" valign="middle" >83.64</td><td align="center" valign="middle" >26.94</td><td align="center" valign="middle" >140.34</td><td align="center" valign="middle" >−3.08</td><td align="center" valign="middle" >170.36</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >AUGUST</td><td align="center" valign="middle" >82.33</td><td align="center" valign="middle" >24.97</td><td align="center" valign="middle" >139.70</td><td align="center" valign="middle" >−5.40</td><td align="center" valign="middle" >170.06</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >SEPTEMBER</td><td align="center" valign="middle" >85.62</td><td align="center" valign="middle" >27.79</td><td align="center" valign="middle" >143.45</td><td align="center" valign="middle" >−2.82</td><td align="center" valign="middle" >174.06</td><td align="center" valign="middle" ></td></tr><tr><td align="center" valign="middle" >OCTOBER</td><td align="center" valign="middle" >89.81</td><td align="center" valign="middle" >31.66</td><td align="center" valign="middle" >147.96</td><td align="center" valign="middle" >0.87</td><td align="center" valign="middle" >178.75</td><td align="center" valign="middle" ></td></tr></tbody></table></table-wrap><p>The fitted number of HIV infections was calculated by optimum SARIMA(1, 0, 1) model from 2019 January-2019 October. The fitted number or the inbound forecast was similar to the observed number of HIV cases.</p></sec></sec><sec id="s5"><title>5. Summary</title><p>This study revealed that SARIMA(1, 0, 1) (1, 0, 1)<sub>12</sub> without drift is the best fit mathematical model forecasting monthly cases of Human Immunodeficiency Virus (HIV) of Minna population. Time series data which is monthly HIV new cases in Minna General Hospital (year 2007-2018) was used. Models such as ARMA, ARIMA and SARIMA were used with a monthly dataset from “January 2007”, to “December, 2018”. The preliminary analysis of the data obtained shows that the distribution of the monthly HIV cases in Minna is stationary at first difference and result of Jarque-Bera statistic revealed that Minna HIV data is not normally distributed as the probability-values is less than 1% and 5%. The Parameter of the ARMA models and Models selection were estimated with most of the parameter significant at 1% and 5%. AIC was used to select the best model that was used for ARIMA and SARIMA models because it is the combination of AR and MA model. From the AIC, ARMA(1, 1) was selected to be the best model since it has the smallest AIC. The diagnostic test shows that ARMA(1, 1) shows no evidence that the residual is dependent, also the Q-Q plot result confirmed that the model is normally distributed.</p><p>More so, ARIMA of first and second difference were estimated and ARIMA(1, 0, 1) was the best model from the result of the AIC and diagnostic test carried out which revealed that the model was adequate and normally distributed using Box-Lung and Q-Q plot respectively. From the results of the parameter estimated, most of the parameters were significant and SARIMA(1, 0, 1) was selected to be the best model since it has the smallest AIC. A diagnostic test also was evaluated which confirms that SARIMA(1, 0, 1) is an adequate model because the residual is not dependent and the Q-Q plot is normally distributed.</p><p>Furthermore, estimating the SARIMA model, shows that the parameter are significant at 1% and 5% and the diagnostic test indicate that SARIMA(1, 0, 1) &#215; (1, 0, 1)<sub>12</sub> without drift is an adequate model since there is no evidence of dependent in the residual of the model and the Q-Q plot is normally distributed. The monthly HIV cases in Minna time series were normal on its level but stationary at first difference. The range of monthly cases that occurred from year 2007 to 2018 is from 147 to 845 cases and the highest peak happened in May 2009 and May 2015 with 182 cases.</p></sec><sec id="s6"><title>6. Conclusion</title><p>The following conclusions are derived from the findings presented:</p><p>1) The monthly HIV cases from 2017 to 2018 show an increasing trend, somewhat have a cycle and seasonality as well.</p><p>2) It found out that the highest increase of the HIV cases is on November 2012 to September 2013 and the highest decrease of the HIV cases is on January 2007 to September 2008.</p><p>3) The best model that can predict the HIV monthly cases is SARIMA(1, 0, 1) &#215; (1, 0, 1)<sub>12</sub> without drift.</p><p>4) The forecasted value of the created model has moderate increasing trend.</p><p>5) The average forecasted value is half of the actual value from January 2007.</p><p>Therefore, in this study based on the seasonal pattern of HIV prevalence in Minna, the SARIMA model is proposed as a useful tool for monitoring prevalence. The results of the study will be beneficial specifically to Niger State Government for prevention and control of HIV and Nigeria Government.</p></sec><sec id="s7"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s8"><title>Cite this paper</title><p>Umunna, N.C. and Olanrewaju, S.O. (2020) Forecasting the Monthly Reported Cases of Human Immunodeficiency Virus (HIV) at Minna Niger State, Nigeria. Open Journal of Statistics, 10, 494-515. https://doi.org/10.4236/ojs.2020.103030</p></sec></body><back><ref-list><title>References</title><ref id="scirp.100794-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">World Health Organization Fact Sheet (2014) Global Update on the Health Sector Response to HIV. Geneva.</mixed-citation></ref><ref id="scirp.100794-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">UNAIDS (2013) UNAIDS Report on the Global AIDS Epidemic 2013. Geneva.</mixed-citation></ref><ref id="scirp.100794-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Nigeria National Agency for the Control of AIDS (2012) Global AIDS Response: Country Progress Report. GARPR, Abuja.</mixed-citation></ref><ref id="scirp.100794-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Nigeria National Agency for the Control of AIDS (2010) United Nations General Assembly Special Session (UNGASS) Country Progress Report. Nigeria: January 2008 to December 2009.</mixed-citation></ref><ref id="scirp.100794-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Kee, M.K., Lee, J.H., Chu, C., et al. (2009) Characteristics of HIV Seroprevalence of Visitors to Public Health Centers under the National HIV Surveillance System in Korea: Cross Sectional Study. BMC Public Health, 9, Article No. 123. https://doi.org/10.1186/1471-2458-9-123</mixed-citation></ref><ref id="scirp.100794-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Fritzer, F., Gabriel, M. and Johann, S. (2002) Forecasting Austrian HICP and Its Components Using VAR and ARIMA Models. Working Papers 73, Oesterreichische National Bank (Austrian Central Bank).</mixed-citation></ref><ref id="scirp.100794-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Apa-Ap, R. and Tolosa, H.L. (2017) Forecasting the Monthly Cases of Human Immunodeficiency Virus (HIV) of the Philippines. Indian Journal of Science and Technology, 11, 1-10. https://doi.org/10.17485/ijst/2018/v11i47/121923</mixed-citation></ref><ref id="scirp.100794-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Yu, H.-K., et al. (2013) Forecasting the Number of Human Immunodeficiency Virus Infections in the Korean Population Using the Autoregressive Integrated Moving Average Model. Osong Public Health and Research Perspectives, 4, 358-362. https://doi.org/10.1016/j.phrp.2013.10.009</mixed-citation></ref><ref id="scirp.100794-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Demissew, T.G. (2015) Modeling and Projection of HIV/AIDS Epidemics in Ethiopia Using ARIMA. Master’s Thesis, University of Nairobi College of Physical and Biological Sciences, School of Mathematics, Nairobi.</mixed-citation></ref><ref id="scirp.100794-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">He, Z.R. and Tao, H.B. (2018) Epidemiology and ARIMA Model of Positive-Rate of Influenza Viruses among Children in Wuhan, China: A Nine-Year Retrospective Study. International Journal of Infectious Diseases, 74, 61-70. https://doi.org/10.1016/j.ijid.2018.07.003</mixed-citation></ref><ref id="scirp.100794-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Abdoulaye, C., Wang, F. and Liu, X. (2016) Energy Consumption Forecasting Using Seasonal ARIMA with Artificial Neural Networks Models. International Journal of Business and Management, 11, 231-243. https://doi.org/10.5539/ijbm.v11n5p231</mixed-citation></ref><ref id="scirp.100794-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Kibunja, H.W., Kihoro, J.M., Orwa, G.O. and Yodah, W.O. (2014) Forecasting Precipitation Using SARIMA Model: A Case Study of Mt. Kenya Region. Mathematical Theory and Modeling, 4, 50-58.</mixed-citation></ref><ref id="scirp.100794-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Dickey, D. and Fuller, W. (1997) Distribution of the Estimators for Autoregressive Time Series with a Unit Root. Journal of the American Statistical Association, 74, 427-431. https://doi.org/10.1080/01621459.1979.10482531</mixed-citation></ref><ref id="scirp.100794-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Box, G.P. and Jenkins, G.M. (1976) Time Series Analysis, Forecasting and Control. Holden-Day, San Francisco.</mixed-citation></ref><ref id="scirp.100794-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Eni, D. and Adesola, A.W. (2013) Sarima Modelling of Passenger Flow at Cross Line Limited, Nigeria. Journal of Emerging Trends in Economics and Management Sciences, 4, 427-432.</mixed-citation></ref><ref id="scirp.100794-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Yi, J., Du, C.T., Wang, R.H., et al. (2007) Applications of Multiple Seasonal Autoregressive Integrated Moving Average (ARIMA) Model on Predictive Incidence of Tuberculosis. Chinese Journal of Preventive Medicine, 41, 118-121.</mixed-citation></ref></ref-list></back></article>