<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">EPE</journal-id><journal-title-group><journal-title>Energy and Power Engineering</journal-title></journal-title-group><issn pub-type="epub">1949-243X</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/epe.2022.148018</article-id><article-id pub-id-type="publisher-id">EPE-119354</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Engineering</subject></subj-group></article-categories><title-group><article-title>
 
 
  An Application of Decision Trees Algorithm to Project Hourly Electricity Spot Price as Support for Decision Making on Electricity Trading in Brazil
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Cosme</surname><given-names>Rodolfo R. dos Santos</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Roberto</surname><given-names>Castro</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Rafael</surname><given-names>Marques</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Independent Consultant, Vinhedo/SP, Brazil</addr-line></aff><aff id="aff3"><addr-line>Brazilian Association of Photovoltaic Solar Energy (ABSOLAR), S&amp;amp;atilde;o Paulo, Brazil</addr-line></aff><aff id="aff1"><addr-line>State University of Campinas, School of Electrical and Computer Engineering, Campinas, Brazil</addr-line></aff><pub-date pub-type="epub"><day>11</day><month>08</month><year>2022</year></pub-date><volume>14</volume><issue>08</issue><fpage>327</fpage><lpage>342</lpage><history><date date-type="received"><day>21,</day>	<month>July</month>	<year>2022</year></date><date date-type="rev-recd"><day>20,</day>	<month>August</month>	<year>2022</year>	</date><date date-type="accepted"><day>23,</day>	<month>August</month>	<year>2022</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Estimating the price of a financial asset or any tradable product is a complex task that depends on the availability of a reasonable amount of data samples. In the Brazilian electricity market environment, where spot prices are centrally calculated by computational models, the projection of hourly energy prices at the spot market is essential for decision-making, and with the particularities of this sector, this task becomes even more complex due to the stochastic behavior of some variables, such as the inflow to hydroelectric power plants and the correlation between variables that affect electricity generation, traditional statistical techniques of time series forecasting present an additional complexity when one tries to project scenarios of spot prices on different time horizons. To address these complexities of traditional forecasting methods, this study presents a new approach based on Machine Learning methodology applied to the electricity spot prices forecasting process. The model’s Learning Base is obtained from public information provided by the Brazilian official computational models: NEWAVE, DECOMP, and DESSEM. The application of the methodology to real cases, using back-testing with actual information from the Brazilian electricity sector demonstrates that the research is promising, as the adherence of the projections with the realized values is significant.
 
</p></abstract><kwd-group><kwd>Artificial Intelligence</kwd><kwd> Machine Learning</kwd><kwd> Price Estimation</kwd><kwd> Energy Planning</kwd><kwd> Spot Electricity Market</kwd><kwd> Spot Prices Forecast</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The use of computational techniques in time series forecasting is one of the most active fields in academic research. With the advent of machine learning techniques, new algorithms have been developed and often made available through libraries for large-scale use.</p><p>This study discusses some characteristics of electricity price formation for the spot electricity market in Brazil, also known as by the initials SPD (Settlement Price for the Differences), and its applications through a new methodology for time series forecasting, using Machine Learning techniques, more specifically the XGBoost algorithm.</p><p>When XGBoost is compared to ARIMA [<xref ref-type="bibr" rid="scirp.119354-ref1">1</xref>], it is observed that XGBoost has advantages, such as the lack of a need to preprocess the data, a fast operation speed, complete feature extraction, a good fitting effect, and high prediction accuracy. When XGBoost is compared to Deep Learning techniques [<xref ref-type="bibr" rid="scirp.119354-ref2">2</xref>], the main conclusion points out that boosting methods demand fewer data and features. Other studies [<xref ref-type="bibr" rid="scirp.119354-ref3">3</xref>], beyond time series scope, also point out that XGBoost is more suitable for tabular data, in the case of this study. The main reasons are specific features of tabular structure: irregular patterns in the target function, uninformative features, and non-rotationally-invariant data where linear combinations of features misrepresent the information.</p><p>In this study, the time series features will be passed to the model as input variables, allowing the representation of seasonality and time cycles. A variable representing the characteristics of the Brazilian Interconnected Electricity System (BSIN), from the official models NEWAVE, DECOMP, and DESSEM, is also inserted in the model. Different forecast horizons will be provided, and especially for the very short and short term, the forecast performance will be measured [<xref ref-type="bibr" rid="scirp.119354-ref4">4</xref>] and presented.</p></sec><sec id="s2"><title>2. Characteristics of the Brazilian Electricity Price Definition</title><p>The BSIN is characterized by the interconnection, through the Basic Power Transmission Network, of four of the country’s subsystems: Northeast, North, South, and Southeast, the latter together with the Center-West. The interconnected system is operated centrally by the National Electric System Operator (ONS) and is integrated by different electricity generation and transmission companies, which may be public or privately owned.</p><p>The Brazilian electricity generation profile makes the system predominantly hydrothermal; that is; most of the energy consumed is generated in hydroelectric powerplants, in addition to thermal plants (nuclear, natural gas, biomass, coal, and fuel oil) and renewables such as wind and solar plants.</p><p>Data from ANEEL, published in January 2022, indicate that 60.12% of the electric energy demand is met by hydroelectric powerplants (HPP), 8.95% by thermal power plants (TPP) operating on natural gas, 5.02% by TPPs operating on liquid petroleum-derived fuels, and 1.97% by coal-fired TPPs. These Hydro and Thermal sources meet about 76% of the energy demand [<xref ref-type="bibr" rid="scirp.119354-ref5">5</xref>].</p><p>Renewable sources, such as wind, small hydro, and solar power plants, play an important role in the composition of the electricity generation matrix, but in the operational optimization process, the expected renewable generation from these plants is deducted directly from the expected Gross Demand for electricity and is not explicitly considered in the centralized operation optimization process.</p><p>Due to the stochastic nature of the Gross Demand and the renewable generation, this operation allows a single parcel to be presented to the hydrothermal optimization model, and this parcel is known as the Net Energy Demand, adopted as deterministic (single scenario) in the SPD calculation process.</p><p>The system’s hydrothermal characteristic impacts the electricity generation’s price since the cost of generating electricity is a function of the optimal dispatch of the hydraulic and thermal sources to minimize the cost over an operation horizon.</p><p>In pricing the SPD, the computer models used by ONS to operate the BSIN optimize the operation and calculate the Marginal Cost of Operation (MCO). These models take into account the constant changes in the operating condition of the system, especially related to meteorological issues—favorable or not of the Affluent Natural Energy (ANE) and Energy Stored (ES) in the reservoirs of the hydroelectric power plants—as well as the Unit Generation Costs (UGC) of each thermal power plant.</p><sec id="s2_1"><title>2.1. Variable Related to the Hydrothermal System</title><p>The chain of computer models, more specifically NEWAVE, DECOMP, and DESSEM, are used by both ONS, the system operator, and the Chamber of Electric Energy Commercialization (CCEE), the market operator but for different purposes. ONS seeks the best way to operate the power system to guarantee the supply of demand at the lowest cost. CCEE aims to determine the SPD, for each submarket with hourly granularity, which will be used in the short-term market accounting and financial settlement (i.e., spot market) [<xref ref-type="bibr" rid="scirp.119354-ref6">6</xref>].</p><p>To guarantee a connection between the results generated by the two institutions, ONS runs the models first, and then CCEE uses these results and treats the electric constraints and the generating units under tests, processes the models again, and finally publishes the hourly prices. The coupling among the three models occurs through the so-called Future Cost Function (FCF). This function represents a cost associated with each of the several possible trajectories of the state variables and energy storage in the reservoirs of the hydroelectric power plants, together with the respective thermal complements, to satisfy the Net energy demand. <xref ref-type="fig" rid="fig1">Figure 1</xref> below shows a relation of each model, using FCF as a coupling parameter.</p><p>The hydrological scenario fluctuation is the main responsible for the FCF variability, so it is mandatory to include a variable in the Machine Learning model that represents this oscillation between the various scenarios. In this research, it will be used a weekly version of the SPD—also called SPD Week Level—as this</p><p>independent input variable. It represents the electricity price level, divided into the blocks of heavy, intermediate, and low electricity consumption throughout the week, as follows:</p><p>• Very Short and Short Term:</p><p>• SPD Week Level from the MCO of the DESSEM and DECOMP output deck, specifically the deterministic part of the models.</p><p>• Medium Term:</p><p>• SPD Week Level from the MCO from the DECOMP output deck, specifically the stochastic part of the model.</p><p>• Long Term</p><p>• SPD Week Level from the MCO of the NEWAVE output deck, presenting a stochastic approach for longer-term hydrological scenarios.</p></sec><sec id="s2_2"><title>2.2. Temporal Variables</title><p>The price forecast model considers and represents the patterns associated with the time profile of the various horizons considered. Thus, the following independent variables were included, as shown in <xref ref-type="table" rid="table1">Table 1</xref>:</p><p>• Hour, it can represent the hourly fluctuations (i.e., peak and valley hours of prices), essential for very short-term analysis;</p><p>• Day of the Week and Day of the Month, which can represent oscillations within the same week and month, is suitable for short-term analysis (i.e., separation of weekdays and weekends, holidays, beginning of the month, etc.);</p><p>• Month, to represent the effects of annual economic and meteorological events (i.e., dry season, rainy season, scholar vacations, year-end festivals, carnival, etc.).</p></sec><sec id="s2_3"><title>2.3. Hourly Pricing Variable (Hourly SPD)</title><p>This is the dependent and target variable of the forecast. To obtain the correlation of this variable with all the possible values assigned to the others, it is part of the database used in the learning process.</p><p>The hourly SPD history used starts on April 17, 2018, and extends until July 16, 2021, with daily updates, and the hourly SPD value started to be effectively used in CCEE’s accounting and financial settlement as of 1/1/2021. Between</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Southeast sub Market XGBoost model variables for April 17, 2018</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Hourly SPD (R$/MWh)</th><th align="center" valign="middle" >Weekly SPD (R$/MWh)</th><th align="center" valign="middle" >Month</th><th align="center" valign="middle" >Day</th><th align="center" valign="middle" >Day of the Week</th><th align="center" valign="middle" >Hour</th></tr></thead><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >1</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >2</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >3</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >4</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >5</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >118.17</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >6</td></tr><tr><td align="center" valign="middle" >40.16</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >7</td></tr><tr><td align="center" valign="middle" >116.86</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >8</td></tr><tr><td align="center" valign="middle" >119</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >9</td></tr><tr><td align="center" valign="middle" >121.45</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >10</td></tr><tr><td align="center" valign="middle" >121.45</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >11</td></tr><tr><td align="center" valign="middle" >119.07</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >12</td></tr><tr><td align="center" valign="middle" >121.41</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >13</td></tr><tr><td align="center" valign="middle" >122.74</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >14</td></tr><tr><td align="center" valign="middle" >123.81</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >15</td></tr><tr><td align="center" valign="middle" >121.35</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >16</td></tr><tr><td align="center" valign="middle" >119.11</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >17</td></tr><tr><td align="center" valign="middle" >120.76</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >18</td></tr><tr><td align="center" valign="middle" >119.16</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >19</td></tr><tr><td align="center" valign="middle" >119.11</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >20</td></tr><tr><td align="center" valign="middle" >119.1</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >21</td></tr><tr><td align="center" valign="middle" >118.31</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >22</td></tr><tr><td align="center" valign="middle" >114.26</td><td align="center" valign="middle" >125.33</td><td align="center" valign="middle" >4</td><td align="center" valign="middle" >17</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >23</td></tr></tbody></table></table-wrap><p>April 2018 and December 2020, the parameter was calculated on an experimental basis, known as “Shadow Operation”.</p></sec></sec><sec id="s3"><title>3. XGBoost: A Decision Tree Based Algorithm</title><p>Decision Trees are branching structures with three types of nodes used in data classification. The root node represents the entire data set. After it, there are internal nodes, which represent the variables in the data set and the decision criteria for further branching. In summary, each internal node will contain a comparison of a given variable x<sub>i</sub> ∈ X—also called the independent or input variable of the model—against a specific value and criterion, such as: “is x<sub>i</sub> ≥ 23.7?” [<xref ref-type="bibr" rid="scirp.119354-ref7">7</xref>]. From the answer to this comparison presented at this inner node, which can assume “TRUE” or “FALSE” values, there will be a new branch to the left or right. This branching will continue until there is no more possibility to proceed, either because all variables have been entered into the model or because all samples have been correctly classified. The last node of the tree is called the leaf and consists of the target variable of the prediction commonly called the dependent variable and notated as y<sub>i</sub>.</p><p>The most common and used structures are called CART (Classification and Regression Trees) because of their applicability in several classes of problems, which involve classification and regression.</p><p>As advantages of using this type of structure, one can list, as presented by [<xref ref-type="bibr" rid="scirp.119354-ref7">7</xref>]:</p><p>• Non-linearity, due to the possibility of representing complex data classification boundaries through logical branching;</p><p>• Support for categorical variables, which result in binary outcomes, TRUE or FALSE, for example;</p><p>• Interpretability, due to the easily built of a self-explanatory structure;</p><p>• Robustness, due to the possibility of exponential growth of new variables and possible tests;</p><p>• Application in regression problems, in addition to the traditional classification applications in the Data Science universe. This advantage is crucial to obtain a prediction of a numerical value, the target of this study.</p><p>The XGBoost stands for Extreme Gradient Boosting and is a scalable and distributed gradient-boosted decision tree (GBDT) machine learning library. It is built upon distributed computing and parallel tree boosting, and the model uses the following concepts: supervised machine learning, decision trees, ensemble learning, and gradient boosting.</p><p>In this study, two XGBoost model metrics will be assessed. The first is called Weight and represents the frequency of use of which variable to generate new branches. A higher value of Weight for a variable also indicates its greater importance in the definition of the forecast model. A higher value of Weight for a variable indicates its greater importance in the definition of the forecast model.</p><p>The second is called Gain split, based on Equation (1) and indicates the average Gain value of a variable used in the creation of new branches. A higher Gain value for a variable, relative to the others also means its greater importance in generating forecasts.</p><p>L s p l i t = L E + L D − L R − γ = 1 2 [ ( ∑ i ∈ I E g i ) 2 ∑ i ∈ I E h i + λ + ( ∑ i ∈ I D g i ) 2 ∑ i ∈ I D h i + λ − ( ∑ i ∈ I R g i ) 2 ∑ i ∈ I R h i + λ ] − γ (1)</p></sec><sec id="s4"><title>4. Results</title><p>To facilitate the presentation of graphs and results, only the Southeast submarket will be addressed, but the machine learning model can also be applied to other electrical submarkets, with very similar results. The main metropolitan regions of Brazil are part of the Southeast submarket, making this chosen scenario a good example to discuss the proposed methodology.</p><p>Additionally, the performances of the very short-term and short-term forecasts were evaluated with two metrics commonly used in linear regression assessment, which are MAPE and RMSE (Mean Absolute Percentage Error and Root Mean Square Error, respectively).</p><p>A third metric was specially created by the authors and was named TAI (Trend Accuracy Index), defined according to Equation (2). This index represents the adherence between the actual hourly SPD variation, verified between subsequent hours, with the forecasted variation in these same periods. To exemplify, consider that between 00:00 and 01:00 on a given day, an increase in the SPD was predicted, and when verifying the actual SPD for the same period, it was found that the actual SPD also showed an increase. In this example, a score of 1 will be assigned for this hit, and if it is not similar, a score of 0 will be assigned.</p><p>This metric aims to provide the decision-maker with the sensibility on what to expect from the price behavior in the analysis horizon, answering with certain accuracy the question: In this horizon will the SPD go up or down?</p><p>After the analysis of all intervals, all scores are added up and the result is divided by the maximum possible score, which is 24 (number of hours in a day). The result, multiplied by 100, will represent a percentage score of the adherence between the Forecast and the Realized, in the variation of the SPD.</p><p>s c o r e i = { 1 ,   if   F o r i &gt; F o r i + 1   and   R e a l i &gt; R e a l i + 1 ,     or   F o r i &lt; F o r i + 1   and   R e a l i &lt; R e a l i + 1 0 , otherwise .</p><p>T A I = ∑ i = 1 24 s c o r e i 24 &#215; 100 (2)</p><p>where:</p><p>F o r i = Variable of the predicted hourly SPD increase or decrease, for period i.</p><p>R e a l i = Variable of the real hourly SPD increase or decrease, for period i.</p><p>s c o r e i = grade assigned for the forecast accuracy, for period i.</p><sec id="s4_1"><title>4.1. Weight Criterion</title><p>The first model indicator is called Weight, and <xref ref-type="fig" rid="fig2">Figure 2</xref> shows an example obtained for Southeast. As per the results, the SPD Week Level variable was the highlight, with a utilization varying from 607 times for the Northeast to 717 times for the South. And with the worst performance in this aspect, the Day of the Week variable presented a usage varying from 86 for the Southeast to 108 for the North.</p><p>The second variable of note here is Month for Southeast, where it was relevant to define new branches. The same result was obtained for South as well, but for Northeast and North, the variable Day was the second more relevant.</p></sec><sec id="s4_2"><title>4.2. Gain Criterion</title><p><xref ref-type="fig" rid="fig3">Figure 3</xref> presents the results for the second metric called Gain, obtained for Southeast. It was verified that the SPD Week Level excels the other variables, just like the Weight criterion, with average Gain values ranging from 0.78 for the Northeast to 0.9 for the North.</p><p>It is worth mentioning that the Gain values are normalized so that the sum of all the Gains equals 1. Additionally, in <xref ref-type="fig" rid="fig3">Figure 3</xref>, the model variables are represented by the acronyms f0 (SPD Week Level), f1 (Month), f2 (Day), f3 (Day of the Week), and f4 (Hour).</p><p>The second variable of note here is Month, which was relevant to all submarkets. The third more relevant variable was Hour for Southeast and South models and Day for Northeast and North model.</p><p>Concerning the Day of the Week, they presented the smallest gains Southeast, South and North. And as the model evolves, with the inclusion of new samples and other independent BSIN variables, this variable may be candidate to be removed, depending on the improvement of two factors: accuracy and processing time.</p></sec><sec id="s4_3"><title>4.3. 1-Day Ahead Forecast Performance</title><p>In this method of performance analysis, the one-day ahead forecast took into consideration the inclusion of the previous day in the Learning Base. This strategy allowed keeping the model as up-to-date as possible regarding the correlations of variables of the BSIN model, embedded in the SPD Week Level variable, with the hourly SPD to be predicted.</p><p>The forecast period was divided into hourly discretizations from 00:00 AM on July 17, 2021, to 11:00 PM on July 23, 2021, <xref ref-type="fig" rid="fig4">Figure 4</xref> shows the forecasted hourly SPD and the actual one for July 19<sup>th</sup> and the MAPE, RMSE, and TAI partials for the whole week forecast is shown in <xref ref-type="table" rid="table2">Table 2</xref>.</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> 1-day ahead forecast results</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  ></th><th align="center" valign="middle"  colspan="3"  >Southeast</th><th align="center" valign="middle"  colspan="3"  >South</th></tr></thead><tr><td align="center" valign="middle" >MAPE (%)</td><td align="center" valign="middle" >RMSE (R$/MWh)</td><td align="center" valign="middle" >TAI (%)</td><td align="center" valign="middle" >MAPE (%)</td><td align="center" valign="middle" >RMSE (R$/MWh)</td><td align="center" valign="middle" >TAI (%)</td></tr><tr><td align="center" valign="middle" >July 17, 2021 Saturday</td><td align="center" valign="middle" >1.67</td><td align="center" valign="middle" >11.73</td><td align="center" valign="middle" >65.22</td><td align="center" valign="middle" >3.04</td><td align="center" valign="middle" >20.79</td><td align="center" valign="middle" >56.52</td></tr><tr><td align="center" valign="middle" >July 18, 2021 Sunday</td><td align="center" valign="middle" >5.92</td><td align="center" valign="middle" >40.32</td><td align="center" valign="middle" >82.61</td><td align="center" valign="middle" >4.49</td><td align="center" valign="middle" >30.75</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >July 19, 2021 Monday</td><td align="center" valign="middle" >1.25</td><td align="center" valign="middle" >10.6</td><td align="center" valign="middle" >73.91</td><td align="center" valign="middle" >1.06</td><td align="center" valign="middle" >8.06</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >July 20, 2021 Tuesday</td><td align="center" valign="middle" >1.09</td><td align="center" valign="middle" >7.08</td><td align="center" valign="middle" >73.91</td><td align="center" valign="middle" >0.92</td><td align="center" valign="middle" >6</td><td align="center" valign="middle" >60.87</td></tr><tr><td align="center" valign="middle" >July 21, 2021 Wednesday</td><td align="center" valign="middle" >2.38</td><td align="center" valign="middle" >15.52</td><td align="center" valign="middle" >82.61</td><td align="center" valign="middle" >2.19</td><td align="center" valign="middle" >16.52</td><td align="center" valign="middle" >65.22</td></tr><tr><td align="center" valign="middle" >July 22, 2021 Thursday</td><td align="center" valign="middle" >1.21</td><td align="center" valign="middle" >8.25</td><td align="center" valign="middle" >65.22</td><td align="center" valign="middle" >1.26</td><td align="center" valign="middle" >8.65</td><td align="center" valign="middle" >56.52</td></tr><tr><td align="center" valign="middle" >July 23, 2021 Friday</td><td align="center" valign="middle" >0.81</td><td align="center" valign="middle" >5.15</td><td align="center" valign="middle" >60.87</td><td align="center" valign="middle" >0.85</td><td align="center" valign="middle" >5.67</td><td align="center" valign="middle" >78.26</td></tr><tr><td align="center" valign="middle" >Average</td><td align="center" valign="middle" >2.05</td><td align="center" valign="middle" >11.3</td><td align="center" valign="middle" >72.05</td><td align="center" valign="middle" >1.97</td><td align="center" valign="middle" >13.78</td><td align="center" valign="middle" >66.46</td></tr><tr><td align="center" valign="middle"  rowspan="2"  ></td><td align="center" valign="middle"  colspan="3"  >Northeast</td><td align="center" valign="middle"  colspan="3"  >North</td></tr><tr><td align="center" valign="middle" >MAPE (%)</td><td align="center" valign="middle" >RMSE (R$/MWh)</td><td align="center" valign="middle" >TAI (%)</td><td align="center" valign="middle" >MAPE (%)</td><td align="center" valign="middle" >RMSE (R$/MWh)</td><td align="center" valign="middle" >TAI (%)</td></tr><tr><td align="center" valign="middle" >July 17, 2021 Saturday</td><td align="center" valign="middle" >1.33</td><td align="center" valign="middle" >10.81</td><td align="center" valign="middle" >47.83</td><td align="center" valign="middle" >2.18</td><td align="center" valign="middle" >14.53</td><td align="center" valign="middle" >60.87</td></tr><tr><td align="center" valign="middle" >July 18, 2021 Sunday</td><td align="center" valign="middle" >5.08</td><td align="center" valign="middle" >32.12</td><td align="center" valign="middle" >69.57</td><td align="center" valign="middle" >4.72</td><td align="center" valign="middle" >30.38</td><td align="center" valign="middle" >69.57</td></tr><tr><td align="center" valign="middle" >July 19, 2021 Monday</td><td align="center" valign="middle" >1.30</td><td align="center" valign="middle" >9.69</td><td align="center" valign="middle" >69.57</td><td align="center" valign="middle" >1.48</td><td align="center" valign="middle" >12.65</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >July 20, 2021 Tuesday</td><td align="center" valign="middle" >1.49</td><td align="center" valign="middle" >10.94</td><td align="center" valign="middle" >73.91</td><td align="center" valign="middle" >0.56</td><td align="center" valign="middle" >4.1</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >July 21, 2021 Wednesday</td><td align="center" valign="middle" >2.48</td><td align="center" valign="middle" >16.63</td><td align="center" valign="middle" >52.17</td><td align="center" valign="middle" >1.84</td><td align="center" valign="middle" >14.97</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >July 22, 2021 Thursday</td><td align="center" valign="middle" >1.67</td><td align="center" valign="middle" >11.13</td><td align="center" valign="middle" >47.83</td><td align="center" valign="middle" >0.81</td><td align="center" valign="middle" >7.23</td><td align="center" valign="middle" >65.22</td></tr><tr><td align="center" valign="middle" >July 23, 2021 Friday</td><td align="center" valign="middle" >1.86</td><td align="center" valign="middle" >16.31</td><td align="center" valign="middle" >69.57</td><td align="center" valign="middle" >1.00</td><td align="center" valign="middle" >6.66</td><td align="center" valign="middle" >73.91</td></tr><tr><td align="center" valign="middle" >Average</td><td align="center" valign="middle" >2.17</td><td align="center" valign="middle" >15.38</td><td align="center" valign="middle" >61.49</td><td align="center" valign="middle" >1.80</td><td align="center" valign="middle" >12.93</td><td align="center" valign="middle" >70.19</td></tr></tbody></table></table-wrap></sec><sec id="s4_4"><title>4.4. 7-Days Ahead Forecast Performance</title><p>In this method, an hourly SPD forecast was performed with a horizon of seven days ahead. The model was trained only at the beginning of the process; that is, as the days evolved within the horizon, there were no new model updates. At the end of the seven days, its assertiveness was measured based on the initial view. <xref ref-type="fig" rid="fig5">Figure 5</xref> shows the results obtained for the Southeast submarket.</p><p><xref ref-type="table" rid="table3">Table 3</xref> shows the results of the forecast performed on July 16, 2021, for the period from July 17 to July 23, 2021.</p></sec><sec id="s4_5"><title>4.5. 1-Month Ahead Forecast Performance</title><p>Like the seven-day ahead forecast and performance analysis, in this method, the model update occurs only on the day the forecast is calculated, and the model is not updated during the next 30 days. The model’s performance is measured at the end of the first month period. For the analysis, a one-month forecast limit was set, however, this analysis can be extended to a period up to two months, according to public files provided by CCEE. From the third month on, the forecast acquires an informative character of conjectural trends in the sector.</p><p>For this horizon, the publication date coincides with the Monthly Operation Programming (MOP), held by ONS monthly on the last Friday of the month before the validity of this programming. In the case of the forecast below, the MOP date was June 25, 2021, for the programming of July 2021. <xref ref-type="fig" rid="fig6">Figure 6</xref> shows how the forecast and real SPD movement behaved on this horizon.</p><p>The results indicate that the short-term forecast presents higher volatility than the very short-term forecast due to the updates of the SPD Week Level that occur between the availability of the short-term study, known as revision 0 or rv0</p><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> 7-days ahead forecast results</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >MAPE (%)</th><th align="center" valign="middle" >RMSE (R$/MWh)</th><th align="center" valign="middle" >TAI (%)</th></tr></thead><tr><td align="center" valign="middle" >Southeast</td><td align="center" valign="middle" >1.77</td><td align="center" valign="middle" >15.10</td><td align="center" valign="middle" >73.81</td></tr><tr><td align="center" valign="middle" >South</td><td align="center" valign="middle" >2.14</td><td align="center" valign="middle" >17.22</td><td align="center" valign="middle" >69.64</td></tr><tr><td align="center" valign="middle" >Northeast</td><td align="center" valign="middle" >2.08</td><td align="center" valign="middle" >17.32</td><td align="center" valign="middle" >60.71</td></tr><tr><td align="center" valign="middle" >North</td><td align="center" valign="middle" >1.84</td><td align="center" valign="middle" >14.56</td><td align="center" valign="middle" >72.62</td></tr></tbody></table></table-wrap><p>until the actual disclosure of the SPD Week Level that will be in effect for the week in question. The results for all submarkets can be seen in <xref ref-type="table" rid="table4">Table 4</xref>.</p></sec><sec id="s4_6"><title>4.6. Medium-Term Projection</title><p>The medium-term forecast starts from the end of the horizon defined by the short-term analysis; in other words, it starts in the third month and extends until the 14th month. It represents a prospective study carried out by CCEE, based on the Market Operator view of SPD Week Level for the horizon.</p><p>In the forecasting strategy without coupling short-term results, the Learning Base included only the actual values of the Hourly SPD, published at the time of this study.</p><p>In <xref ref-type="fig" rid="fig7">Figure 7</xref>, it is possible to visualize the behavior of the Hourly SPD along the period, specific for working days. For comparison purposes, the SPDs Week Level used in the input was included in the graphics.</p><p>In this forecasting strategy with the coupling of short-term results, the Learning Base incorporated the short-term predicted results, so the predicted results for the first and second months were incorporated into the model.</p><p>In <xref ref-type="table" rid="table5">Table 5</xref>, it is possible to visualize the effect of the Average variation of the Hourly SPD after this coupling.</p><table-wrap id="table4" ><label><xref ref-type="table" rid="table4">Table 4</xref></label><caption><title> 1-month ahead forecast results</title></caption><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >MAPE (%)</th><th align="center" valign="middle" >RMSE (R$/MWh)</th><th align="center" valign="middle" >TAI (%)</th></tr></thead><tr><td align="center" valign="middle" >Southeast</td><td align="center" valign="middle" >14.00</td><td align="center" valign="middle" >87.27</td><td align="center" valign="middle" >60.08</td></tr><tr><td align="center" valign="middle" >South</td><td align="center" valign="middle" >13.00</td><td align="center" valign="middle" >81.97</td><td align="center" valign="middle" >46.64</td></tr><tr><td align="center" valign="middle" >Northeast</td><td align="center" valign="middle" >9.06</td><td align="center" valign="middle" >63.94</td><td align="center" valign="middle" >46.24</td></tr><tr><td align="center" valign="middle" >North</td><td align="center" valign="middle" >13.16</td><td align="center" valign="middle" >82.41</td><td align="center" valign="middle" >54.03</td></tr></tbody></table></table-wrap><table-wrap id="table5" ><label><xref ref-type="table" rid="table5">Table 5</xref></label><caption><title> Hourly SPD with and without short-term coupling for Southeast</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Month</th><th align="center" valign="middle" >Average Hourly SPD with coupling (R$/MWh)</th><th align="center" valign="middle" >Average Hourly SPD without coupling (R$/MWh)</th><th align="center" valign="middle" >Variation (%)</th></tr></thead><tr><td align="center" valign="middle" >Sep-21</td><td align="center" valign="middle" >573.95</td><td align="center" valign="middle" >500.85</td><td align="center" valign="middle" >14.60</td></tr><tr><td align="center" valign="middle" >Oct-21</td><td align="center" valign="middle" >528.11</td><td align="center" valign="middle" >547.98</td><td align="center" valign="middle" >−3.63</td></tr><tr><td align="center" valign="middle" >Nov-21</td><td align="center" valign="middle" >291.72</td><td align="center" valign="middle" >301.59</td><td align="center" valign="middle" >−3.27</td></tr><tr><td align="center" valign="middle" >Dec-21</td><td align="center" valign="middle" >284.59</td><td align="center" valign="middle" >293.05</td><td align="center" valign="middle" >−2.89</td></tr><tr><td align="center" valign="middle" >Jan-22</td><td align="center" valign="middle" >320</td><td align="center" valign="middle" >306.88</td><td align="center" valign="middle" >4.28</td></tr><tr><td align="center" valign="middle" >Feb-22</td><td align="center" valign="middle" >267.51</td><td align="center" valign="middle" >269.03</td><td align="center" valign="middle" >−0.56</td></tr><tr><td align="center" valign="middle" >Mar-22</td><td align="center" valign="middle" >238.6</td><td align="center" valign="middle" >236.2</td><td align="center" valign="middle" >1.02</td></tr><tr><td align="center" valign="middle" >Apr-22</td><td align="center" valign="middle" >42.63</td><td align="center" valign="middle" >41.93</td><td align="center" valign="middle" >1.67</td></tr><tr><td align="center" valign="middle" >May-22</td><td align="center" valign="middle" >45.65</td><td align="center" valign="middle" >48.12</td><td align="center" valign="middle" >−5.13</td></tr><tr><td align="center" valign="middle" >Jun-22</td><td align="center" valign="middle" >45.84</td><td align="center" valign="middle" >46.84</td><td align="center" valign="middle" >−2.13</td></tr><tr><td align="center" valign="middle" >Jul-22</td><td align="center" valign="middle" >110.21</td><td align="center" valign="middle" >111.2</td><td align="center" valign="middle" >−0.89</td></tr><tr><td align="center" valign="middle" >Aug-22</td><td align="center" valign="middle" >165.91</td><td align="center" valign="middle" >166.24</td><td align="center" valign="middle" >−0.20</td></tr><tr><td align="center" valign="middle" >Averages</td><td align="center" valign="middle" >242.89</td><td align="center" valign="middle" >239.16</td><td align="center" valign="middle" >0.24</td></tr></tbody></table></table-wrap></sec><sec id="s4_7"><title>4.7. Long-Term Projection</title><p>The long-term forecast starts at the end of the horizon defined by the medium-term analysis, that is, at the 15th month. However, it is possible to start the long-term horizon as early as the third month, thus dispensing the medium-term prospective study made available by the CCEE or another source.</p><p>Given the uncertainty inherent to this horizon, NEWAVE’s official input files make available 2000 synthetic series, each representing a possible hydrological scenario, and with this, the series were grouped to create three possible scenarios for the SPD Week Level in each month of this horizon:</p><p>• Pessimistic scenario:</p><p>Comprised of 20% of the largest weighted average MCOs, calculated from the range between the 80th percentile and the highest MCO value found,</p><p>• Probable scenario:</p><p>Comprised of 60% of the weighted averages of MCOs, calculated from the range between the 20th percentile and the 80th percentile,</p><p>• Optimistic scenario:</p><p>Comprised of 20% of the lowest weighted average MCOs, calculated from the range between the 20th percentile and the lowest MCO value found.</p><p>As a result, <xref ref-type="fig" rid="fig8">Figure 8</xref> presents the three scenarios for the projection of Hourly SPD. The Probable scenario was represented by the black lines, the Optimistic scenarios by the blue lines, and finally, the Pessimistic scenarios by the red lines.</p><p>The coupling technique, using medium-term horizon results, was also applied in the long-term study. And to understand the impact of this strategy, <xref ref-type="table" rid="table6">Table 6</xref></p><table-wrap id="table6" ><label><xref ref-type="table" rid="table6">Table 6</xref></label><caption><title> Variation of Hourly SPD with and without medium-term coupling for Southeast</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Month</th><th align="center" valign="middle" >variation of Probable scenario (%)</th><th align="center" valign="middle" >variation of Optimistic scenario (%)</th><th align="center" valign="middle" >variation of Pessimistic scenario (%)</th></tr></thead><tr><td align="center" valign="middle" >Sep-22</td><td align="center" valign="middle" >4.59</td><td align="center" valign="middle" >3.53</td><td align="center" valign="middle" >2.55</td></tr><tr><td align="center" valign="middle" >Oct-22</td><td align="center" valign="middle" >12.94</td><td align="center" valign="middle" >14.13</td><td align="center" valign="middle" >−1.81</td></tr><tr><td align="center" valign="middle" >Nov-22</td><td align="center" valign="middle" >−1.78</td><td align="center" valign="middle" >−1.78</td><td align="center" valign="middle" >−2.15</td></tr><tr><td align="center" valign="middle" >Dec-22</td><td align="center" valign="middle" >1.65</td><td align="center" valign="middle" >1.65</td><td align="center" valign="middle" >0.83</td></tr><tr><td align="center" valign="middle" >Jan-23</td><td align="center" valign="middle" >−11.19</td><td align="center" valign="middle" >−11.19</td><td align="center" valign="middle" >2.92</td></tr><tr><td align="center" valign="middle" >Feb-23</td><td align="center" valign="middle" >4.79</td><td align="center" valign="middle" >4.79</td><td align="center" valign="middle" >0.78</td></tr><tr><td align="center" valign="middle" >Mar-23</td><td align="center" valign="middle" >1.31</td><td align="center" valign="middle" >1.31</td><td align="center" valign="middle" >1.69</td></tr><tr><td align="center" valign="middle" >Apr-23</td><td align="center" valign="middle" >1.12</td><td align="center" valign="middle" >1.12</td><td align="center" valign="middle" >3.49</td></tr><tr><td align="center" valign="middle" >May-23</td><td align="center" valign="middle" >7.64</td><td align="center" valign="middle" >7.64</td><td align="center" valign="middle" >1.80</td></tr><tr><td align="center" valign="middle" >Jun-23</td><td align="center" valign="middle" >5.01</td><td align="center" valign="middle" >5.01</td><td align="center" valign="middle" >4.08</td></tr><tr><td align="center" valign="middle" >Jul-23</td><td align="center" valign="middle" >−2.58</td><td align="center" valign="middle" >−2.58</td><td align="center" valign="middle" >0.30</td></tr><tr><td align="center" valign="middle" >Aug-23</td><td align="center" valign="middle" >−1.48</td><td align="center" valign="middle" >−1.48</td><td align="center" valign="middle" >−0.26</td></tr><tr><td align="center" valign="middle" >Sep-23</td><td align="center" valign="middle" >3.59</td><td align="center" valign="middle" >3.59</td><td align="center" valign="middle" >1.16</td></tr><tr><td align="center" valign="middle" >Oct-23</td><td align="center" valign="middle" >12.24</td><td align="center" valign="middle" >12.24</td><td align="center" valign="middle" >9.07</td></tr><tr><td align="center" valign="middle" >Nov-23</td><td align="center" valign="middle" >−1.61</td><td align="center" valign="middle" >−1.61</td><td align="center" valign="middle" >−0.83</td></tr><tr><td align="center" valign="middle" >Dec-23</td><td align="center" valign="middle" >4.02</td><td align="center" valign="middle" >4.02</td><td align="center" valign="middle" >1.88</td></tr><tr><td align="center" valign="middle" >Jan-24</td><td align="center" valign="middle" >−11.84</td><td align="center" valign="middle" >−11.84</td><td align="center" valign="middle" >−16.73</td></tr><tr><td align="center" valign="middle" >Feb-24</td><td align="center" valign="middle" >5.03</td><td align="center" valign="middle" >5.03</td><td align="center" valign="middle" >−6.45</td></tr><tr><td align="center" valign="middle" >Mar-24</td><td align="center" valign="middle" >2.85</td><td align="center" valign="middle" >2.85</td><td align="center" valign="middle" >2.37</td></tr><tr><td align="center" valign="middle" >Apr-24</td><td align="center" valign="middle" >−0.33</td><td align="center" valign="middle" >−0.33</td><td align="center" valign="middle" >4.55</td></tr><tr><td align="center" valign="middle" >May-24</td><td align="center" valign="middle" >8.17</td><td align="center" valign="middle" >8.17</td><td align="center" valign="middle" >1.72</td></tr><tr><td align="center" valign="middle" >Jun-24</td><td align="center" valign="middle" >5.62</td><td align="center" valign="middle" >5.62</td><td align="center" valign="middle" >1.52</td></tr><tr><td align="center" valign="middle" >Jul-24</td><td align="center" valign="middle" >−3.72</td><td align="center" valign="middle" >−3.72</td><td align="center" valign="middle" >−0.03</td></tr><tr><td align="center" valign="middle" >Aug-24</td><td align="center" valign="middle" >−0.92</td><td align="center" valign="middle" >−0.92</td><td align="center" valign="middle" >0.47</td></tr><tr><td align="center" valign="middle" >Sep-24</td><td align="center" valign="middle" >2.56</td><td align="center" valign="middle" >2.56</td><td align="center" valign="middle" >4.50</td></tr><tr><td align="center" valign="middle" >Oct-24</td><td align="center" valign="middle" >11.09</td><td align="center" valign="middle" >11.09</td><td align="center" valign="middle" >3.84</td></tr><tr><td align="center" valign="middle" >Nov-24</td><td align="center" valign="middle" >−0.07</td><td align="center" valign="middle" >−0.07</td><td align="center" valign="middle" >−1.78</td></tr><tr><td align="center" valign="middle" >Dec-24</td><td align="center" valign="middle" >2.02</td><td align="center" valign="middle" >2.02</td><td align="center" valign="middle" >0.86</td></tr><tr><td align="center" valign="middle" >Jan-25</td><td align="center" valign="middle" >−10.72</td><td align="center" valign="middle" >−10.72</td><td align="center" valign="middle" >−8.36</td></tr><tr><td align="center" valign="middle" >Feb-25</td><td align="center" valign="middle" >5.64</td><td align="center" valign="middle" >5.64</td><td align="center" valign="middle" >−4.74</td></tr><tr><td align="center" valign="middle" >Mar-25</td><td align="center" valign="middle" >2.57</td><td align="center" valign="middle" >2.57</td><td align="center" valign="middle" >−1.19</td></tr><tr><td align="center" valign="middle" >Apr-25</td><td align="center" valign="middle" >−0.79</td><td align="center" valign="middle" >−0.79</td><td align="center" valign="middle" >−4.60</td></tr><tr><td align="center" valign="middle" >May-25</td><td align="center" valign="middle" >8.91</td><td align="center" valign="middle" >8.91</td><td align="center" valign="middle" >6.12</td></tr><tr><td align="center" valign="middle" >Jun-25</td><td align="center" valign="middle" >4.71</td><td align="center" valign="middle" >4.71</td><td align="center" valign="middle" >1.44</td></tr><tr><td align="center" valign="middle" >Jul-25</td><td align="center" valign="middle" >−3.07</td><td align="center" valign="middle" >−3.07</td><td align="center" valign="middle" >0.03</td></tr><tr><td align="center" valign="middle" >Aug-25</td><td align="center" valign="middle" >−0.63</td><td align="center" valign="middle" >−0.63</td><td align="center" valign="middle" >1.42</td></tr><tr><td align="center" valign="middle" >Sep-25</td><td align="center" valign="middle" >2.47</td><td align="center" valign="middle" >2.47</td><td align="center" valign="middle" >7.44</td></tr><tr><td align="center" valign="middle" >Oct-25</td><td align="center" valign="middle" >12.31</td><td align="center" valign="middle" >12.31</td><td align="center" valign="middle" >−7.37</td></tr><tr><td align="center" valign="middle" >Nov-25</td><td align="center" valign="middle" >−1.12</td><td align="center" valign="middle" >−1.12</td><td align="center" valign="middle" >−2.96</td></tr><tr><td align="center" valign="middle" >Dec-25</td><td align="center" valign="middle" >−0.64</td><td align="center" valign="middle" >−0.64</td><td align="center" valign="middle" >−0.76</td></tr><tr><td align="center" valign="middle" >Averages</td><td align="center" valign="middle" >2.01</td><td align="center" valign="middle" >2.01</td><td align="center" valign="middle" >0.17</td></tr></tbody></table></table-wrap><p>displays the differences between the average hourly SPD for a specific month with coupling, minus the average hourly SPD without coupling. Additionally, this strategy was applied to three scenarios created.</p><p>In both approaches, it is possible to see that the Probable and Optimistic scenarios are practically the same in most of the months of this horizon. This happens because of the low values of MCO, pointing to a well-known characteristic of the official models: optimism about the future inflows of the rivers. This optimism points to a favorable scenario of rains in the future, thus reducing the CMO by favoring the use of hydraulic sources in energy generation.</p><p>It is important to mention that if the values in both scenarios are lower than the floor allowed for the hourly SPD; this value must be discarded and replaced by this minimum value defined by CCEE.</p></sec></sec><sec id="s5"><title>5. Conclusions</title><p>This study presented the XGBoost model as an interesting tool in the projection of multivariate time series, with a relevant application in the hourly projection of the SPD for the Brazilian electricity market.</p><p>This projection encompassed different forecast horizons and proved to be flexible when incorporating temporal and BSIN variables, dealing satisfactorily with tabular-type information.</p><p>As shown in this study, the official models have an optimistic bias in the medium and long term, so there is much to evolve in the strategy presented of basing the creation of scenarios on a group of series of the SPD Week Level. Potential new studies can be developed to anticipate this bias more consistently by including other variables that can translate potential water crises ahead, such as ANE and ES. Although the official models include these indicators in the SPD Week Level, the inclusion of specific variables for this purpose may be a promising field for further studies.</p><p>From a model robustness point of view, with the increase of samples inserted into the Learning Base, the XGBoost model will gradually become more comprehensive, serving as an important decision-making support tool.</p></sec><sec id="s6"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s7"><title>Cite this paper</title><p>dos Santos, C.R.R., Castro, R., Marques, R. and da Silva, L.C.P. (2022) An Application of Decision Trees Algorithm to Project Hourly Electricity Spot Price as Support for Decision Making on Electricity Trading in Brazil. Energy and Power Engineering, 14, 327-342. https://doi.org/10.4236/epe.2022.148018</p></sec></body><back><ref-list><title>References</title><ref id="scirp.119354-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Alim, M., Ye, G.-H., Guan, P., Huang, D.-S. and Zhou, B.-S. (2020) Comparison of ARIMA Model and XGBoost Model for Prediction of Human Brucellosis in Mainland China: A Time-Series Study. British Medical Journal Open, 10, e039676.https://doi.org/10.1136/bmjopen-2020-039676</mixed-citation></ref><ref id="scirp.119354-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, L., Bian, W., Qu, W., Tuo, L. and Wang, Y. (2021) Time Series Forecast of Sales Volume Based on XGBoost. Journal of Physics: Conference Series, 1873, Article ID: 012067. https://doi.org/10.1088/1742-6596/1873/1/012067</mixed-citation></ref><ref id="scirp.119354-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Grinsztajn, L., Oyallon, E. and Varoquaux, G. (2022) Why Do Tree-Based Models Still Outperform Deep Learning on Tabular Data? Cornell University, Ithaca.</mixed-citation></ref><ref id="scirp.119354-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">dos Santos, C.R.R., Castro, R., Marques, R.F. and Pereira, L.C. (2022) Aplica&amp;atilde;o de Aprendizado de Máquina para proje&amp;atilde;o do Preo Horário de Liquida&amp;atilde;o das Diferenas, como suporte às estratégias de comercializa&amp;atilde;o de energia elétrica. Revista Brasileira de Energia, 28, 243-279.</mixed-citation></ref><ref id="scirp.119354-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">ANEEL (2021) Sistema de Informa&amp;atilde;o de Gera&amp;atilde;o da ANEEL SIGA. https://bit.ly/2IGf4Q0</mixed-citation></ref><ref id="scirp.119354-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Centro de Pesquisas de Energia Elétrica (2019) Manual de Referência: Modelo NEWAVE. Centro de Pesquisas de Energia Elétrica, Rio de Janeiro.</mixed-citation></ref><ref id="scirp.119354-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Skiena, S.S. (2017) The Data Science Design Manual. Springer, New York.</mixed-citation></ref></ref-list></back></article>