<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    ijg
   </journal-id>
   <journal-title-group>
    <journal-title>
     International Journal of Geosciences
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2156-8359
   </issn>
   <issn publication-format="print">
    2156-8367
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/ijg.2025.1610037
   </article-id>
   <article-id pub-id-type="publisher-id">
    ijg-146806
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Earth 
     </subject>
     <subject>
       Environmental Sciences
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    Performance Evaluation of Soft Computing Techniques in Blast-Induced Predictions: The Case of Backbreak
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Festus
      </surname>
      <given-names>
       Kunkyin-Saadaari
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref> 
     <xref ref-type="aff" rid="aff2"> 
      <sup>2</sup>
     </xref>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Charles
      </surname>
      <given-names>
       Asiedu
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref> 
     <xref ref-type="aff" rid="aff2"> 
      <sup>2</sup>
     </xref>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Ephraim
      </surname>
      <given-names>
       Atta-Duncan
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Victor Kwaku
      </surname>
      <given-names>
       Agadzie
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref> 
     <xref ref-type="aff" rid="aff2"> 
      <sup>2</sup>
     </xref>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Mary
      </surname>
      <given-names>
       Adu-Gyamfi
      </given-names>
     </name> 
     <xref ref-type="aff" rid="aff1"> 
      <sup>1</sup>
     </xref>
    </contrib>
   </contrib-group> 
   <aff id="aff1">
    <addr-line>
     aDepartment of Mining Engineering, Faculty of Mining and Minerals Technology, University of Mines and Technology (UMaT), Tarkwa, Ghana
    </addr-line> 
   </aff> 
   <aff id="aff2">
    <addr-line>
     aDepartment of Mining Systems and Rock Engineering, Division of Geo-Industrial Sciences Technology, Festari Institute of Professional Studies, Tarkwa, Ghana
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     15
    </day> 
    <month>
     10
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    16
   </volume> 
   <issue>
    10
   </issue>
   <fpage>
    738
   </fpage>
   <lpage>
    761
   </lpage>
   <history>
    <date date-type="received">
     <day>
      23,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      26,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      26,
     </day>
     <month>
      October
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    The accurate prediction of backbreak, a crucial parameter in mining operations, has a significant influence on safety and operational efficiency. The occurrence of this phenomenon is detrimental to the safety, capital and human resources of a mine. This framework applies machine learning algorithms to predict backbreak. An enhanced precision will be explored specifically employing gradient boosting decision trees (GBDT), light gradient boosting machines (LightGBM), backpropagation neural network (BPNN), Graph Neural Networks (GNNs) and Kolmogorov-Arnold Network (KAN) algorithm. The study utilises a comprehensive dataset from the Goldfields Ghana Limited, Damang Mine comprising geomechanical, drilling, and blasting parameters (burden, spacing, stemming height, geometric stiffness, and powder factor) as well as backbreak data. The potential of each algorithm to learn the intricate relationships between the input features and backbreak values is investigated. To quantitatively assess the predictive performance of the models, the evaluation metrics, coefficient of determination (R
    <sup>2</sup>), mean absolute error (MAE), and mean square error (MSE) are employed. The results revealed that GBDT and BPNN algorithms exhibited robust predictive capabilities, capturing the complex non-linear patterns in the dataset, achieving higher R
    <sup>2</sup> values (97% and 92% respectively) and lower MAE scores (0.2603 and 0.456, respectively) and MSE scores (0.1456 and 0.3798, respectively). The LightGBM and KAN models also showed their predictive potential and captured the complex non-linear patterns in the dataset but not as efficiently as GBDT and BPNN. GNN showed the least performance in backbreak prediction. The findings highlighted the potential of the GBDT model to enhance backbreak prediction accuracy, thereby aiding in safer and more efficient excavation practices.
   </abstract>
   <kwd-group> 
    <kwd>
     Backbreak Prediction
    </kwd> 
    <kwd>
      Machine Learning
    </kwd> 
    <kwd>
      Open-Pit Mining
    </kwd> 
    <kwd>
      Gradient Boosting
    </kwd> 
    <kwd>
      Neural Networks
    </kwd> 
    <kwd>
      Blast Design Parameters
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction</title>
   <p>Backbreak is defined as the undesirable fracturing of rock behind the final wall of an open-pit mine due to blasting. This phenomenon can lead to instability of mine walls, equipment damage, poor fragmentation, and increased mining costs <xref ref-type="bibr" rid="scirp.146806-1">
     [1]
    </xref>, as resources must be spent to treat and correct its occurrence and effects. To prevent backbreak occurrence, various factors must be considered, including the blasting pattern’s geometrical properties, explosives’ qualities, and the geomechanical characteristics of the rock mass <xref ref-type="bibr" rid="scirp.146806-2">
     [2]
    </xref>. In a blast pattern, backbreak is characterized by fractured rocks extending past the rear row of holes, with its quantification typically involving measuring the distance between the final excavation line and the point where the rock breaks <xref ref-type="bibr" rid="scirp.146806-3">
     [3]
    </xref>.</p>
   <p>The evolution of backbreak prediction methods in mining operations has undergone a significant transformation over the past decades. According to <xref ref-type="bibr" rid="scirp.146806-1">
     [1]
    </xref>, blast design in the past relied on empirical modelling approaches that sought to achieve optimal outcomes such as appropriate fragmentation, minimized backbreak, optimal muck pile configuration, and reduced boulder formation. Further stating that, the empirical methods lacked a direct mechanism for backbreak prediction and typically considered only a limited subset of the relevant parameters that influence blasting performance. The early 2000s saw advancement through numerical modeling approaches, with finite element analysis demonstrating improved accuracy against field data, though these methods often required extensive computational resources <xref ref-type="bibr" rid="scirp.146806-4">
     [4]
    </xref>. <xref ref-type="bibr" rid="scirp.146806-5">
     [5]
    </xref> Kuz-Ram model, while primarily focused on fragmentation, provided valuable insights into backbreak mechanisms.</p>
   <p>The transition to computational methods marked a significant shift in prediction approaches. The Distinct Element Method (DEM) offered detailed insights into rock mass behaviour and fracture propagation, though its computational demands often made it impractical for real-time applications <xref ref-type="bibr" rid="scirp.146806-6">
     [6]
    </xref>. <xref ref-type="bibr" rid="scirp.146806-7">
     [7]
    </xref> then introduced neural networks to evaluate blasting pattern parameters’ effects on backbreak, demonstrating the potential of computational methods in improving prediction accuracy. This was followed by <xref ref-type="bibr" rid="scirp.146806-1">
     [1]
    </xref> application of fuzzy set theory, which provided a more nuanced approach to handling the uncertainties inherent in backbreak prediction.</p>
   <p>Machine learning applications in mining have expanded considerably, show+casing the versatility and effectiveness of these techniques across different operational aspects. <xref ref-type="bibr" rid="scirp.146806-8">
     [8]
    </xref> demonstrated the successful application of machine learning in predicting slope deformation in open-pit mines, while <xref ref-type="bibr" rid="scirp.146806-9">
     [9]
    </xref> extended its use to predict haul truck fuel consumption. The implementation of deep learning algorithms for dust concentration monitoring by <xref ref-type="bibr" rid="scirp.146806-10">
     [10]
    </xref> and rockburst prediction by <xref ref-type="bibr" rid="scirp.146806-11">
     [11]
    </xref> further exemplifies the growing sophistication of machine learning applications in mining operations. These developments have established a strong foundation for the application of advanced algorithms in solving complex mining challenges.</p>
   <p>Recent developments in machine learning techniques have introduced more sophisticated approaches to backbreak prediction. The emergence of ensemble learning methods, particularly gradient boosting algorithms, has opened new possibilities for improving prediction accuracy. <xref ref-type="bibr" rid="scirp.146806-12">
     [12]
    </xref> development of LightGBM as an efficient Gradient Boosting Decision Tree framework, coupled with <xref ref-type="bibr" rid="scirp.146806-13">
     [13]
    </xref> comparative analysis of GBDT, XGBoost, and LightGBM algorithms in rock stability prediction, has demonstrated the potential of these methods in mining applications. The advancement in neural network architectures, as reviewed by <xref ref-type="bibr" rid="scirp.146806-14">
     [14]
    </xref>, has further enhanced the capability to capture complex non-linear relationships in mining parameters. These developments, combined with improvements in computational efficiency and data processing capabilities, have set the stage for more accurate and reliable backbreak prediction methods.</p>
   <p>The accurate prediction of backbreak in open-pit mining operations remains a significant challenge that has not been adequately resolved by existing methods. While previous studies have explored various approaches-from <xref ref-type="bibr" rid="scirp.146806-7">
     [7]
    </xref> evaluation of blasting parameters using neural networks to <xref ref-type="bibr" rid="scirp.146806-15">
     [15]
    </xref> RES-based model for risk assessment, none have comprehensively addressed the complex non-linear relationships between blasting pattern geometry, explosive properties, and geomechanical characteristics. Although machine learning has shown promise in mining applications, as demonstrated by <xref ref-type="bibr" rid="scirp.146806-8">
     [8]
    </xref> in slope deformation prediction and <xref ref-type="bibr" rid="scirp.146806-10">
     [10]
    </xref> in dust concentration monitoring, its full potential for backbreak prediction remains unexplored. The continued occurrence of unwanted rock fracturing beyond the final pit wall, resulting in safety hazards, equipment damage, poor fragmentation, and increased mining costs <xref ref-type="bibr" rid="scirp.146806-1">
     [1]
    </xref>, underscores the limitations of current prediction methods.</p>
   <p>This research specifically addresses the gap in applying advanced machine learning techniques to predict backbreak more accurately. The study has three primary objectives: (1) to develop and implement a comparative analysis of GBDT, LightGBM, BPNN, GNN and KAN algorithms for backbreak prediction; (2) to evaluate the predictive performance of these algorithms using real-world data from Goldfields Ghana Limited, Damang Mine; and (3) to determine the most effective algorithm combination for improving backbreak prediction accuracy in open-pit mining operations. While <xref ref-type="bibr" rid="scirp.146806-3">
     [3]
    </xref> and <xref ref-type="bibr" rid="scirp.146806-16">
     [16]
    </xref> have explored individual machine learning approaches for backbreak prediction, there has not been a comprehensive comparative analysis of these specific algorithms in this context. Despite the proven efficiency of LightGBM in complex datasets <xref ref-type="bibr" rid="scirp.146806-12">
     [12]
    </xref> and the established capabilities of GBDT and BPNN in mining applications <xref ref-type="bibr" rid="scirp.146806-13">
     [13]
    </xref> <xref ref-type="bibr" rid="scirp.146806-14">
     [14]
    </xref>, their combined potential for backbreak prediction remains unexplored.</p>
   <p>The remainder of this paper is structured as follows: Section 2 describes the study area and presents the methodology, including detailed descriptions of the GBDT, LightGBM, BPNN, GNN and KAN algorithms and their implementation. Section 3 presents the results and discusses the comparative performance of the algorithms using evaluation metrics such as coefficient of determination (R<sup>2</sup>), mean absolute error (MAE), and mean square error (MSE). Finally, Section 4 concludes the study by summarizing the key findings and their implications for improving backbreak prediction in open-pit mining operations. This systematic approach represents a novel contribution to the field by combining advanced ML techniques in a way that has not been previously attempted for backbreak prediction, with the potential to significantly improve safety and operational efficiency in open-pit mining operations.</p>
  </sec><sec id="s2">
   <title>2. Materials and Methods Used</title>
   <sec id="s2_1">
    <title>2.1. Study Area Description</title>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>The Damang Gold Mine is in southwest of Ghana, West Africa approximately 300 km west of Accra (5˚11'N, 1˚57'W), and about 30 km north of Tarkwa. The deposit is found within the Tarkwaian sediments of the Ashanti Belt. The Ashanti Belt strikes northeast with a broadly synclinal structure and comprises lower Proterozoic sediments, as well as volcanic rocks and the Birimian System metasediments. The Tarkwaian sedimentary sequence within the Ashanti Belt exhibits lower-intensity metamorphism compared to other rock formations in the region. It is composed of immature sedimentary units and coarse-grains that include several series, such as the Huni series, the Banket series, the Damang phyllite series, and the Kawere series.</p>
    <p>The mine mineralisation occurs within the Tarkwaian’s paleoplacer deposits. These deposits are composed of distinct tabular units of quartz pebble conglomerates, commonly referred to as reefs. These reefs are interspersed between argillaceous sandstone blocks and quartzites. The mineralisation in the Damang deposit is likely associated with the concentration of gold within these paleoplacer deposits. Hydrothermal, oxide and Witwatersrand-style paleoplacer mineralisation are exploited at the mine. The hydrothermal deposits occur within the pyrite and pyrrhotite alteration selvedge and are found next to the echelon quartz veins. Accessory vein minerals such as ilmenite, carbonate, tourmaline and apatite are also linked to the gold. Conventional open-pit methods are employed, which utilise standard truck-shovel techniques to mine the ore. Loading and hauling activities are undertaken using excavators in backhoe configurations. To support drill-and-blast and haulage operations, vehicle, road, and bench maintenance, dust, and erosion control are undertaken. For optimal fragmentation and blast control, both conventional (Nonel) and electronic detonators are employed in rock blasting. The location of the study area is depicted in <xref ref-type="fig" rid="fig1">
      Figure 1
     </xref>.</p>
    <fig id="fig1" position="float">
     <label>Figure 1</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 1. Location of study area (source: designed by the authors).</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId18.jpeg?20251029033931" />
    </fig>
   </sec>
   <sec id="s2_2">
    <title>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>2.2. Datasets</title>
    <p>A dataset comprising the parameters: spacing (S), burden (B), geometric stiffness (K), stemming height (T), backbreak (BB) and powder factor (P) was used. A total of 60 data points were obtained from the Goldfields Ghana Limited, Damang Mine. <xref ref-type="table" rid="table1">
      Table 1
     </xref> represents the statistical description of the dataset employed.</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>Table 1. Statistical description of dataset.</p>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td acenter" width="29.76%"><p style="text-align:center">Parameter</p></td> 
      <td class="custom-bottom-td acenter" width="12.53%"><p style="text-align:center">Unit</p></td> 
      <td class="custom-bottom-td acenter" width="14.43%"><p style="text-align:center">Maximum</p></td> 
      <td class="custom-bottom-td acenter" width="14.43%"><p style="text-align:center">Minimum</p></td> 
      <td class="custom-bottom-td acenter" width="14.43%"><p style="text-align:center">Mean</p></td> 
      <td class="custom-bottom-td acenter" width="14.44%"><p style="text-align:center">Standard Deviation</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="29.76%"><p style="text-align:center">Burden (B)</p></td> 
      <td class="custom-top-td acenter" width="12.53%"><p style="text-align:center">m</p></td> 
      <td class="custom-top-td acenter" width="14.43%"><p style="text-align:center">6.50</p></td> 
      <td class="custom-top-td acenter" width="14.43%"><p style="text-align:center">0.73</p></td> 
      <td class="custom-top-td acenter" width="14.43%"><p style="text-align:center">3.11</p></td> 
      <td class="custom-top-td acenter" width="14.44%"><p style="text-align:center">0.75</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="29.76%"><p style="text-align:center">Spacing (S)</p></td> 
      <td class="acenter" width="12.53%"><p style="text-align:center">m</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">6.00</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">0.57</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">3.82</p></td> 
      <td class="acenter" width="14.44%"><p style="text-align:center">0.73</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="29.76%"><p style="text-align:center">Stemming (T)</p></td> 
      <td class="acenter" width="12.53%"><p style="text-align:center">m</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">4.50</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">0.73</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">2.93</p></td> 
      <td class="acenter" width="14.44%"><p style="text-align:center">0.64</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="29.76%"><p style="text-align:center">Powder Factor (P)</p></td> 
      <td class="acenter" width="12.53%"><p style="text-align:center">kg/m<sup>3</sup></p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">1.64</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">0.15</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">0.57</p></td> 
      <td class="acenter" width="14.44%"><p style="text-align:center">0.25</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="29.76%"><p style="text-align:center">Geometric Stiffness (K)</p></td> 
      <td class="acenter" width="12.53%"><p style="text-align:center">-</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">8.65</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">0.44</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">4.69</p></td> 
      <td class="acenter" width="14.44%"><p style="text-align:center">1.72</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="29.76%"><p style="text-align:center">Backbreak (BB)</p></td> 
      <td class="acenter" width="12.53%"><p style="text-align:center">m</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">10.00</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">1.00</p></td> 
      <td class="acenter" width="14.43%"><p style="text-align:center">3.88</p></td> 
      <td class="acenter" width="14.44%"><p style="text-align:center">2.11</p></td> 
     </tr> 
    </table>
   </sec>
   <sec id="s2_3">
    <title>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>2.3. Data Preparation</title>
    <p>The train_test_split was used on the dataset to separate into training and testing sets for all algorithms. This is important to assess how effectively the model generalizes to new, unseen, or future data. The dataset was split into 80:20, where 80% was for training with the corresponding 20% for testing. 12 out of 60 data points were for testing, and 48 out of 60 data points were for training. The training set (comprising input parameters and target variable) is used in a process of adjusting the model’s internal parameters to make it capable of learning patterns and relationships from the data after which the testing set is employed to assess the trained model’s performance in real-world scenarios.</p>
   </sec>
   <sec id="s2_4">
    <title>2.4. Method Used</title>
    <p>This prediction task employs machine learning algorithms for regression modelling using GBDT, LightGBM, BPNN, GNN and KAN. These algorithms were selected for their proven capabilities in handling non-linear relationships and complex datasets in mining applications <xref ref-type="bibr" rid="scirp.146806-17">
      [17]
     </xref>. Using a standard 80:20 training-testing split ratio, all three algorithms process the same input parameters (burden, spacing, stemming height, powder factor, and geometric stiffness) to predict backbreak values.</p>
    <p>Theory</p>
    <p>GBDT ensures that the errors produced in the past are fixed by adding the prior under-fitted forecasts to the ensemble one after the other. In the gradient boosting process, models are iteratively built, with the errors and losses (loss function) from previous models being calculated and used as target variables to improve predictions on subsequent models and combining the predictions from the multiple models to minimise errors <xref ref-type="bibr" rid="scirp.146806-13">
      [13]
     </xref>. This process repeats until a stopping condition is satisfied or a maximum number of models is reached <xref ref-type="bibr" rid="scirp.146806-18">
      [18]
     </xref>. The gradient descent method is utilised in this algorithm to minimise the loss function. According to <xref ref-type="bibr" rid="scirp.146806-19">
      [19]
     </xref> for the initial model prediction Equation (1). <xref ref-type="fig" rid="fig2">
      Figure 2
     </xref> illustrates the implementation of the GBDT algorithm.</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          F 
        </mi> 
        <mrow> 
         <mn>
           0 
         </mn> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            x 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mi>
         arg 
       </mi> 
       <mi>
         min 
       </mi> 
       <mi>
         γ 
       </mi> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </msubsup> 
        <mrow> 
         <mi>
           n 
         </mi> 
         <mi>
           L 
         </mi> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              y 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
           <mo>
             , 
           </mo> 
           <mi>
             γ 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (1)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>where; L is the loss function, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        γ 
      </mi> 
     </math> is the predicted value, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math> is the observed value, n is the number of observations, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         arg 
       </mi> 
       <mi>
         min 
       </mi> 
       <mi>
         γ 
       </mi> 
      </mrow> 
     </math> denotes finding the value of 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        γ 
      </mi> 
     </math> that minimizes the loss function</p>
    <p>The loss function is calculated by Equation (2),</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         L 
       </mi> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          n 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </msubsup> 
        <mrow> 
         <msup> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                y 
              </mi> 
              <mi>
                i 
              </mi> 
             </msub> 
             <mo>
               − 
             </mo> 
             <mi>
               γ 
             </mi> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mn>
            2 
          </mn> 
         </msup> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (2)</p>
    <fig id="fig2" position="float">
     <label>Figure 2</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 2. Architecture of GBDT.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId31.jpeg?20251029033932" />
    </fig>
    <p>Implementation</p>
    <p>The dataset (n = 60) was initially sectioned using stratified partitioning into training (n_train = 48, 80%) and test (n_test = 12, 20%) sets depicted in <xref ref-type="fig" rid="fig3">
      Figure 3
     </xref>. To enhance model accuracy and ensure robust performance assessment, particularly given the limited dataset size, k-fold cross-validation with k = 5 was implemented on the training data. This process systematically divided the training dataset into five equally sized folds. The GBDT model underwent five iterations of training and evaluation, with each iteration utilizing a distinct fold as the validation set while the remaining four folds served as training data. This approach allowed for a more reliable estimation of the model’s generalization performance and demonstrated its stability across different subsets of the training data.</p>
    <fig id="fig3" position="float">
     <label>Figure 3</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 3. Model implementation process.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId32.jpeg?20251029033932" />
    </fig>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>Through systematic hyperparameter optimization, it was determined that 300 estimators yielded optimal model performance. The model’s configuration was further refined through additional hyperparameters, notably the maximum depth and learning rate. An exhaustive grid search strategy, implemented via GridSearchCV, was employed to identify the optimal hyperparameter combination. This methodological approach systematically explored the hyperparameter space, evaluating various permutations to determine the configuration that maximized the predictive model’s performance. A summary of the optimal tuned hyperparameters is presented in <xref ref-type="table" rid="table2">
      Table 2
     </xref>.</p>
    <p>Theory</p>
    <p>According to <xref ref-type="bibr" rid="scirp.146806-20">
      [20]
     </xref>, LightGBM is a robust gradient boosting framework that leverages decision trees to enhance model efficiency and minimize memory consumption. It introduces two innovative techniques known as Gradient-based One Side Sampling (GOSS) and Exclusive Feature Bundling (EFB) <xref ref-type="bibr" rid="scirp.146806-12">
      [12]
     </xref> <xref ref-type="bibr" rid="scirp.146806-21">
      [21]
     </xref>. The GOSS technique gives more weight to instances with larger gradients, indicating that they are under-trained. These instances contribute more to the calculation of information gain. GOSS selectively keeps instances with large gradients (above a threshold or within the top percentiles) and only randomly discards instances with small gradients <xref ref-type="bibr" rid="scirp.146806-22">
      [22]
     </xref>. This improves the accuracy of information gain estimation compared to uniform random sampling, especially when the range of information gain values is large. EFB bundles exclusive features (mutually exclusive features that have no simultaneous non-zero values) into a single feature. This significantly reduces the complexity of histogram building from being proportional to the number of data points multiplied by the number of features to being proportional to the number of data points multiplied by the number of bundles. Since the number of bundles is much smaller than the number of features, this approach improves the training speed of the framework without sacrificing accuracy. The architecture of LightGBM is depicted in <xref ref-type="fig" rid="fig4">
      Figure 4
     </xref>.</p>
    <p>Parameters that can be tuned in LightGBM are max depth (maximum limit of tree depth is effective in controlling overfitting), categorical feature (feature to be used for model training), bagging fraction (proportion of data selected for each iteration.) num iterations (total count of iterations to be executed), num leaves (total count of leaves in a tree), max bin (maximum count of bins for grouping feature values), min data in a bin (least amount of data contained in a bin), task (operation to be performed, either training or prediction) and feature fraction (proportion of features selected for each iteration). Overfitting, which occurs when models excessively capture details and noise from the training data, resulting to poor performance on new data, should be watched for in decision trees. The model accuracy is enhanced and the risk of overfitting is reduced through K-fold cross-validation technique <xref ref-type="bibr" rid="scirp.146806-23">
      [23]
     </xref>.</p>
    <fig id="fig4" position="float">
     <label>Figure 4</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 4. Architecture of LightGBM.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId33.jpeg?20251029033932" />
    </fig>
    <p>Implementation</p>
    <p>The experimental methodology employed a systematic data partitioning approach, where the dataset (n = 60) was stratified and randomly split into training (n_train = 48, 80%) and test (n_test = 12, 20%) sets. For hyperparameter optimization, an exhaustive grid search strategy was implemented using GridSearchCV, coupled with a k-fold cross-validation mechanism (k = 5) to ensure robust model selection and mitigate overfitting risks. The grid search systematically explored the defined hyperparameter space to identify the optimal configuration that maximized model performance. The k-fold cross-validation procedure partitioned the training data into five stratified subsets, enabling comprehensive model evaluation across multiple data combinations while maintaining the statistical distribution of the target variable. This rigorous validation framework ensured the selected hyperparameters generalized well across different data subsets, while the held-out test set provided an unbiased assessment of the final model’s performance. <xref ref-type="table" rid="table2">
      Table 2
     </xref> presents the optimal hyperparameters for the model.</p>
    <table-wrap id="table1">
     <label>
      <xref ref-type="table" rid="table1">
       Table 1
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Table 2. Hyperparameter tuning for algorithms.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="27.01%"><p style="text-align:center">Algorithm</p></td> 
       <td class="custom-bottom-td acenter" width="31.02%"><p style="text-align:center">Hyperparameter</p></td> 
       <td class="custom-bottom-td acenter" width="41.97%"><p style="text-align:center">Optimal Tuned Values</p></td> 
      </tr> 
      <tr> 
       <td rowspan="3" class="custom-top-td acenter" width="27.01%"><p style="text-align:center">GBDT</p></td> 
       <td class="custom-top-td acenter" width="31.02%"><p style="text-align:center">n_estimators</p></td> 
       <td class="custom-top-td acenter" width="41.97%"><p style="text-align:center">300</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">learning_rate</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">0.01</p></td> 
      </tr> 
      <tr> 
       <td class="custom-bottom-td acenter" width="31.02%"><p style="text-align:center">max_depth</p></td> 
       <td class="custom-bottom-td acenter" width="41.97%"><p style="text-align:center">5</p></td> 
      </tr> 
      <tr> 
       <td rowspan="3" class="custom-top-td acenter" width="27.01%"><p style="text-align:center">LightGBM</p></td> 
       <td class="custom-top-td acenter" width="31.02%"><p style="text-align:center">n_estimators</p></td> 
       <td class="custom-top-td acenter" width="41.97%"><p style="text-align:center">100, 200, 300, 400, 500</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">learning_rate</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">0.1, 0.05, 0.01, 0.001</p></td> 
      </tr> 
      <tr> 
       <td class="custom-bottom-td acenter" width="31.02%"><p style="text-align:center">max_depth</p></td> 
       <td class="custom-bottom-td acenter" width="41.97%"><p style="text-align:center">3, 4, 5</p></td> 
      </tr> 
      <tr> 
       <td rowspan="5" class="custom-top-td acenter" width="27.01%"><p style="text-align:center">BPNN</p></td> 
       <td class="custom-top-td acenter" width="31.02%"><p style="text-align:center">hidden_layer_sizes</p></td> 
       <td class="custom-top-td acenter" width="41.97%"><p style="text-align:center">64</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">activation</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">Relu</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">solver</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">adam</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">max_iter</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">1000</p></td> 
      </tr> 
      <tr> 
       <td class="custom-bottom-td acenter" width="31.02%"><p style="text-align:center">random_state</p></td> 
       <td class="custom-bottom-td acenter" width="41.97%"><p style="text-align:center">42</p></td> 
      </tr> 
      <tr> 
       <td rowspan="4" class="custom-top-td acenter" width="27.01%"><p style="text-align:center">GNN</p></td> 
       <td class="custom-top-td acenter" width="31.02%"><p style="text-align:center">hidden_channels</p></td> 
       <td class="custom-top-td acenter" width="41.97%"><p style="text-align:center">224</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">dropout_rate</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">0.3</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">learning_rate</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">0.00425</p></td> 
      </tr> 
      <tr> 
       <td class="custom-bottom-td acenter" width="31.02%"><p style="text-align:center">epochs</p></td> 
       <td class="custom-bottom-td acenter" width="41.97%"><p style="text-align:center">500</p></td> 
      </tr> 
      <tr> 
       <td rowspan="4" class="custom-top-td acenter" width="27.01%"><p style="text-align:center">KAN</p></td> 
       <td class="custom-top-td acenter" width="31.02%"><p style="text-align:center">width</p></td> 
       <td class="custom-top-td acenter" width="41.97%"><p style="text-align:center">[5, 5]</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">grid</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">3</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">k (spline order)</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">2</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="31.02%"><p style="text-align:center">lambda</p></td> 
       <td class="acenter" width="41.97%"><p style="text-align:center">0.001</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>Theory</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>BPNN, originally discovered in 1969, has been the most widely used neural network because of its ability to learn complex relations <xref ref-type="bibr" rid="scirp.146806-24">
      [24]
     </xref>. BPNN enables the computation of the loss function gradient concerning all network weights <xref ref-type="bibr" rid="scirp.146806-14">
      [14]
     </xref>. The BPNN learning process involves forward propagation and backward propagation.</p>
    <p>During forward propagation, input signals enter the neural network from the input layer through hidden layers to the output layer <xref ref-type="bibr" rid="scirp.146806-25">
      [25]
     </xref>. The network’s weights and biases remain unchanged, and the current layer’s neuron outputs affect only the following layer. If the desired output isn’t reached, the process can switch to backpropagation to correct errors <xref ref-type="bibr" rid="scirp.146806-26">
      [26]
     </xref>. With backpropagation, the error signal is the difference between the actual and desired network outputs and sent from the output layer back through the layers to the input layer <xref ref-type="bibr" rid="scirp.146806-25">
      [25]
     </xref> <xref ref-type="bibr" rid="scirp.146806-27">
      [27]
     </xref>. The network’s weights are remodelled based on this error feedback, continuously adjusting them to bring the actual output closer to the expected output <xref ref-type="bibr" rid="scirp.146806-25">
      [25]
     </xref>. The weights of the network are then updated according to a gradient descent rule <xref ref-type="bibr" rid="scirp.146806-14">
      [14]
     </xref>. The architecture of BPNN is shown in <xref ref-type="fig" rid="fig5">
      Figure 5
     </xref>.</p>
    <p>Implementation</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>Feature preprocessing was implemented through the MinMaxScaler class from scikit-learn, which normalized input parameters to a uniform range, thereby preventing the dominance of features with larger magnitudes in the learning process. The dataset (n = 60) underwent stratified partitioning into training (n_train = 48, 80%) and test (n_test = 12, 20%) sets. To ensure robust model evaluation and hyperparameter tuning, especially given the limited dataset size, k-fold cross-validation with k = 5 was applied to the training data. This involved iteratively training and validating the model across five distinct subsets of the training data.</p>
    <fig id="fig5" position="float">
     <label>Figure 5</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 5. BPNN Architecture.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId34.jpeg?20251029033932" />
    </fig>
    <p>The neural network architecture was configured with specific hyperparameters: ReLU activation function, hidden layer dimensions of 64 neurons, and a maximum iteration limit of 1000 epochs (representing complete forward and backward passes of the entire dataset). A random state of 42 was established to ensure experimental reproducibility, while the network’s weight optimization employed the Adam (Adaptive Moment Estimation) optimizer. The Adam optimizer is known for its adaptive learning rate mechanism, achieved by tracking exponentially decaying averages of previous gradients and their squared values. This allows it to adjust the learning rate dynamically, improving convergence rates in various optimization problems <xref ref-type="bibr" rid="scirp.146806-28">
      [28]
     </xref>. The ReLU activation function is mathematically as shown in Equation (3):</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         h 
       </mi> 
       <mo>
         = 
       </mo> 
       <mi>
         max 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mn>
           0 
         </mn> 
         <mo>
           , 
         </mo> 
         <mi>
           a 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (3)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>where a represents any real number, such that the function returns 0 for inputs less than or equal to 0, and returns a otherwise. A summary of the BPNN optimally tuned hyperparameter configuration is summarized in <xref ref-type="table" rid="table2">
      Table 2
     </xref>.</p>
    <p>Theory</p>
    <p>Graph Neural Networks (GNNs) extend traditional neural networks to operate on graph-structured data, making them effective for capturing complex relationships between data points. GNNs represent data as nodes interconnected by edges, where each node contains feature information, and the edges define the relationships between nodes <xref ref-type="bibr" rid="scirp.146806-29">
      [29]
     </xref>.</p>
    <p>The Graph Convolutional Network (GCN), first proposed by <xref ref-type="bibr" rid="scirp.146806-30">
      [30]
     </xref>, is a specific variant of GNN that generalizes convolutional operations to graph-structured data. In GCNs, each node’s representation is updated by aggregating information from its neighbours. The basic operation in a GCN layer is shown in Equation (4):</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             l 
           </mi> 
           <mo>
             + 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msup> 
       <mo>
         = 
       </mo> 
       <mi>
         σ 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msup> 
          <mover accent="true"> 
           <mi>
             D 
           </mi> 
           <mo>
             ˜ 
           </mo> 
          </mover> 
          <mrow> 
           <mo>
             − 
           </mo> 
           <mfrac> 
            <mn>
              1 
            </mn> 
            <mn>
              2 
            </mn> 
           </mfrac> 
          </mrow> 
         </msup> 
         <mover accent="true"> 
          <mi>
            A 
          </mi> 
          <mo>
            ˜ 
          </mo> 
         </mover> 
         <msup> 
          <mover accent="true"> 
           <mi>
             D 
           </mi> 
           <mo>
             ˜ 
           </mo> 
          </mover> 
          <mrow> 
           <mo>
             − 
           </mo> 
           <mfrac> 
            <mn>
              1 
            </mn> 
            <mn>
              2 
            </mn> 
           </mfrac> 
          </mrow> 
         </msup> 
         <msup> 
          <mi>
            H 
          </mi> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              l 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </msup> 
         <msup> 
          <mi>
            W 
          </mi> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              l 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </msup> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (4)</p>
    <p>where 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          H 
        </mi> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            l 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msup> 
      </mrow> 
     </math> represents the node features at layer 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        l 
      </mi> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         Ã 
       </mi> 
       <mo>
         = 
       </mo> 
       <mi>
         A 
       </mi> 
       <mo>
         + 
       </mo> 
       <mi>
         I 
       </mi> 
      </mrow> 
     </math> is the adjacency matrix with added self-connections, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mover accent="true"> 
       <mi>
         D 
       </mi> 
       <mo>
         ˜ 
       </mo> 
      </mover> 
     </math> is the degree matrix of 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        Ã 
      </mi> 
     </math>, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          W 
        </mi> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            l 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msup> 
      </mrow> 
     </math> is the trainable weight matrix for layer 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        l 
      </mi> 
     </math>, and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        σ 
      </mi> 
     </math> is a non-linear activation function <xref ref-type="bibr" rid="scirp.146806-31">
      [31]
     </xref>.</p>
    <p>For regression tasks like backbreak prediction, GNNs offer advantages by considering the relationships between different blasting scenarios rather than treating each instance independently. This approach enables the model to leverage similarities and patterns across different blast design parameters to improve prediction accuracy <xref ref-type="bibr" rid="scirp.146806-32">
      [32]
     </xref>.</p>
    <p>Implementation</p>
    <p>Feature normalization was implemented through MinMaxScaler to ensure uniform feature scaling, preventing features with larger magnitudes from dominating the learning process. The dataset (n = 60) underwent stratified partitioning into training (n_train = 48, 80%) and test (n_test = 12, 20%) sets depicted in <xref ref-type="fig" rid="fig3">
      Figure 3
     </xref>. To ensure robust model evaluation and hyperparameter optimization, particularly given the limited dataset size, k-fold cross-validation with k = 5 was applied to the training data. This involved iteratively training and validating the model across five distinct subsets of the training data.</p>
    <p>A k-nearest neighbors (kNN) graph was constructed to establish the topological structure of the data. The choice of a neighborhood size of k=10 for the kNN graph was determined through empirical evaluation during the hyperparameter tuning process, where various values of k were tested to identify the configuration that best captured the local data manifold while balancing computational efficiency and predictive performance. This selection aimed to ensure sufficient connectivity within the graph to propagate information effectively without introducing excessive noise from distant neighbors.</p>
    <p>A three-layer Graph Convolutional Network (GCN) architecture was implemented using the PyTorch Geometric framework. The network architecture incorporated dropout regularization to prevent overfitting. The model configuration featured the following components:</p>
    <p>i. Input layer: Accepting normalized blasting parameters,</p>
    <p>ii. Three graph convolutional layers with ReLU activations,</p>
    <p>iii. Two fully connected layers for the final prediction,</p>
    <p>iv. Dropout regularization between layers.</p>
    <p>Hyperparameter tuning for the GNN model, including the learning rate, number of hidden units in each layer, and dropout rates, was conducted using an exhaustive grid search strategy. This process systematically explored a predefined range of values for each hyperparameter, with the optimal combination selected based on the model’s performance during the 5-fold cross-validation. The training was conducted using the Adam optimizer, known for its adaptive learning rate mechanism, which dynamically adjusts learning rates to improve convergence. The mean squared error (MSE) loss function was employed for optimization, with model checkpoints saved periodically during training to capture the best performing model throughout the training epochs.</p>
    <p>Theory</p>
    <p>The Kolmogorov-Arnold Network (KAN) is a neural network architecture inspired by the Kolmogorov-Arnold representation theorem, which states that any multivariate continuous function can be represented as a composition of continuous functions of a single variable and addition operations <xref ref-type="bibr" rid="scirp.146806-33">
      [33]
     </xref>. Unlike traditional neural networks that use fixed activation functions, KANs employ learnable univariate activation functions represented as splines, providing greater flexibility in function approximation. KAN architecture is presented in <xref ref-type="fig" rid="fig6">
      Figure 6
     </xref>.</p>
    <p>The KAN’s mathematical foundation stems from Kolmogorov’s superposition theorem, which stipulates that any continuous function 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         f 
       </mi> 
       <mo>
         : 
       </mo> 
       <msup> 
        <mrow> 
         <mrow> 
          <mo>
            [ 
          </mo> 
          <mrow> 
           <mn>
             0 
           </mn> 
           <mo>
             , 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
          <mo>
            ] 
          </mo> 
         </mrow> 
        </mrow> 
        <mi>
          n 
        </mi> 
       </msup> 
       <mo>
         → 
       </mo> 
       <mi>
         R 
       </mi> 
      </mrow> 
     </math> is expressed in Equation (5)</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         f 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mn>
            2 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            n 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            q 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mrow> 
          <mn>
            2 
          </mn> 
          <mi>
            n 
          </mi> 
          <mo>
            + 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
        </msubsup> 
        <mrow> 
         <msub> 
          <mi>
            Φ 
          </mi> 
          <mi>
            q 
          </mi> 
         </msub> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mstyle displaystyle="true"> 
            <msubsup> 
             <mo>
               ∑ 
             </mo> 
             <mrow> 
              <mi>
                p 
              </mi> 
              <mo>
                = 
              </mo> 
              <mn>
                1 
              </mn> 
             </mrow> 
             <mi>
               n 
             </mi> 
            </msubsup> 
            <mrow> 
             <msub> 
              <mi>
                ϕ 
              </mi> 
              <mrow> 
               <mi>
                 q 
               </mi> 
               <mo>
                 , 
               </mo> 
               <mi>
                 p 
               </mi> 
              </mrow> 
             </msub> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mrow> 
               <msub> 
                <mi>
                  x 
                </mi> 
                <mi>
                  p 
                </mi> 
               </msub> 
              </mrow> 
              <mo>
                ) 
              </mo> 
             </mrow> 
            </mrow> 
           </mstyle> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (5)</p>
    <p>where 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          Φ 
        </mi> 
        <mi>
          q 
        </mi> 
       </msub> 
      </mrow> 
     </math> and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          φ 
        </mi> 
        <mrow> 
         <mrow> 
          <mo>
            { 
          </mo> 
          <mrow> 
           <mi>
             q 
           </mi> 
           <mo>
             , 
           </mo> 
           <mi>
             p 
           </mi> 
          </mrow> 
          <mo>
            } 
          </mo> 
         </mrow> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> are continuous univariate functions.</p>
    <p>KANs implement this theorem by representing each univariate function as a spline with learnable parameters. The spline function of order 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        k 
      </mi> 
     </math> with grid size 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        g 
      </mi> 
     </math> has a form shown in Equation (6)</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         s 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          x 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            0 
          </mn> 
         </mrow> 
         <mi>
           g 
         </mi> 
        </msubsup> 
        <mrow> 
         <msub> 
          <mi>
            c 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <msubsup> 
          <mi>
            B 
          </mi> 
          <mi>
            i 
          </mi> 
          <mi>
            k 
          </mi> 
         </msubsup> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            x 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (6)</p>
    <p>where 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msubsup> 
        <mi>
          B 
        </mi> 
        <mi>
          i 
        </mi> 
        <mi>
          k 
        </mi> 
       </msubsup> 
      </mrow> 
     </math> are basis splines and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          c 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math> are learnable coefficients <xref ref-type="bibr" rid="scirp.146806-34">
      [34]
     </xref>. This approach provides KANs with high expressivity while maintaining interpretability, as the network can be decomposed into simpler univariate functions.</p>
    <fig id="fig6" position="float">
     <label>Figure 6</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 6. KAN Architecture <xref ref-type="bibr" rid="scirp.146806-35">
        [35]
       </xref>.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId73.jpeg?20251029033933" />
    </fig>
    <p>Implementation</p>
    <p>The KAN implementation used a systematic approach for feature preprocessing, model optimization, and evaluation. The dataset (n = 60) underwent feature standardization using StandardScaler to normalize all input parameters to a consistent scale. The dataset was then stratified and randomly split into training (n_train = 48, 80%) and test (n_test = 12, 20%) sets.</p>
    <p>To ensure robust model evaluation and hyperparameter tuning, particularly given the limited dataset size, k-fold cross-validation with k = 5 was applied to the training data (the 48 data points). This involved iteratively training and validating the model across five distinct subsets of the training data, providing a more reliable estimate of the model’s generalization performance and assessing its stability.</p>
    <p>The architecture featured an input layer dimensioned according to the feature count, followed by a hidden layer of neurons, and an output layer with a single neuron. The model was configured with a specific grid size and spline order, which provided a balanced trade-off between model complexity and generalization capability. A regularization parameter was applied to prevent overfitting. The training was conducted using the Adam optimizer. The mean squared error loss function was employed for optimization, with model checkpoints saved periodically during training. The final model was then validated with the held-out test set (the remaining 12 datasets) to provide an unbiased assessment of its performance.</p>
    <p>K-fold cross-validation (CV) is a widely adopted and robust technique for evaluating machine learning models, particularly when aiming to ensure generalization capability and prevent overfitting <xref ref-type="bibr" rid="scirp.146806-36">
      [36]
     </xref>. Unlike a single train-test split, which can yield biased performance estimates, k-fold CV partitions the dataset into k equal subsets. The model is then iteratively trained on k − 1 folds and validated on the remaining fold, repeating this process k times so that each data point serves as both a training and validation example <xref ref-type="bibr" rid="scirp.146806-36">
      [36]
     </xref>.</p>
    <p>This method provides a more reliable estimate of a model’s generalization error by averaging performance metrics across all k iterations <xref ref-type="bibr" rid="scirp.146806-37">
      [37]
     </xref>. Furthermore, it offers crucial insights into model stability; consistent performance across folds indicates robustness and reduced sensitivity to specific data compositions <xref ref-type="bibr" rid="scirp.146806-38">
      [38]
     </xref>. Consequently, k-fold cross-validation is an essential tool for developing reliable and generalizable predictive models. The performance metrics (e.g., R<sup>2</sup>, MAE) are computed for each iteration, and the final reported performance is typically the average of these k measurements <xref ref-type="bibr" rid="scirp.146806-38">
      [38]
     </xref>.</p>
    <p>Theory</p>
    <p>Hyperparameters govern model capacity and regularisation but are not learned directly from the data; their values must be specified a priori and strongly influence generalisation performance <xref ref-type="bibr" rid="scirp.146806-38">
      [38]
     </xref>. Grid search with cross-validation remains a widely adopted, reproducible strategy for hyperparameter selection because it exhaustively evaluates a discrete, user-defined search space under a consistent validation protocol <xref ref-type="bibr" rid="scirp.146806-36">
      [36]
     </xref> <xref ref-type="bibr" rid="scirp.146806-39">
      [39]
     </xref>. Formally, let 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         ℋ 
       </mi> 
       <mo>
         = 
       </mo> 
       <mi>
         h 
       </mi> 
      </mrow> 
     </math> denote the Cartesian product of candidate values for each hyperparameter, 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          { 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            V 
          </mi> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           , 
         </mo> 
         <mo>
           ⋯ 
         </mo> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            V 
          </mi> 
          <mi>
            J 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          } 
        </mo> 
       </mrow> 
      </mrow> 
     </math>. For a K-fold scheme, the cross-validated empirical risk for configuration h is represented in Equation (7).</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mover accent="true"> 
         <mi>
           R 
         </mi> 
         <mo>
           ^ 
         </mo> 
        </mover> 
        <mrow> 
         <mi>
           C 
         </mi> 
         <mi>
           V 
         </mi> 
        </mrow> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          h 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          K 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            k 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           K 
         </mi> 
        </msubsup> 
        <mrow> 
         <msubsup> 
          <mi>
            L 
          </mi> 
          <mrow> 
           <mi>
             v 
           </mi> 
           <mi>
             a 
           </mi> 
           <mi>
             l 
           </mi> 
          </mrow> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mi>
              k 
            </mi> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
         </msubsup> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msubsup> 
            <mi>
              f 
            </mi> 
            <mi>
              h 
            </mi> 
            <mrow> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mrow> 
               <mo>
                 − 
               </mo> 
               <mi>
                 k 
               </mi> 
              </mrow> 
              <mo>
                ) 
              </mo> 
             </mrow> 
            </mrow> 
           </msubsup> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (7)</p>
    <p>where 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msubsup> 
        <mi>
          f 
        </mi> 
        <mi>
          h 
        </mi> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mo>
             − 
           </mo> 
           <mi>
             k 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msubsup> 
      </mrow> 
     </math> is the model trained on the K-1 non-held-out folds and 
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msubsup> 
        <mi>
          L 
        </mi> 
        <mrow> 
         <mi>
           v 
         </mi> 
         <mi>
           a 
         </mi> 
         <mi>
           l 
         </mi> 
        </mrow> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mi>
            k 
          </mi> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </msubsup> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mo>
          ⋅ 
        </mo> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the loss evaluated on the k-th validation fold. The selected configuration minimises this estimate expressed in Equation (8)</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          h 
        </mi> 
        <mo>
          * 
        </mo> 
       </msup> 
       <mo>
         = 
       </mo> 
       <mi>
         arg 
       </mi> 
       <mi>
         arg 
       </mi> 
       <msub> 
        <mover accent="true"> 
         <mi>
           R 
         </mi> 
         <mo>
           ^ 
         </mo> 
        </mover> 
        <mrow> 
         <mi>
           C 
         </mi> 
         <mi>
           V 
         </mi> 
        </mrow> 
       </msub> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          h 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (8)</p>
    <p>Because the grid is discrete, its size grows multiplicatively with the number of tuned hyperparameters, this is expressed in Equation (9).</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mrow> 
        <mo>
          | 
        </mo> 
        <mi>
          H 
        </mi> 
        <mo>
          | 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∏ 
         </mo> 
         <mrow> 
          <mi>
            j 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           J 
         </mi> 
        </msubsup> 
        <mrow> 
         <mrow> 
          <mo>
            | 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              V 
            </mi> 
            <mi>
              j 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            | 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (9)</p>
    <p>so practitioners often balance coverage and computational cost by focusing on the most influential knobs (e.g., learning rate, tree depth, number of estimators for boosting; hidden width and regularisation for neural networks). Compared with alternatives such as random search <xref ref-type="bibr" rid="scirp.146806-40">
      [40]
     </xref> or Bayesian optimisation <xref ref-type="bibr" rid="scirp.146806-41">
      [41]
     </xref>, grid search is simple, deterministic, and easy to reproduce; however, it can be sample-inefficient in high-dimensional spaces. Cross-validation provides an approximately unbiased estimate of out-of-sample error when folds are i.i.d., but model selection on the same resamples can introduce selection bias; a strictly unbiased assessment may use a held-out test set or nested CV <xref ref-type="bibr" rid="scirp.146806-42">
      [42]
     </xref>.</p>
    <p>Several best practices from the literature improve the robustness of GridSearchCV in small-n regression: (i) construct pipelines that apply all preprocessing (e.g., scaling, encoding) within each fold to prevent leakage; (ii) stratify folds by target quantiles when feasible to stabilise error estimates in heteroscedastic settings; (iii) use evaluation metrics aligned with the study objective (e.g., MAE/MSE/R² for this work); and (iv) fix random seeds and shuffle splits to enhance reproducibility <xref ref-type="bibr" rid="scirp.146806-36">
      [36]
     </xref> <xref ref-type="bibr" rid="scirp.146806-39">
      [39]
     </xref>. For boosted trees, literature commonly reports strong sensitivity to learning rate and depth lower learning rates with more estimators tend to generalise better, provided early-stopping or adequate regularisation is used <xref ref-type="bibr" rid="scirp.146806-38">
      [38]
     </xref> <xref ref-type="bibr" rid="scirp.146806-43">
      [43]
     </xref>. For neural networks, grid sizes are often kept modest and centred on architecture (width, depth) and optimiser settings because training is comparatively expensive <xref ref-type="bibr" rid="scirp.146806-44">
      [44]
     </xref>.</p>
    <p>GridSearchCV in the Context of This Study</p>
    <p>Consistent with prior work, this study employs scikit-learn’s GridSearchCV to systematise hyperparameter selection for tabular learners. For GBDT and LightGBM, grids spanned the principal capacity and optimisation controls (e.g., n_estimators, learning_rate, max_depth), evaluated under K = 10 folds, with performance aggregated via the same loss functions later used for testing (MAE, MSE, and R<sup>2</sup>). The resulting configurations (see <xref ref-type="table" rid="table2">
      Table 2
     </xref>) reflect the literature’s guidance on conservative learning rates with sufficient estimators for boosted ensembles. For neural models (BPNN, GNN, KAN), hyperparameters were tuned using compact, problem-informed grids around architecture and regularisation settings while preserving the same K-fold protocol to ensure comparability across learners. All preprocessing (e.g., scaling via MinMaxScaler/StandardScaler) was embedded in fold-wise pipelines to eliminate information leakage, and the final generalisation estimates were computed on the held-out test partition to guard against selection bias.</p>
   </sec>
   <sec id="s2_5">
    <title>2.5. Evaluation Metrics Considered</title>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>Machine learning prediction models’ efficacy and performance can be assessed using evaluation metrics. Mean absolute error (MAE), coefficient of determination (R<sup>2</sup>), and mean squared error (MSE) are the three metrics used in this study to assess how well the models performed. R<sup>2</sup> evaluates the extent to which the observed results align with the predicted values from the regression line. This assessment provides insights into the degree to which the independent variable accounts for the variability in the model. Essentially, it gauges the effectiveness of the model in explaining the dataset. Mathematically R<sup>2</sup> can be expressed as shown in Equation (10) <xref ref-type="bibr" rid="scirp.146806-45">
      [45]
     </xref></p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mtext>
          R 
        </mtext> 
        <mn>
          2 
        </mn> 
       </msup> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mtext>
           SSR 
         </mtext> 
        </mrow> 
        <mrow> 
         <mtext>
           SST 
         </mtext> 
        </mrow> 
       </mfrac> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mrow> 
         <mstyle displaystyle="true"> 
          <mo>
            ∑ 
          </mo> 
          <mrow> 
           <msup> 
            <mrow> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mrow> 
               <msub> 
                <mover accent="true"> 
                 <mi>
                   y 
                 </mi> 
                 <mo>
                   ^ 
                 </mo> 
                </mover> 
                <mi>
                  i 
                </mi> 
               </msub> 
               <mo>
                 − 
               </mo> 
               <munder accentunder="true"> 
                <mi>
                  y 
                </mi> 
                <mo>
                  _ 
                </mo> 
               </munder> 
              </mrow> 
              <mo>
                ) 
              </mo> 
             </mrow> 
            </mrow> 
            <mn>
              2 
            </mn> 
           </msup> 
          </mrow> 
         </mstyle> 
        </mrow> 
        <mrow> 
         <mstyle displaystyle="true"> 
          <mo>
            ∑ 
          </mo> 
          <mrow> 
           <msup> 
            <mrow> 
             <mrow> 
              <mo>
                ( 
              </mo> 
              <mrow> 
               <msub> 
                <mi>
                  y 
                </mi> 
                <mi>
                  i 
                </mi> 
               </msub> 
               <mo>
                 − 
               </mo> 
               <munder accentunder="true"> 
                <mi>
                  y 
                </mi> 
                <mo>
                  _ 
                </mo> 
               </munder> 
              </mrow> 
              <mo>
                ) 
              </mo> 
             </mrow> 
            </mrow> 
            <mn>
              2 
            </mn> 
           </msup> 
          </mrow> 
         </mstyle> 
        </mrow> 
       </mfrac> 
      </mrow> 
     </math> (10)</p>
    <p>where SSR is Sum Square of Residuals (squared difference between the predicted ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mover accent="true"> 
         <mi>
           y 
         </mi> 
         <mo>
           ^ 
         </mo> 
        </mover> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math>) and the average value ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math>)). SST is Sum Square of Total (squared difference between the actual ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        y 
      </mi> 
     </math>) and average value ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          y 
        </mi> 
        <mi>
          i 
        </mi> 
       </msub> 
      </mrow> 
     </math>)). The MAE is a measure of the prediction error of a model; it is measured by taking the average of the absolute difference between actual values and the predictions. For this, the number of datapoints (n), actual output value (y) and predicted output value ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mover accent="true"> 
       <mi>
         y 
       </mi> 
       <mo>
         ^ 
       </mo> 
      </mover> 
     </math>) are considered Equation (11) <xref ref-type="bibr" rid="scirp.146806-46">
      [46]
     </xref>.</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mtext>
         MAE 
       </mtext> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          n 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </msubsup> 
        <mrow> 
         <mrow> 
          <mo>
            | 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              y 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
           <mo>
             − 
           </mo> 
           <msub> 
            <mover accent="true"> 
             <mi>
               y 
             </mi> 
             <mo>
               ^ 
             </mo> 
            </mover> 
            <mi>
              i 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            | 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (11)</p>
    <p>MSE is a key metric in statistical analysis and machine learning, used to measure the variation between actual and expected results. The average of the squared deviations between these values is used to compute it, which serves to magnify larger errors, making it particularly useful for models that aim to minimise prediction errors. The mathematical expression for MSE is the expectation of the squared deviation of an estimator from the true parameter value, encompassing both the variance and the bias squared of the estimator <xref ref-type="bibr" rid="scirp.146806-47">
      [47]
     </xref>. Mathematically, MSE can be expressed as shown in Equation (12)</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mtext>
         MSE 
       </mtext> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mi>
          n 
        </mi> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           n 
         </mi> 
        </msubsup> 
        <mrow> 
         <msup> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                y 
              </mi> 
              <mi>
                i 
              </mi> 
             </msub> 
             <mo>
               − 
             </mo> 
             <msub> 
              <mover accent="true"> 
               <mi>
                 y 
               </mi> 
               <mo>
                 ^ 
               </mo> 
              </mover> 
              <mi>
                i 
              </mi> 
             </msub> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mn>
            2 
          </mn> 
         </msup> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (12)</p>
    <p>
     <xref ref-type="bibr" rid="scirp.146806-"></xref>where y is the true value, 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mover accent="true"> 
       <mi>
         y 
       </mi> 
       <mo>
         ^ 
       </mo> 
      </mover> 
     </math> is the predicted value and n is the number of datapoints.</p>
   </sec>
  </sec><sec id="s3">
   <title>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>3. Results and Discussion</title>
   <p>Model performance</p>
   <p>The integration of k-fold cross-validation into the model development process significantly enhanced the robustness and reliability of the performance evaluation for the five machine learning models: Gradient Boosting Decision Trees (GBDT), Light Gradient Boosting Machine (LightGBM), Back Propagation Neural Network (BPNN), Graph Neural Network (GNN), and Kolmogorov-Arnold Network (KAN). This rigorous validation approach, particularly crucial given the dataset’s size, provides a more accurate representation of each model’s generalization capabilities. The comparative analysis, encompassing both cross-validation (CV) and independent test set results, revealed distinct patterns in their predictive efficacy for backbreak estimation. The evaluation metrics; Coefficient of Determination (R<sup>2</sup>), Mean Absolute Error (MAE), and Mean Squared Error (MSE) were employed to quantitatively assess the models’ performance.</p>
   <p>Cross-Validation Performance</p>
   <p>
    <xref ref-type="table" rid="table3">
     Table 3
    </xref> presents the detailed cross-validation performance results for each model. These metrics, averaged across five folds, offer a more stable and less biased estimate of the models’ true performance. GBDT demonstrated a robust performance with an R<sup>2</sup> of 0.6438 ± 0.2196, MAE of 0.5812 ± 0.1999, and MSE of 1.0482 ± 0.5176. While these values might appear lower than typical R<sup>2</sup> values for highly accurate models, the standard deviations indicate a degree of variability across the folds, which is expected in cross-validation and provides a more realistic view of performance. KAN exhibited the highest R<sup>2</sup> in cross-validation at 0.7753 ± 0.1901, coupled with a MAE of 0.5498 ± 0.1266 and MSE of 0.6178 ± 0.3083, suggesting its strong and consistent performance across different data subsets. BPNN showed an R<sup>2</sup> of 0.3455 ± 0.4226, MAE of 1.0498 ± 0.4577, and MSE of 2.0794 ± 1.0477. LightGBM and GNN, however, displayed less favorable performance in the cross-validation phase, with negative R<sup>2</sup> values, indicating that their predictions were worse than simply predicting the mean of the target variable. Specifically, LightGBM had an R<sup>2</sup> of −0.2728 ± 0.2903, MAE of 1.7055 ± 0.2245, and MSE of 4.2453 ± 1.1320, while GNN showed an R<sup>2</sup> of −0.2917 ± 0.1771, MAE of 1.7302 ± 0.2405, and MSE of 4.4082 ± 1.2973. The negative R<sup>2</sup> values for LightGBM and GNN in CV suggest potential issues with model stability or suitability for this specific dataset under cross-validation conditions.</p>
   <table-wrap id="table2">
    <label>
     <xref ref-type="table" rid="table2">
      Table 2
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.146806-"></xref>Table 3. Model CV performance results.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td rowspan="2" class="acenter" width="16.81%"><p style="text-align:center">ML Algorithm</p></td> 
      <td class="custom-bottom-td acenter" width="83.19%" colspan="3"><p style="text-align:center">Evaluation Metrics</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td custom-top-td acenter" width="27.72%"><p style="text-align:center">R<sup>2</sup></p></td> 
      <td class="custom-bottom-td custom-top-td acenter" width="27.74%"><p style="text-align:center">MAE</p></td> 
      <td class="custom-bottom-td custom-top-td acenter" width="27.74%"><p style="text-align:center">MSE</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="16.81%"><p style="text-align:center">GBDT</p></td> 
      <td class="custom-top-td acenter" width="27.72%"><p style="text-align:center">0.6438 Â ± 0.2196</p></td> 
      <td class="custom-top-td acenter" width="27.74%"><p style="text-align:center">0.5812 Â ± 0.1999</p></td> 
      <td class="custom-top-td acenter" width="27.74%"><p style="text-align:center">1.0482 Â ± 0.5176</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="16.81%"><p style="text-align:center">LightGBM</p></td> 
      <td class="acenter" width="27.72%"><p style="text-align:center">−0.2728 Â ± 0.2903</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">1.7055 Â ± 0.2245</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">4.2453 Â ± 1.1320</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="16.81%"><p style="text-align:center">BPNN</p></td> 
      <td class="acenter" width="27.72%"><p style="text-align:center">0.3455 Â ± 0.4226</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">1.0498 Â ± 0.4577</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">2.0794 Â ± 1.0477</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="16.81%"><p style="text-align:center">GNN</p></td> 
      <td class="acenter" width="27.72%"><p style="text-align:center">−0.2917 Â ± 0.1771</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">1.7302 Â ± 0.2405</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">4.4082 Â ± 1.2973</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="16.81%"><p style="text-align:center">KAN</p></td> 
      <td class="acenter" width="27.72%"><p style="text-align:center">0.7753 Â ± 0.1901</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">0.5498 Â ± 0.1266</p></td> 
      <td class="acenter" width="27.74%"><p style="text-align:center">0.6178 Â ± 0.3083</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <p>Test Set Performance</p>
   <p>
    <xref ref-type="table" rid="table4">
     Table 4
    </xref> summarizes the performance of the models on the independent test set, providing an unbiased assessment of their generalization capabilities on unseen data. In this crucial evaluation, GBDT emerged as the top-performing model, achieving an impressive R<sup>2</sup> of 0.9728, a MAE of 0.2603, and an MSE of 0.1456. This indicates that GBDT can explain approximately 97.28% of the variance in backbreak predictions on new data, with a low average absolute error and mean squared error. BPNN also demonstrated strong performance on the test set, with an R<sup>2</sup> of 0.9289, MAE of 0.456, and MSE of 0.3798, making it the second-best performer. KAN, despite its strong CV performance, showed a slightly lower R<sup>2</sup> of 0.8719 on the test set, with a MAE of 0.6107 and MSE of 0.6842. LightGBM improved significantly from its CV performance, achieving an R<sup>2</sup> of 0.7276, MAE of 0.9778, and MSE of 1.4553. GNN, however, continued to struggle, exhibiting a negative R<sup>2</sup> of −0.0184, MAE of 1.9642, and MSE of 5.4418, reinforcing the observation that it is not well-suited for this specific prediction task.</p>
   <table-wrap id="table3">
    <label>
     <xref ref-type="table" rid="table3">
      Table 3
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.146806-"></xref>Table 4. Model test performance results.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td rowspan="2" class="acenter" width="26.17%"><p style="text-align:center">ML Algorithm</p></td> 
      <td class="custom-bottom-td acenter" width="73.83%" colspan="3"><p style="text-align:center">Evaluation Metrics</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td custom-top-td acenter" width="24.60%"><p style="text-align:center">R<sup>2</sup></p></td> 
      <td class="custom-bottom-td custom-top-td acenter" width="24.62%"><p style="text-align:center">MAE</p></td> 
      <td class="custom-bottom-td custom-top-td acenter" width="24.62%"><p style="text-align:center">MSE</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="26.17%"><p style="text-align:center">GBDT</p></td> 
      <td class="custom-top-td acenter" width="24.60%"><p style="text-align:center">0.9728</p></td> 
      <td class="custom-top-td acenter" width="24.62%"><p style="text-align:center">0.2603</p></td> 
      <td class="custom-top-td acenter" width="24.62%"><p style="text-align:center">0.1456</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="26.17%"><p style="text-align:center">LightGBM</p></td> 
      <td class="acenter" width="24.60%"><p style="text-align:center">0.7276</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">0.9778</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">1.4553</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="26.17%"><p style="text-align:center">BPNN</p></td> 
      <td class="acenter" width="24.60%"><p style="text-align:center">0.9289</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">0.456</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">0.3798</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="26.17%"><p style="text-align:center">GNN</p></td> 
      <td class="acenter" width="24.60%"><p style="text-align:center">−0.0184</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">1.9642</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">5.4418</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="26.17%"><p style="text-align:center">KAN</p></td> 
      <td class="acenter" width="24.60%"><p style="text-align:center">0.8719</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">0.6107</p></td> 
      <td class="acenter" width="24.62%"><p style="text-align:center">0.6842</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <p>Comparison of CV and Test Results</p>
   <p>
    <xref ref-type="fig" rid="fig7">
     Figure 7
    </xref> visually represents the model performance comparison of test and CV results. The results presented in <xref ref-type="table" rid="table3">
     Table 3
    </xref> and <xref ref-type="table" rid="table4">
     Table 4
    </xref> align with the general trends observed in <xref ref-type="fig" rid="fig7">
     Figure 7
    </xref>, particularly highlighting the superior performance of GBDT and BPNN. The figure illustrates the R<sup>2</sup> values, MAE, and MSE for both CV and test sets, allowing for a direct visual comparison. The significant improvement in R<sup>2</sup> for GBDT from CV (0.6438) to test (0.9728) underscores the effectiveness of the k-fold cross-validation in hyperparameter tuning and model selection, leading to a highly generalized model. Similarly, BPNN also shows strong and consistent performance across both CV and test sets. KAN, while performing exceptionally well in CV, shows a slight drop in R<sup>2</sup> on the test set, which is still a commendable performance.</p>
   <fig id="fig7" position="float">
    <label>Figure 7</label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 7. Model Performance Comparison of Test and CV results.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId106.jpeg?20251029033935" />
   </fig>
   <p>The negative R<sup>2</sup> values for LightGBM and GNN in CV, and GNN in the test set, are critical observations. A negative R<sup>2</sup> indicates that the model performs worse than a simple horizontal line (mean of the observed data), suggesting that these models are not suitable for predicting backbreak with this dataset and methodology. This could be due to various factors, including the complexity of the models relative to the dataset size, or their inability to capture the underlying relationships effectively.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>Analysis of the Best Performing Model: GBDT</p>
   <p>The Gradient Boosting Decision Trees (GBDT) model consistently demonstrated superior performance, particularly on the independent test set, making it the best-performing model for backbreak prediction in this study. Its high R<sup>2</sup> value of 0.9728 signifies that the model effectively captures the complex, non-linear relationships between the input blasting parameters and backbreak. The low MAE (0.2603) and MSE (0.1456) further confirm its accuracy, indicating minimal deviation between predicted and actual backbreak values. The strength of GBDT lies in its ensemble learning approach, where it sequentially builds decision trees, with each new tree correcting the errors of the previous ones. This iterative refinement process, coupled with its ability to handle various data types and capture intricate interactions between features, contributes to its robust predictive power.</p>
   <p>The implementation of k-fold cross-validation during the training phase further optimized GBDT’s hyperparameters, ensuring its generalization capability and preventing overfitting. The model’s ability to accurately predict backbreak has significant practical implications for mining operations, enabling more precise blast design, improved safety, and enhanced operational efficiency.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>The sensitivity analysis also conducted for the GBDT model in <xref ref-type="fig" rid="fig8">
     Figure 8
    </xref> provided crucial insights into the relative importance of input parameters in predicting backbreak. Powder factor emerged as the most influential parameter, followed by burden and stemming height, indicating their critical role in determining backbreak extent. Geometric stiffness demonstrated moderate influence, while stemming showed relatively lower effects. This hierarchical understanding of parameter importance translates directly to practical mining operations.</p>
   <fig id="fig8" position="float">
    <label>Figure 8</label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.146806-"></xref>Figure 8. GBDT model performance analysis.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2802695-rId107.jpeg?20251029033934" />
   </fig>
   <p>Implications to Research and Practice</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>Despite the inherent limitations of a relatively small dataset (n = 60) and data from a single mine site, this study offers significant insights into machine learning applications for backbreak prediction. The integration of k-fold cross-validation provides a more robust assessment of model generalization, particularly highlighting the Gradient Boosting Decision Trees (GBDT) model’s exceptional performance on the independent test set (R<sup>2</sup> = 0.9728). This high accuracy suggests GBDT’s strong potential for practical application in optimizing blast designs, leading to improved safety, reduced operational costs, and enhanced efficiency in open-pit mining. The consistent performance of BPNN and the promising, albeit slightly lower, performance of KAN also indicate their utility. While the results are preliminary and necessitate further validation with larger, more diverse datasets, they underscore the transformative potential of advanced machine learning techniques in addressing complex mining challenges and guiding more precise and safer blasting operations.</p>
  </sec><sec id="s4">
   <title>4. Conclusions</title>
   <p>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>This study rigorously evaluated the effectiveness of five machine learning algorithms Gradient Boosting Decision Trees (GBDT), Light Gradient Boosting Machine (LightGBM), Back Propagation Neural Network (BPNN), Graph Neural Network (GNN), and Kolmogorov-Arnold Network (KAN) in predicting blast-induced backbreak in open-pit mining operations at Goldfields Ghana Limited, Damang Mine. By integrating k-fold cross-validation and assessing performance on an independent test set, the analysis provided a comprehensive understanding of each model’s generalization capabilities.</p>
   <p>The GBDT model consistently demonstrated superior performance, particularly on the independent test set, achieving an impressive R<sup>2</sup> of 0.9728, a Mean Absolute Error (MAE) of 0.2603, and a Mean Squared Error (MSE) of 0.1456. This highlights its robust predictive power and ability to capture complex non-linear relationships in backbreak prediction. The BPNN model also exhibited strong performance on the test set (R<sup>2</sup> = 0.9289, MAE = 0.456, MSE = 0.3798), positioning it as a highly effective alternative. KAN, while showing excellent cross-validation performance (R<sup>2</sup> = 0.7753 ± 0.1901), maintained a commendable R<sup>2</sup> of 0.8719 on the test set. In contrast, LightGBM showed moderate performance on the test set (R<sup>2</sup> = 0.7276), while GNN consistently struggled, exhibiting negative R<sup>2</sup> values in both cross-validation and test sets, indicating its unsuitability for this specific task.</p>
   <p>The findings underscore the significant potential of advanced machine learning approaches, particularly GBDT and BPNN, for enhancing backbreak prediction accuracy. While these promising results are based on a limited dataset and require further validation with larger, more diverse datasets from multiple mine sites, they establish a strong foundation for the application of machine learning in optimizing blast design and improving safety and efficiency in open-pit mining. Future research should focus on expanding data collection, incorporating additional geological and operational parameters, and exploring hybrid algorithms or ensemble methods that leverage the strengths of top-performing models like GBDT and BPNN. Further investigation into interpretable models like KAN could also provide valuable insights into the underlying physical relationships. Ultimately, integrating these predictive models with real-time monitoring systems holds the key to achieving more robust and adaptable backbreak prediction solutions.</p>
  </sec><sec id="s5">
   <title>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>5. Limitations</title>
   <p>This study provides a comprehensive analysis of various machine learning algorithms for backbreak prediction; however, it is important to acknowledge certain limitations, primarily concerning the scope of input variables. While our dataset from Goldfields Ghana Limited, Damang Mine, included critical parameters such as burden, spacing, stemming height, powder factor, and geometric stiffness, other potentially influential variables were not incorporated. Specifically, factors such as bench height, rock density, and explosive type were omitted from the analysis. This exclusion was primarily due to the inconsistent availability and recording of these specific data points within the operational dataset provided.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.146806-"></xref>It is recognized that these omitted variables can significantly influence blast-induced backbreak. For instance, bench height plays a crucial role in determining the burden and spacing design, and variations in bench geometry can alter stress distribution within the rock mass, potentially affecting the extent and pattern of backbreak <xref ref-type="bibr" rid="scirp.146806-48">
     [48]
    </xref>. Similarly, rock density is a fundamental geomechanical property that dictates the rock’s response to explosive energy; variations in density can lead to differential fragmentation and backbreak. Furthermore, the type of explosive used, including its energy content, detonation velocity, and coupling, directly impacts the energy transfer to the rock and the resulting blast performance, including backbreak <xref ref-type="bibr" rid="scirp.146806-49">
     [49]
    </xref>. Future research could benefit from incorporating these variables to develop more robust and universally applicable backbreak prediction models, provided that comprehensive and consistent data collection for these parameters becomes feasible.</p>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.146806-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Monjezi, M., Rezaei, M. and Yazdian, A. (2010) Prediction of Backbreak in Open-Pit Blasting Using Fuzzy Set Theory. Expert Systems with Applications, 37, 2637-2643. &gt;https://doi.org/10.1016/j.eswa.2009.08.014
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Shamsoddin Saeed, M., Jalalifar, H., Shamsoddini, H. and Darbor, M. (2022) A Comparative Study on the Application of Regression-PSO and ANN Methods to Predict Backbreak in Open-Pit Mines. Analytical and Numerical Methods in Mining Engineering, 11, 55-66.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ghasemi, E., Amnieh, H.B. and Bagherpour, R. (2016) Assessment of Backbreak Due to Blasting Operation in Open Pit Mines: A Case Study. Environmental Earth Sciences, 75, Article No. 552. &gt;https://doi.org/10.1007/s12665-016-5354-6
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jing, L. (2003) A Review of Techniques, Advances and Outstanding Issues in Numerical Modelling for Rock Mechanics and Rock Engineering. International Journal of Rock Mechanics and Mining Sciences, 40, 283-353. &gt;https://doi.org/10.1016/s1365-1609(03)00013-3
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Cunningham, C.V.B. (2005) The Kuz-Ram Fragmentation Model—20 Years on. Brighton Conference Proceedings, European Federation of Explosives Engineers, Brighton, 13-16 September 2005, 201-210.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Morris, J.P., Rubin, M.B., Block, G.I. and Bonner, M.P. (2006) Simulations of Fracture and Fragmentation of Geologic Materials Using Combined FEM/DEM Analysis. International Journal of Impact Engineering, 33, 463-473. &gt;https://doi.org/10.1016/j.ijimpeng.2006.09.006
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Monjezi, M. and Dehghani, H. (2008) Evaluation of Effect of Blasting Pattern Parameters on Back Break Using Neural Networks. International Journal of Rock Mechanics and Mining Sciences, 45, 1446-1453. &gt;https://doi.org/10.1016/j.ijrmms.2008.02.007
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Du, S., Feng, G., Wang, J., Feng, S., Malekian, R. and Li, Z. (2019) A New Machine-Learning Prediction Model for Slope Deformation of an Open-Pit Mine: An Evaluation of Field Data. Energies, 12, Article No. 1288. &gt;https://doi.org/10.3390/en12071288
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Alamdari, S., Basiri, M., Mousavi, A. and Soofastaei, A. (2022) Application of Machine Learning Techniques to Predict Haul Truck Fuel Consumption in Open-Pit Mines. Journal of Mining and Environment, 13, 69-85.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Li, L., Zhang, R., Sun, J., He, Q., Kong, L. and Liu, X. (2021) Monitoring and Prediction of Dust Concentration in an Open-Pit Mine Using a Deep-Learning Algorithm. Journal of Environmental Health Science and Engineering, 19, 401-414. &gt;https://doi.org/10.1007/s40201-021-00613-0
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Xue, Y., Bai, C., Qiu, D., Kong, F. and Li, Z. (2020) Predicting Rockburst with Database Using Particle Swarm Optimization and Extreme Learning Machine. Tunnelling and Underground Space Technology, 98, Article ID: 103287. &gt;https://doi.org/10.1016/j.tust.2020.103287
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ke, G., et al. (2017) LightGBM: A highly efficient Gradient Boosting Decision Tree. Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 3146-3154.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref13">
    <label>13</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Liang, W., Luo, S., Zhao, G. and Wu, H. (2020) Predicting Hard Rock Pillar Stability Using GBDT, XGBoost, and LightGBM Algorithms. Mathematics, 8, Article No. 765. &gt;https://doi.org/10.3390/math8050765
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref14">
    <label>14</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Li, M. (2024) Comprehensive Review of Backpropagation Neural Networks. Academic Journal of Science and Technology, 9, 150-154. &gt;https://doi.org/10.54097/51y16r47
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref15">
    <label>15</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Faramarzi, F., Ebrahimi Farsangi, M.A. and Mansouri, H. (2012) A Res-Based Model for Risk Assessment and Prediction of Backbreak in Bench Blasting. Rock Mechanics and Rock Engineering, 46, 877-887. &gt;https://doi.org/10.1007/s00603-012-0298-y
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref16">
    <label>16</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Khandelwal, M. and Monjezi, M. (2012) Prediction of Backbreak in Open-Pit Blasting Operations Using the Machine Learning Method. Rock Mechanics and Rock Engineering, 46, 389-396. &gt;https://doi.org/10.1007/s00603-012-0269-3
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref17">
    <label>17</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Russell, R. (2018) Machine Learning: Step-by-Step Guide to Implement Machine Learning Algorithms with Python.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref18">
    <label>18</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Tatsat, H., Puri, S. and Lookabaugh, B. (2020) From Building Trading Strategies to Robo-Advisors Using Python.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref19">
    <label>19</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Saied, M., Guirguis, S. and Madbouly, M. (2023) A Comparative Study of Using Boosting-Based Machine Learning Algorithms for IoT Network Intrusion Detection. International Journal of Computational Intelligence Systems, 16, Article No. 177. &gt;https://doi.org/10.1007/s44196-023-00355-x
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref20">
    <label>20</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Alcolea, A. and Resano, J. (2021) FPGA Accelerator for Gradient Boosting Decision Trees. Electronics, 10, Article No. 314. &gt;https://doi.org/10.3390/electronics10030314
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref21">
    <label>21</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Sai, M.J., Chettri, P., Panigrahi, R., Garg, A., Bhoi, A.K. and Barsocchi, P. (2023) An Ensemble of Light Gradient Boosting Machine and Adaptive Boosting for Prediction of Type-2 Diabetes. International Journal of Computational Intelligence Systems, 16, Article No. 14. &gt;https://doi.org/10.1007/s44196-023-00184-y
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref22">
    <label>22</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Chen, T., Xu, J., Ying, H., Chen, X., Feng, R., Fang, X., et al. (2019) Prediction of Extubation Failure for Intensive Care Unit Patients Using Light Gradient Boosting Machine. IEEE Access, 7, 150960-150968. &gt;https://doi.org/10.1109/access.2019.2946980
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref23">
    <label>23</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Soper, D.S. (2021) Greed Is Good: Rapid Hyperparameter Optimization and Model Selection Using Greedy K-Fold Cross Validation. Electronics, 10, Article No. 1973. &gt;https://doi.org/10.3390/electronics10161973
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref24">
    <label>24</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hecht-Nielsen (1989) Theory of the Backpropagation Neural Network. International Joint Conference on Neural Networks, New York, 18-22 June 1989, 593-605. &gt;https://doi.org/10.1109/ijcnn.1989.118638
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref25">
    <label>25</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Li, J., Cheng, J., Shi, J. and Huang, F. (2012) Brief Introduction of Back Propagation (BP) Neural Network Algorithm and Its Improvement. In: Jin, D. and Lin, S., Eds., Advances in Computer Science and Information Engineering, Springer, 553-558. &gt;https://doi.org/10.1007/978-3-642-30223-7_87
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref26">
    <label>26</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Suliman, A. and Zhang, Y. (2015) A Review on Back-Propagation Neural Networks in the Application of Remote Sensing Image Classification. Journal of Earth Science and Engineering, 5, 1. &gt;https://doi.org/10.17265/2159-581x/2015.01.004
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref27">
    <label>27</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Asaad, R.R. and Ali, R.I. (2019) Back Propagation Neural Network (BPNN) and Sigmoid Activation Function in Multi-Layer Networks. Academic Journal of Nawroz University, 8, 216-221. &gt;https://doi.org/10.25007/ajnu.v8n4a464
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref28">
    <label>28</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Dereich, S. and Jentzen, A. (2024) Convergence Rates for the Adam Optimizer.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref29">
    <label>29</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Zhou, J., Cui, G., Hu, S., Zhang, Z., Yang, C., Liu, Z., et al. (2020) Graph Neural Networks: A Review of Methods and Applications. AI Open, 1, 57-81. &gt;https://doi.org/10.1016/j.aiopen.2021.01.001
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref30">
    <label>30</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kipf, T.N. and Welling, M. (2017) Semi-Supervised Classification with Graph Convolutional Networks.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref31">
    <label>31</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hamilton, W.L. (2020) Graph Representation Learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 14, 1-159.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref32">
    <label>32</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C. and Yu, P.S. (2019) A Comprehensive Survey on Graph Neural Networks. Journal of Latex Class Files, 20, 1-22.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref33">
    <label>33</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kilani, B.H. (2024) Kolmogorov-Arnold Networks: Key Developments and Uses.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref34">
    <label>34</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Karlik, B. and Olgac, A.V. (2011) Performance Analysis of Various Activation Functions in Generalized MLP Architectures of Neural Networks. International Journal of Artificial Intelligence and Expert Systems, 1, 111-122.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref35">
    <label>35</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Liu, Z., et al. (2025) KAN: Kolmogorov-Arnold Networks.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref36">
    <label>36</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kohavi, R. (1995) A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, 20-25 August 1995, 1137-1145.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref37">
    <label>37</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Stone, M. (1974) Cross-Validatory Choice and Assessment of Statistical Predictions. Journal of the Royal Statistical Society Series B: Statistical Methodology, 36, 111-133. &gt;https://doi.org/10.1111/j.2517-6161.1974.tb00994.x
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref38">
    <label>38</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hastie, T., Tibshirani, R. and Friedman, J. (2009) The Elements of Statistical Learning Data Mining, Inference, and Prediction. 2nd Edition, Springer. &gt;https://doi.org/10.1007/978-0-387-84858-7
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref39">
    <label>39</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Pedregosa, F., et al. (2011) Scikit-Learn: Machine Learning in Python. Journal of Machine Learning Research, 12, 2825-2830.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref40">
    <label>40</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Bergstra, J. and Bengio, Y. (2012) Random Search for Hyper-Parameter Optimization. Journal of Machine Learning Research, 13, 281-305.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref41">
    <label>41</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Snoek, J., Larochelle, H. and Adams, R.P. (2012) Practical Bayesian Optimization of Machine Learning Algorithms. Advances in Neural Information Processing Systems, Vol. 5, 2951-2959.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref42">
    <label>42</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Cawley, G.C. and Talbot, N.L.C. (2010) On Over-Fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation. Journal of Machine Learning Research, 11, 2079-2107.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref43">
    <label>43</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Friedman, J.H. (2001) Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics, 29, 1189-1232. &gt;https://doi.org/10.1214/aos/1013203451
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref44">
    <label>44</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning. The MIT Press. &gt;http://www.deeplearningbook.org/
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref45">
    <label>45</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hahn, G.J. (1973) The Coefficient of Determination Exposed! Chemical Technology, 3, 609.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref46">
    <label>46</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Robeson, S.M. and Willmott, C.J. (2023) Decomposition of the Mean Absolute Error (MAE) into Systematic and Unsystematic Components. PLOS ONE, 18, e0279774. &gt;https://doi.org/10.1371/journal.pone.0279774
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref47">
    <label>47</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Schluchter, M.D. (2014) Mean Square Error. Wiley.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref48">
    <label>48</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Prasad, S., Choudhary, B.S. and Mishra, A.K. (2017) Effect of Blast Design Parameters on Blast Induced Rock Fragmentation Size—A Case Study. The International Conference on Deep Excavation, Energy Resources and Production, IIT Kharagpur, 24-26 January 2017, 1-7.
    </mixed-citation>
   </ref>
   <ref id="scirp.146806-ref49">
    <label>49</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Dotto, M.S. and Pourrahimian, Y. (2024) The Influence of Explosive and Rock Mass Properties on Blast Damage in a Single-Hole Blasting. Mining, 4, 168-188. &gt;https://doi.org/10.3390/mining4010011
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>