<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN" "JATS-journalpublishing1-4.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.4" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">jcc</journal-id>
      <journal-title-group>
        <journal-title>Journal of Computer and Communications</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2327-5227</issn>
      <issn pub-type="ppub">2327-5219</issn>
      <publisher>
        <publisher-name>Scientific Research Publishing</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.4236/jcc.2026.146008</article-id>
      <article-id pub-id-type="publisher-id">jcc-152116</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
        <subj-group>
          <subject>Computer Science</subject>
          <subject>Communications</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Adaptive Dynamic K-NN Spatio-Temporal Graph Convolutional Network for Bearing Fault Diagnosis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Hu</surname>
            <given-names>Xiuwei</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Fang</surname>
            <given-names>Ting</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Chang</surname>
            <given-names>Wenhui</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Pang</surname>
            <given-names>Cunrui</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name name-style="western">
            <surname>Wang</surname>
            <given-names>Tonghui</given-names>
          </name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="aff1"><label>1</label> Zhangjiakou Cigarette Factory Co., Ltd., Zhangjiakou, China </aff>
      <author-notes>
        <fn fn-type="conflict" id="fn-conflict">
          <p>The authors declare no conflicts of interest regarding the publication of this paper.</p>
        </fn>
      </author-notes>
      <pub-date pub-type="epub">
        <day>11</day>
        <month>06</month>
        <year>2026</year>
      </pub-date>
      <pub-date pub-type="collection">
        <month>06</month>
        <year>2026</year>
      </pub-date>
      <volume>14</volume>
      <issue>06</issue>
      <fpage>102</fpage>
      <lpage>114</lpage>
      <history>
        <date date-type="received">
          <day>29</day>
          <month>05</month>
          <year>2026</year>
        </date>
        <date date-type="accepted">
          <day>22</day>
          <month>06</month>
          <year>2026</year>
        </date>
        <date date-type="published">
          <day>25</day>
          <month>06</month>
          <year>2026</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>© 2026 by the authors and Scientific Research Publishing Inc.</copyright-statement>
        <copyright-year>2026</copyright-year>
        <license license-type="open-access">
          <license-p> This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link> ). </license-p>
        </license>
      </permissions>
      <self-uri content-type="doi" xlink:href="https://doi.org/10.4236/jcc.2026.146008">https://doi.org/10.4236/jcc.2026.146008</self-uri>
      <abstract>
        <p>Rolling bearing fault diagnosis is an important means to ensure the reliable operation of mechanical equipment. Aiming at the problems of poor working condition adaptability, over-smoothing in deep networks, and insufficient mining of fault temporal evolution features existing when the traditional K-nearest neighbor graph convolutional network is applied to bearing fault diagnosis, this paper proposes an adaptive dynamic K-NN spatio-temporal graph convolutional network (AdaDKNN-STGCN). Experimental validations are implemented on the CWRU bearing dataset. The proposed model attains a test accuracy of 99.78% in ten-class classification. It exhibits excellent generalization capability and anti-interference performance under noisy and variable load environments, which provides a reliable solution for intelligent fault diagnosis of rotating machinery bearings.</p>
      </abstract>
      <kwd-group kwd-group-type="author-generated" xml:lang="en">
        <kwd>Rolling Bearing</kwd>
        <kwd>Fault Diagnosis</kwd>
        <kwd>Graph Convolutional Network</kwd>
        <kwd>Adaptive Dynamic KNN</kwd>
        <kwd>Spatio-Temporal Feature Fusion</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
      <title>1. Introduction</title>
      <p>Rolling bearings are key vulnerable parts of rotating machinery transmission systems, and their operating conditions largely determine the safety, stability, and service life of the entire mechanical equipment [<xref ref-type="bibr" rid="B1">1</xref>]. In industrial applications, equipment shutdowns caused by bearing failures occur frequently. Accordingly, exploring high-precision and robust intelligent diagnosis methods for rolling bearing faults possesses important practical value for equipment health management and predictive maintenance. </p>
      <p>Traditional bearing fault diagnosis methods rely on manual experience to extract time-domain indicators including kurtosis, root mean square, and margin index, as well as frequency-domain spectral features [<xref ref-type="bibr" rid="B2">2</xref>]. Fault classification is then realized by combining the support vector machine (SVM) and the back propagation (BP) neural network. Such methods are highly dependent on handcrafted features and suffer from poor generalization ability, making them difficult to adapt to complex industrial noise environments [<xref ref-type="bibr" rid="B3">3</xref>]. With the development of deep learning, end-to-end fault diagnosis methods based on convolutional neural networks (CNN) and recurrent neural networks (LSTM/GRU) have become the mainstream research direction [<xref ref-type="bibr" rid="B4">4</xref>]. CNN performs well in local feature extraction but fails to model the long-range dependencies of signal sequences [<xref ref-type="bibr" rid="B5">5</xref>]. Although LSTM is capable of mining temporal information, it ignores the topological correlation among samples and shows limited performance in characterizing weak faults with vibration impulses. </p>
      <p>Graph Neural Networks (GNNs) are capable of processing non-Euclidean data. They can convert one-dimensional vibration sequences into graph-structured data to mine correlation information between samples, and have been widely applied to bearing fault diagnosis in recent years [<xref ref-type="bibr" rid="B6">6</xref>]. Most existing graph network-based diagnosis methods construct static graph structures following the fixed K-Nearest Neighbor (K-NN) rule, where a fixed number of adjacent nodes is assigned to all sample nodes. Feature learning is then implemented by using basic Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT) [<xref ref-type="bibr" rid="B7">7</xref>]-[<xref ref-type="bibr" rid="B9">9</xref>]. Although the fixed KNN-GCN method achieves higher diagnosis accuracy than traditional deep learning models, it still suffers from a rigid graph structure, separated spatio-temporal information, and deep-layer over-smoothing. </p>
      <p>To address the aforementioned research limitations, this paper proposes AdaDKNN-STGCN. The main contributions are summarized as follows: </p>
      <p>1) A dynamic weighted K-NN graph construction algorithm is proposed. It integrates feature similarity and temporal distance to build an adaptive adjacency matrix. Breaking away from the traditional fixed-K strategy, the method realizes adaptive updates of graph topology under different working conditions. </p>
      <p>2) A spatio-temporal dual-branch graph convolution architecture is constructed. The spatial branch explores topological correlations among nodes, while the temporal branch extracts temporal evolution features, realizing the fusion of complete spatio-temporal information of vibration signals. </p>
      <p>3) Residual skip connections and a node attention mechanism are introduced, which effectively alleviate the over-smoothing problem of graph convolution and enhance the extraction capability of weak fault features. </p>
    </sec>
    <sec id="sec2">
      <title>2. Related Theoretical Foundations</title>
      <sec id="sec2dot1">
        <title>2.1. Vibration Signal Characteristics of Rolling Bearings</title>
        <p>Rolling bearing faults are mainly divided into four categories: normal state, inner race fault, outer race fault, and rolling element fault. Different damage sizes correspond to various fault severity levels [<xref ref-type="bibr" rid="B10">10</xref>]. Bearing vibration signals are non-stationary impact time-series signals, and fault impacts manifest as amplitude fluctuations in the time domain and resonance peak shifts in the frequency domain [<xref ref-type="bibr" rid="B11">11</xref>]. <xref ref-type="fig" rid="fig1">Figure 1</xref> presents the time-frequency domain analysis of vibration signals collected from bearings with ball faults. The characteristics of these signals serve as the basis for the time-frequency fusion features extracted by the model, and also provide physical support for subsequent fault diagnosis based on spatio-temporal graph convolution. Therefore, combining time-domain statistical features and frequency-domain spectral features can fully characterize fault information.</p>
        <fig id="fig1">
          <label>Figure 1</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId15.jpeg?20260625032433" />
        </fig>
        <p><bold>Figure 1.</bold>Time-domain and frequency-domain analysis of typical fault signals.</p>
      </sec>
      <sec id="sec2dot2">
        <title>2.2. Fundamentals of Graph Convolutional Networks</title>
        <p>As a dominant graph neural network for non-Euclidean graph-structured data, GCN is capable of capturing node attributes and topological correlations, which remedies the drawback of conventional CNNs limited to regular data [<xref ref-type="bibr" rid="B12">12</xref>][<xref ref-type="bibr" rid="B13">13</xref>].</p>
        <p>A graph <inline-formula><mml:math display="inline"><mml:mi> G </mml:mi></mml:math></inline-formula> is formally defined as <inline-formula><mml:math display="inline"><mml:mrow><mml:mi> G </mml:mi><mml:mo> = </mml:mo><mml:mrow><mml:mo> ( </mml:mo><mml:mrow><mml:mi> V </mml:mi><mml:mo> , </mml:mo><mml:mi> E </mml:mi></mml:mrow><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> , where <inline-formula><mml:math display="inline"><mml:mi> V </mml:mi></mml:math></inline-formula> . represents the set of nodes and <italic>E</italic> denotes the set of edges. The core idea of graph convolution is to aggregate neighborhood features and perform nonlinear transformations, so as to fuse each node’s own features and its topological neighborhood features, and finally generate node representations containing both attribute and structural information. </p>
      </sec>
      <sec id="sec2dot3">
        <title>2.3. Traditional KNN Graph Construction Method</title>
        <p>As a typical approach to constructing graphs from unstructured data, KNN graph building screens neighbors based on node feature similarity and creates topological connections [<xref ref-type="bibr" rid="B14">14</xref>]-[<xref ref-type="bibr" rid="B17">17</xref>]. It converts raw feature data into graph data and is extensively used in topology-free deep learning tasks. </p>
        <p>Firstly, we compute the feature similarity across all nodes using Euclidean or cosine distance. Next, each node picks its top-K most similar neighbors [<xref ref-type="bibr" rid="B10">10</xref>]. Eventually, undirected edges are created to build the entire graph topology. The calculation formula for Euclidean distance is as follows: </p>
        <disp-formula id="FD1">
          <label>(1)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:mi>d</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:msub>
                    <mml:mi>x</mml:mi>
                    <mml:mi>i</mml:mi>
                  </mml:msub>
                  <mml:mo>,</mml:mo>
                  <mml:msub>
                    <mml:mi>x</mml:mi>
                    <mml:mi>j</mml:mi>
                  </mml:msub>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>=</mml:mo>
              <mml:msqrt>
                <mml:mrow>
                  <mml:mstyle displaystyle="true">
                    <mml:msubsup>
                      <mml:mo>∑</mml:mo>
                      <mml:mrow>
                        <mml:mi>c</mml:mi>
                        <mml:mo>=</mml:mo>
                        <mml:mn>1</mml:mn>
                      </mml:mrow>
                      <mml:mi>C</mml:mi>
                    </mml:msubsup>
                    <mml:mrow>
                      <mml:msup>
                        <mml:mrow>
                          <mml:mrow>
                            <mml:mo>(</mml:mo>
                            <mml:mrow>
                              <mml:msub>
                                <mml:mi>x</mml:mi>
                                <mml:mrow>
                                  <mml:mi>i</mml:mi>
                                  <mml:mi>c</mml:mi>
                                </mml:mrow>
                              </mml:msub>
                              <mml:mo>−</mml:mo>
                              <mml:msub>
                                <mml:mi>x</mml:mi>
                                <mml:mrow>
                                  <mml:mi>j</mml:mi>
                                  <mml:mi>c</mml:mi>
                                </mml:mrow>
                              </mml:msub>
                            </mml:mrow>
                            <mml:mo>)</mml:mo>
                          </mml:mrow>
                        </mml:mrow>
                        <mml:mn>2</mml:mn>
                      </mml:msup>
                    </mml:mrow>
                  </mml:mstyle>
                </mml:mrow>
              </mml:msqrt>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>In the formula, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mi> i </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mi> j </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> denote the feature vectors of node <italic>i</italic> and node <italic>j</italic> respectively, and <italic>C</italic> stands for the feature dimension. A smaller distance value indicates higher similarity between node features. </p>
        <p>Compared to random and fully-connected graph construction methods, traditional KNN graphing is structurally simple and computationally efficient, and can establish semantically meaningful topologies by capturing feature-level relationships [<xref ref-type="bibr" rid="B18">18</xref>]-[<xref ref-type="bibr" rid="B21">21</xref>]. However, it relies solely on original feature similarity while ignoring global data distribution and implicit connections. The resulting locally optimal topology will impair the learning performance of graph convolution.</p>
      </sec>
    </sec>
    <sec id="sec3">
      <title>3. Dynamic KNN Spatio-Temporal Graph Convolutional Network</title>
      <sec id="sec3dot1">
        <title>3.1. Overall Model Framework</title>
        <fig id="fig2">
          <label>Figure 2</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId28.jpeg?20260625032434" />
        </fig>
        <p><bold>Figure 2.</bold> Structure of the proposed model. </p>
        <p>The overall architecture of the proposed model is illustrated in <xref ref-type="fig" rid="fig2">Figure 2</xref>, which consists of five parts: data input layer, spatial-temporal dual-branch graph convolution layer (STDBGCN Layer), residual attention layer (RA Layer), feature fusion layer (FF Layer), and fault classification output layer (FCO Layer). </p>
        <p>The raw one-dimensional vibration time-series signals of rolling bearings are fed into the model. The original data are divided into samples via sliding window segmentation, and fused time-domain and frequency-domain features are extracted as node features of the graph. A dynamic weighted K-NN algorithm combining temporal distance and cosine similarity is adopted to generate the adaptive adjacency matrix. The spatial-temporal dual-branch network learns spatial and temporal features in parallel, and residual connections together with the node attention module are used to optimize feature representation. Finally, global pooling and fully connected layers in the fault classification output layer realize the classification of bearing fault patterns.</p>
      </sec>
      <sec id="sec3dot2">
        <title>3.2. Vibration Signal Preprocessing and Multi-Domain Node Feature Construction</title>
        <p>The original vibration acceleration signals are segmented via non-overlapping sliding windows. The length of each sample is set to <inline-formula><mml:math><mml:math xmlns:m="http://schemas.openxmlformats.org/officeDocument/2006/math" xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi> L </mml:mi></mml:math></mml:math></inline-formula>, and each window segment is regarded as an individual node of the graph network. </p>
        <p>We extract time-domain features for each signal segment, including mean value, standard deviation, peak value, kurtosis, waveform indicator, and impulse indicator. Meanwhile, frequency-domain features such as spectral centroid, mean square frequency, and variance frequency are also computed. The concatenation of time-domain and frequency-domain features yields a high-dimensional node feature vector <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mi> i </mml:mi></mml:msub><mml:mo> ∈ </mml:mo><mml:msup><mml:mi> ℝ </mml:mi><mml:mi> d </mml:mi></mml:msup></mml:mrow></mml:math></inline-formula> , which realizes the mapping from raw non-sequential signals to graph node features. </p>
        <p>The feature cosine similarity between node <inline-formula><mml:math display="inline"><mml:mi> i </mml:mi></mml:math></inline-formula> and node <inline-formula><mml:math display="inline"><mml:mi> j </mml:mi></mml:math></inline-formula> is defined as: </p>
        <disp-formula id="FD2">
          <label>(2)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:mi>S</mml:mi>
              <mml:mi>i</mml:mi>
              <mml:msub>
                <mml:mi>m</mml:mi>
                <mml:mrow>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:msub>
                    <mml:mi>x</mml:mi>
                    <mml:mi>i</mml:mi>
                  </mml:msub>
                  <mml:mo>⋅</mml:mo>
                  <mml:msub>
                    <mml:mi>x</mml:mi>
                    <mml:mi>j</mml:mi>
                  </mml:msub>
                </mml:mrow>
                <mml:mrow>
                  <mml:mrow>
                    <mml:mo>‖</mml:mo>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>x</mml:mi>
                        <mml:mi>i</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>‖</mml:mo>
                  </mml:mrow>
                  <mml:mrow>
                    <mml:mo>‖</mml:mo>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>x</mml:mi>
                        <mml:mi>j</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>‖</mml:mo>
                  </mml:mrow>
                </mml:mrow>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>The temporal position distance between node <inline-formula><mml:math display="inline"><mml:mi> i </mml:mi></mml:math></inline-formula> and node <inline-formula><mml:math display="inline"><mml:mi> j </mml:mi></mml:math></inline-formula> is defined as: </p>
        <disp-formula id="FD3">
          <label>(3)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:mi>D</mml:mi>
              <mml:mi>i</mml:mi>
              <mml:msub>
                <mml:mi>s</mml:mi>
                <mml:mrow>
                  <mml:mi>t</mml:mi>
                  <mml:mo>,</mml:mo>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mn>1</mml:mn>
                <mml:mrow>
                  <mml:mrow>
                    <mml:mo>|</mml:mo>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>t</mml:mi>
                        <mml:mi>i</mml:mi>
                      </mml:msub>
                      <mml:mo>−</mml:mo>
                      <mml:msub>
                        <mml:mi>t</mml:mi>
                        <mml:mi>j</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>|</mml:mo>
                  </mml:mrow>
                  <mml:mo>+</mml:mo>
                  <mml:mi>ε</mml:mi>
                </mml:mrow>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Here, <inline-formula><mml:math display="inline"><mml:mi> ε </mml:mi></mml:math></inline-formula> is a small constant preventing a zero denominator. <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> t </mml:mi><mml:mi> i </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> t </mml:mi><mml:mi> j </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are sample temporal indices. Linear normalization is performed on node feature similarity and temporal position distance, respectively, to scale them into the range of [0, 1]. </p>
        <disp-formula id="FD4">
          <label>(4)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mrow>
                  <mml:mover accent="true">
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:mi>m</mml:mi>
                    </mml:mrow>
                    <mml:mo stretchy="true">˜</mml:mo>
                  </mml:mover>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mi>S</mml:mi>
                  <mml:mi>i</mml:mi>
                  <mml:msub>
                    <mml:mi>m</mml:mi>
                    <mml:mrow>
                      <mml:mi>i</mml:mi>
                      <mml:mi>j</mml:mi>
                    </mml:mrow>
                  </mml:msub>
                  <mml:mo>−</mml:mo>
                  <mml:mi>min</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>m</mml:mi>
                        <mml:mrow>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>max</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>m</mml:mi>
                        <mml:mrow>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                  <mml:mo>−</mml:mo>
                  <mml:mi>min</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>m</mml:mi>
                        <mml:mrow>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <disp-formula id="FD5">
          <label>(5)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mrow>
                  <mml:mover accent="true">
                    <mml:mrow>
                      <mml:mi>D</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:mi>s</mml:mi>
                    </mml:mrow>
                    <mml:mo stretchy="true">˜</mml:mo>
                  </mml:mover>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>t</mml:mi>
                  <mml:mo>,</mml:mo>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mi>D</mml:mi>
                  <mml:mi>i</mml:mi>
                  <mml:msub>
                    <mml:mi>s</mml:mi>
                    <mml:mrow>
                      <mml:mi>t</mml:mi>
                      <mml:mo>,</mml:mo>
                      <mml:mi>i</mml:mi>
                      <mml:mi>j</mml:mi>
                    </mml:mrow>
                  </mml:msub>
                  <mml:mo>−</mml:mo>
                  <mml:mi>min</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>D</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>s</mml:mi>
                        <mml:mrow>
                          <mml:mi>t</mml:mi>
                          <mml:mo>,</mml:mo>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>max</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>D</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>s</mml:mi>
                        <mml:mrow>
                          <mml:mi>t</mml:mi>
                          <mml:mo>,</mml:mo>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                  <mml:mo>−</mml:mo>
                  <mml:mi>min</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:mi>D</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:msub>
                        <mml:mi>s</mml:mi>
                        <mml:mrow>
                          <mml:mi>t</mml:mi>
                          <mml:mo>,</mml:mo>
                          <mml:mi>a</mml:mi>
                          <mml:mi>b</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>where <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi> S </mml:mi><mml:mi> i </mml:mi><mml:mi> m </mml:mi></mml:mrow><mml:mo stretchy="true"> ˜ </mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi> i </mml:mi><mml:mi> j </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi> D </mml:mi><mml:mi> i </mml:mi><mml:mi> s </mml:mi></mml:mrow><mml:mo stretchy="true"> ˜ </mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi> t </mml:mi><mml:mo> , </mml:mo><mml:mi> i </mml:mi><mml:mi> j </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> denote the normalized feature similarity and temporal distance, respectively. The adjacency weights are obtained after weighted fusion. The overall weighted fusion distance is given by: </p>
        <disp-formula id="FD6">
          <label>(6)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mi>S</mml:mi>
                <mml:mrow>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mi>λ</mml:mi>
              <mml:msub>
                <mml:mrow>
                  <mml:mover accent="true">
                    <mml:mrow>
                      <mml:mi>S</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:mi>m</mml:mi>
                    </mml:mrow>
                    <mml:mo stretchy="true">˜</mml:mo>
                  </mml:mover>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>+</mml:mo>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mn>1</mml:mn>
                  <mml:mo>−</mml:mo>
                  <mml:mi>λ</mml:mi>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>⋅</mml:mo>
              <mml:msub>
                <mml:mrow>
                  <mml:mover accent="true">
                    <mml:mrow>
                      <mml:mi>D</mml:mi>
                      <mml:mi>i</mml:mi>
                      <mml:mi>s</mml:mi>
                    </mml:mrow>
                    <mml:mo stretchy="true">˜</mml:mo>
                  </mml:mover>
                </mml:mrow>
                <mml:mrow>
                  <mml:mi>t</mml:mi>
                  <mml:mo>,</mml:mo>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Here, <inline-formula><mml:math display="inline"><mml:mi> λ </mml:mi></mml:math></inline-formula> denotes the weight balancing coefficient. To satisfy the requirements of graph convolution operations for the adjacency matrix, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> S </mml:mi><mml:mrow><mml:mi> i </mml:mi><mml:mi> j </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is normalized via the Softmax function. Here, <italic>k</italic> represents the indices of all neighboring nodes of node <italic>i</italic>, which means the summation is conducted over all <italic>j</italic> corresponding to node <italic>i</italic>. </p>
        <disp-formula id="FD7">
          <label>(7)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mover accent="true">
                  <mml:mi>S</mml:mi>
                  <mml:mo>^</mml:mo>
                </mml:mover>
                <mml:mrow>
                  <mml:mi>i</mml:mi>
                  <mml:mi>j</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mi>exp</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>S</mml:mi>
                        <mml:mrow>
                          <mml:mi>i</mml:mi>
                          <mml:mi>j</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
                <mml:mrow>
                  <mml:mstyle displaystyle="true">
                    <mml:msub>
                      <mml:mo>∑</mml:mo>
                      <mml:mi>k</mml:mi>
                    </mml:msub>
                    <mml:mrow>
                      <mml:mi>exp</mml:mi>
                      <mml:mrow>
                        <mml:mo>(</mml:mo>
                        <mml:mrow>
                          <mml:msub>
                            <mml:mi>S</mml:mi>
                            <mml:mrow>
                              <mml:mi>i</mml:mi>
                              <mml:mi>k</mml:mi>
                            </mml:mrow>
                          </mml:msub>
                        </mml:mrow>
                        <mml:mo>)</mml:mo>
                      </mml:mrow>
                    </mml:mrow>
                  </mml:mstyle>
                </mml:mrow>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Traditional methods adopt a globally fixed <italic>K</italic>. In this work, the number of neighbors is adaptively adjusted according to the density of sample features. </p>
        <disp-formula id="FD8">
          <label>(8)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mi>K</mml:mi>
                <mml:mi>i</mml:mi>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:msub>
                <mml:mi>K</mml:mi>
                <mml:mrow>
                  <mml:mi>b</mml:mi>
                  <mml:mi>a</mml:mi>
                  <mml:mi>s</mml:mi>
                  <mml:mi>e</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>+</mml:mo>
              <mml:mi>η</mml:mi>
              <mml:mo>⋅</mml:mo>
              <mml:mfrac>
                <mml:mrow>
                  <mml:mstyle displaystyle="true">
                    <mml:msub>
                      <mml:mo>∑</mml:mo>
                      <mml:mi>j</mml:mi>
                    </mml:msub>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>S</mml:mi>
                        <mml:mrow>
                          <mml:mi>i</mml:mi>
                          <mml:mi>j</mml:mi>
                        </mml:mrow>
                      </mml:msub>
                    </mml:mrow>
                  </mml:mstyle>
                </mml:mrow>
                <mml:mi>N</mml:mi>
              </mml:mfrac>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Here, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> K </mml:mi><mml:mrow><mml:mi> b </mml:mi><mml:mi> a </mml:mi><mml:mi> s </mml:mi><mml:mi> e </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> stands for the base neighbor count, <inline-formula><mml:math display="inline"><mml:mi> η </mml:mi></mml:math></inline-formula> is the regulation coefficient, and <italic>N</italic> is the total node number. Higher feature similarity corresponds to more adaptive neighbors, forming a dynamic topology. </p>
      </sec>
      <sec id="sec3dot3">
        <title>3.3. Spatio-Temporal Dual-Branch Graph Convolution Module</title>
        <p>The spatial branch adopts a multi-head graph attention network to explore fault topological correlations among nodes. It performs spatial aggregation on the features of dynamic graph nodes and learns the common and distinctive characteristics of samples. </p>
        <p>Vibration signals exhibit continuous temporal evolution characteristics. In this paper, gated temporal convolution is introduced to model node sequences in the time dimension, so as to capture the time-varying dependencies of fault impacts and compensate for the lack of temporal information in pure spatial graph networks. </p>
        <disp-formula id="FD9">
          <label>(9)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mi>X</mml:mi>
                <mml:mi>T</mml:mi>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mi>C</mml:mi>
              <mml:mi>o</mml:mi>
              <mml:mi>n</mml:mi>
              <mml:mi>v</mml:mi>
              <mml:mn>1</mml:mn>
              <mml:mi>d</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:msub>
                    <mml:mi>x</mml:mi>
                    <mml:mi>t</mml:mi>
                  </mml:msub>
                  <mml:mo>,</mml:mo>
                  <mml:msub>
                    <mml:mi>W</mml:mi>
                    <mml:mi>f</mml:mi>
                  </mml:msub>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
              <mml:mo>⊙</mml:mo>
              <mml:mi>σ</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:mi>C</mml:mi>
                  <mml:mi>o</mml:mi>
                  <mml:mi>n</mml:mi>
                  <mml:mi>v</mml:mi>
                  <mml:mn>1</mml:mn>
                  <mml:mi>d</mml:mi>
                  <mml:mrow>
                    <mml:mo>(</mml:mo>
                    <mml:mrow>
                      <mml:msub>
                        <mml:mi>x</mml:mi>
                        <mml:mi>t</mml:mi>
                      </mml:msub>
                      <mml:mo>,</mml:mo>
                      <mml:msub>
                        <mml:mi>W</mml:mi>
                        <mml:mi>g</mml:mi>
                      </mml:msub>
                    </mml:mrow>
                    <mml:mo>)</mml:mo>
                  </mml:mrow>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Here, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> x </mml:mi><mml:mi> t </mml:mi></mml:msub><mml:mo> ∈ </mml:mo><mml:msup><mml:mi> ℝ </mml:mi><mml:mrow><mml:mi> L </mml:mi><mml:mo> × </mml:mo><mml:msub><mml:mi> d </mml:mi><mml:mi> S </mml:mi></mml:msub></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> . <inline-formula><mml:math display="inline"><mml:mi> L </mml:mi></mml:math></inline-formula> is the temporal length, <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> d </mml:mi><mml:mi> S </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the output feature dimension of the spatial branch. <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> W </mml:mi><mml:mi> f </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> W </mml:mi><mml:mi> g </mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are one-dimensional convolution kernels for the feature branch and gating branch, and <inline-formula><mml:math display="inline"><mml:mrow><mml:mi> σ </mml:mi><mml:mrow><mml:mo> ( </mml:mo><mml:mo> ⋅ </mml:mo><mml:mo> ) </mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> is the Sigmoid activation function. </p>
        <p>The spatial and temporal feature vectors are concatenated along the channel dimension to construct the global spatio-temporal feature representation. The fused feature is defined as: </p>
        <disp-formula id="FD10">
          <label>(10)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mi>X</mml:mi>
                <mml:mrow>
                  <mml:mi>S</mml:mi>
                  <mml:mi>T</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:mi>C</mml:mi>
              <mml:mi>o</mml:mi>
              <mml:mi>n</mml:mi>
              <mml:mi>c</mml:mi>
              <mml:mi>a</mml:mi>
              <mml:mi>t</mml:mi>
              <mml:mrow>
                <mml:mo>(</mml:mo>
                <mml:mrow>
                  <mml:msub>
                    <mml:mi>X</mml:mi>
                    <mml:mi>S</mml:mi>
                  </mml:msub>
                  <mml:mo>,</mml:mo>
                  <mml:msub>
                    <mml:mi>X</mml:mi>
                    <mml:mi>T</mml:mi>
                  </mml:msub>
                  <mml:mo>;</mml:mo>
                  <mml:mi>dim</mml:mi>
                  <mml:mo>=</mml:mo>
                  <mml:mi>c</mml:mi>
                  <mml:mi>h</mml:mi>
                  <mml:mi>a</mml:mi>
                  <mml:mi>n</mml:mi>
                  <mml:mi>n</mml:mi>
                  <mml:mi>e</mml:mi>
                  <mml:mi>l</mml:mi>
                </mml:mrow>
                <mml:mo>)</mml:mo>
              </mml:mrow>
            </mml:mrow>
          </mml:math>
        </disp-formula>
      </sec>
      <sec id="sec3dot4">
        <title>3.4. Residual Attention Module</title>
        <p>To address the over-smoothing problem of deep graph convolution, we design residual skip connections and a node attention weighting module. </p>
        <disp-formula id="FD11">
          <label>(11)</label>
          <mml:math display="inline">
            <mml:mrow>
              <mml:msub>
                <mml:mi>H</mml:mi>
                <mml:mrow>
                  <mml:mi>o</mml:mi>
                  <mml:mi>u</mml:mi>
                  <mml:mi>t</mml:mi>
                </mml:mrow>
              </mml:msub>
              <mml:mo>=</mml:mo>
              <mml:msub>
                <mml:mi>H</mml:mi>
                <mml:mn>1</mml:mn>
              </mml:msub>
              <mml:mo>+</mml:mo>
              <mml:msub>
                <mml:mi>H</mml:mi>
                <mml:mrow>
                  <mml:mi>l</mml:mi>
                  <mml:mo>−</mml:mo>
                  <mml:mn>1</mml:mn>
                </mml:mrow>
              </mml:msub>
              <mml:mo>+</mml:mo>
              <mml:mi>α</mml:mi>
              <mml:msub>
                <mml:mi>H</mml:mi>
                <mml:mrow>
                  <mml:mi>g</mml:mi>
                  <mml:mi>c</mml:mi>
                  <mml:mi>n</mml:mi>
                </mml:mrow>
              </mml:msub>
            </mml:mrow>
          </mml:math>
        </disp-formula>
        <p>Shallow original features are directly delivered to deep networks via skip connections to avoid feature homogenization after multi-layer aggregation. The node attention mechanism amplifies important fault features by weighting, suppresses noisy and redundant information, and enhances the representation of weak faults. </p>
      </sec>
      <sec id="sec3dot5">
        <title>3.5. Fault Classification Output Layer</title>
        <p>The fused spatio-temporal features are processed by global average pooling and then fed into the fully connected neural network. The Softmax activation function outputs the probability of each fault category to implement fault classification. </p>
      </sec>
    </sec>
    <sec id="sec4">
      <title>4. Experiments and Analysis</title>
      <sec id="sec4dot1">
        <title>4.1. Datasets</title>
        <p>We use the CWRU rolling bearing dataset from Case Western Reserve University. The CWRU dataset consists of vibration time series of various rolling bearing conditions collected from the test rig, as shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>. The test bearing is the 6205-2RS deep groove ball bearing with a sampling frequency of 12 kHz. Four load conditions are set, namely 0 hp, 1 hp, 2 hp, and 3 hp. The fault types include normal condition (NC), inner race fault (IR), outer race fault (OR), and ball fault (B). Fault defect sizes are 0.1778 mm, 0.3556 mm, and 0.5334 mm. The ten fault modes are shown in <bold>Table 1</bold>. To strictly meet the generalization requirements for industrial applications, the training, validation, and test sets are divided according to experimental working conditions and acquisition batches. Specifically, the data collected under 1 hp and 2 hp loads are used as the training and validation sets, while the data from the independent 3 hp working condition is adopted as the test set. All sample windows segmented from the original vibration signals are assigned to a single dataset only, without distribution across the training and test sets. The dataset is split into a training set (70%), a validation set (20%), and a test set (10%).</p>
        <fig id="fig3">
          <label>Figure 3</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId87.jpeg?20260625032437" />
        </fig>
        <p><bold>Figure 3.</bold> Test rig for rolling bearings. </p>
        <p><bold>Table 1.</bold> Classification information of rolling bearings.</p>
        <table-wrap id="tbl1">
          <label>Table 1</label>
          <table>
            <tbody>
              <tr>
                <td>Fault Name</td>
                <td>Label</td>
                <td>Fault Location</td>
                <td>Bearing Damage Level (mm)</td>
                <td>Sample Collection</td>
              </tr>
              <tr>
                <td>Normal</td>
                <td>0</td>
                <td>Normal Bearing</td>
                <td>0</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>007-BALL</td>
                <td>1</td>
                <td>Ball Fault</td>
                <td>0.18</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>007-IR</td>
                <td>2</td>
                <td>Inner Race Fault</td>
                <td>0.18</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>007-OUT</td>
                <td>3</td>
                <td>Outer Race Fault</td>
                <td>0.18</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>014-BALL</td>
                <td>4</td>
                <td>Ball Fault</td>
                <td>0.36</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>014-IR</td>
                <td>5</td>
                <td>Inner Race Fault</td>
                <td>0.36</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>014-OUT</td>
                <td>6</td>
                <td>Outer Race Fault</td>
                <td>0.36</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>021-BALL</td>
                <td>7</td>
                <td>Ball Fault</td>
                <td>0.54</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>021-IR</td>
                <td>8</td>
                <td>Inner Race Fault</td>
                <td>0.54</td>
                <td>3000</td>
              </tr>
              <tr>
                <td>021-OUT</td>
                <td>9</td>
                <td>Outer Race Fault</td>
                <td>0.54</td>
                <td>3000</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
      </sec>
      <sec id="sec4dot2">
        <title>4.2. Data Preprocessing</title>
        <p>Sliding window segmentation is applied to the original vibration signals to construct training samples. The sliding window length is set to 1024 sampling points with a sliding step of 512 sampling points, corresponding to an overlap rate of 50%, which fully preserves the temporal characteristics of fault impulses. Variational Mode Decomposition (VMD) is used for signal denoising and reconstruction. Afterwards, each sample window is standardized to zero mean and unit variance to eliminate dimensional and amplitude interference . </p>
      </sec>
      <sec id="sec4dot3">
        <title>4.3. Baseline Models</title>
        <p>Four mainstream methods are selected for comparative experiments: </p>
        <p>1) SVM: A classic traditional machine learning algorithm. It constructs an optimal hyperplane to achieve sample classification, and performs well on small-scale datasets. It is widely used in traditional bearing fault diagnosis as a benchmark method. </p>
        <p>2) LSTM: A typical recurrent neural network. It is designed to capture long-range temporal dependencies of time-series signals and is widely applied to vibration signal analysis and time-series fault classification tasks. </p>
        <p>3) KNN-GCN: It first constructs a graph structure based on K-nearest neighbor relationships of samples, then leverages graph convolution to aggregate node features. This method combines traditional distance metric learning with graph neural networks. </p>
        <p>4) GAT: It introduces the attention mechanism into graph learning, adaptively assigns different weights to neighboring nodes during feature aggregation, and effectively highlights important associative relationships between nodes. </p>
      </sec>
      <sec id="sec4dot4">
        <title>4.4. Hyperparameters</title>
        <p>All baseline models adopt the same dataset partition. The Adam optimizer is used in the experiments with a learning rate of 0.001 and a batch size of 32. The total training epochs are set to 100. CrossEntropyLoss is selected as the loss function, and the basic adjacency number <inline-formula><mml:math display="inline"><mml:mrow><mml:msub><mml:mi> K </mml:mi><mml:mrow><mml:mi> b </mml:mi><mml:mi> a </mml:mi><mml:mi> s </mml:mi><mml:mi> e </mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is set to 8. </p>
      </sec>
      <sec id="sec4dot5">
        <title>4.5. Experimental Results</title>
        <p>Multi-class fault diagnosis experiments are conducted on the bearing fault dataset. We comprehensively evaluate the overall performance of the proposed AdaDKNN-STGCN model from multiple perspectives, including model convergence, classification accuracy, confusion matrix , and feature visualization. Meanwhile, comparative analysis is carried out against mainstream algorithms such as SVM, LSTM, KNN-GCN, and GAT. </p>
        <p>As shown in <bold>Table 2</bold>, the proposed AdaDKNN-STGCN model achieves an accuracy of 99.78% in the 10-class fault diagnosis task, which is remarkably higher than that of the comparative models including SVM, LSTM, KNN-GCN, and GAT. The standard deviation is merely 0.08, indicating the optimal stability of the model. According to the paired t-test for significance, the p-values between all comparative models and the proposed model are less than 0.05. At the 95% confidence level, the performance difference is statistically significant. It is statistically verified that the diagnostic performance of the proposed model is remarkably superior to existing mainstream algorithms. This verifies the effectiveness and superiority of the proposed method for fault diagnosis. </p>
        <p><bold>Table 2</bold><bold>.</bold>Table comparison of fault diagnosis accuracy of different models.</p>
        <table-wrap id="tbl2">
          <label>Table 2</label>
          <table>
            <tbody>
              <tr>
                <td>Model</td>
                <td>SVM</td>
                <td>LSTM</td>
                <td>KNN-GCN</td>
                <td>GAT</td>
                <td>AdaDKNN-STGCN (Ours)</td>
              </tr>
              <tr>
                <td>Accuracy (%)</td>
                <td>62.33</td>
                <td>57.27</td>
                <td>60.87</td>
                <td>95.67</td>
                <td>
                  <bold>99</bold>
                  <bold>.</bold>
                  <bold>78</bold>
                </td>
              </tr>
              <tr>
                <td>Std (5 runs)</td>
                <td>0.31</td>
                <td>0.35</td>
                <td>0.28</td>
                <td>0.22</td>
                <td>
                  <bold>0.08</bold>
                </td>
              </tr>
              <tr>
                <td>95% Confidence Interval (CI)</td>
                <td>[62.02, 62.64]</td>
                <td>[56.99, 57.55]</td>
                <td>[60.62, 61.12]</td>
                <td>[95.47, 95.87]</td>
                <td>[99.72, 99.82]</td>
              </tr>
              <tr>
                <td>p-value</td>
                <td>p &lt; 0.001</td>
                <td>p &lt; 0.001</td>
                <td>p &lt; 0.001</td>
                <td>p &lt; 0.001</td>
                <td>-</td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p>The training and validation accuracy curves of different models are presented in <xref ref-type="fig" rid="fig4">Figure 4</xref>. It can be observed that AdaDKNN-STGCN converges faster. In addition, the training and validation accuracy curves of the model closely overlap, indicating its outstanding generalization ability and effective suppression of overfitting. <xref ref-type="fig" rid="fig5">Figure 5</xref> illustrates the loss curve of AdaDKNN-STGCN, which further demonstrates stable training performance. The classification details are analyzed via the confusion matrix in <xref ref-type="fig" rid="fig6">Figure 6</xref>. The results reveal that the proposed model can accurately identify samples under normal conditions as well as bearing samples with different damage severities and fault locations.</p>
        <p>To explore the feature learning capability of the model, t-SNE is adopted to reduce the dimensionality of high-dimensional features from the output layer for 2D visualization, as shown in <xref ref-type="fig" rid="fig7">Figure 7</xref>. The feature distribution shows that samples of the same fault category are highly clustered in the feature space. This proves that AdaDKNN-STGCN can fully extract deep features from vibration signals and effectively distinguish different fault types and damage degrees. The excellent feature representation capability lays a solid foundation for the high diagnosis accuracy of the model.</p>
        <fig id="fig4">
          <label>Figure 4</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId90.jpeg?20260625032438" />
        </fig>
        <p><bold>Figure 4.</bold> Comparison curves of training and validation accuracy of models.</p>
        <fig id="fig5">
          <label>Figure 5</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId91.jpeg?20260625032438" />
        </fig>
        <p><bold>Figure 5.</bold> Training and validation curve for loss.</p>
        <fig id="fig6">
          <label>Figure 6</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId92.jpeg?20260625032438" />
        </fig>
        <p><bold>Figure 6.</bold> Confusion matrix for each category. </p>
        <fig id="fig7">
          <label>Figure 7</label>
          <graphic xlink:href="https://html.scirp.org/file/1733577-rId93.jpeg?20260625032438" />
        </fig>
        <p><bold>Figure 7.</bold>Visualization based on t-SNE algorithm. </p>
      </sec>
    </sec>
    <sec id="sec5">
      <title>5. Conclusion</title>
      <p>Aiming at the drawbacks of traditional K-nearest neighbor graph convolutional networks in bearing fault diagnosis, such as poor working condition adaptability, feature over-smoothing, and insufficient mining of temporal features, this paper proposes an AdaDKNN-STGCN model. The model constructs graph nodes by fusing time-frequency features and adopts a spatio-temporal dual-branch structure to simultaneously learn the spatial topology and temporal evolution features of signals. Experimental results demonstrate that the proposed model achieves an accuracy of 99.78% on the 10-classification task and presents excellent generalization and robustness under complex working conditions. This method effectively remedies the deficiencies of traditional algorithms and provides a new technical solution for intelligent fault diagnosis of rotating machinery. </p>
    </sec>
    <sec id="sec6">
      <title>Acknowledgements</title>
      <p>This research was partially supported by the Science and Technology Project of China Tobacco Hebei Industrial Co., Ltd. (Grant No. HBZY2026A016).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      <ref id="B1">
        <label>1.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Ni, Q., Ji, J.C., Halkon, B., Feng, K. and Nandi, A.K. (2023) Physics-informed Residual Network (PIResNet) for Rolling Element Bearing Fault Diagnostics. <italic>Mechanical Systems and Signal Processing</italic>, 200, Article 110544. https://doi.org/10.1016/j.ymssp.2023.110544 <pub-id pub-id-type="doi">10.1016/j.ymssp.2023.110544</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ymssp.2023.110544">https://doi.org/10.1016/j.ymssp.2023.110544</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Ni, Q.</string-name>
              <string-name>Ji, J.C.</string-name>
              <string-name>Halkon, B.</string-name>
              <string-name>Feng, K.</string-name>
              <string-name>Nandi, A.K.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Physics-informed Residual Network (PIResNet) for Rolling Element Bearing Fault Diagnostics</article-title>
            <source>Mechanical Systems and Signal Processing</source>
            <volume>200</volume>
            <elocation-id>110544</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.ymssp.2023.110544</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B2">
        <label>2.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Geetha, G. and Geethanjali, P. (2024) An Efficient Method for Bearing Fault Diagnosis. <italic>Systems Science &amp; Control Engineering</italic>, 12, Article 2329264. https://doi.org/10.1080/21642583.2024.2329264 <pub-id pub-id-type="doi">10.1080/21642583.2024.2329264</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/21642583.2024.2329264">https://doi.org/10.1080/21642583.2024.2329264</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Geetha, G.</string-name>
              <string-name>Geethanjali, P.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>An Efficient Method for Bearing Fault Diagnosis</article-title>
            <source>Systems Science &amp; Control Engineering</source>
            <volume>12</volume>
            <elocation-id>2329264</elocation-id>
            <pub-id pub-id-type="doi">10.1080/21642583.2024.2329264</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B3">
        <label>3.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Wang, L. and Zhao, W. (2025) An Ensemble Deep Learning Network Based on 2D Convolutional Neural Network and 1D LSTM with Self-Attention for Bearing Fault Diagnosis. <italic>Applied Soft Computing</italic>, 172, Article 112889. https://doi.org/10.1016/j.asoc.2025.112889 <pub-id pub-id-type="doi">10.1016/j.asoc.2025.112889</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.asoc.2025.112889">https://doi.org/10.1016/j.asoc.2025.112889</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Wang, L.</string-name>
              <string-name>Zhao, W.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>An Ensemble Deep Learning Network Based on 2D Convolutional Neural Network and 1D LSTM with Self-Attention for Bearing Fault Diagnosis</article-title>
            <source>Applied Soft Computing</source>
            <volume>172</volume>
            <elocation-id>112889</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.asoc.2025.112889</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B4">
        <label>4.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Wang, J., Mo, Z., Zhang, H. and Miao, Q. (2019) A Deep Learning Method for Bearing Fault Diagnosis Based on Time-Frequency Image. <italic>IEEE Access</italic>, 7, 42373-42383. https://doi.org/10.1109/access.2019.2907131 <pub-id pub-id-type="doi">10.1109/access.2019.2907131</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/access.2019.2907131">https://doi.org/10.1109/access.2019.2907131</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Wang, J.</string-name>
              <string-name>Mo, Z.</string-name>
              <string-name>Zhang, H.</string-name>
              <string-name>Miao, Q.</string-name>
            </person-group>
            <year>2019</year>
            <article-title>A Deep Learning Method for Bearing Fault Diagnosis Based on Time-Frequency Image</article-title>
            <source>IEEE Access</source>
            <volume>7</volume>
            <pub-id pub-id-type="doi">10.1109/access.2019.2907131</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B5">
        <label>5.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Abtane, M., Dahi, K., Martinez, H., Sedki, M., El Kimi, H., Dahhassi, C., <italic>et al</italic>. (2025) Axle Bearing Fault Diagnosis for High-Speed Trains: A Comprehensive Review of Methodologies, Technologies, Challenges and Emerging Trends. <italic>Measurement</italic>, 251, Article 117098. https://doi.org/10.1016/j.measurement.2025.117098 <pub-id pub-id-type="doi">10.1016/j.measurement.2025.117098</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.measurement.2025.117098">https://doi.org/10.1016/j.measurement.2025.117098</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Abtane, M.</string-name>
              <string-name>Dahi, K.</string-name>
              <string-name>Martinez, H.</string-name>
              <string-name>Sedki, M.</string-name>
              <string-name>Kimi, H.</string-name>
              <string-name>Dahhassi, C.</string-name>
              <string-name>Methodologies, T</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Axle Bearing Fault Diagnosis for High-Speed Trains: A Comprehensive Review of Methodologies, Technologies, Challenges and Emerging Trends</article-title>
            <source>Measurement</source>
            <volume>251</volume>
            <elocation-id>117098</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.measurement.2025.117098</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B6">
        <label>6.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Zhu, H., He, Z., Wei, J., Wang, J. and Zhou, H. (2021) Bearing Fault Feature Extraction and Fault Diagnosis Method Based on Feature Fusion. <italic>Sensors</italic>, 21, Article 2524. https://doi.org/10.3390/s21072524 <pub-id pub-id-type="doi">10.3390/s21072524</pub-id><pub-id pub-id-type="pmid">33916563</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/s21072524">https://doi.org/10.3390/s21072524</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Zhu, H.</string-name>
              <string-name>He, Z.</string-name>
              <string-name>Wei, J.</string-name>
              <string-name>Wang, J.</string-name>
              <string-name>Zhou, H.</string-name>
            </person-group>
            <year>2021</year>
            <article-title>Bearing Fault Feature Extraction and Fault Diagnosis Method Based on Feature Fusion</article-title>
            <source>Sensors</source>
            <volume>21</volume>
            <elocation-id>2524</elocation-id>
            <pub-id pub-id-type="doi">10.3390/s21072524</pub-id>
            <pub-id pub-id-type="pmid">33916563</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B7">
        <label>7.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Keshun, Y., Puzhou, W., Peng, H. and Yingkui, G. (2025) A Sound-Vibration Physical-Information Fusion Constraint-Guided Deep Learning Method for Rolling Bearing Fault Diagnosis. <italic>Reliability Engineering &amp; System Safety</italic>, 253, Article 110556. https://doi.org/10.1016/j.ress.2024.110556 <pub-id pub-id-type="doi">10.1016/j.ress.2024.110556</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ress.2024.110556">https://doi.org/10.1016/j.ress.2024.110556</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Keshun, Y.</string-name>
              <string-name>Puzhou, W.</string-name>
              <string-name>Peng, H.</string-name>
              <string-name>Yingkui, G.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>A Sound-Vibration Physical-Information Fusion Constraint-Guided Deep Learning Method for Rolling Bearing Fault Diagnosis</article-title>
            <source>Reliability Engineering &amp; System Safety</source>
            <volume>253</volume>
            <elocation-id>110556</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.ress.2024.110556</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B8">
        <label>8.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Dong, Z., Jiang, Y., Jiao, W., Zhang, F., Wang, Z., Huang, J., <italic>et al</italic>. (2025) Double Attention-Guided Tree-Inspired Grade Decision Network: A Method for Bearing Fault Diagnosis of Unbalanced Samples under Strong Noise Conditions. <italic>Advanced Engineering Informatics</italic>, 64, Article 103004. https://doi.org/10.1016/j.aei.2024.103004 <pub-id pub-id-type="doi">10.1016/j.aei.2024.103004</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.aei.2024.103004">https://doi.org/10.1016/j.aei.2024.103004</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Dong, Z.</string-name>
              <string-name>Jiang, Y.</string-name>
              <string-name>Jiao, W.</string-name>
              <string-name>Zhang, F.</string-name>
              <string-name>Wang, Z.</string-name>
              <string-name>Huang, J.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>Double Attention-Guided Tree-Inspired Grade Decision Network: A Method for Bearing Fault Diagnosis of Unbalanced Samples under Strong Noise Conditions</article-title>
            <source>Advanced Engineering Informatics</source>
            <volume>64</volume>
            <elocation-id>103004</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.aei.2024.103004</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B9">
        <label>9.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Chen, Z., Zhu, L., Lu, H., Chen, S., Zhu, F., Liu, S., <italic>et al</italic>. (2024) Research on Bearing Fault Diagnosis Based on Improved Genetic Algorithm and BP Neural Network. <italic>Scientific Reports</italic>, 14, Article No. 15527. https://doi.org/10.1038/s41598-024-66318-0 <pub-id pub-id-type="doi">10.1038/s41598-024-66318-0</pub-id><pub-id pub-id-type="pmid">38969797</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/s41598-024-66318-0">https://doi.org/10.1038/s41598-024-66318-0</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Chen, Z.</string-name>
              <string-name>Zhu, L.</string-name>
              <string-name>Lu, H.</string-name>
              <string-name>Chen, S.</string-name>
              <string-name>Zhu, F.</string-name>
              <string-name>Liu, S.</string-name>
            </person-group>
            <year>2024</year>
            <article-title>Research on Bearing Fault Diagnosis Based on Improved Genetic Algorithm and BP Neural Network</article-title>
            <source>Scientific Reports</source>
            <volume>14</volume>
            <elocation-id>No</elocation-id>
            <pub-id pub-id-type="doi">10.1038/s41598-024-66318-0</pub-id>
            <pub-id pub-id-type="pmid">38969797</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B10">
        <label>10.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Yu, L., Qu, J., Gao, F. and Tian, Y. (2019) A Novel Hierarchical Algorithm for Bearing Fault Diagnosis Based on Stacked LSTM. <italic>Shock and Vibration</italic>, 2019, Article 2756284. https://doi.org/10.1155/2019/2756284 <pub-id pub-id-type="doi">10.1155/2019/2756284</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1155/2019/2756284">https://doi.org/10.1155/2019/2756284</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Yu, L.</string-name>
              <string-name>Qu, J.</string-name>
              <string-name>Gao, F.</string-name>
              <string-name>Tian, Y.</string-name>
            </person-group>
            <year>2019</year>
            <article-title>A Novel Hierarchical Algorithm for Bearing Fault Diagnosis Based on Stacked LSTM</article-title>
            <source>Shock and Vibration</source>
            <volume>2019</volume>
            <elocation-id>2756284</elocation-id>
            <pub-id pub-id-type="doi">10.1155/2019/2756284</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B11">
        <label>11.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Zou, P., Hou, B., Lei, J. and Zhang, Z. (2020) Bearing Fault Diagnosis Method Based on EEMD and LSTM. <italic>International Journal of Computers Communications &amp; Control</italic>, 15, 1-14. https://doi.org/10.15837/ijccc.2020.1.3780 <pub-id pub-id-type="doi">10.15837/ijccc.2020.1.3780</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.15837/ijccc.2020.1.3780">https://doi.org/10.15837/ijccc.2020.1.3780</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Zou, P.</string-name>
              <string-name>Hou, B.</string-name>
              <string-name>Lei, J.</string-name>
              <string-name>Zhang, Z.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>Bearing Fault Diagnosis Method Based on EEMD and LSTM</article-title>
            <source>International Journal of Computers Communications &amp; Control</source>
            <volume>15</volume>
            <pub-id pub-id-type="doi">10.15837/ijccc.2020.1.3780</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B12">
        <label>12.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Ruan, D., Chen, X., Gühmann, C. and Yan, J. (2023) Improvement of Generative Adversarial Network and Its Application in Bearing Fault Diagnosis: A Review. <italic>Lubricants</italic>, 11, Article 74. https://doi.org/10.3390/lubricants11020074 <pub-id pub-id-type="doi">10.3390/lubricants11020074</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/lubricants11020074">https://doi.org/10.3390/lubricants11020074</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Ruan, D.</string-name>
              <string-name>Chen, X.</string-name>
              <string-name>Yan, J.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Improvement of Generative Adversarial Network and Its Application in Bearing Fault Diagnosis: A Review</article-title>
            <source>Lubricants</source>
            <volume>11</volume>
            <elocation-id>74</elocation-id>
            <pub-id pub-id-type="doi">10.3390/lubricants11020074</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B13">
        <label>13.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">He, D., Wu, J., Jin, Z., Huang, C., Wei, Z. and Yi, C. (2025) AGFCN: A Bearing Fault Diagnosis Method for High-Speed Train Bogie under Complex Working Conditions. <italic>Reliability Engineering &amp; System Safety</italic>, 258, Article 110907. https://doi.org/10.1016/j.ress.2025.110907 <pub-id pub-id-type="doi">10.1016/j.ress.2025.110907</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ress.2025.110907">https://doi.org/10.1016/j.ress.2025.110907</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>He, D.</string-name>
              <string-name>Wu, J.</string-name>
              <string-name>Jin, Z.</string-name>
              <string-name>Huang, C.</string-name>
              <string-name>Wei, Z.</string-name>
              <string-name>Yi, C.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>AGFCN: A Bearing Fault Diagnosis Method for High-Speed Train Bogie under Complex Working Conditions</article-title>
            <source>Reliability Engineering &amp; System Safety</source>
            <volume>258</volume>
            <elocation-id>110907</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.ress.2025.110907</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B14">
        <label>14.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Hou, L., Yi, H., Jin, Y., Gui, M., Sui, L., Zhang, J., <italic>et al</italic>. (2023) Inter-Shaft Bearing Fault Diagnosis Based on Aero-Engine System: A Benchmarking Dataset Study. <italic>Journal of Dynamics</italic>, <italic>Monitoring and Diagnostics</italic>, 2, 228-242. https://doi.org/10.37965/jdmd.2023.314 <pub-id pub-id-type="doi">10.37965/jdmd.2023.314</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.37965/jdmd.2023.314">https://doi.org/10.37965/jdmd.2023.314</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Hou, L.</string-name>
              <string-name>Yi, H.</string-name>
              <string-name>Jin, Y.</string-name>
              <string-name>Gui, M.</string-name>
              <string-name>Sui, L.</string-name>
              <string-name>Zhang, J.</string-name>
              <string-name>Dynamics, M</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Inter-Shaft Bearing Fault Diagnosis Based on Aero-Engine System: A Benchmarking Dataset Study</article-title>
            <source>Journal of Dynamics</source>
            <volume>2</volume>
            <pub-id pub-id-type="doi">10.37965/jdmd.2023.314</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B15">
        <label>15.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Hu, W., Xin, G., Wu, J., An, G., Li, Y., Feng, K., <italic>et al</italic>. (2023) Vibration-Based Bearing Fault Diagnosis of High-Speed Trains: A Literature Review. <italic>High</italic>- <italic>Speed Railway</italic>, 1, 219-223. https://doi.org/10.1016/j.hspr.2023.11.001 <pub-id pub-id-type="doi">10.1016/j.hspr.2023.11.001</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.hspr.2023.11.001">https://doi.org/10.1016/j.hspr.2023.11.001</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Hu, W.</string-name>
              <string-name>Xin, G.</string-name>
              <string-name>Wu, J.</string-name>
              <string-name>An, G.</string-name>
              <string-name>Li, Y.</string-name>
              <string-name>Feng, K.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Vibration-Based Bearing Fault Diagnosis of High-Speed Trains: A Literature Review</article-title>
            <source>High-Speed Railway</source>
            <volume>1</volume>
            <pub-id pub-id-type="doi">10.1016/j.hspr.2023.11.001</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B16">
        <label>16.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Liang, B. and Feng, W. (2023) Bearing Fault Diagnosis Based on ICEEMDAN Deep Learning Network. <italic>Processes</italic>, 11, Article 2440. https://doi.org/10.3390/pr11082440 <pub-id pub-id-type="doi">10.3390/pr11082440</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/pr11082440">https://doi.org/10.3390/pr11082440</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Liang, B.</string-name>
              <string-name>Feng, W.</string-name>
            </person-group>
            <year>2023</year>
            <article-title>Bearing Fault Diagnosis Based on ICEEMDAN Deep Learning Network</article-title>
            <source>Processes</source>
            <volume>11</volume>
            <elocation-id>2440</elocation-id>
            <pub-id pub-id-type="doi">10.3390/pr11082440</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B17">
        <label>17.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Tao, L., Liu, H., Ning, G., Cao, W., Huang, B. and Lu, C. (2025) LLM-Based Framework for Bearing Fault Diagnosis. <italic>Mechanical Systems and Signal Processing</italic>, 224, Article 112127. https://doi.org/10.1016/j.ymssp.2024.112127 <pub-id pub-id-type="doi">10.1016/j.ymssp.2024.112127</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.ymssp.2024.112127">https://doi.org/10.1016/j.ymssp.2024.112127</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Tao, L.</string-name>
              <string-name>Liu, H.</string-name>
              <string-name>Ning, G.</string-name>
              <string-name>Cao, W.</string-name>
              <string-name>Huang, B.</string-name>
              <string-name>Lu, C.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>LLM-Based Framework for Bearing Fault Diagnosis</article-title>
            <source>Mechanical Systems and Signal Processing</source>
            <volume>224</volume>
            <elocation-id>112127</elocation-id>
            <pub-id pub-id-type="doi">10.1016/j.ymssp.2024.112127</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B18">
        <label>18.</label>
        <citation-alternatives>
          <mixed-citation publication-type="journal">Eren, L., Ince, T. and Kiranyaz, S. (2018) A Generic Intelligent Bearing Fault Diagnosis System Using Compact Adaptive 1D CNN Classifier. <italic>Journal of Signal Processing Systems</italic>, 91, 179-189. https://doi.org/10.1007/s11265-018-1378-3 <pub-id pub-id-type="doi">10.1007/s11265-018-1378-3</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11265-018-1378-3">https://doi.org/10.1007/s11265-018-1378-3</ext-link></mixed-citation>
          <element-citation publication-type="journal">
            <person-group person-group-type="author">
              <string-name>Eren, L.</string-name>
              <string-name>Ince, T.</string-name>
              <string-name>Kiranyaz, S.</string-name>
            </person-group>
            <year>2018</year>
            <article-title>A Generic Intelligent Bearing Fault Diagnosis System Using Compact Adaptive 1D CNN Classifier</article-title>
            <source>Journal of Signal Processing Systems</source>
            <volume>91</volume>
            <pub-id pub-id-type="doi">10.1007/s11265-018-1378-3</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B19">
        <label>19.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Yuan, L., Lian, D., Kang, X., Chen, Y. and Zhai, K. (2020) Rolling Bearing Fault Diagnosis Based on Convolutional Neural Network and Support Vector Machine. <italic>IEEE Access</italic>, 8, 137395-137406. https://doi.org/10.1109/access.2020.3012053 <pub-id pub-id-type="doi">10.1109/access.2020.3012053</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/access.2020.3012053">https://doi.org/10.1109/access.2020.3012053</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Yuan, L.</string-name>
              <string-name>Lian, D.</string-name>
              <string-name>Kang, X.</string-name>
              <string-name>Chen, Y.</string-name>
              <string-name>Zhai, K.</string-name>
            </person-group>
            <year>2020</year>
            <article-title>Rolling Bearing Fault Diagnosis Based on Convolutional Neural Network and Support Vector Machine</article-title>
            <source>IEEE Access</source>
            <volume>8</volume>
            <pub-id pub-id-type="doi">10.1109/access.2020.3012053</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B20">
        <label>20.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Fu, W., Li, S., Wen, B., Zheng, B., Liao, W. and Tan, C. (2025) A Review of Rolling Bearing Fault Diagnosis: Data Preprocessing and Model Optimization. <italic>Measurement Science and Technology</italic>, 36, Article 062002. https://doi.org/10.1088/1361-6501/add7fb <pub-id pub-id-type="doi">10.1088/1361-6501/add7fb</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1088/1361-6501/add7fb">https://doi.org/10.1088/1361-6501/add7fb</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Fu, W.</string-name>
              <string-name>Li, S.</string-name>
              <string-name>Wen, B.</string-name>
              <string-name>Zheng, B.</string-name>
              <string-name>Liao, W.</string-name>
              <string-name>Tan, C.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>A Review of Rolling Bearing Fault Diagnosis: Data Preprocessing and Model Optimization</article-title>
            <source>Measurement Science and Technology</source>
            <volume>36</volume>
            <elocation-id>062002</elocation-id>
            <pub-id pub-id-type="doi">10.1088/1361-6501/add7fb</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
      <ref id="B21">
        <label>21.</label>
        <citation-alternatives>
          <mixed-citation publication-type="other">Wang, J., Li, T., Yang, Y., Chen, S. and Zhai, W. (2025) DiagLLM: Multimodal Reasoning with Large Language Model for Explainable Bearing Fault Diagnosis. <italic>Science China Information Sciences</italic>, 68, Article 160103. https://doi.org/10.1007/s11432-024-4333-7 <pub-id pub-id-type="doi">10.1007/s11432-024-4333-7</pub-id><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s11432-024-4333-7">https://doi.org/10.1007/s11432-024-4333-7</ext-link></mixed-citation>
          <element-citation publication-type="other">
            <person-group person-group-type="author">
              <string-name>Wang, J.</string-name>
              <string-name>Li, T.</string-name>
              <string-name>Yang, Y.</string-name>
              <string-name>Chen, S.</string-name>
              <string-name>Zhai, W.</string-name>
            </person-group>
            <year>2025</year>
            <article-title>DiagLLM: Multimodal Reasoning with Large Language Model for Explainable Bearing Fault Diagnosis</article-title>
            <source>Science China Information Sciences</source>
            <volume>68</volume>
            <elocation-id>160103</elocation-id>
            <pub-id pub-id-type="doi">10.1007/s11432-024-4333-7</pub-id>
          </element-citation>
        </citation-alternatives>
      </ref>
    </ref-list>
  </back>
</article>