<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    abb
   </journal-id>
   <journal-title-group>
    <journal-title>
     Advances in Bioscience and Biotechnology
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2156-8456
   </issn>
   <issn publication-format="print">
    2156-8502
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/abb.2025.169024
   </article-id>
   <article-id pub-id-type="publisher-id">
    abb-145550
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Biomedical 
     </subject>
     <subject>
       Life Sciences
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    Prediction Modeling: Basic Metabolic Panel
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Philip de
      </surname>
      <given-names>
       Melo
      </given-names>
     </name>
    </contrib>
   </contrib-group> 
   <aff id="affnull">
    <addr-line>
     aDepartment of Nursing and Allied Health, Norfolk State University, Norfolk, VA, USA
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     03
    </day> 
    <month>
     09
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    16
   </volume> 
   <issue>
    09
   </issue>
   <fpage>
    360
   </fpage>
   <lpage>
    378
   </lpage>
   <history>
    <date date-type="received">
     <day>
      2,
     </day>
     <month>
      July
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      9,
     </day>
     <month>
      July
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      9,
     </day>
     <month>
      September
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    Blood test informatics is a field that combines data science, medical informatics, and research to improve management, treatment, and understanding of diseases. This field uses health data, wearable technology, artificial intelligence (AI), and electronic health records (EHRs) to optimize healthcare. EHR informatics focuses on the following: 1) Using AI and data analytics to tailor EHR data management for individuals, 2) Identifying early signs of complications or predicting blood sugar fluctuations, 3) Using continuous glucose monitors (CGMs) and insulin pumps to collect real-time data, 4) Assisting doctors and patients with real-time recommendations. In this paper, we will discuss the basic principles of EHR informatics focusing on assisting doctors and patients with accurate recommendations and data management. We will demonstrate a new prediction method that improves accuracy compared to other forecasting technologies.
   </abstract>
   <kwd-group> 
    <kwd>
     Artificial Intelligence
    </kwd> 
    <kwd>
      Electronic Health Records
    </kwd> 
    <kwd>
      Prediction Modeling
    </kwd> 
    <kwd>
      Metabolic Panel Prediction
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction</title>
   <p>Predictive modeling in healthcare is the application of statistical, machine learning, or artificial intelligence methods to forecast outcomes based on historical or real-time health data. These models help in anticipating disease progression, readmission risk, treatment outcomes, and more, ultimately improving clinical decision-making and resource allocation.</p>
   <p>Key applications include Disease risk prediction using Logistic Regression, Random Forest, Neural Networks, etc. Predictive modeling plays an important role in patient readmission. Hospitals use models to predict if a patient is likely to be readmitted within 30 days of discharge. It helps reduce readmission penalties and improve care plans.</p>
   <p>Length of Stay (LOS) estimation focuses on predicting how long a patient will stay in the hospital. It is critical for hospital resource management and treatment response prediction.</p>
   <p>Personalized medicine forecasts how a patient might respond to a drug or therapy. It represents an important element in early warning systems based on vital signs, lab results, and clinical notes.</p>
   <p>Predictive modeling is often powered by real-time data and deep learning. It is widely used in public health forecasting such as modeling the spread of infectious diseases (e.g., COVID-19), syndromic surveillance and outbreak detection.</p>
   <p>Predictive modeling uses the following types of data: Electronic Health Records (EHR), laboratory test results, imaging data (X-ray, CT scans), wearable device data, genomics and biomarkers, social determinants of health.</p>
   <p>Over the past decade, the rapid increase in the availability of healthcare data and dramatic advances in machine learning stimulated predictive models, which estimate the probability of some event of interest occurring in a specified time frame in the future. These models have been developed for events such as heart failure, inpatient mortality, and patient deterioration. However, there have been relatively few success stories where these models led to impact on what matters to patients, providers, and healthcare decision makers.</p>
   <p>Blood tests are widely utilized in primary care and serve critical roles in diagnosis, prognosis, and treatment <xref ref-type="bibr" rid="scirp.145550-1">
     [1]
    </xref>. Beyond their clinical functions, qualitative studies have shown that blood tests also play psychosocial roles during consultations. They can offer reassurance, foster empathetic interactions, and validate patients’ symptoms <xref ref-type="bibr" rid="scirp.145550-2">
     [2]
    </xref>.</p>
   <p>Inflammatory markers, a specific category of blood test, are frequently used to diagnose and monitor infections, autoimmune disorders, and cancers. They are also employed as broad screening tools for patients presenting with nonspecific or unexplained symptoms <xref ref-type="bibr" rid="scirp.145550-3">
     [3]
    </xref>. While inflammatory markers seldom yield definitive diagnoses, they can provide diagnostic clues.</p>
   <p>However, false-positive results may lead to unnecessary follow-up appointments, repeated testing, and referrals <xref ref-type="bibr" rid="scirp.145550-4">
     [4]
    </xref>. This highlights the importance of a shared understanding between clinicians and patients regarding both the rationale for ordering inflammatory marker tests and the limitations associated with their use.</p>
   <p>Shared decision-making—a collaborative approach where clinicians and patients make health decisions together, integrating clinical evidence and patient values—is widely endorsed as a best practice <xref ref-type="bibr" rid="scirp.145550-5">
     [5]
    </xref>. Although its importance is well recognized, most existing research focuses on treatment decisions rather than diagnostic testing. When diagnostic decision-making is studied, attention is often limited to tests, such as prostate-specific antigen (PSA) for prostate cancer <xref ref-type="bibr" rid="scirp.145550-6">
     [6]
    </xref>.</p>
   <p>This study aimed to explore how communication and decision-making occur around the use of inflammatory marker blood tests in primary care. While the concept of shared decision-making provided a framework for our future investigation, we deliberately adopted a broader perspective to encompass the full communication process—from the initial consultation through to the delivery of test results.</p>
   <p>Let us discuss major components of the blood test chemistry. Blood urea nitrogen (BUN) test: This test measures the volume of nitrogen in the blood. High levels can be caused by kidney damage or disease, while low levels may be a sign of malnutrition or severe liver damage.</p>
   <p>Calcium test: This test measures the levels of calcium in the blood. If testing indicates low levels, this can indicate under-active parathyroid glands, insufficient calcium in the diet, or Vit D insufficiency, and other less common conditions. High levels can indicate conditions including over-active parathyroid glands, excessive Vit D supplementation, kidney problems as well as more concerning causes that usually require further investigation. Proper interpretation of calcium levels often requires testing of other parameters such as total serum protein and albumin.</p>
   <p>Chloride test: This test measures the body’s chloride levels. An increased level of chloride can indicate dehydration as well as kidney disorders and adrenal gland dysfunction.</p>
   <p>Sodium test: Sodium is a mineral that aids nerve impulses and muscle contractions, as well as balancing water levels. Irregularities are a possible indication of dehydration, adrenal gland disorders, corticosteroids, and kidney or liver disorders.</p>
   <p>Creatinine test: Creatinine is a chemical waste molecule that is important for creating muscle energy. Increased levels of creatinine can be a sign of kidney dysfunction.</p>
   <p>Potassium test: Potassium aids communication between nerves and muscles, regulates the heart and maintains muscle function. Diuretics (a substance or medication used to increase urination) can cause potassium levels to fall.</p>
   <p>Blood glucose: A blood glucose test measures the amount of sugar in the blood. It’s a standard part of the electrolyte test panel. Blood sugar will vary depending on when you last ate. A normal range for a fasting blood sugar (meaning you haven’t eaten for 8 hours) is 99 mg/dL or less. A normal range for a random blood sugar is less than 200 mg/dL.</p>
   <p>Anion gap (or serum anion gap) is a measurement to check for how acidic blood is. Too much acid or too little acid in blood can be a sign of certain health conditions.</p>
   <p>A carbon dioxide (CO<sub>2</sub>) blood test helps healthcare providers determine if the body is balancing electrolytes properly. Results outside the normal range of 20 to 29 millimoles per liter (mmol/L) may point to a more serious issue with the kidneys or lungs. The following Table (<xref ref-type="table" rid="table1">
     Table 1
    </xref>) depicts normal values of BMP:</p>
   <table-wrap id="table1">
    <label>
     <xref ref-type="table" rid="table1">
      Table 1
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.145550-"></xref>Table 1. Normal values for BMP panel.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td acenter" width="47.42%"><p style="text-align:center">BMP element</p></td> 
      <td class="custom-bottom-td acenter" width="52.58%"><p style="text-align:center">Normal numbers</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="47.42%"><p style="text-align:center">Glucose:</p></td> 
      <td class="custom-top-td acenter" width="52.58%"><p style="text-align:center">74 to 99 mg/dL</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Calcium:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">8.5 to 10.2 mg/dL</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">BUN:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">7 to 21 mg/dL</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Creatinine:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">0.58 to 0.96 mg/dL</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Sodium:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">136 to 144 mmol/L</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Potassium:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">3.7 to 5.1 mmol/L</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Bicarbonate (CO<sub>2</sub>):</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">22 to 30 mmol/L</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="47.42%"><p style="text-align:center">Chloride:</p></td> 
      <td class="acenter" width="52.58%"><p style="text-align:center">98 to 107 mmol/L</p></td> 
     </tr> 
    </table>
   </table-wrap>
  </sec><sec id="s2">
   <title>2. Methods</title>
   <sec id="s2_1">
    <title>2.1. Interpolation of Sparse Data</title>
    <p>In this section, we will consider a traditional linear interpolation between the recorded data and show that the correlation between the features does not change much proving that unlike extrapolation, the interpolation is a stable process. In the next sections, we will demonstrate 2 methods PM GenAI and CAIHA that are able to make accurate predictions beyond the recorded data. a 73-year-old man complained about hypertension. The blood test was taken. The history of the bmp panel is given below (<xref ref-type="table" rid="table2">
      Table 2
     </xref>):</p>
    <table-wrap id="table2">
     <label>
      <xref ref-type="table" rid="table2">
       Table 2
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Table 2. BMP panel.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="21.03%"><p style="text-align:center">Glucose</p></td> 
       <td class="custom-bottom-td acenter" width="22.88%"><p style="text-align:center">Potassium</p></td> 
       <td class="custom-bottom-td acenter" width="16.83%"><p style="text-align:center">Sodium</p></td> 
       <td class="custom-bottom-td acenter" width="20.75%"><p style="text-align:center">Chloride</p></td> 
       <td class="custom-bottom-td acenter" width="10.75%"><p style="text-align:center">CO<sub>2</sub></p></td> 
       <td class="custom-bottom-td acenter" width="24.18%"><p style="text-align:center">Creatinine</p></td> 
       <td class="custom-bottom-td acenter" width="10.61%"><p style="text-align:center">AG</p></td> 
       <td class="custom-bottom-td acenter" width="10.78%"><p style="text-align:center">BUN</p></td> 
       <td class="custom-bottom-td acenter" width="13.15%"><p style="text-align:center">Month</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="21.03%"><p style="text-align:center">98</p></td> 
       <td class="custom-top-td acenter" width="22.88%"><p style="text-align:center">4.0</p></td> 
       <td class="custom-top-td acenter" width="16.83%"><p style="text-align:center">142</p></td> 
       <td class="custom-top-td acenter" width="20.75%"><p style="text-align:center">106</p></td> 
       <td class="custom-top-td acenter" width="10.75%"><p style="text-align:center">24</p></td> 
       <td class="custom-top-td acenter" width="24.18%"><p style="text-align:center">0.84</p></td> 
       <td class="custom-top-td acenter" width="10.61%"><p style="text-align:center">12</p></td> 
       <td class="custom-top-td acenter" width="10.78%"><p style="text-align:center">30</p></td> 
       <td class="custom-top-td acenter" width="13.15%"><p style="text-align:center">1</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="21.03%"><p style="text-align:center">93</p></td> 
       <td class="acenter" width="22.88%"><p style="text-align:center">4.3</p></td> 
       <td class="acenter" width="16.83%"><p style="text-align:center">142</p></td> 
       <td class="acenter" width="20.75%"><p style="text-align:center">108</p></td> 
       <td class="acenter" width="10.75%"><p style="text-align:center">25</p></td> 
       <td class="acenter" width="24.18%"><p style="text-align:center">0.97</p></td> 
       <td class="acenter" width="10.61%"><p style="text-align:center">9</p></td> 
       <td class="acenter" width="10.78%"><p style="text-align:center">21</p></td> 
       <td class="acenter" width="13.15%"><p style="text-align:center">9</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="21.03%"><p style="text-align:center">99</p></td> 
       <td class="acenter" width="22.88%"><p style="text-align:center">4.0</p></td> 
       <td class="acenter" width="16.83%"><p style="text-align:center">141</p></td> 
       <td class="acenter" width="20.75%"><p style="text-align:center">107</p></td> 
       <td class="acenter" width="10.75%"><p style="text-align:center">22</p></td> 
       <td class="acenter" width="24.18%"><p style="text-align:center">1.03</p></td> 
       <td class="acenter" width="10.61%"><p style="text-align:center">12</p></td> 
       <td class="acenter" width="10.78%"><p style="text-align:center">17</p></td> 
       <td class="acenter" width="13.15%"><p style="text-align:center">10</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="21.03%"><p style="text-align:center">112</p></td> 
       <td class="acenter" width="22.88%"><p style="text-align:center">3.8</p></td> 
       <td class="acenter" width="16.83%"><p style="text-align:center">138</p></td> 
       <td class="acenter" width="20.75%"><p style="text-align:center">105</p></td> 
       <td class="acenter" width="10.75%"><p style="text-align:center">23</p></td> 
       <td class="acenter" width="24.18%"><p style="text-align:center">0.95</p></td> 
       <td class="acenter" width="10.61%"><p style="text-align:center">10</p></td> 
       <td class="acenter" width="10.78%"><p style="text-align:center">22</p></td> 
       <td class="acenter" width="13.15%"><p style="text-align:center">19</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="21.03%"><p style="text-align:center">100</p></td> 
       <td class="acenter" width="22.88%"><p style="text-align:center">4.3</p></td> 
       <td class="acenter" width="16.83%"><p style="text-align:center">139</p></td> 
       <td class="acenter" width="20.75%"><p style="text-align:center">109</p></td> 
       <td class="acenter" width="10.75%"><p style="text-align:center">21</p></td> 
       <td class="acenter" width="24.18%"><p style="text-align:center">0.95</p></td> 
       <td class="acenter" width="10.61%"><p style="text-align:center">9</p></td> 
       <td class="acenter" width="10.78%"><p style="text-align:center">27</p></td> 
       <td class="acenter" width="13.15%"><p style="text-align:center">24</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>Time is measured in months. <xref ref-type="fig" rid="fig1">
      Figure 1
     </xref> shows the correlation between features (heatmap). For example, potassium highly correlates with chloride values while CO<sub>2</sub> correlates with sodium.</p>
    <p>
     <xref ref-type="fig" rid="fig2">
      Figure 2
     </xref> shows the interpolated data for Chloride (a) and CO<sub>2</sub> (b).</p>
    <p>
     <xref ref-type="fig" rid="fig3">
      Figure 3
     </xref> shows the heatmap (correlation between interpolated features). It demonstrates slightly different correlation values compared to original data correlation. This shows that correlation is a stable process.</p>
   </sec>
   <sec id="s2_2">
    <title>2.2. Augmentation of Data</title>
    <p>Augmented data can be calculated in the form of Gaussian posterior probabilities:</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         p 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mi>
          x 
        </mi> 
        <mo>
          ) 
        </mo> 
       </mrow> 
       <mo>
         = 
       </mo> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            k 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           N 
         </mi> 
        </msubsup> 
        <mrow> 
         <msub> 
          <mi>
            π 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              x 
            </mi> 
            <mi>
              k 
            </mi> 
           </msub> 
           <mo>
             , 
           </mo> 
           <mi>
             μ 
           </mi> 
           <mo>
             , 
           </mo> 
           <msub> 
            <mi>
              Σ 
            </mi> 
            <mi>
              k 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (1)</p>
    <p>which is the multivariate normal distribution of the data set x with mean μ and covariance matrix Σ<sub>k</sub>. K is the number of Gaussian components and 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          π 
        </mi> 
        <mi>
          k 
        </mi> 
       </msub> 
      </mrow> 
     </math> is a coefficient for k-Gaussian distribution:</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            k 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           N 
         </mi> 
        </msubsup> 
        <mrow> 
         <msub> 
          <mi>
            π 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
       </mstyle> 
       <mo>
         = 
       </mo> 
       <mn>
         1 
       </mn> 
      </mrow> 
     </math></p>
    <p>The covariance matrix is calculated from:</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          Σ 
        </mi> 
        <mi>
          k 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mfrac> 
        <mn>
          1 
        </mn> 
        <mrow> 
         <msub> 
          <mi>
            N 
          </mi> 
          <mi>
            k 
          </mi> 
         </msub> 
        </mrow> 
       </mfrac> 
       <mstyle displaystyle="true"> 
        <msubsup> 
         <mo>
           ∑ 
         </mo> 
         <mrow> 
          <mi>
            i 
          </mi> 
          <mo>
            = 
          </mo> 
          <mn>
            1 
          </mn> 
         </mrow> 
         <mi>
           N 
         </mi> 
        </msubsup> 
        <mrow> 
         <msub> 
          <mi>
            γ 
          </mi> 
          <mrow> 
           <mi>
             i 
           </mi> 
           <mi>
             k 
           </mi> 
          </mrow> 
         </msub> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <msub> 
            <mi>
              x 
            </mi> 
            <mi>
              i 
            </mi> 
           </msub> 
           <mo>
             − 
           </mo> 
           <msub> 
            <mi>
              μ 
            </mi> 
            <mi>
              k 
            </mi> 
           </msub> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
         <msup> 
          <mrow> 
           <mrow> 
            <mo>
              ( 
            </mo> 
            <mrow> 
             <msub> 
              <mi>
                x 
              </mi> 
              <mi>
                i 
              </mi> 
             </msub> 
             <mo>
               − 
             </mo> 
             <msub> 
              <mi>
                μ 
              </mi> 
              <mi>
                k 
              </mi> 
             </msub> 
            </mrow> 
            <mo>
              ) 
            </mo> 
           </mrow> 
          </mrow> 
          <mtext>
            T 
          </mtext> 
         </msup> 
        </mrow> 
       </mstyle> 
      </mrow> 
     </math> (2)</p>
    <p>In this formula,</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          N 
        </mi> 
        <mi>
          k 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <munderover> 
        <mstyle mathsize="140%" displaystyle="true"> 
         <mo>
           ∑ 
         </mo> 
        </mstyle> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mo>
           = 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
        <mi>
          N 
        </mi> 
       </munderover> 
       <mtext>
           
       </mtext> 
       <msub> 
        <mi>
          γ 
        </mi> 
        <mrow> 
         <mi>
           i 
         </mi> 
         <mi>
           k 
         </mi> 
        </mrow> 
       </msub> 
      </mrow> 
     </math></p>
    <fig id="fig1" position="float">
     <label>Figure 1</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 1. Heatmap (correlation) of features.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId25.jpeg?20250912020743" />
    </fig>
    <fig id="fig2" position="float">
     <label>Figure 2</label>
     <caption>
      <title>(a)<p class="imgGroupCss_v"><img class=" imgMarkCss lazy" data-original="https://html.scirp.org/file/7302204-rId27.jpeg?20250912020742" /></p>(b)<xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 2. Original (blue dots) and interpolated data (red dots). (a) Potassium, (b) Glucose.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId26.jpeg?20250912020743" />
    </fig>
    <fig id="fig3" position="float">
     <label>Figure 3</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 3. Heatmap of the interpolated data.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId28.jpeg?20250912020742" />
    </fig>
    <p>
     <xref ref-type="fig" rid="fig4(a)">
      Figure 4(a)
     </xref> and <xref ref-type="fig" rid="fig4(b)">
      Figure 4(b)
     </xref> shows the scattered plots of augmented data. <xref ref-type="fig" rid="fig5(a)">
      Figure 5(a)
     </xref> and <xref ref-type="fig" rid="fig5(b)">
      Figure 5(b)
     </xref> depicts Gaussian and Gibbs data augmentation. <xref ref-type="fig" rid="fig6">
      Figure 6
     </xref> depicts original + interpolated data and augmented data (red)</p>
    <p>Let’s build now the confidence ellipse. Given a 2D Gaussian distribution of variables X = original chloride and Y = augmented data. The ellipse is defined by:</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          χ 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
       <mo>
         = 
       </mo> 
       <msup> 
        <mrow> 
         <mrow> 
          <mo>
            ( 
          </mo> 
          <mrow> 
           <mi>
             x 
           </mi> 
           <mo>
             − 
           </mo> 
           <mi>
             μ 
           </mi> 
          </mrow> 
          <mo>
            ) 
          </mo> 
         </mrow> 
        </mrow> 
        <mtext>
          T 
        </mtext> 
       </msup> 
       <msup> 
        <mi>
          Σ 
        </mi> 
        <mrow> 
         <mo>
           − 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
       </msup> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <mi>
           x 
         </mi> 
         <mo>
           − 
         </mo> 
         <mi>
           μ 
         </mi> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (3)</p>
    <p>Here 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msup> 
        <mi>
          χ 
        </mi> 
        <mn>
          2 
        </mn> 
       </msup> 
       <mo>
         = 
       </mo> 
       <mn>
         5.99 
       </mn> 
      </mrow> 
     </math> chi-squared critical value with 2 degrees of freedom.</p>
    <p>Σ is the 2 × 2 covariance matrix between X and Y.</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <mi>
         μ 
       </mi> 
       <mo>
         = 
       </mo> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            μ 
          </mi> 
          <mi>
            X 
          </mi> 
         </msub> 
         <mo>
           , 
         </mo> 
         <msub> 
          <mi>
            μ 
          </mi> 
          <mi>
            Y 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> is the mean vector of X and Y.</p>
    <p>
     <xref ref-type="fig" rid="fig5(a)">
      Figure 5(a)
     </xref> and <xref ref-type="fig" rid="fig5(b)">
      Figure 5(b)
     </xref> illustrates Gaussian and Gibbs augmentation of the recorded data.</p>
    <p>
     <xref ref-type="fig" rid="fig6(a)">
      Figure 6(a)
     </xref> depicts the confidence ellipse of augmented and original + interpolated data (BUN vs. Chloride) and <xref ref-type="fig" rid="fig6(b)">
      Figure 6(b)
     </xref> Chloride vs. creatinine. <xref ref-type="fig" rid="fig7">
      Figure 7
     </xref> shows the confidence ellipse of augmented and original + interpolated data (BUN vs. creatinine.</p>
    <fig id="fig4" position="float">
     <label>Figure 4</label>
     <caption>
      <title>(a)<p class="imgGroupCss_v"><img class=" imgMarkCss lazy" data-original="https://html.scirp.org/file/7302204-rId36.jpeg?20250912020743" /></p>(b)<xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 4. Gaussian (a) BUN vs. Chloride and (b) Creatinine vs. Chloride data augmentation.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId35.jpeg?20250912020743" />
    </fig>
    <fig id="fig5" position="float">
     <label>Figure 5</label>
     <caption>
      <title>(a)<p class="imgGroupCss_v"><img class=" imgMarkCss lazy" data-original="https://html.scirp.org/file/7302204-rId38.jpeg?20250912020742" /></p>(b)<xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 5. Gaussian (a) and Gibbs (b) data augmentation.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId37.jpeg?20250912020743" />
    </fig>
    <p>Let us build now the confidence ellipse.</p>
    <p>A confidence ellipse is a graphical representation of the uncertainty (or confidence region) around an estimated mean vector in two dimensions. It is often used in statistics, regression analysis, and multivariate analysis to show where the “true” values are likely to fall with a certain probability (e.g., 95%).</p>
    <p>Key Ideas:</p>
    <fig id="fig6" position="float">
     <label>Figure 6</label>
     <caption>
      <title>(a)<p class="imgGroupCss_v"><img class=" imgMarkCss lazy" data-original="https://html.scirp.org/file/7302204-rId40.jpeg?20250912020743" /></p>(b)<xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 6. 95% confidence ellipse between original + interpolated and augmented data. (a) Bun-Creatinine and (b) Chloride-Creatinine.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId39.jpeg?20250912020742" />
    </fig>
    <p>Shape: The ellipse is centered at the sample mean of two features. Axes: The major and minor axes of the ellipse correspond to the eigenvectors of the covariance matrix of the data, and their lengths depend on the eigenvalues (which reflect variance along each direction).</p>
    <p>Confidence Level: A 95% confidence ellipse means that if you were to repeat the sampling many times, the true population mean would lie within the ellipse about 95% of the time. <xref ref-type="fig" rid="fig7">
      Figure 7
     </xref> shows the 95% confidence ellipse of creatinine and BUN.</p>
    <fig id="fig7" position="float">
     <label>Figure 7</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 7. 95% confidence ellipse between original + interpolated and augmented data.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId41.jpeg?20250912020743" />
    </fig>
    <p>The characteristics of the confidence ellipse are given by:</p>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="acenter" width="41.74%"><p style="text-align:center">Size of ellipse</p></td> 
      <td class="acenter" width="95.10%"><p style="text-align:center">Variance or spread in original/augmented data</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="41.74%"><p style="text-align:center">Shape</p></td> 
      <td class="acenter" width="95.10%"><p style="text-align:center">Strength and direction of correlation</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="41.74%"><p style="text-align:center">Tilted ellipse</p></td> 
      <td class="acenter" width="95.10%"><p style="text-align:center">Indicates linear correlation (positive/negative)</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="41.74%"><p style="text-align:center">Circular ellipse</p></td> 
      <td class="acenter" width="95.10%"><p style="text-align:center">No correlation</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="41.74%"><p style="text-align:center">Overlap with 45˚ line</p></td> 
      <td class="acenter" width="95.10%"><p style="text-align:center">High consistency between original and augmented values</p></td> 
     </tr> 
    </table>
   </sec>
   <sec id="s2_3">
    <title>2.3. Prediction Algorithms to Forecast the Blood Lab Tests</title>
    <p>LSTM (Long Short-Term Memory) is a refined version of Recurrent Neural Networks (RNNs), designed to better handle long-term dependencies by controlling information flow through special structures called gates. The core of LSTM consists of four interacting components:</p>
    <p>1) Forget Gate</p>
    <p>Controls what portion of the previous cell state 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          c 
        </mi> 
        <mrow> 
         <mi>
           t 
         </mi> 
         <mo>
           − 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
       </msub> 
      </mrow> 
     </math> to retain ( 
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mi>
        σ 
      </mi> 
     </math> is a sigmoid function).</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          f 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mi>
         σ 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            W 
          </mi> 
          <mi>
            f 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            t 
          </mi> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            U 
          </mi> 
          <mi>
            f 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            h 
          </mi> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mo>
             − 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            b 
          </mi> 
          <mi>
            f 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (4)</p>
    <p>2) Input Gate</p>
    <p>Controls what new information to write into the cell.</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          i 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mi>
         σ 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            W 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            t 
          </mi> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            U 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            h 
          </mi> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mo>
             − 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            b 
          </mi> 
          <mi>
            i 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (5)</p>
    <p>3) Cell Candidate (New Content)</p>
    <p>Proposes new content to be added to the cell state.</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mover accent="true"> 
         <mi>
           c 
         </mi> 
         <mo>
           ˜ 
         </mo> 
        </mover> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mi>
         tanh 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            W 
          </mi> 
          <mi>
            c 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            t 
          </mi> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            U 
          </mi> 
          <mi>
            c 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            h 
          </mi> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mo>
             − 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            b 
          </mi> 
          <mi>
            c 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (6)</p>
    <p>4) Cell State Update combines old state and new candidate using the gates:</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          c 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <msub> 
        <mi>
          f 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         ⊙ 
       </mo> 
       <msub> 
        <mi>
          c 
        </mi> 
        <mrow> 
         <mi>
           t 
         </mi> 
         <mo>
           − 
         </mo> 
         <mn>
           1 
         </mn> 
        </mrow> 
       </msub> 
       <mo>
         + 
       </mo> 
       <msub> 
        <mi>
          i 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         ⊙ 
       </mo> 
       <msub> 
        <mover accent="true"> 
         <mi>
           c 
         </mi> 
         <mo>
           ˜ 
         </mo> 
        </mover> 
        <mi>
          t 
        </mi> 
       </msub> 
      </mrow> 
     </math> (7)</p>
    <p>5) Output Gate decides how much of the cell state to expose:</p>
    <p>
     <math xmlns="http://www.w3.org/1998/Math/MathML"> <mrow> 
       <msub> 
        <mi>
          o 
        </mi> 
        <mi>
          t 
        </mi> 
       </msub> 
       <mo>
         = 
       </mo> 
       <mi>
         σ 
       </mi> 
       <mrow> 
        <mo>
          ( 
        </mo> 
        <mrow> 
         <msub> 
          <mi>
            W 
          </mi> 
          <mi>
            o 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            x 
          </mi> 
          <mi>
            t 
          </mi> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            U 
          </mi> 
          <mi>
            o 
          </mi> 
         </msub> 
         <msub> 
          <mi>
            h 
          </mi> 
          <mrow> 
           <mi>
             t 
           </mi> 
           <mo>
             − 
           </mo> 
           <mn>
             1 
           </mn> 
          </mrow> 
         </msub> 
         <mo>
           + 
         </mo> 
         <msub> 
          <mi>
            b 
          </mi> 
          <mi>
            o 
          </mi> 
         </msub> 
        </mrow> 
        <mo>
          ) 
        </mo> 
       </mrow> 
      </mrow> 
     </math> (8)</p>
    <p>LSTM prediction is highly inaccurate and unstable for small data sets. This is why the sparce data were interpolated (Time index 40 corresponds to month 24). Interpolation significantly stabilizes the LSTM algorithm (<xref ref-type="fig" rid="fig8">
      Figure 8
     </xref> and <xref ref-type="fig" rid="fig9">
      Figure 9
     </xref>).</p>
    <fig id="fig8" position="float">
     <label>Figure 8</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 8. LSTM prediction of BUN (red solid line) for months 25 - 30.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId56.jpeg?20250912020746" />
    </fig>
    <fig id="fig9" position="float">
     <label>Figure 9</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 9. LSTM prediction of Glucose (red solid line) for months 25 - 30.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId57.jpeg?20250912020746" />
    </fig>
    <p>PM GenAI (Principal Model Generative Algorithm) was introduced by Philip de Melo <xref ref-type="bibr" rid="scirp.145550-7">
      [7]
     </xref> <xref ref-type="bibr" rid="scirp.145550-8">
      [8]
     </xref>.</p>
    <p>The method consists of the following steps:</p>
    <p>PM GenAI generates new data by sampling from the posterior distribution of the model weights. This allows it to create diverse outputs based on the learned distributions, rather than relying on a fixed, deterministic model.</p>
    <p>By modeling uncertainty in the parameters, PM GenAI produces a distribution of outputs instead of a single prediction. This is particularly useful for tasks that benefit from multiple plausible outcomes, such as data generation or decision-making under uncertainty.</p>
    <p>Design the neural network architecture and specify prior distributions for the model parameters. Use techniques such as variational inference or Markov Chain Monte Carlo (MCMC) to approximate the posterior distribution of the weights.</p>
    <p>Once the posterior is approximated (typically via backpropagation), sample weights from the distribution and perform feedforward passes through the network to generate predictions or synthetic data points that reflect the learned uncertainty. PM GenAI generates data that resembles real data, which is especially valuable in scenarios with limited labeled examples. By incorporating uncertainty, PM GenAI enables models to be more robust to outliers and noise, ultimately improving generalization.</p>
    <p>In summary, using Bayesian inference in deep learning for data generation enables the incorporation of uncertainty into models, generate realistic synthetic data, and make more reliable and accurate predictions as expressed in <xref ref-type="fig" rid="figFigures 10-12">
      Figures 10-12
     </xref>.</p>
    <p>To analyze the structure of the original data and predict (such as sharp variations), we use the CWT method (Continuous Wavelet Transform). The Continuous Wavelet Transform transforms a 1D signal into a 2D representation that shows how the frequency content of the data variations changes over time, using wavelets—localized oscillatory functions.</p>
    <p>Larger scales capture low frequencies (slow trends) while smaller scales correspond to high frequencies (sharp changes). <xref ref-type="fig" rid="fig13">
      Figure 13
     </xref> and <xref ref-type="fig" rid="fig14">
      Figure 14
     </xref> show CWT of original and predicted values of BUN and Glucose.</p>
    <fig id="fig10" position="float">
     <label>Figure 10</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 10. PM GenAI prediction of BUN (red solid line) for months 25 - 30.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId58.jpeg?20250912020747" />
    </fig>
    <fig id="fig11" position="float">
     <label>Figure 11</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 11. PM GenAI prediction of Glucose (red solid line) for months 25 - 30.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId59.jpeg?20250912020747" />
    </fig>
    <fig id="fig12" position="float">
     <label>Figure 12</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 12. PM GenAI prediction of Creatinine (red solid line) for months 25 - 30.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId60.jpeg?20250912020748" />
    </fig>
    <fig id="fig13" position="float">
     <label>Figure 13</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 13. CWT analysis of original and predicted data (Glucose).</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId61.jpeg?20250912020748" />
    </fig>
    <fig id="fig14" position="float">
     <label>Figure 14</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 14. CWT analysis of original and predicted data (BUN).</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId62.jpeg?20250912020748" />
    </fig>
    <p>We will follow now <xref ref-type="bibr" rid="scirp.145550-9">
      [9]
     </xref> and consider a new algorithm that uses stochastic characteristics of the data to predict values such as vitals, etc. beyond the time of observations. We will consider real data (irregular, sparse, incomplete). <xref ref-type="bibr" rid="scirp.145550-10">
      [10]
     </xref> considered predicting disease progress with imprecise lab test results. <xref ref-type="bibr" rid="scirp.145550-11">
      [11]
     </xref> discussed time series prediction with Long Short-Term Memory (LSTM). <xref ref-type="bibr" rid="scirp.145550-12">
      [12]
     </xref> introduce fuzzy logics in biomedical prediction of time series. <xref ref-type="bibr" rid="scirp.145550-13">
      [13]
     </xref> demonstrated an advanced machine learning algorithm in diabetes dynamics. <xref ref-type="bibr" rid="scirp.145550-14">
      [14]
     </xref> used machine learning applied to blood glucose forecasting.</p>
    <p>We generate augmented data in effort to identify the most likely predicted value for BUN from 20 to 30. We have chosen the center at 25 (<xref ref-type="fig" rid="fig15">
      Figure 15
     </xref>). Now we can explore other possibilities. <xref ref-type="table" rid="table3">
      Table 3
     </xref> shows statistical data of other possibilities with different centers. <xref ref-type="fig" rid="fig16">
      Figure 16
     </xref> shows KDE curves and BUN = 22 was chosen in accordance with <xref ref-type="table" rid="table3">
      Table 3
     </xref>.</p>
    <fig id="fig15" position="float">
     <label>Figure 15</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 15. Histogram of augmented data center at BUN=25.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId63.jpeg?20250912020748" />
    </fig>
    <fig id="fig16" position="float">
     <label>Figure 16</label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Figure 16. KDE values for different prediction values.</title>
     </caption>
     <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/7302204-rId64.jpeg?20250912020748" />
    </fig>
    <table-wrap id="table3">
     <label>
      <xref ref-type="table" rid="table3">
       Table 3
      </xref></label>
     <caption>
      <title>
       <xref ref-type="bibr" rid="scirp.145550-"></xref>Table 3. Statistical characteristics of augmented data centered around various possibilities.</title>
     </caption>
     <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
      <tr> 
       <td class="custom-bottom-td acenter" width="24.78%"><p style="text-align:center">BUN</p></td> 
       <td class="custom-bottom-td acenter" width="24.87%"><p style="text-align:center">Center</p></td> 
       <td class="custom-bottom-td acenter" width="26.36%"><p style="text-align:center">Histogram</p></td> 
       <td class="custom-bottom-td acenter" width="24.78%"><p style="text-align:center">KDE</p></td> 
       <td class="custom-bottom-td acenter" width="25.06%"><p style="text-align:center">LogProb</p></td> 
       <td class="custom-bottom-td acenter" width="24.81%"><p style="text-align:center">Score</p></td> 
      </tr> 
      <tr> 
       <td class="custom-top-td acenter" width="24.78%"><p style="text-align:center">22</p></td> 
       <td class="custom-top-td acenter" width="24.87%"><p style="text-align:center">22</p></td> 
       <td class="custom-top-td acenter" width="26.36%"><p style="text-align:center">166</p></td> 
       <td class="custom-top-td acenter" width="24.78%"><p style="text-align:center">0.168</p></td> 
       <td class="custom-top-td acenter" width="25.06%"><p style="text-align:center">−0.99</p></td> 
       <td class="custom-top-td acenter" width="24.81%"><p style="text-align:center">0.997183</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">24</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">24</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">166</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.168</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−1.02</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.996479</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">20</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">20</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">166</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.168</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−1.70</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.972535</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">26</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">26</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">166</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.168</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−1.76</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.970423</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">23</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">24</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">160</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.160</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−0.92</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.967261</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">21</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">22</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">160</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.160</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−1.26</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.955290</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">25</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">26</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">160</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.160</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−1.30</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.953881</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">18</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">18</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">166</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.169</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−3.13</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.922535</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">23</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">22</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">153</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.148</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−0.92</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.921718</p></td> 
      </tr> 
      <tr> 
       <td class="acenter" width="24.78%"><p style="text-align:center">19</p></td> 
       <td class="acenter" width="24.87%"><p style="text-align:center">20</p></td> 
       <td class="acenter" width="26.36%"><p style="text-align:center">160</p></td> 
       <td class="acenter" width="24.78%"><p style="text-align:center">0.160</p></td> 
       <td class="acenter" width="25.06%"><p style="text-align:center">−2.32</p></td> 
       <td class="acenter" width="24.81%"><p style="text-align:center">0.917966</p></td> 
      </tr> 
     </table>
    </table-wrap>
    <p>To decide which BUN value to choose, you want the value that best balances:</p>
    <p>From this Table, it is seen that the next BUN value will be 22 which very well approximates the PM GenAI prediction (<xref ref-type="fig" rid="fig10">
      Figure 10
     </xref>).</p>
    <p>BUN values 22, 24, 20, and 26 all have the same KDE value (0.168) and identical histogram count (166). However, BUN = 22 was still chosen because of its higher (less negative) log-probability, which leads to the highest weighted score. BUN = 22 served as the initial condition for AI time-series prediction.</p>
    <p>The method is called “Self-Adaptive Informed Healthcare Algorithm” because it concurrently uses synthetic data and stochastic property of the recorded data. The method gives a number of solutions, but the choice is based on the maximum of log probability.</p>
   </sec>
  </sec><sec id="s3">
   <title>3. Results</title>
   <p>Predicting laboratory test results is becoming increasingly vital in modern health-care due to its potential to improve diagnostic accuracy, optimize resource allocation, and support proactive clinical decision-making. As healthcare systems shift toward precision medicine and data-driven strategies, predictive modeling of lab tests—using machine learning, artificial intelligence, and statistical methods—offers transformative advantages across several domains:</p>
   <p>Predictive models can forecast abnormal lab values before they manifest clinically. For instance, trends in blood urea nitrogen (BUN), creatinine, or glucose levels can indicate impending kidney dysfunction or diabetes. Early warnings allow clinicians to intervene before the disease progresses, reducing morbidity and associated healthcare costs.</p>
   <p>By anticipating which lab results are likely to be within normal limits, predictive algorithms can reduce redundant or low-yield testing. This not only minimizes patient discomfort and financial burden but also alleviates the strain on laboratory resources.</p>
   <p>Prediction models can help tailor patient management plans based on expected future values. For example, forecasting a drop in hemoglobin or rise in inflammatory markers can prompt preemptive action such as medication adjustment or further diagnostics, improving patient outcomes.</p>
   <p>Integrating lab prediction tools into electronic health record (EHR) systems empowers clinicians with real-time, data-driven insights. This enhances diagnostic accuracy, especially in complex cases where patterns in lab data may not be immediately obvious.</p>
   <p>For patients with chronic illnesses (e.g., diabetes, renal failure, or cardiovascular disease), predictive modeling of lab trends supports longitudinal care. Forecasts of future lab values help monitor disease progression, treatment efficacy, and risk of complications.</p>
   <p>In virtual care settings, where physical lab access may be limited, predicting lab results allows providers to make informed decisions even in the absence of immediate testing, ensuring continuity and quality of care.</p>
   <p>On a broader scale, predictive lab analytics can be used to monitor public health trends, identify emerging disease patterns, and allocate resources effectively. This is especially relevant in epidemic preparedness and management.</p>
   <p>Predictive models serve as educational tools for students and medical trainees, helping them understand the interplay between symptoms, disease progression, and laboratory findings.</p>
   <p>The ability to predict laboratory test results represents a paradigm shift in clinical medicine—from reactive care to anticipatory care. As computational methods and access to large-scale health data improve, predictive lab testing will become a cornerstone of intelligent, efficient, and patient-centered healthcare delivery.</p>
   <p>It is well known that many conventional prediction methods fail to give accurate results <xref ref-type="bibr" rid="scirp.145550-15">
     [15]
    </xref>. In most existing studies, the algorithm is designed under the implicit assumption that the target value for each sample is the only correct outcome, and the data quality is high (completeness, uncertainty-free, etc.). However, this assumption does not hold for laboratory test data known by irregularities and a lack regular time periods of timestamps between observations.</p>
   <p>In this paper, we address the challenge of developing a robust predictive model that accounts for both measurement imprecision, lack of regular temporal information, and data sparsity. We introduce a novel accurate prediction algorithm SAIHA, which assigns probabilistic weight to each value within the defined range of augmented data, reflecting its likelihood of being the true value. The method is controlled by a stochastic controller that defines a range of possible values beyond the historical time of lab data defining the best choice within the imprecision range (<xref ref-type="fig" rid="fig16">
     Figure 16
    </xref>). The difference between SAIHA and PM GenAI is that SAIHA gives accurate results for each individual feature prediction, while PM GenAI is focused on the accuracy of the prediction of the combination of all features.</p>
   <p>Experiments conducted on real-world datasets (chemical blood panel) demonstrate that the SAIHA can achieve higher accuracy across various tasks and learning algorithms. Additionally, our approach yields more stable and consistent predictions when test samples originate from within imprecision ranges or exhibit minimal temporal variation.</p>
  </sec><sec id="s4">
   <title>4. Discussion</title>
   <sec id="s4_1">
    <title>4.1. Description</title>
    <p>This paper tackles a critical challenge in healthcare analytics: the prediction of Basic Metabolic Panel (BMP) values from sparse and irregular clinical data, which is a common issue in real-world Electronic Health Records (EHRs).</p>
    <p>Accurate forecasting of BMP components—such as blood urea nitrogen, creatinine, and electrolytes—is vital for monitoring metabolic function, especially in patients with chronic or acute conditions. However, the sparsity and irregular sampling of clinical lab data hinder the effectiveness of traditional time-series models.</p>
    <p>To address this, the study introduces a multi-component methodology consisting of 1) data interpolation, 2) Gaussian Mixture Model (GMM)-based data augmentation, and 3) the development of two novel predictive algorithms: PM GenAI (Predictive Modeling with Generative AI) and SAIHA (Self-Adaptive Informed Healthcare Algorithm).</p>
    <p>The interpolation step ensures a more continuous representation of patient data, while the GMM augmentation enriches the dataset with plausible synthetic samples that reflect the underlying statistical distribution of the observed values. This addresses the data sparsity problem and improves the robustness of downstream prediction models.</p>
    <p>The core contribution lies in the design and application of PM GenAI and SAIHA. These algorithms are specifically tailored to leverage both real and synthetic data for enhanced predictive performance. PM GenAI integrates generative modeling with feature-aware prediction, while SAIHA introduces a self-adaptive mechanism that dynamically adjusts to the patient’s evolving clinical state.</p>
   </sec>
   <sec id="s4_2">
    <title>4.2. Limitations</title>
    <p>While this study presents a novel and promising methodology for predicting Basic Metabolic Panel (BMP) values using sparse and irregular clinical data, several limitations must be acknowledged:</p>
    <p>The methodology was evaluated on a single patient case study. While this allowed for a detailed analysis, it limits the generalizability of the findings. The variability across patient populations, disease profiles, and care pathways may significantly affect the model’s performance in broader clinical settings.</p>
    <p>The PM GenAI and SAIHA algorithms, while powerful, introduce an additional layer of model complexity that may hinder interpretability for clinicians. Clinical deployment requires transparency and explainability, which remains a challenge for generative and adaptive models.</p>
    <p>The proposed methodology has not yet been validated on independent datasets or across multiple healthcare institutions. External validation is necessary to ensure robustness, reproducibility, and applicability in diverse clinical environments.</p>
   </sec>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.145550-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Whiting, P.F., Rutjes, A.W.S., Westwood, M.E. and Mallett, S. (2013) A Systematic Review Classifies Sources of Bias and Variation in Diagnostic Test Accuracy Studies. Journal of Clinical Epidemiology, 66, 1093-1104. &gt;https://doi.org/10.1016/j.jclinepi.2013.05.014
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Watson, J., de Salis, I., Banks, J. and Salisbury, C. (2017) What Do Tests Do for Doctors? A Qualitative Study of Blood Testing in UK Primary Care. Family Practice, 34, 735-739. &gt;https://doi.org/10.1093/fampra/cmx051
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Kravitz, R.L. and Callahan, E.J. (2000) Patients’ Perceptions of Omitted Examinations and Tests. Journal of General Internal Medicine, 15, 38-45. &gt;https://doi.org/10.1046/j.1525-1497.2000.12058.x
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     van Bokhoven, M.A., Pleunis-van Empel, M.C., Koch, H., Grol, R.P., Dinant, G. and van der Weijden, T. (2006) Why Do Patients Want to Have Their Blood Tested? A Qualitative Study of Patient Expectations in General Practice. BMC Family Practice, 7, Article No. 75. &gt;https://doi.org/10.1186/1471-2296-7-75
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Watson, J., Round, A. and Hamilton, W. (2012) Raised Inflammatory Markers. BMJ, 344, e454-e454. &gt;https://doi.org/10.1136/bmj.e454
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Riikonen, J.M., Guyatt, G.H., Kilpeläinen, T.P., Craigie, S., Agarwal, A., Agoritsas, T., et al. (2019) Decision Aids for Prostate Cancer Screening Choice: A Systematic Review and Meta‐Analysis. JAMA Internal Medicine, 179, 1072-1082. &gt;https://doi.org/10.1001/jamainternmed.2019.0763
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     De Melo, P. (2025) Data Evaluation in Artificial Intelligence. Journal of Data Analysis and Information Processing, 13, 281-297.
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Melo, P. (2025) Accurate Diagnostics of Lung Cancer Using Prime Model Generative Ai. Cancer Research Journal, 13, 81-97. &gt;https://doi.org/10.11648/j.crj.20251303.11
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     De Melo, P. (2024) Public Health Informatics and Technology. Library of Congress.
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Wang, M., Lin, Z., Li, R., Li, Y. and Su, J. (2022) Predicting Disease Progress with Imprecise Lab Test Results. Artificial Intelligence in Medicine, 132, Article 102373. &gt;https://doi.org/10.1016/j.artmed.2022.102373
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Yang, H., Pan, Z. and Tao, Q. (2017) Robust and Adaptive Online Time Series Prediction with Long Short-Term Memory. Computational Intelligence and Neuroscience, 2017, Article ID: 9478952. &gt;https://doi.org/10.1155/2017/9478952
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Salimi-Badr, A. and Ebadzadeh, M.M. (2022) A Novel Self-Organizing Fuzzy Neural Network to Learn and Mimic Habitual Sequential Tasks. IEEE Transactions on Cybernetics, 52, 323-332. &gt;https://doi.org/10.1109/tcyb.2020.2984646
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref13">
    <label>13</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Woldaregay, A.Z., Årsand, E., Walderhaug, S., Albers, D., Mamykina, L., Botsis, T., et al. (2019) Data-Driven Modeling and Prediction of Blood Glucose Dynamics: Machine Learning Applications in Type 1 Diabetes. Artificial Intelligence in Medicine, 98, 109-134. &gt;https://doi.org/10.1016/j.artmed.2019.07.007
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref14">
    <label>14</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ahmed, A., Aziz, S., Abd-alrazaq, A., Farooq, F., Househ, M. and Sheikh, J. (2023) The Effectiveness of Wearable Devices Using Artificial Intelligence for Blood Glucose Level Forecasting or Prediction: Systematic Review. Journal of Medical Internet Research, 25, e40259. &gt;https://doi.org/10.2196/40259
    </mixed-citation>
   </ref>
   <ref id="scirp.145550-ref15">
    <label>15</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Tejedor, M., Woldaregay, A.Z. and Godtliebsen, F. (2020) Reinforcement Learning Application in Diabetes Blood Glucose Control: A Systematic Review. Artificial Intelligence in Medicine, 104, Article 101836. &gt;https://doi.org/10.1016/j.artmed.2020.101836
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>