<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    jdaip
   </journal-id>
   <journal-title-group>
    <journal-title>
     Journal of Data Analysis and Information Processing
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    2327-7211
   </issn>
   <issn publication-format="print">
    2327-7203
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/jdaip.2025.133019
   </article-id>
   <article-id pub-id-type="publisher-id">
    jdaip-144890
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Computer Science 
     </subject>
     <subject>
       Communications, Physics 
     </subject>
     <subject>
       Mathematics
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    A Prediction Model for Estimating Egg Hatch Rate for Ghanaian Farmers Using Machine Learning
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Abraham
      </surname>
      <given-names>
       Atosona
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Stephen
      </surname>
      <given-names>
       Akobre
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Mohammed Ibrahim
      </surname>
      <given-names>
       Daabo
      </given-names>
     </name>
    </contrib>
   </contrib-group> 
   <aff id="affnull">
    <addr-line>
     aDepartment of Computer Science, Department of Cyber Security and Computer Engineering Technology, School of Computing and Information Sciences, C. K. Tedam University of Technology and Applied Sciences, Navrongo, Ghana
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     28
    </day> 
    <month>
     07
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    13
   </volume> 
   <issue>
    03
   </issue>
   <fpage>
    306
   </fpage>
   <lpage>
    313
   </lpage>
   <history>
    <date date-type="received">
     <day>
      10,
     </day>
     <month>
      January
     </month>
     <year>
      2025
     </year>
    </date>
    <date date-type="published">
     <day>
      16,
     </day>
     <month>
      January
     </month>
     <year>
      2025
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      16,
     </day>
     <month>
      August
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    Egg loss is one of the major problems in the egg hatching industry. This study aims to support farmers in optimizing their egg hatch through the development of a prediction model. This is to enable local farmers to accurately predict the hatchability of eggs before loading them into the incubator, thereby minimizing losses and reducing time wastage. By employing advanced data analysis techniques, a predictive algorithm was developed to predict the hatchability of eggs based on various parameters, such as egg storage temperature, time of egg storage, quantity processed and other egg characteristics. This predictive model provides farmers with valuable insights into the potential outcome of the incubation process, enabling them to make informed decisions in selecting eggs for incubation. The prediction model exhibited a high level of accuracy in estimating hatchability for farmers. The practical implications of this research are significant as it helps local farmers minimize losses, reduce time wastage and improve the overall efficiency of egg hatch process. This research does not only contribute to the field of agricultural technology, but also provides practical solutions for sustainable farming practices.
   </abstract>
   <kwd-group> 
    <kwd>
     Hatchability
    </kwd> 
    <kwd>
      Algorithm
    </kwd> 
    <kwd>
      Incubation
    </kwd> 
    <kwd>
      Prediction
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction</title>
   <p>Poultry farming in developing countries is a significant source of revenue generation for smallholder farmers. Also, raising poultry in Africa plays a vital role in providing the protein requirements for humans through its meat and eggs <xref ref-type="bibr" rid="scirp.144890-1">
     [1]
    </xref>.</p>
   <p>In modern times, farmers make use of egg incubators to hatch their eggs. This device provides ideal temperature and humidity in an enclosed environment for egg embryo development without human intervention <xref ref-type="bibr" rid="scirp.144890-2">
     [2]
    </xref>.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>In poultry farming, maximizing the effectiveness of incubation, early identification of viable eggs, reducing cost on wasted/spoiled eggs and boosting overall egg hatching while minimizing resource waste are very crucial to every farmer <xref ref-type="bibr" rid="scirp.144890-3">
     [3]
    </xref>. In recent times, Machine learning is used in the poultry industry to predict egg viability and hatchability, but much has not been achieved in Ghana locally. This technology has been shown to achieve accuracy rates of over 90% in predicting hatchability, making it a useful tool for hatcheries to optimize production and reduce waste and costs <xref ref-type="bibr" rid="scirp.144890-4">
     [4]
    </xref>. The use of artificial intelligence, machine learning, among other innovative tools, has allowed us to predict early hatching rates with remarkable accuracy by improving hatchability. Electricity, time and space are saved by preventing unnecessary waiting for infertile eggs in the incubator not to hatch <xref ref-type="bibr" rid="scirp.144890-5">
     [5]
    </xref>.</p>
   <p>This study aims to develop a hatch prediction model using linear regression to help farmers select good eggs for incubation to avoid wastage of eggs, time and other resources.</p>
  </sec><sec id="s2">
   <title>2. Related Literature</title>
   <p>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>Incubation is a critical aspect of poultry farming, as it allows farmers to hatch eggs under controlled conditions and ensure the best possible hatching rates. It is therefore necessary that farmers are able to predict the hatchability of the eggs to be incubated. Machine learning algorithms such as regression, decision trees or neural networks can be used to create a model that can predict the optimal results based on the data that one collected. <xref ref-type="bibr" rid="scirp.144890-6">
     [6]
    </xref> considered the online-prediction kind of standard problem of linear regression. The objective of the researchers was to estimate subsequent responses given the relevant explanatory variables and all of the prior observations. The researchers were mainly interested in predicting intervals instead of point predictions. <xref ref-type="bibr" rid="scirp.144890-7">
     [7]
    </xref> opined that the prediction distribution is a good basis for predictive inferences applied to several real-world situations. This is a distribution of the unobserved future response(s) conditional on a set of realized responses from an informative experiment. The Bayesian approach was used by the researchers to further derive the prediction distribution(s) for the multiple linear regression model. The Bayesian approach was used by the researchers to further derive the prediction distribution(s) for the multiple linear regression model. Linear regression is a popular machine learning-based prediction tool used in agriculture to predict crop yield. A study was conducted by <xref ref-type="bibr" rid="scirp.144890-8">
     [8]
    </xref> aimed to predict corn yield using linear regression. The study utilized various predictors, such as soil characteristics, weather conditions, and crop management practices, among others. The results showed that linear regression is a very powerful tool in the prediction of crop yield. The study found that soil characteristics were also significant predictors of corn yield.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>Machine learning techniques have also been applied to poultry productivity prediction. <xref ref-type="bibr" rid="scirp.144890-9">
     [9]
    </xref> conducted a review of various machine learning methods, including linear regression, neural networks and support vector machines for this purpose. Their study found that linear regression models are effective in predicting poultry productivity when appropriate features are selected and model parameters are optimized. Another literature review on this topic was conducted by <xref ref-type="bibr" rid="scirp.144890-10">
     [10]
    </xref>, who discussed the various types of linear regression models that have been used for predicting poultry performance. They analyzed the strengths and limitations of different models, such as simple linear regression, multiple linear regression, and polynomial regression, and concluded that the choice of model depends on the specific prediction task and data characteristics. Furthermore, <xref ref-type="bibr" rid="scirp.144890-11">
     [11]
    </xref> examined the application of linear regression models in predicting egg hatchability. Their study revealed that linear regression models can effectively predict egg hatchability by incorporating various factors, such as incubation temperature, humidity, and egg weight. They also highlighted the importance of using appropriate feature selection techniques and sample sizes to ensure accurate predictions. A study by <xref ref-type="bibr" rid="scirp.144890-12">
     [12]
    </xref> investigated the effectiveness of linear regression models in predicting egg hatchability in poultry farming. The authors analyzed various factors that impact egg hatchability, including egg weight, egg shape index, and incubation temperature. They developed a multiple linear regression model to predict hatchability and compared its performance with that of a simple linear regression model. The study found that the multiple linear regression model achieved a higher accuracy in predicting egg hatchability than the simple linear regression model. However, the authors also noted that the predictive power of the model was affected by the quality and quantity of the input data, as well as the model parameters.</p>
   <p>After a critical review of related studies, it has been established that Linear Regression is a crucial tool in machine learning as it has been used in numerous prediction studies. Overall, machine learning has the potential to revolutionize the poultry farming industry by providing insights into the production process, predicting outcomes with high accuracy, improving efficiency, disease management, and animal welfare. While there are challenges associated with implementing machine learning in poultry farming, such as data availability and model interpretability, the benefits are significant and warrant further investigation.</p>
  </sec><sec id="s3">
   <title>3. Methodology</title>
   <p>This study aimed to predict egg hatch rate using linear regression. The study collected data on egg viability/hatchability history from two hatchery centers in Tamale in Northern Region and Kumasi in the Ashanti Region of Ghana. The study combines data from Tamale (extreme weather) and Kumasi (favorable weather), making it robust and representative for conclusions across Ghana.</p>
   <p>One thousand and fifty (1050) dataset was collected from the two centers. From the data collected, several independent variables were extracted, including quantity of eggs processed, breed of bird, age of birds, breeding nature (free range/intensive), egg storage temperature, egg duration of storage, transport distance, average size of egg collected, fertility and hatch rate, with hatch rate as the dependent variable. These features were considered crucial by the researcher as they are major factors that drive egg hatch rate in the incubation. <xref ref-type="table" rid="table1">
     Table 1
    </xref> below shows the various descriptions of the variables that were used in this study.</p>
   <table-wrap id="table1">
    <label>
     <xref ref-type="table" rid="table1">
      Table 1
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.144890-"></xref>Table 1. Description of data collected.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td acenter" width="37.46%"><p style="text-align:center">Independent Variable</p></td> 
      <td class="custom-bottom-td acenter" width="62.54%"><p style="text-align:center">Description</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="37.46%"><p style="text-align:center">Quantity of eggs processed</p></td> 
      <td class="custom-top-td acenter" width="62.54%"><p style="text-align:center">Number of eggs collected and loaded into the incubator at a season.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Breed of bird</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">This variable captures the specific breed or species of birds under consideration. Different breeds may exhibit variations in reproductive performance and hatchability.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Age of birds</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">This indicates age of the birds involved in egg production.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Breeding nature</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Describes whether the birds are on free range breeding nature or intensive breeding nature.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Egg storage temperature</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Temperature at which eggs are stored before incubation.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Egg duration of storage</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Amount of time (days) the eggs stored before being loaded into the indicator.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Transport distance</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Distance or length of egg transportation.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Average size of egg collected</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Average physical size or weight of collected eggs.</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="37.46%"><p style="text-align:center">Fertility</p></td> 
      <td class="acenter" width="62.54%"><p style="text-align:center">Reproductive fertility of the birds.</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <sec id="s3_1">
    <title>3.1. Data Preprocessing</title>
    <p>Collected data was preprocessed before being used for the model training. All variables considered were used for the model. The preprocessing steps included removing missing or null values, removing duplicate entries and encoding categorical variables as numerical variables. The data was then divided into training dataset and test dataset with an 80:20 split ratio, respectively. The 80% of the data was used for the training and 20% used for testing the model.</p>
   </sec>
   <sec id="s3_2">
    <title>3.2. Training the Model</title>
    <p>The regression model was trained using the preprocessed dataset. The model was trained to predict hatch rate using the independent variables as predictors and performance of the model was evaluated on the test set using commonly used regression metrics such as mean squared error (MSE) and R-squared (R<sup>2</sup>).</p>
   </sec>
   <sec id="s3_3">
    <title>
     <xref ref-type="bibr" rid="scirp.144890-"></xref>3.3. Linear Regression as a Prediction</title>
    <p>Linear regression can be used as a prediction tool by fitting a linear equation to a set of data points. This equation can then be used to predict the value of the dependent variable for new values of the independent variables.</p>
    <p>The linear regression model assumes that there is a linear relationship between the dependent variable and the independent variables. This means that the change in the dependent variable is proportional to the change in the independent variables.</p>
    <p>Linear regression is used as a prediction tool in this study as the dependent variable which is the hatch rate is continuous in nature and there is no existence of multicollinearity between the independent variables.</p>
    <p>In developing an efficient model, data collected undergoes wrangling and subsequently trained using Linear Regression to ensure accurate model generation. Independent variables for training the model are: quantity of eggs, breed of bird, age of egg, mode of breeding, storage temperature, duration of storage, egg transport distance, average size of egg and egg fertility. These were subjected to heatmap correlation analysis and Pair Plot, which yielded the positive results, appropriate for training the model.</p>
   </sec>
   <sec id="s3_4">
    <title>
     <xref ref-type="bibr" rid="scirp.144890-"></xref>3.4. Hatch Rate Prediction Model</title>
    <p>The linear regression equation used to predict hatch rate of eggs is:</p>
    <p>Y = b<sub>0</sub> + b<sub>1</sub>X<sub>1</sub> + b<sub>2</sub>X<sub>2</sub> + b<sub>3</sub>X<sub>3</sub> + b<sub>4</sub>X<sub>4</sub> + b<sub>5</sub>X<sub>5</sub>+ b<sub>6</sub>X<sub>6</sub>+ b<sub>7</sub>X<sub>7</sub>+ b<sub>8</sub>X<sub>8</sub>+ b<sub>9</sub>X<sub>9</sub> (1)</p>
    <p>where:</p>
    <p>Y is dependent variable (hatch rate of eggs) and X<sub>1</sub>, X<sub>2</sub>, X<sub>3</sub>, X<sub>4</sub>, X<sub>5</sub>, X<sub>6</sub>, X<sub>7</sub>, X<sub>8</sub>, X<sub>9</sub> are the independent variables (Quantity of eggs, Breed, Age of birds, Free Range, Storage temperature, Time of storage, Distance, Average Size, Fertility), respectively.</p>
    <p>b<sub>0</sub>, b<sub>1</sub>, b<sub>2</sub>, b<sub>3</sub>, b<sub>4</sub>, and b<sub>5</sub> are regression coefficients estimated using ordinary least squares method.</p>
   </sec>
  </sec><sec id="s4">
   <title>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>4. Results and Discussions</title>
   <p>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>The Hatch Rate Prediction Model is a critical component of our Indigenous Smart Incubator, which uses machine learning algorithms to analyze and predict the viability of eggs in real-time. The goal of our model is to improve hatch rates, reduce production costs, and increase productivity in the poultry industry.</p>
   <p>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>To establish a reliable model, the data collected were wrangled and subsequently trained using Linear Regression to ensure accurate model generation. The independent variables, which include quantity of eggs, breed of bird, age of egg, mode of breeding, storage temperature, duration of storage, egg transport distance, average size of egg and egg fertility, were subjected to heatmap correlation analysis, which yielded the following result.</p>
   <p>
    <xref ref-type="fig" rid="fig1">
     Figure 1
    </xref> indicates that maximum correlation of 0.41. This indicates there is no strong correlation between the independent variables and also, this suggests that linear regression can be applied to the dataset as there is no multicollinearity between the independent variables.</p>
   <fig id="fig1" position="float">
    <label>Figure 1</label>
    <caption>
     <title>Figure 1. Heatmap correlation analysis.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/2870777-rId15.jpeg?20250819013406" />
   </fig>
   <p>
    <xref ref-type="table" rid="table2">
     Table 2
    </xref> indicates that there is no strong positive or negative correlation between the independent variables. The maximum correlation 0.2615 exists between the free range and transport distance, which is still minimal and hence we can affirm that there is no multicollinearity between the independent variables. No multicollinearity between the independent variables implies that Linear Regression can be applied to the dataset.</p>
   <table-wrap id="table2">
    <label>
     <xref ref-type="table" rid="table2">
      Table 2
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.144890-"></xref>Table 2. Correlation values for the independent variables.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td acenter" width="12.03%"><p style="text-align:center">X.corr()</p></td> 
      <td class="custom-bottom-td acenter" width="9.81%"><p style="text-align:center">QUANTITY</p></td> 
      <td class="custom-bottom-td acenter" width="8.72%"><p style="text-align:center">BREED</p></td> 
      <td class="custom-bottom-td acenter" width="8.74%"><p style="text-align:center">AGE</p></td> 
      <td class="custom-bottom-td acenter" width="8.72%"><p style="text-align:center">FREE RANGE</p></td> 
      <td class="custom-bottom-td acenter" width="11.64%"><p style="text-align:center">STORAGE TEMPERATURE</p></td> 
      <td class="custom-bottom-td acenter" width="8.74%"><p style="text-align:center">TIME OF STORAGE</p></td> 
      <td class="custom-bottom-td acenter" width="10.51%"><p style="text-align:center">TRANSPORT DISTANCE</p></td> 
      <td class="custom-bottom-td acenter" width="12.26%"><p style="text-align:center">AVERAGESIZE OF EGG</p></td> 
      <td class="custom-bottom-td acenter" width="8.82%"><p style="text-align:center">FERTILITY</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="12.03%"><p style="text-align:center">QUANTITY</p></td> 
      <td class="custom-top-td acenter" width="9.81%"><p style="text-align:center">1.00000</p></td> 
      <td class="custom-top-td acenter" width="8.72%"><p style="text-align:center">−0.00122</p></td> 
      <td class="custom-top-td acenter" width="8.74%"><p style="text-align:center">0.042461</p></td> 
      <td class="custom-top-td acenter" width="8.72%"><p style="text-align:center">0.045087</p></td> 
      <td class="custom-top-td acenter" width="11.64%"><p style="text-align:center">0.172494</p></td> 
      <td class="custom-top-td acenter" width="8.74%"><p style="text-align:center">0.248807</p></td> 
      <td class="custom-top-td acenter" width="10.51%"><p style="text-align:center">0.091236</p></td> 
      <td class="custom-top-td acenter" width="12.26%"><p style="text-align:center">−0.17239</p></td> 
      <td class="custom-top-td acenter" width="8.82%"><p style="text-align:center">−0.01187</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">BREED</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">−0.001223</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">0.261389</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.1239</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">0.237366</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">0.130912</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">−0.150602</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">0.109286</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.02229</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">AGE</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">0.042461</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.261389</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.11159</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">−0.012272</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">0.144933</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">−0.038297</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">−0.064789</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.41339</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">FREE RANGE</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">0.045087</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.1239</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.11159</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">0.085716</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.13672</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">0.261487</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">−0.010552</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">0.134942</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">STORAGE TEMPERATURE</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">0.172494</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.237366</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.01227</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.085716</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">0.139141</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">−0.065788</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">0.065435</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.13178</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">TIME OF STORAGE</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">0.248807</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.130912</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">0.144933</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.13672</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">0.139141</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">−0.163669</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">−0.096745</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.0372</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">TRANSPORT DISTANCE</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">0.091236</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.1506</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.0383</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.261487</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">−0.065788</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.16367</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">0.211022</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.15251</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">AVERAGE SIZE OF EGG</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">−0.17239</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.109286</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.06479</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.01055</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">0.065435</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.09675</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">0.211022</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">1</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">−0.23979</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="12.03%"><p style="text-align:center">FERTILITY</p></td> 
      <td class="acenter" width="9.81%"><p style="text-align:center">−0.011873</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">−0.02229</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.41339</p></td> 
      <td class="acenter" width="8.72%"><p style="text-align:center">0.134942</p></td> 
      <td class="acenter" width="11.64%"><p style="text-align:center">−0.131784</p></td> 
      <td class="acenter" width="8.74%"><p style="text-align:center">−0.0372</p></td> 
      <td class="acenter" width="10.51%"><p style="text-align:center">−0.15251</p></td> 
      <td class="acenter" width="12.26%"><p style="text-align:center">−0.239793</p></td> 
      <td class="acenter" width="8.82%"><p style="text-align:center">1</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <sec id="s4_1">
    <title>4.1. Test of Prediction Model and Its Accuracy</title>
    <p>To verify the correctness of the model, 20% of the data was reserved and used for the model testing and the result of the testing was measured using the coefficient of determination (R<sup>2</sup>) regression formula, which yielded 0.6781.</p>
    <p>The high R-squared value of 67.81 in linear regression implies that the model explains 67.81% of the total variance in the dependent variable based on the independent variables included in the model.</p>
    <p>This value can be interpreted as the proportion of the variation in the dependent variable that can be accounted for by the independent variables in the model.</p>
   </sec>
   <sec id="s4_2">
    <title>
     <xref ref-type="bibr" rid="scirp.144890-"></xref>4.2. Mean Squared Error (MSE)</title>
    <p>The MSE takes into account the squared differences between the actual hatch rates and the predicted values, providing a measure of how spread out these errors are across the dataset. The mean squared error (MSE) was computed is 14.37. A lower MSE generally indicates better model performance, suggesting that the model’s predictions are closer to the true values.</p>
    <p>It can also be said that on average, the predictions made by the linear regression model are off by approximately 14.37 units in the context of hatch rate.</p>
   </sec>
   <sec id="s4_3">
    <title>4.3. Linear Regression Equation of the Model</title>
    <p>Intercept and coefficient of the independent variables, the linear regression equation of the trained model can be expressed as follows in Equation (2):</p>
    <p>
     <math display="inline" xmlns="http://www.w3.org/1998/Math/MathML"> <mtable columnalign="left"> 
       <mtr> 
        <mtd> 
         <mtext>
           Y 
         </mtext> 
         <mo>
           = 
         </mo> 
         <mn>
           169.1382 
         </mn> 
         <mo>
           − 
         </mo> 
         <mn>
           0.0111 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            1 
          </mn> 
         </msub> 
         <mo>
           − 
         </mo> 
         <mn>
           3.9850 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            2 
          </mn> 
         </msub> 
         <mo>
           − 
         </mo> 
         <mn>
           4.9637 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            3 
          </mn> 
         </msub> 
         <mo>
           + 
         </mo> 
         <mn>
           8.9641 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            4 
          </mn> 
         </msub> 
         <mo>
           − 
         </mo> 
         <mn>
           3.0556 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            5 
          </mn> 
         </msub> 
        </mtd> 
       </mtr> 
       <mtr> 
        <mtd> 
         <mtext> 
         </mtext> 
         <mo>
           − 
         </mo> 
         <mn>
           2.2422 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            6 
          </mn> 
         </msub> 
         <mo>
           + 
         </mo> 
         <mn>
           11.9571 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            7 
          </mn> 
         </msub> 
         <mo>
           + 
         </mo> 
         <mn>
           13.14785 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            8 
          </mn> 
         </msub> 
         <mo>
           + 
         </mo> 
         <mn>
           0.0038 
         </mn> 
         <msub> 
          <mtext>
            X 
          </mtext> 
          <mn>
            9 
          </mn> 
         </msub> 
        </mtd> 
       </mtr> 
      </mtable> 
     </math>(2)</p>
    <p>where:</p>
    <p>X<sub>1</sub> = Quantity of eggs, X<sub>2</sub> = Breed, X<sub>3</sub> = Age of birds, X<sub>4</sub> = Free range, X<sub>5</sub> = Storage temperature, X<sub>6</sub> = Time of storage, X<sub>7</sub> = Distance, X<sub>8</sub> = Average size, X<sub>9</sub> = Fertility.</p>
   </sec>
  </sec><sec id="s5">
   <title>
    <xref ref-type="bibr" rid="scirp.144890-"></xref>5. Conclusions</title>
   <p>This study aimed to develop a prediction model for estimating the egg hatch rate for local farmers using a linear regression model in machine learning. Through the collection and analysis of relevant data, including various independent variables such as the quantity of eggs processed, breed of bird, age of birds, breeding nature, egg storage temperature, egg duration of storage, transport distance, average size of eggs collected, fertility, and hatch rate, we were able to derive valuable insights.</p>
   <p>The correlation analysis, as demonstrated by the heatmap and pair plot, revealed a lack of multicollinearity among the independent variables, indicating that they contributed unique information to the prediction model. The R<sup>2</sup> value of 0.6781 indicated that approximately 67.81% of the variability in the egg hatch rate could be explained by the selected independent variables. Furthermore, the mean squared error of 14.37 indicated that the model’s predictions were reasonably accurate. This study ultimately makes a significant contribution to its scope as it helps farmers in Ghana appropriately predict egg hatch rates, thereby minimizing egg losses in the process of incubation.</p>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.144890-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Attia, Y.A., Rahman, M.T., Hossain, M.J., Basiouni, S., Khafaga, A.F., Shehata, A.A., et al. (2022) Poultry Production and Sustainability in Developing Countries under the COVID-19 Crisis: Lessons Learned. Animals, 12, Article 644. &gt;https://doi.org/10.3390/ani12050644
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Radhakrishnan, K., Jose, N., Sanjay, S.G, Cherian, T. and Vishnu, K.R. (2014) Design and Implementation of a Fully Automated Egg Incubator. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 3, 7666-7672. &gt;https://www.ijareeie.com/upload/2014/february/54_Radhakrishnan.pdf 
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Bolzan, A., Machado, R. and Piaia, J. (2008) Egg Hatchability Prediction by Multiple Linear Regression and Artificial Neural Networks. Revista Brasileira de Ciência Avícola, 10, 97-102. &gt;https://doi.org/10.1590/s1516-635x2008000200004
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Ayoub Shaikh, T., Rasool, T. and Rasheed Lone, F. (2022) Towards Leveraging the Role of Machine Learning and Artificial Intelligence in Precision Agriculture and Smart Farming. Computers and Electronics in Agriculture, 198, Article ID: 107119. &gt;https://doi.org/10.1016/j.compag.2022.107119
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Çevik, K.K., Koçer, H.E. and Boğa, M. (2022) Deep Learning Based Egg Fertility Detection. Veterinary Sciences, 9, Article 574. &gt;https://doi.org/10.3390/vetsci9100574
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Vovk, V., Nouretdinov, I. and Gammerman, A. (2009) On-line Predictive Linear Regression. The Annals of Statistics, 37, 1566-1590. &gt;https://doi.org/10.1214/08-aos622
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Rahman, A. and Khan, S. (2007) Prediction Distribution for Linear Regression Model with Multivariate Student-T Errors under the Bayesian Approach. &gt;https://www.researchgate.net/publication/257939560 
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Cubillas, J.J., Ramos, M.I., Jurado, J.M. and Feito, F.R. (2022) A Machine Learning Model for Early Prediction of Crop Yield, Nested in a Web Application in the Cloud: A Case Study in an Olive Grove in Southern Spain. Agriculture, 12, Article 1345. &gt;https://doi.org/10.3390/agriculture12091345
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Nosratabadi, S., Ardabili, S., Lakner, Z., Mako, C. and Mosavi, A. (2021) Prediction of Food Production Using Machine Learning Algorithms of Multilayer Perceptron and ANFIS. Agriculture, 11, Article 408. &gt;https://doi.org/10.3390/agriculture11050408
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Fontana, I., Tullo, E., Fernández, A.P. and Berckmans, D. (2023) Frequency Analysis of Vocalisation in Relation to Growth in Broiler Chicken. 7th European Conference on Precision Livestock Farming, Milan, September 2015, 3. &gt;https://www.researchgate.net/publication/281836267_Frequency_analysis_of_vocalisation_in_relation_to_growth_in_broiler_chicken 
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Klein, D.R., Vale, M.M.d., Silva, M.F.R.d., Kuhn, M.F., Branco, T. and Santos, M.P.d. (2020) Data Mining as a Hatchery Process Evaluation Tool. Scientia Agricola, 77, e2018007. &gt;https://doi.org/10.1590/1678-992x-2018-0074
    </mixed-citation>
   </ref>
   <ref id="scirp.144890-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Oleforuh-Okoleh, V.U. (2016) Hatchability Prediction in Chickens Using Some External Egg Quality Traits. Asian Journal of Animal Sciences, 10, 159-164. &gt;https://doi.org/10.3923/ajas.2016.159.164
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>