<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JBiSE</journal-id><journal-title-group><journal-title>Journal of Biomedical Science and Engineering</journal-title></journal-title-group><issn pub-type="epub">1937-6871</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jbise.2016.910B017</article-id><article-id pub-id-type="publisher-id">JBiSE-70767</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject></subj-group></article-categories><title-group><article-title>
 
 
  A Hybrid Algorithm for Disease Association Study
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Bin</surname><given-names>Wei</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>Key Laboratory of Network &amp;amp; Information Security of APF, Engineering College of APF, Xi’an, China</addr-line></aff><author-notes><corresp id="cor1">* E-mail:</corresp></author-notes><pub-date pub-type="epub"><day>23</day><month>09</month><year>2016</year></pub-date><volume>09</volume><issue>10</issue><fpage>129</fpage><lpage>136</lpage><history><date date-type="received"><day>August</day>	<month>19,</month>	<year>2016</year></date><date date-type="rev-recd"><day>Accepted:</day>	<month>September</month>	<year>20,</year>	</date><date date-type="accepted"><day>September</day>	<month>23,</month>	<year>2016</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
   
   Single nucleotide polymorphisms (SNPs), which are the most common form of DNA variations, have great potential as a medical diagn
   ostic tool. However, compared to the number of SNPs involved, the available training data sets generally have a fairly small sample size, which is a main challenge to traditional data analysis methods. This paper proposed an improved univariate marginal distribution algorithm (UMDA) named multi-population UMDA (MPUMDA) for disease association study. In order to illustrate the effectiveness of our algorithm, we compared it with some current known methods, and the results showed that our method is potentially interesting as an alter-native tool in disease association study. 
  
 
</p></abstract><kwd-group><kwd>Single Nucleotide Polymorphisms</kwd><kwd> Disease Association Study</kwd><kwd> Feature Selection</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The risk of common diseases is likely determined by complex interplay between several genetics [<xref ref-type="bibr" rid="scirp.70767-ref1">1</xref>]. Common genetic variations, i.e. the single nucleotide polymorphisms (SNPs), are believed to modulate diseases susceptibility [<xref ref-type="bibr" rid="scirp.70767-ref2">2</xref>]. Therefore, the main task of human genetics is to understand the mapping relationship between SNPs and susceptibility of disease [<xref ref-type="bibr" rid="scirp.70767-ref3">3</xref>]. Knowing that an individual is (or is not) susceptible to (or belong to risk group for) a certain disease will help us greatly reduce the cost of preventive measures or even completely avoid disease development.</p><p>Although most of SNPs are neutral, certain of them are functional and affect the phenotype, such as skin color and different resistances to infection or drug responses [<xref ref-type="bibr" rid="scirp.70767-ref4">4</xref>]. The improvement of high-throughput SNPs genotyping methods generates huge quantities of SNPs data. However, only a small number of SNPs show strong correlation with a certain disease compared to the total number of SNPs investigated [<xref ref-type="bibr" rid="scirp.70767-ref5">5</xref>]. Thus feature selection is crucial for this study. Estimation of distribution algorithms (EDAs) are one of the population-based stochastic algorithms. In this paper, we propose an improved univariate marginal distribution algorithm (UMDA) named multi-popula- tion UMDA (MPUMDA) for diseases association study. The proposed algorithm was tested on three disease datasets: Crohn’s disease, Lung cancer and Tick-borne encephalitis [<xref ref-type="bibr" rid="scirp.70767-ref6">6</xref>]. The results were compared with those obtained by some other algorithms. Experimental results showed that the proposed method can improve classification accuracy remarkably.</p></sec><sec id="s2"><title>2. Background</title><sec id="s2_1"><title>2.1. Single Nucleotide Polymorphism</title><p>One single nucleotide of A, C, G or T in the DNA sequence is replace by any other 3 nucleotides, e.g., from CCCTAC to CCTTAC, we call this variation (C &#224; T) as single nucleotide polymorphism (SNP). It has the following three characteristics: 1) very common in the human genome (it is estimated that the number of SNPs in the human genome is about 12 million); 2) among the SNPs, two out of every three SNPs are the variation from cytosine (C) to thymine (T); 3) very stable from generation to generation.</p></sec><sec id="s2_2"><title>2.2. Diseases Association Study</title><p>Disease association study is to identify factors that may contribute to a medical condition by comparing the cases (subjects who have that condition) with controls (subjects who do not have the condition).</p><p>Assume that we have m samples (each sample has n SNPs/features and the disease status).</p><p>Let <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x2.png" xlink:type="simple"/></inline-formula> denote the value of each SNP, where 0 and 1 stand for homozygous sites with major and minor allele, respectively, and 2 stands for heterozygous sites.</p><p>We use <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x3.png" xlink:type="simple"/></inline-formula> to represent an individual sample, where<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x4.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x5.png" xlink:type="simple"/></inline-formula>(−1 stands for case and +1 stands for control), <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x6.png" xlink:type="simple"/></inline-formula>,<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x7.png" xlink:type="simple"/></inline-formula>.</p><p>Let <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x8.png" xlink:type="simple"/></inline-formula> represent the feature set, where <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x9.png" xlink:type="simple"/></inline-formula> correspond to the n SNPs.</p><p>The goal of this study is to select a least redundant feature subset<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x10.png" xlink:type="simple"/></inline-formula> with predicting power for disease status. Then the problem can be described as the following model.</p><disp-formula id="scirp.70767-formula72"><label>(1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/70767x11.png"  xlink:type="simple"/></disp-formula></sec></sec><sec id="s3"><title>3. Methods</title><sec id="s3_1"><title>3.1. Univariate Marginal Distribution Algorithm (UMDA)</title><p>EDAs have been proposed for the purpose of overcoming some drawbacks that classical EAs presented. The advantage of EDAs over classical EAs is the reduction in the number of parameters to be tuned. Univariate Marginal Distribution Algorithm (UMDA) presented for the first time by M&#252;hlenbein is one of the simplest algorithms in the EDAs family [<xref ref-type="bibr" rid="scirp.70767-ref7">7</xref>]. The UMDA sorts M individuals by their fitness value from high to low, and then selects N best ones to estimate the probability distribution <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x12.png" xlink:type="simple"/></inline-formula> at each generation. The probability distribution for the l-th iteration is defined as follow. The pseudocode of UMDA is presented in <xref ref-type="fig" rid="fig1">Figure 1</xref>.</p><disp-formula id="scirp.70767-formula73"><label>(2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/70767x13.png"  xlink:type="simple"/></disp-formula><p>where n is the number of variables, and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x14.png" xlink:type="simple"/></inline-formula></p></sec><sec id="s3_2"><title>3.2. Multi-Population UMDA</title><p>In UMDA, lack of diversity, particularly during the latter stages of the optimization, is the dominant factor for converging to local optimum solutions. To improve the capability of UMDA, multi-population UMDA (MPUMDA) is proposed in this paper. MPUMDA is an extension of traditional UMDA by dividing the population into three sub-groups (best, middle and worst) according to their performances. Each sub-group executes UMDA separately. At the end of each generation, sub-groups are reassigned to ensure information sharing.</p></sec><sec id="s3_3"><title>3.3. MPUMDA for Feature Selection and Parameters Optimization</title><p>The largest problems encountered in setting up the SVM model are how to select the kernel function and its parameters [<xref ref-type="bibr" rid="scirp.70767-ref8">8</xref>]. However, optimizing feature subsets and SVM model parameters respectively may not be reached the good results. Therefore, we use the MPUMDA to optimize the SVM model parameters and feature subsets simultaneously.</p><p>1) Individual representation</p><p>An individual of MPUMDA comprises two parts: the features mask and the SVM model parameters. The part of SVM model parameters consists of two sub-parts: kernel function type and other parameters. <xref ref-type="fig" rid="fig2">Figure 2</xref> shows the representation of an individual with N dimensions.</p><p>In <xref ref-type="fig" rid="fig2">Figure 2</xref>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x15.png" xlink:type="simple"/></inline-formula>represent features mask. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x16.png" xlink:type="simple"/></inline-formula>is the number of SNPs. For the feature mask, the bit with value ‘1’ indicates the SNP is selected and “0” indicates not.</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Pseudocode of univariate marginal distribution algorithm</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x17.png"/></fig><p><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x18.png" xlink:type="simple"/></inline-formula>represent kernel function types.</p><p>And <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x19.png" xlink:type="simple"/></inline-formula> represent the related kernel parameter and the penalty parameter C.</p><p>2) Fitness definition</p><p>Classification accuracy is often used as a criterion to design fitness function. However, due to the difficulty in obtaining clinical information on patients, the data used in disease association studies are often imbalanced. The traditional accuracy measure may be misleading since the all-negative classifier may achieve a good measure value. Thus, the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/70767x22.png" xlink:type="simple"/></inline-formula> is used as the fitness function which is able to deal with imbalanced dataset.</p></sec></sec><sec id="s4"><title>4. Experiments and Results</title><p>In this section, MPUMDA-SVM is test on three datasets: Crohn’s disease, Lung cancer and Tick-borne encephalitis [<xref ref-type="bibr" rid="scirp.70767-ref6">6</xref>] and the results are compared with several current known algorithms. The 5-fold method was used in this paper.</p><sec id="s4_1"><title>4.1. Datasets</title><p>In this subsection, we focus on the selection of the best combination of SNPs with maximal difference between case and control groups. Three datasets used in this study are summarized in <xref ref-type="table" rid="table1">Table 1</xref>.</p></sec><sec id="s4_2"><title>4.2. Results</title><p>Firstly, in order to verify the effectiveness of MPUMDA-SVM, we compared it with sequential forward search (SFS), UMDA with SVM and the method proposed in [<xref ref-type="bibr" rid="scirp.70767-ref8">8</xref>] (<xref ref-type="table" rid="table2">Table 2</xref>). The numbers of individuals and iterations were set to 60 and 100, respectively. It is clearly that, the accuracy of SVM with feature selection outperformed that without feature selection. This means that not all features are necessary to achieve total classification accuracy. The classification accuracy on Crohn’s disease data set obtained by MPUMDA-SVM was 93.3%, whereas, only 81.4%, 84.8% and 89.6% was obtained by SFS, UMDA or PSO with SVM, respectively. For the Lung cancer and Tick-borne</p><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> The expression of individual combining features mask and SVM model parameters</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x23.png"/></fig><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The dataset description</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Dataset</th><th align="center" valign="middle" >Number of SNPs</th><th align="center" valign="middle" >Number of cases</th><th align="center" valign="middle" >Number of controls</th></tr></thead><tr><td align="center" valign="middle" >Crohn’s disease</td><td align="center" valign="middle" >103</td><td align="center" valign="middle" >144</td><td align="center" valign="middle" >243</td></tr><tr><td align="center" valign="middle" >Lung cancer</td><td align="center" valign="middle" >141</td><td align="center" valign="middle" >322</td><td align="center" valign="middle" >273</td></tr><tr><td align="center" valign="middle" >Tick-borne encephalitis</td><td align="center" valign="middle" >41</td><td align="center" valign="middle" >21</td><td align="center" valign="middle" >54</td></tr></tbody></table></table-wrap><p>encephalitis datasets, the classification accuracy obtained by our method (87.9% and 96.4%) was also better than that of other algorithms. Therefore, we may draw the conclusion that the MPUMDA is able to select more relevant SNPs.</p><p><xref ref-type="fig" rid="fig3">Figure 3</xref> shows that the number of SNPs can be reduced by MPUMDA-SVM. The percentage of SNPs selected was reduced to 0.068 and 0.171 for the Crohn’s disease and Tick-borne encephalitis datasets, respectively. However, for the Lung cancer dataset, the percentage of SNPs selected was only reduced to 0.588. That can be explained by the fact that Lung cancer is a more complex disease than the other two.</p><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> The performance of SVM with different feature selection methods</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Data set</th><th align="center" valign="middle"  rowspan="2"  >Evaluation Criteria</th><th align="center" valign="middle"  colspan="5"  >Feature selection + Classifier</th></tr></thead><tr><td align="center" valign="middle" >SVM</td><td align="center" valign="middle" >SFS + SVM</td><td align="center" valign="middle" >PSO + SVM [<xref ref-type="bibr" rid="scirp.70767-ref8">8</xref>]</td><td align="center" valign="middle" >UMDA + SVM</td><td align="center" valign="middle" >MPUMDA + SVM</td></tr><tr><td align="center" valign="middle"  rowspan="3"  >Crohn’s disease</td><td align="center" valign="middle" >Sn</td><td align="center" valign="middle" >71.6%</td><td align="center" valign="middle" >77.5%</td><td align="center" valign="middle" >76.4%</td><td align="center" valign="middle" >85.4%</td><td align="center" valign="middle" >89.6%</td></tr><tr><td align="center" valign="middle" >Sp</td><td align="center" valign="middle" >79.0%</td><td align="center" valign="middle" >83.7%</td><td align="center" valign="middle" >89.7%</td><td align="center" valign="middle" >92.2%</td><td align="center" valign="middle" >95.5%</td></tr><tr><td align="center" valign="middle" >Acc</td><td align="center" valign="middle" >76.3%</td><td align="center" valign="middle" >81.4%</td><td align="center" valign="middle" >84.8%</td><td align="center" valign="middle" >89.6%</td><td align="center" valign="middle" >93.3%</td></tr><tr><td align="center" valign="middle"  rowspan="3"  >Lung cancer</td><td align="center" valign="middle" >Sn</td><td align="center" valign="middle" >69.6%</td><td align="center" valign="middle" >72.5%</td><td align="center" valign="middle" >79.5%</td><td align="center" valign="middle" >87.6%</td><td align="center" valign="middle" >88.2%</td></tr><tr><td align="center" valign="middle" >Sp</td><td align="center" valign="middle" >80.7%</td><td align="center" valign="middle" >82.6%</td><td align="center" valign="middle" >77.6%</td><td align="center" valign="middle" >83.9%</td><td align="center" valign="middle" >87.6%</td></tr><tr><td align="center" valign="middle" >Acc</td><td align="center" valign="middle" >74.7%</td><td align="center" valign="middle" >77.1%</td><td align="center" valign="middle" >78.6%</td><td align="center" valign="middle" >85.9%</td><td align="center" valign="middle" >87.9%</td></tr><tr><td align="center" valign="middle"  rowspan="3"  >Tick-borne encephalitis</td><td align="center" valign="middle" >Sn</td><td align="center" valign="middle" >34.0%</td><td align="center" valign="middle" >67.0%</td><td align="center" valign="middle" >57.0%</td><td align="center" valign="middle" >76.0%</td><td align="center" valign="middle" >91.6%</td></tr><tr><td align="center" valign="middle" >Sp</td><td align="center" valign="middle" >85.1%</td><td align="center" valign="middle" >86.7%</td><td align="center" valign="middle" >87.1%</td><td align="center" valign="middle" >90.5%</td><td align="center" valign="middle" >98.3%</td></tr><tr><td align="center" valign="middle" >Acc</td><td align="center" valign="middle" >70.7%</td><td align="center" valign="middle" >81.1%</td><td align="center" valign="middle" >78.5%</td><td align="center" valign="middle" >86.5%</td><td align="center" valign="middle" >96.4%</td></tr></tbody></table></table-wrap><fig id="fig3"  position="float"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Number of feature (SNPs) selected for each of the 3 datasets (numbers on the bar indicate the percentage of selected SNPs)</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x24.png"/></fig><p>Finally, we did some further analysis using the selected SNPs. <xref ref-type="fig" rid="fig4">Figure 4</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref> shows the frequencies of 0, 1 and 2 in the selected SNPs for above three data sets. From <xref ref-type="fig" rid="fig4">Figure 4</xref> we can see that at 57, 86 and 95 the frequency of 2 in the controls group is higher than that in cases group. 1’s frequency is almost equal to zero at 9 and 85 in case and control group respectively. Those 7 SNPs combination is mostly association with Crohn’s disease. We can see from the <xref ref-type="fig" rid="fig5">Figure 5</xref> that almost in all the sites 0’s frequencies are higher in case group than controls. In addition, 1’s frequency is zero in SNP 1, which can be regarded as a risk factor of tick-borne encephalitis.</p><fig-group id="fig4"><label><xref ref-type="fig" rid="fig4">Figure 4</xref></label><caption><title> Frequencies of 3 types of genotype in the selected SNPs in Crohn’s disease dataset. (a) Frequency in cases group; (b) Frequency in controls group.</title></caption><fig id ="fig4_1"><label>(b)</label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x25.png"/></fig><fig id ="fig4_2"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x26.png"/></fig></fig-group><fig-group id="fig5"><label><xref ref-type="fig" rid="fig5">Figure 5</xref></label><caption><title> Frequencies of 3 types of genotype in the selected SNPs in tick-borne encephalitis dataset. (a) Frequency in cases group; (b) Frequency in controls group.</title></caption><fig id ="fig5_1"><label>(b)</label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x27.png"/></fig><fig id ="fig5_2"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/70767x28.png"/></fig></fig-group></sec></sec><sec id="s5"><title>5. Conclusion</title><p>Due to the high genotyping cost in association studies, it is desirable to find a least redundant subset of SNPs with the highest prediction accuracy for complex diseases. In this paper, a multi-population univariate marginal distribution algorithm (MPUMDA) is used to implement the feature selection, and support vector machine (SVM) serves as an evaluator of MPUMDA for disease association study. The merit of our proposed approach is that MPUMDA is capable of finding the optimal feature subset and SVM model parameters simultaneously. Experimental results show that the proposed approach achieves the highest classification accuracy for all the datasets when compared with some current known methods, that is, it is potentially interesting as an alternative tool in disease association study.</p></sec><sec id="s6"><title>Acknowledgements</title><p>The research is supported by Foundation of Engineering College of APF (WJY201518) and (JLX201648). Thanks to Dr. Dumitru Brinza for providing the dataset.</p></sec><sec id="s7"><title>Cite this paper</title><p>Wei, B. (2016) A Hybrid Algorithm for Disease Association Study. J. Biomedical Science and Engineering, 9, 129-136. http://dx.doi.org/10.4236/jbise.2016.910B017</p></sec></body><back><ref-list><title>References</title><ref id="scirp.70767-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Yang, X.Y., et al. (2015) Adiponectin Gene Polymorphisms Are Associated with Increased Risk of Colorectal Cancer. Medical Science Monitor, 21, 2595-2606.  
http://dx.doi.org/10.12659/MSM.893472</mixed-citation></ref><ref id="scirp.70767-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Yang, J.P., et al. (2015) Association Analysis of Genetic Variants of Adiponectin Gene and Risk of Pancreatic Cancer. International Journal of Clinical and Experimental Medicine, 8, 8094-8100.</mixed-citation></ref><ref id="scirp.70767-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Yi, H.G., et al. (2015) Com-parison of Dimension Reduction-Based Logistic Regression Models for Case-Control Genome-Wide Association Study: Principal Components Analysis vs. Partial Least Squares. Journal of biomedical Research, 29, 298-307.  
http://dx.doi.org/10.7555/JBR.29.20140043</mixed-citation></ref><ref id="scirp.70767-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Yu, W.B., Kwon, M.-S. and Taesung, P. (2015) Multivariate Quantitative Multifactor Di-mensionality Reduction for Detecting Gene-Gene Interactions. Human Heredity, 79, 168- 181. http://dx.doi.org/10.1159/000377723</mixed-citation></ref><ref id="scirp.70767-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Zhu, Z.H., et al. (2015) Dominance Genetic Variation Contributes Little to the Missing Heritability for Human Complex Traits. American Journal of Human Genetics, 96, 377-385.  
http://dx.doi.org/10.1016/j.ajhg.2015.01.001</mixed-citation></ref><ref id="scirp.70767-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Brinza, D. and Zelikovsky, A. (2008) Design and Validation of Methods Searching for Risk Factors in Genotype Case-Control Studies. Journal of Computational Biology, 15, 81-90.  
http://dx.doi.org/10.1089/cmb.2007.0081</mixed-citation></ref><ref id="scirp.70767-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Muehlenbein, H. (1997) The Equation for Response to Selection and Its Use of Prediction. Evolutionary Computation, 5, 303-346. http://dx.doi.org/10.1162/evco.1997.5.3.303</mixed-citation></ref><ref id="scirp.70767-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Lin, S.W., et al. (2008) Particle Swarm Optimization for Parameter Determination and Feature Selection of Support Vector Machines. Expert Systems with Applications, 35, 1817- 1824. http://dx.doi.org/10.1016/j.eswa.2007.08.088</mixed-citation></ref></ref-list></back></article>