<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JCC</journal-id><journal-title-group><journal-title>Journal of Computer and Communications</journal-title></journal-title-group><issn pub-type="epub">2327-5219</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jcc.2015.33003</article-id><article-id pub-id-type="publisher-id">JCC-54707</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Gait-Ground Reaction Force Sensors Selection Based on ROC Curve Evaluation
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Rami</surname><given-names>Alkhatib</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Mohamad</surname><given-names>Diab</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Bassam</surname><given-names>Moslem</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Christophe</surname><given-names>Corbier</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Mohamed</surname><given-names>El Badaoui</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Laboratoire LASPI, Université Jean-Monnet, Roanne, France</addr-line></aff><aff id="aff2"><addr-line>Electrical and Computer Engineering Department, College of Engineering, Rafik Hariri University, Damour, Lebanon</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>rami.alkhatib@universite-lyon.fr(RA)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>17</day><month>03</month><year>2015</year></pub-date><volume>03</volume><issue>03</issue><fpage>13</fpage><lpage>19</lpage><history><date date-type="received"><day>December</day>	<month>2014</month></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
   Classification of normal gait from pathological gait as then can be used as indicator of falling among subjects requires the correct choice of sensor location in the insole. Such a flexi force- sensor can be used underneath foot to measure vertical ground reaction force. To start with, the most relevant information (parameters) that can characterize the recorded signals are extracted from the vertical ground reaction force signals. Then Receiver Operating Characteristic curve is used to evaluate the features upon 8 sensors underneath each foot located at different locations. To confirm results obtained, features are passed upon a chosen classifier, in this paper K-nearest neighbors algorithm is chosen. Results show that the sensor located at the inner arch of the sole of the foot (i.e. at the mid foot) holds the most relevant information needed for better classification compared to other sensors. 
 
</p></abstract><kwd-group><kwd>Gait Analysis</kwd><kwd> ROC Analysis</kwd><kwd> Sensor Selection</kwd><kwd> Gait Ground Reaction Force (GRF)</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Gait Ground reaction force (GRF) is widely used and modeled by many scientist [<xref ref-type="bibr" rid="scirp.54707-ref1">1</xref>] in order to extract valuable information. For example, it has been used for the purpose of differentiating between normal and Parkinson’s subjects [<xref ref-type="bibr" rid="scirp.54707-ref2">2</xref>].</p><p>However, still many studies consider the sensor location to measure the GRF either at both toe and heel or usually by analyzing the total force from sensors underneath each foot [<xref ref-type="bibr" rid="scirp.54707-ref2">2</xref>]. Though, a need to investigate more exploration of choosing the correct position of a sensor or set of sensors in the insole becomes crucial.</p><p>This is for the reason that some features are used for classification and better work when they are extracted from sensors data at a given location. In this paper, KNN classifier is used randomly as an example to verify the concept and not to test its power in classification. The selection of feature relayed on the features used in a number of papers for classification persistence such as skewness, kurtosis, median frequency and many more.</p><p>ROC curve is then used to evaluate each feature for all sensors. This could contribute to other research conducting in the goal of anticipating the risk of falling among people and especially among elderly [<xref ref-type="bibr" rid="scirp.54707-ref3">3</xref>].</p><p>In this paper the location of each sensor is studied independently in order to identify the sensor in the best location and which reveals the most relevant information needed for better classification compared to other sensor’s data signals.</p></sec><sec id="s2"><title>2. Gait Data―Vertical Ground Reaction Force (VGRF)</title><sec id="s2_1"><title>2.1. Database Description</title><p>VGRF database is obtained from PhysioNet [<xref ref-type="bibr" rid="scirp.54707-ref4">4</xref>]. A collection of signal measures VGRF (in Newton) as a function of time extracted from 8 sensors (Ultraflex Computer DuynoGraphy, Infotronic Inc.) underneath each of the right and left foot. In addition, two signals representing the summation of each of the 8 signals recorded from each foot are taken as a reference.</p><p>Sensor locations are shown in <xref ref-type="fig" rid="fig1">Figure 1</xref> inside the insole as lying approximately at the following (X, Y) coordinates measured as a person is comfortably standing with both legs parallel to each other. The origin (0, 0) is just between the legs and the person is facing towards the positive side of the Y axis.</p><p>Two groups of persons were recorded: the normal case, also named control and they count for 18 persons, and 29 patients with Parkinson disease. Each participant walked for two minutes at their own natural pace and with acquisition sampling rate of 100 Hz. <xref ref-type="fig" rid="fig2">Figure 2</xref> shows the force generated by eight sensors underneath right and left foot in addition to their total for both normal and Parkinson diseased person of age 72 years old.</p></sec><sec id="s2_2"><title>2.2. Preprocessing</title><p>Before utilizing data for any purpose, preprocessing must be performed on the data to remove any undesirable characteristics that produced during acquisition. For instance, Filtering is used in cleaning and removing any unwanted disturbance in GRF data. That’s because the presence of noise can totally mask the true information in data. In addition, it’s significant to eliminate sources of variation on the measured VGRF like the influence of mediolateral and anterior-posterior variations. For illustration, Butterworth low pass filter of second order can be used for that purpose [<xref ref-type="bibr" rid="scirp.54707-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.54707-ref6">6</xref>]. Such a filter is used to filter the data with cut-off frequency of 30 Hz. This choose is due to studying the mean frequency of high oscillations denoted as Intrinsic mode function given by empirical mode decomposition of each signal.</p><p>In the other side, if one variable is to be normalized to another variable, it is important to understand the relation between them [<xref ref-type="bibr" rid="scirp.54707-ref3">3</xref>]. That’s why, data normalized bynorm-2 as to make the data comparable to each other.</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Sensors location in both right and left insoles</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x5.png"/></fig><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> GRF signals from eight sensors underneath right and left foot for normal and Parkinson subject</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x6.png"/></fig></sec></sec><sec id="s3"><title>3. ROC Curve Overview</title><p>Plotting sensitivity or true positive rate (TPR) against fall-out (1-specificity) or false positive rate (FPR) exemplifies the performance of binary classifier. This is designated by ROC curve. Their calculation is given in equation one and two.</p><disp-formula id="scirp.54707-formula584"><label>(1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/54707x7.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.54707-formula585"><label>(2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/54707x8.png"  xlink:type="simple"/></disp-formula><p>where T refers to true whenever the prediction matches the actual situation. Therefore, TP is true positive, TN is true negative, FP is false positive and FN is false negative. Thus, with 95% confidence interval, the area under the curve (AUC) will reflect the accuracy on how well a feature could well separate normal gait from Parkinson’s gait. The following scores could be used to evaluate the accuracy:</p><p>0.9 - 1: Excellent</p><p>0.8 - 0.9: Good</p><p>0.7 - 0.8: Fair</p><p>0.6 - 0.7: Poor</p><p>0.5 - 0.6: Fail</p><p>As a result AUC = 1 refers to a perfect discrimination and has a ROC curve that passes through the upper left corner i.e. 100% sensitivity and 100% specificity with no overlap in the two distributions. <xref ref-type="fig" rid="fig3">Figure 3</xref> show a plot of the ROC curve for skewness feature tested over sensor 5 in the right foot among normal and Parkinson. Each point on the ROC curve represents the sensitivity and specificity pair corresponding to a particular decision threshold. The diagonal line dividing the ROC space is also called line of no-discrimination in which a point on this line corresponds to a completely guess. When points are above the diagonal, this indicates a good classification results and on the other side points below this line indicate a poor predictors. Therefore the distance from the random guess line is the best indicator of how much predictive power a method has.</p><p>The evaluation of skewness shows the following:</p><fig id="fig3"  position="float"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Skewness ability in discrimination between Normal and Parkinson Gait using ROC curve</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x9.png"/></fig><p>Using the above listed data, 0.9 as area under the curve indicates an excellent performance of skewness in discriminating normal from Parkinson using VGRF from sensor 5.</p></sec><sec id="s4"><title>4. Methodology</title><sec id="s4_1"><title>4.1. Feature Extraction</title><p>In this part, simple features were extracted after the data being pre-processed. Then their performance was tested. The features used here are used from literature, we found 23 features. In this paper the nine most relevant features were retained. A list of the commonly wide used features are shown below:</p><p>・ Mean: Signal averaging</p><p>・ Median: numerical value separating the higher half of a data from the lower half.</p><p>・ standard deviation: measures the amount of variation or dispersion from the average</p><p>・ range: the difference between the maximum and minimum values</p><p>・ Interquartile range: robust estimate for the spread of the data being equal to the difference between the upper and lower quartiles IQR = Q<sub>3</sub> − Q<sub>1</sub>.</p><p>・ 95% percentiles of the distribution of the signal. A percentile (or a centile) is a measure used in statistics indicating the value below which a given percentage of observations in a group of observations fall.</p><p>・ Skewness: measure of lack of symmetry</p><p>・ Kurtosis: measure of whether the data are peaked or flat relative to a normal distribution.</p><p>・ Power of the signal</p><p>・ Mean power frequency</p><p>・ Magnitude of peak frequency</p></sec><sec id="s4_2"><title>4.2. Features Evaluation by ROC</title><p>As too many statistical features could be extracted and evaluated in time domain analysis and frequency domain, in this section one feature is used to demonstrate the evaluation. However this is done for all features among all sensors for the 47 subjects. Let’s take for instance the skewness as in many studies related to vertical ground reaction force [<xref ref-type="bibr" rid="scirp.54707-ref2">2</xref>] shows its capability in distinguishing between normal and Parkinson’s diseased person.</p><p>The results of ROC evaluation among all the 47 subjects are shown in <xref ref-type="table" rid="table1">Table 1</xref> for each sensor.</p><p>Analyzing <xref ref-type="table" rid="table1">Table 1</xref> indicates thatunlike other studies similarto [<xref ref-type="bibr" rid="scirp.54707-ref2">2</xref>] that consider total summation of force signals from all sensors as the most important, however, its clearly shown that sensor 5 is the most important sensor (AUC = 0.9) to consider in building acquisition system to acquire data for analyses. <xref ref-type="fig" rid="fig4">Figure 4</xref> shows the ROC curve of the total ground reaction force from sensors of the right foot. The sensitivity is recorded to be 0.6552 while the specificity is 0.5556. This yield AUC = 0.5460 which refers a fail level of accuracy in classification.</p><p>This conclusion is generalized as the same procedure is applied over the rest of the features chosen in this study. If expanding data is needed, then adding sensor 7 and 6 corresponds as main sensors also to be considered for classification as shown in <xref ref-type="fig" rid="fig5">Figure 5</xref>.</p><p>Not to add, the average of the strides of the 2 classes corresponding to the 120 seconds of walking is also considered. As a result, each series of strides is represented by its average, that is, one stride. Next, 3 features were extracted: the amplitude of the first peak, that is, the peak that corresponds to the heel contact (in case of total force), time to the first peak, stride time. The ROC evaluation also infers a better accuracy for sensor 5 compared to other sensors.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> ROC evaluation of skewness among all sensors</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Sensor # in Foot</th><th align="center" valign="middle"  colspan="2"  >ROC-AUC</th></tr></thead><tr><td align="center" valign="middle" >Left Foot</td><td align="center" valign="middle" >Right Foot</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >0.645594</td><td align="center" valign="middle" >0.655172</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >0.609195</td><td align="center" valign="middle" >0.637931</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >0.609195</td><td align="center" valign="middle" >0.613027</td></tr><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >0.701149</td><td align="center" valign="middle" >0.676245</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >0.904215</td><td align="center" valign="middle" >0.842912</td></tr><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >0.787356</td><td align="center" valign="middle" >0.764368</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >0.850575</td><td align="center" valign="middle" >0.808429</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >0.62069</td><td align="center" valign="middle" >0.703065</td></tr><tr><td align="center" valign="middle" >VGRF corresponds to summation of the 8 sensors</td><td align="center" valign="middle" >0.605364</td><td align="center" valign="middle" >0.611111</td></tr></tbody></table></table-wrap><fig id="fig4"  position="float"><label><xref ref-type="fig" rid="fig4">Figure 4</xref></label><caption><title> Skewness is extracted from the total force from sensors located under the right foot and its performance in binary classification is evaluated using ROC curve</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x10.png"/></fig><fig id="fig5"  position="float"><label><xref ref-type="fig" rid="fig5">Figure 5</xref></label><caption><title> The most important locations to acquire data for Gait analysis</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x11.png"/></fig><fig id="fig6"  position="float"><label><xref ref-type="fig" rid="fig6">Figure 6</xref></label><caption><title> Skewness and Median Frequency are extracted from signals of sensor 5 in mid-foot</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/54707x12.png"/></fig></sec><sec id="s4_3"><title>4.3. Verification</title><p>In order to verify results, the features are passed through a chosen classifier, k-nearest neighbors (KNN) in this case. As mentioned, KNN is not used to test its power in classification, but to test the power of sensor 5 in a given classifier. Ten subjects from each class are chosen as training and the rest are tested by the classifier.</p><p>In this study, this is done in two ways. First, select one sensor among all subjects and then choose two features randomly and iterate between them. The feature chosen will have a high score from ROC evaluation and then feed them to KNN-classifier one example is shown in <xref ref-type="fig" rid="fig6">Figure 6</xref>.</p><p>In a second case, fix the feature and iterate a number of sensors among the KNN classifier. The results of KNN classifier indicate an accuracy of around 83% on average in most cases where sensor five exist. Other sensor shows a relatively smaller value. While the total force when used shows an accuracy of around 15% smaller than sensor five.</p></sec></sec><sec id="s5"><title>5. Conclusion and Future Research</title><p>More attention to which sensor is chosen must be made. This is especially recommended when building an acquisition system. This study shows the sensor located at the inner arch of the sole of the foot (i.e. at the mid foot) near the axis of the center of body holds the most important information given certain features for classification. This could help more in using such sensor location to model walking by 3D-link dynamics as one foot is in contact with ground while the other is in swing phase. The classification of the VGRF between normal and Parkinson patient using classical feature is something to pay attention as features are chosen. In addition, more advanced parameters (features) should be extracted from the signals .Conventionally, a classifier, such as a neural network with rbf or an SVM will be used in later studies where the input of the classifier will be more than one feature prior to their use. Therefore, higher classification accuracy is expected to be obtained. Classification of the 8 signals individually from one foot then using a fusion technique may increase the classification accuracy.</p></sec><sec id="s6"><title>Cite this paper</title><p>Rami Alkhatib,Mohamad Diab,Bassam Moslem,Christophe Corbier,Mohamed El Badaoui, (2015) Gait-Ground Reaction Force Sensors Selection Based on ROC Curve Evaluation. Journal of Computer and Communications,03,13-19. doi: 10.4236/jcc.2015.33003</p></sec></body><back><ref-list><title>References</title><ref id="scirp.54707-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Minamisawa, T., Sawahata, H., Takakura, K. and Yamaguchi, T. (2012) Characteristics of Temporal Fluctuation of the Vertical Ground Reaction Force during Quiet Stance in Parkinson’s Disease. Gait &amp; Posture, 35, 308-311.  
http://dx.doi.org/10.1016/j.gaitpost.2011.09.106</mixed-citation></ref><ref id="scirp.54707-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Worobets, J.T. and Stefanyshyn, D.J. (2005) Normalizing Vertical Ground Reaction Force Peals to Body Weight in Heel-Toe Running. Human Performance Laboratory, Faculty of Kinesiology, University of Calgary.</mixed-citation></ref><ref id="scirp.54707-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Jeffry, M. and Research, F. (2005) Gait in Parkinson’s Disease. National Institutes of Health, National Parkinson Foundation, and the Parkinson’s Disease Foundation. http://www.physionet.org/</mixed-citation></ref><ref id="scirp.54707-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Beauchet, O., Allali, G., Berrut, G., Hommet, C., Dubost, V. and Assal, F. (2008) Gait Analysis in Demented Subjects: Interests and Perspectives. Neuropsychiatric Disease and Treatment, 4, 155-160.  
http://dx.doi.org/10.2147/NDT.S2070</mixed-citation></ref><ref id="scirp.54707-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Alkhatib, R., Diab, M., Moslem, B., Corbier, C. and ElBadaoui, M. (2014) Nonstationary Analysis of Gait-Ground Reaction Force Signals. 20th Anual LAAS Conference, AlHadath, Lebanon.</mixed-citation></ref><ref id="scirp.54707-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Sabri, K., El Badaoui, M., Guillet, F., Belli, A., Millet, G. and Morin, J. (2010) Cyclostationary Modeling of Ground Reaction Force Signals. Signal Processing, 90, 1146-1152.</mixed-citation></ref></ref-list></back></article>