<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OALibJ</journal-id><journal-title-group><journal-title>Open Access Library Journal</journal-title></journal-title-group><issn pub-type="epub">2333-9705</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/oalib.1108225</article-id><article-id pub-id-type="publisher-id">OALibJ-114145</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject><subject> Business&amp;Economics</subject><subject> Chemistry&amp;Materials Science</subject><subject> Computer Science&amp;Communications</subject><subject> Earth&amp;Environmental Sciences</subject><subject> Engineering</subject><subject> Medicine&amp;Healthcare</subject><subject> Physics&amp;Mathematics</subject><subject> Social Sciences&amp;Humanities</subject></subj-group></article-categories><title-group><article-title>
 
 
  Nature Scene Signs Recognition by Affine Scale-Invariant Feature Transform
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xin</surname><given-names>Li</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Liuguang</surname><given-names>Song</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Yuqi</surname><given-names>Sun</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Guangxi Key Laboratory of Embedded Technology and Intelligent System, Department of Information Science and Engineering, Guilin University of Technology, Guilin, China</addr-line></aff><aff id="aff1"><addr-line>Department of Information Science and Engineering, Guilin University of Technology, Guilin, China</addr-line></aff><pub-date pub-type="epub"><day>01</day><month>12</month><year>2021</year></pub-date><volume>08</volume><issue>12</issue><fpage>1</fpage><lpage>7</lpage><history><date date-type="received"><day>25,</day>	<month>November</month>	<year>2021</year></date><date date-type="rev-recd"><day>21,</day>	<month>December</month>	<year>2021</year>	</date><date date-type="accepted"><day>24,</day>	<month>December</month>	<year>2021</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  In this paper, it is presented an approach to automatic detection and recognition of signs from natural scenes, and its application to a sign recognition. The natural scene signs recognition is affected by the light changing, fading, being distorted on the surface of t signs. In this paper, a method is proposed that combined the improved scale invariant feature transform (SIFT) algorithm which reduces the high dimension and high complexity with Affine Scale Invariant Feature Transformation (ASIFT) algorithm under the natural scene sign detection and recognition. Experiments have proved that the method improved match number and matching time.
 
</p></abstract><kwd-group><kwd>ASIFT Algorithm</kwd><kwd> SIFT Algorithm</kwd><kwd> Traffic Signs Recognition</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Automatic recognition of traffic signs on the road can prevent the traffic accidents, and it is also important to The Smart Car on safe driving. As traffic signs usually consist of specific shapes (circles, squares, and triangles) and colors (red, blue, and yellow), which have significant visual effects in road environments, traffic sign detection methods can be divided into color-based, shape-based, and color-based methods [<xref ref-type="bibr" rid="scirp.114145-ref1">1</xref>]. However, given traffic signs cannot be effectively recognized while the color, scale, rotation, and light conditions are changed.</p><p>Recently, feature-based method has been considered to be developed into the method based on local invariant features. These methods of local features is to use of the constantness of the image rotation, translation and scaling. Currently, the most advanced method is based on local invariant features SIFT, Harris-Affine, Hessian-Affine and MSER [<xref ref-type="bibr" rid="scirp.114145-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.114145-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.114145-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.114145-ref6">6</xref>] [<xref ref-type="bibr" rid="scirp.114145-ref7">7</xref>]. And SIFT is better than all other methods [<xref ref-type="bibr" rid="scirp.114145-ref8">8</xref>], but only four parameters are invariant for a 6 parameter affine transformation [<xref ref-type="bibr" rid="scirp.114145-ref2">2</xref>]. These algorithms achieved good results in a limited range, but are prone to a large error rate. In addition, some problems of SIFT algorithm result in too long time-consuming in the detection and recognition of road signs and make real-time great impact. This paper combines improved SIFT algorithm [<xref ref-type="bibr" rid="scirp.114145-ref4">4</xref>] with ASIFT algorithm [<xref ref-type="bibr" rid="scirp.114145-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.114145-ref9">9</xref>] to solve effectively recognition while the color, scale, rotation, and light conditions are varied.</p></sec><sec id="s2"><title>2. Algorithm Frame Work Based on ASIFT and Improved SIFT</title><p>This paper proposes a traffic signs recognition algorithm that fused ASIFT and improved SIFT. Simply combined ASIFT with SIFT, although this can ensure accuracy, the speed of detection is slower. Fusing ASIFT and Improved SIFT, both can ensure accuracy and relatively increase the detection rate of speed, making good balance between those. The framework of the algorithm is shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>.</p><p>In the first stage, construct affine region by simulating all images in different perspectives (scale λ, the rotation latitude angle ψ and longitude angle Φ). The next stage, detect the key point using improved SIFT. Each key point can be described by circle SIFT feature descriptor-vector 1 &#215; 64. The next stage, match candidate key point by calculating Euclidean distance. In forth stage we will</p><p>focus on the recognition of sign by choosing match numbers more than 3. These stages will be repeated again until no sign occurs.</p></sec><sec id="s3"><title>3. ASIFT Algorithm</title><p>ASIFT invariant region-wide radiation detection, this approach comes from the SIFT detection, in the addition of two new camera optical axis parameters based on SIFT detection, the rotation latitude angle ψ and longitude angle Φ, can simulate all images in different perspectives. The whole process is as follows: 1) Each image is transformed by simulating all possible affine distortions caused by the change of camera optical axis orientation from a frontal position. For example, the images undergo Φ-rotations followed by tilts and ψ -rotations. 2) These rotations and tilts are performed for a finite and small number of latitude and longitude angles. 3) All simulated images are compared by a similarity invariant matching algorithm SIFT. 4) After above three steps, the original query and search images are yielded. Based on it, the affine transforms SIFT is selected to yield matches. All radiation transformation model can be seen in <xref ref-type="fig" rid="fig2">Figure 2</xref>.</p><p><xref ref-type="fig" rid="fig2">Figure 2</xref> shows the distortion model of the local image in conditions of the camera from the u(x, y) to the Au(x, y) (the decomposition of A).</p></sec><sec id="s4"><title>4. Improved SIFT Algorithm</title><p>During the process of ASIFT algorithm computing, using SIFT algorithm in the input image for the detection of key points. The algorithm mainly consists of four steps [<xref ref-type="bibr" rid="scirp.114145-ref3">3</xref>] (see <xref ref-type="fig" rid="fig3">Figure 3</xref>).</p><p>The first stage, assign the direction value to the key point, is to ensure the rotation invariance of descriptors. If the descriptor itself has a very good anti-rotation capability, then third stage, determine the direction of key points can be omitted. Taking into account the image is rotated, the region around feature points will change, and circle with good rotation invariance, so this paper use circle construct SIFT feature descriptor.</p><p>The new descriptor itself has anti-rotation capability, do not rotate the local</p><p>area to obtain rotation invariant of the descriptors, and decreased the dimension of the original feature describes vector from 1 &#215; 128 to 1 &#215; 64, further reducing algorithm complexity.</p></sec><sec id="s5"><title>5. Experimental Result and Analysis</title><p>In the experiment, we find candidate matching features by Euclidean distance function. When ASIFT feature vectors of traffic signs images captured in a natural scene are generated, using the Euclidean distance of key point of feature vectors as the similarity determines measure of the key points in the two images. Taken in a critical point in a picture, and find out first two key points which are shortest Euclidean distance with image B by traversing search, in these two key points, if the nearest distance divided by the near distance is small to a certain threshold, the match successfully. That the probability of right matching is decided by the number of the nearest distance divided by the near distance (dim/dim-1). Euclidean distance is expressed as follows:</p><p>d ( p , q ) = ∑ i = 1 n ( p i − q i ) 2 (1)</p><p>where, p = (p<sub>1</sub>, p<sub>2</sub>, ・・・, p<sub>n</sub>) and q = (q<sub>1</sub>, q<sub>2</sub>, ・・・, q<sub>n</sub>) is the two European 128 dimensional description of vector points.</p><p>Decrease the proportion dim/dim-1, the number of match points will reduce, but more stable. Here in all the matches at a ratio of distance is greater than 0.8 was rejected. Although this step is loss 5% of the correct match, it can eliminate 90% of the errors match, which is large meaning to match stability.</p><p>Finally, the selection process is the least square method based on the candidate matching points. Adjacent pixels are already established after ASIFT feature matching pretreatment and have a small number of connection points. If the projection of a key point through these parameters lies within half the error range that was used for the parameters in the Hough transform bins, the key point match is voted.</p><p>If fewer than 3 points remain after discarding outliers for a bin, then the object match is rejected. The least-squares fitting is repeated until no more rejections take place. At least 3 votes are identified as the presence of an object.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Matching number of SIFT and ASIFT</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Number</th><th align="center" valign="middle" >SIFT</th><th align="center" valign="middle" >Improved ASIFT</th></tr></thead><tr><td align="center" valign="middle" >(a) condition</td><td align="center" valign="middle" >12</td><td align="center" valign="middle" >38</td></tr><tr><td align="center" valign="middle" >(b) condition</td><td align="center" valign="middle" >3</td><td align="center" valign="middle" >64</td></tr><tr><td align="center" valign="middle" >(c) condition</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >74</td></tr><tr><td align="center" valign="middle" >(d) condition</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >47</td></tr></tbody></table></table-wrap><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Running time of SIFT and ASIFT</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Running time</th><th align="center" valign="middle" >SIFT</th><th align="center" valign="middle" >Improved ASIFT</th><th align="center" valign="middle" >Improved Efficiency</th></tr></thead><tr><td align="center" valign="middle" >(a) condition</td><td align="center" valign="middle" >124.36</td><td align="center" valign="middle" >79.19</td><td align="center" valign="middle" >36.32%</td></tr><tr><td align="center" valign="middle" >(b) condition</td><td align="center" valign="middle" >168.41</td><td align="center" valign="middle" >98.26</td><td align="center" valign="middle" >41.65%</td></tr><tr><td align="center" valign="middle" >(c) condition</td><td align="center" valign="middle" >61.57</td><td align="center" valign="middle" >40.12</td><td align="center" valign="middle" >34.84%</td></tr><tr><td align="center" valign="middle" >(d) condition</td><td align="center" valign="middle" >110.84</td><td align="center" valign="middle" >67.55</td><td align="center" valign="middle" >39.06%</td></tr></tbody></table></table-wrap><p>The decision of whether to cluster a new training image with an existing model view is based on comparing e to a threshold T, T = 0.05 &#215; size.</p><p>To illustrate the superiority of the algorithm, we compared ASIFT algorithm with improved SIFT, and with normal SIFT, also with the SIFT algorithm separately.</p><p>In this case, about 12 to 20 images were taken of each object around at least a hemisphere of viewing directions, such as tilt varying from 1 to 5.8, namely latitude angle from 0˚ to 80˚, longitude angle from 0˚ to 180˚.</p><p>The experiment using the data under various environmental conditions shows the validity of the proposed technique. The signs for recognition are ACROSSING, BIKEWAY, MOTORWAY, PARKING and so on. These illustrations image is used for the template image. The size of these images is 640 &#215; 480. These data are obtained in good weather conditions during the day and night. <xref ref-type="fig" rid="fig4">Figure 4</xref> shows the image under the SIFT algorithm and ASIFT algorithm in several location of shots.</p><p>From <xref ref-type="table" rid="table1">Table 1</xref> matching number of ASIFT algorithm is more than three times that of SIFT algorithms.</p><p>From <xref ref-type="table" rid="table2">Table 2</xref> matching time is decreased from 36.32% to 41.65% in improved ASIFT algorithm. Also efficiency is improved.</p></sec><sec id="s6"><title>6. Conclusions</title><p>From the experiments above and many other experiments, the following conclusions can be obtained.</p><p>1) When the maximum tilting value t<sub>max</sub> = 5.8, ASIFT algorithm is better than the other algorithms in matching.</p><p>2) Improved ASIFT algorithm under premise of ensuring its performance can improve the operation speed.</p><p>3) ASIFT algorithm was better than other algorithms in correct matching, and improved ASIFT algorithm has more real time. High recognition performance is obtained in a real environment, thus clear that, in the actual environment, this approach is effective.</p></sec><sec id="s7"><title>Acknowledgements</title><p>This research work was supported by Guangxi Key Laboratory Fund of Embedded Technology and Intelligent System (Guilin University of Technology) under Grant No.2020-2-6.</p></sec><sec id="s8"><title>Conflicts of Interest</title><p>The author declares no conflicts of interest.</p></sec><sec id="s9"><title>Cite this paper</title><p>Li, X., Song, L.G. and Sun, Y.Q. (2021) Nature Scene Signs Recognition by Affine Scale-Invariant Feature Transform. Open Access Library Journal, 8: e8225. https://doi.org/10.4236/oalib.1108225</p></sec></body><back><ref-list><title>References</title><ref id="scirp.114145-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Madani, A. and Yusof, R. (2017) Traffic Sign Recognition Based on Color, Shape, and Pictogram Classification Using Support Vector. Neural Computing &amp; Applications, 30, 2807-2817. https://doi.org/10.1007/s00521-017-2887-x</mixed-citation></ref><ref id="scirp.114145-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Alexeev, B.G. (2021) Nonlocal Physics in the Wave Function Terminology. Journal of Applied Mathematics and Physics, 9, 2889-2908.  
https://doi.org/10.4236/jamp.2021.911183</mixed-citation></ref><ref id="scirp.114145-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Lowe, D.G. (2005) Distinctive Image Features from Scale-Invariant Keypoint. International Journal of Computer Vision, 60, 91-110.  
https://doi.org/10.1023/B:VISI.0000029664.99615.94</mixed-citation></ref><ref id="scirp.114145-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, C., Gong, Z. and Sun, L. (2008) Application of Improved SIFT Features in Image Matching. Computer Engineering and Applications, 44, 95-97.</mixed-citation></ref><ref id="scirp.114145-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Morel, J.-M. and Yu , G. (2009) A New Framework for Fully Affine Invariant Image Comparison. SIAM Journal on Imaging Sciences, 2, 438-469.  
https://doi.org/10.1137/080732730</mixed-citation></ref><ref id="scirp.114145-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Lowe, D.G. (1999) Object Recognition from Local Scale Invariant Features. International Conference on Computer Vision, Corfu, September 1999, 1150-1157.  
https://doi.org/10.1109/ICCV.1999.790410</mixed-citation></ref><ref id="scirp.114145-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T. and Van Gool, L. (2005) A Comparison of Affine Region Detectors. International Journal of Computer Vision, 65, 43-72.  
https://doi.org/10.1007/s11263-005-3848-x</mixed-citation></ref><ref id="scirp.114145-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Takaki, M. and Fujiyoshi, H. (2009) Traffic Sign Recognition Using SIFT Features. IEEJ Transactions on Electronics Information and Systems, 129, 824-831.  
https://doi.org/10.1541/ieejeiss.129.824</mixed-citation></ref><ref id="scirp.114145-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Li, X., Uchimura, K. and Hu, Z. (2009) Traffic Sign Recognition Using Affine Scale-Invariant Feature Transform. IEICE Technical Report, ITS 2009-12.</mixed-citation></ref></ref-list></back></article>