<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JILSA</journal-id><journal-title-group><journal-title>Journal of Intelligent Learning Systems and Applications</journal-title></journal-title-group><issn pub-type="epub">2150-8402</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jilsa.2015.72005</article-id><article-id pub-id-type="publisher-id">JILSA-55866</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  An Online Malicious Spam Email Detection System Using Resource Allocating Network with Locality Sensitive Hashing
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>iti-Hajar-Aminah</surname><given-names>Ali</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Seiichi</surname><given-names>Ozawa</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Junji</surname><given-names>Nakazato</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Tao</surname><given-names>Ban</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Jumpei</surname><given-names>Shimamura</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff3"><addr-line>Clwit Inc., Tokyo, Japan</addr-line></aff><aff id="aff2"><addr-line>National Institute of Information and Communications Technology (NICT), Tokyo, Japan</addr-line></aff><aff id="aff1"><addr-line>Graduate School of Engineering, Kobe University, Kobe, Japan</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>aminahh@uthm.edu.my(IA)</email>;<email>ozawasei@kobe-u.ac.jp(SO)</email>;<email>nakazato@nict.go.jp(JN)</email>;<email>bantao@nict.go.jp(TB)</email>;<email>shimamura@clwit.co.jp(JS)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>15</day><month>04</month><year>2015</year></pub-date><volume>07</volume><issue>02</issue><fpage>42</fpage><lpage>57</lpage><history><date date-type="received"><day>25</day>	<month>February</month>	<year>2015</year></date><date date-type="rev-recd"><day>accepted</day>	<month>20</month>	<year>April</year>	</date><date date-type="accepted"><day>22</day>	<month>April</month>	<year>2015</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  In this paper, we propose a new online system that can quickly detect malicious spam emails and adapt to the changes in the email contents and the Uniform Resource Locator (URL) links leading to malicious websites by updating the system daily. We introduce an autonomous function for a server to generate training examples, in which double-bounce emails are automatically collected and their class labels are given by a crawler-type software to analyze the website maliciousness called SPIKE. In general, since spammers use botnets to spread numerous malicious emails within a short time, such distributed spam emails often have the same or similar contents. Therefore, it is not necessary for all spam emails to be learned. To adapt to new malicious campaigns quickly, only new types of spam emails should be selected for learning and this can be realized by introducing an active learning scheme into a classifier model. For this purpose, we adopt Resource Allocating Network with Locality Sensitive Hashing (RAN-LSH) as a classifier model with a data selection function. In RAN-LSH, the same or similar spam emails that have already been learned are quickly searched for a hash table in Locally Sensitive Hashing (LSH), in which the matched similar emails located in “well-learned” are discarded without being used as training data. To analyze email contents, we adopt the Bag of Words (BoW) approach and generate feature vectors whose attributes are transformed based on the normalized term frequency-inverse document frequency (TF-IDF). We use a data set of double-bounce spam emails collected at National Institute of Information and Communications Technology (NICT) in Japan from March 1st, 2013 until May 10th, 2013 to evaluate the performance of the proposed system. The results confirm that the proposed spam email detection system has capability of detecting with high detection rate.
 
</p></abstract><kwd-group><kwd>Malicious Spam Email Detection System</kwd><kwd> Incremental Learning</kwd><kwd> Resource Allocating Network</kwd><kwd>  Locality Sensitive Hashing</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Emails have become one of the most frequently used methods for cyber attacks. The most worrying email-based attack is Targeted Malicious Email (TME) [<xref ref-type="bibr" rid="scirp.55866-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.55866-ref2">2</xref>] . In TME, attackers send malicious emails to certain people targeted in an organization, such as executives of large companies, high-ranking government personnel, military officials and even famous researchers, in order for the attackers to obtain valuable confidential information and latest research of the targeted people. In TME, an email often has an attachment with malicious codes that can be installed automatically upon opening without the victims realizing it. In some cases, the victims’ computer will become the back door for the attackers who in turn have the authority to enter the network of the targeted persons and thus steal confidential information.</p><p>Another typical email-based cyber attack is the malicious spam email attack, which aims to spread numerous emails with Uniform Resource Locator (URL) links leading to malicious websites. Previously, malicious codes were sent through the attachment of such spam emails. However, many successful filters have been developed to detect malicious attachments. Thus, attackers are now turning to malicious spam campaigns that attack using the links attached in the emails. According to the Symantec annual report in 2014 [<xref ref-type="bibr" rid="scirp.55866-ref3">3</xref>] , about 87 percent of scanned spam messages contained at least one URL hyperlink. Moreover, recent findings by Symantec [<xref ref-type="bibr" rid="scirp.55866-ref4">4</xref>] show a sharp rise of emails containing malicious links, from 7% in October 2014 to 41% in the following months. Apart from that, currently, attackers also use more relevant email contents [<xref ref-type="bibr" rid="scirp.55866-ref1">1</xref>] that are specific to their victims’ line of work, besides addressing the name of the recipient in the email body to convince the victim that the email received is a normal email. For instance, a fake email notification regarding a conference or journal targeted towards a recipient with academic status, notifications regarding false documents such as telecommunication service bills, fax and voicemail in which the victims are given a link to get more information [<xref ref-type="bibr" rid="scirp.55866-ref4">4</xref>] . This technique is called Social Engineering [<xref ref-type="bibr" rid="scirp.55866-ref5">5</xref>] , which Hadnagy [<xref ref-type="bibr" rid="scirp.55866-ref6">6</xref>] defines as “The Art of Human Hacking”. It becomes difficult for normal users to distinguish not only between non-malicious and malicious spam emails but also spam email from normal emails.</p><p>The objective of this paper is to detect the malicious spam emails so that general users can be protected from being re-directed to malicious websites. For this purpose, we propose an autonomous online system for detecting malicious spam emails. In general, it is not easy to collect spam emails from individual persons because it is not usually permitted to access personal email spools. Therefore in the proposed system, we collect double-bounce spam emails that are delivered to unknown users. From the collected spam emails data, a classifier model is used to learn and classify the malicious spam emails. The updated connection weights of the classifier model are sent to a user’s mailer software to improve the malicious spam email detection ability. Jungsuk [<xref ref-type="bibr" rid="scirp.55866-ref7">7</xref>] points out that the live period of malicious URLs is often very short, usually within a few days; thus, it is expected that introducing incremental learning to malicious spam email detection will be effective. The system can learn from the recent spam emails so that the spam email detection system is always up to date. On the other hand, spammers often use botnets to spread spam emails. For example, a botnet called Rustock which consists of approximately 1 million infected computers that networked together, is capable of sending up to 30 billion spam emails every day [<xref ref-type="bibr" rid="scirp.55866-ref8">8</xref>] . Since the distribution of such spam emails is done in a short time, we assume that the spam emails have the same or similar contents in general [<xref ref-type="bibr" rid="scirp.55866-ref9">9</xref>] . Hence, we adopt the Locality Sensitive Hashing (LSH) [<xref ref-type="bibr" rid="scirp.55866-ref10">10</xref>] - [<xref ref-type="bibr" rid="scirp.55866-ref14">14</xref>] to quickly select important training data to be learned. For this purpose, we adopt Resource Allocating Network with Locality Sensitive Hashing (RAN-LSH) as a classifier model in the proposed detection system. This model has the following two important properties: 1) the learning is carried out incrementally; and 2) only data within an untrained region are selected and learned even when a large amount of data is given.</p><p>This paper is organized as follows. Section 2 gives a brief explanation of RAN-LSH. The proposed system for detecting malicious spam emails is presented in Section 3. In Section 4, the performance of the spam email detection system is evaluated for a set of 20,448 double-bounce emails collected from 1st March, 2013 to 10th May, 2013. Finally, conclusions and future work are addressed in Section 5.</p></sec><sec id="s2"><title>2. Resource Allocating Network with Locally Sensitive Hashing (RAN-LSH)<sup>1</sup></title><p><xref ref-type="fig" rid="fig1">Figure 1</xref> demonstrates the overall architecture of the spam email detection system. In this section, we give a brief explanation of the RAN-LSH classifier [<xref ref-type="bibr" rid="scirp.55866-ref15">15</xref>] illustrated in <xref ref-type="fig" rid="fig1">Figure 1</xref>(c). RAN-LSH is the extended model of the Resource Allocating Network (RAN) [<xref ref-type="bibr" rid="scirp.55866-ref16">16</xref>] , where LSH is adopted to select essential training data and Radial Basis Function (RBF) bases for fast learning. There are three main components in RAN-LSH: hash table, data selection and classifier.</p><p>Algorithm 1 shows the overall learning procedures of RAN-LSH. During the initial learning phase (Lines 1 - 5), initial training data are used to obtain the most suitable values of the following two important parameters: RBF width <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x5.png" xlink:type="simple"/></inline-formula> and the number of partitions<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x6.png" xlink:type="simple"/></inline-formula>. In addition, initial data are also used to obtain an initial hash table and initial structure of the classifier. After that, the incremental learning is carried out whenever training data are given to learn (Lines 7 - 17). In LSH, similar data are allocated in the same hash entry with a high probability. Therefore, the number of hash entries determines the granularity of input space representation, and too large number of hash entries would result in both high computational and memory costs in the data selection. Therefore, it is important to design the hash functions such that a suitable number of hash entries are created.</p><p>In RAN-LSH, we adopt Principal Component Analysis (PCA) to generate a proper number of hash functions by controlling the threshold of the accumulation ratio<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x7.png" xlink:type="simple"/></inline-formula>. Accumulation ratio <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x8.png" xlink:type="simple"/></inline-formula> is the ratio of input components in the approximated subspace over those in the whole input space [<xref ref-type="bibr" rid="scirp.55866-ref17">17</xref>] . Giving a proper value of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x9.png" xlink:type="simple"/></inline-formula> based on a tolerant approximation error, a proper number of hash functions is automatically determined by selecting the number of partitions <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x10.png" xlink:type="simple"/></inline-formula> via the cross-validation.</p><p>Let <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x12.png" xlink:type="simple"/></inline-formula> be the subspace dimensions obtained by PCA. Then, the following linear transformation is considered to define hash values in LSH:</p><disp-formula id="scirp.55866-formula519"><label>(1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x13.png"  xlink:type="simple"/></disp-formula><p>where<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x14.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x15.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x16.png" xlink:type="simple"/></inline-formula> are the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x17.png" xlink:type="simple"/></inline-formula>-dimensional projection vector, the matrix of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x18.png" xlink:type="simple"/></inline-formula> eigenvectors, an <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x19.png" xlink:type="simple"/></inline-formula>-dimensional input vector, respectively. Each projection vectors <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x20.png" xlink:type="simple"/></inline-formula> is then divi-</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Network structure of the proposed autonomous malicious spam email detection system</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x21.png"/></fig><disp-formula id="scirp.55866-formula520"><graphic  xlink:href="http://html.scirp.org/file/2-9601296x22.png"  xlink:type="simple"/></disp-formula><p>ded into <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x23.png" xlink:type="simple"/></inline-formula> partitions with equal size. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x23.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x24.png" xlink:type="simple"/></inline-formula>controls the number of eigenvectors<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x23.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x25.png" xlink:type="simple"/></inline-formula>. To obtain a hash code, data is first projected on all the eigenvectors <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x23.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x25.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x26.png" xlink:type="simple"/></inline-formula> and the hash code is obtained by combining the encoded values for the projections. When a large <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x23.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x24.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x25.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x26.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x27.png" xlink:type="simple"/></inline-formula> is adopted, the number of eigenvectors tends to become large and this would cause elongation of the length of a hash code.</p><p>As shown in Algorithm 1 (see Lines 7 - 17), the incremental learning of RAN-LSH is carried out not only for RAN-LSH classifier but also for the hash table. Let us briefly explain the learning procedures in the following subsections.</p><sec id="s2_1"><title>2.1. Updating Hash Table</title><p>Algorithm 2 illustrates the steps to create and update the hash table which is used in RAN-LSH learning algorithm (Lines 4, 8 and 17 in Algorithm 1). Each subregion is allocated to an entry in a hash table, where each entry is composed of five items: hash value<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula>, prototype<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x29.png" xlink:type="simple"/></inline-formula>, margin flag<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x29.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x30.png" xlink:type="simple"/></inline-formula>, outlier flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x29.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x30.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x31.png" xlink:type="simple"/></inline-formula> and the occurrence frequency <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x29.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x30.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x31.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x32.png" xlink:type="simple"/></inline-formula> (<xref ref-type="fig" rid="fig1">Figure 1</xref>: top right). The index or a hash value is used as a key to find a matched entry <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x28.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x29.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x30.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x31.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x32.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x33.png" xlink:type="simple"/></inline-formula> of a similar item which has been registered previously in hash table (see the first condition in</p><p>Line 6). Hash values <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x34.png" xlink:type="simple"/></inline-formula> are a set of hash functions<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x34.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x35.png" xlink:type="simple"/></inline-formula>, which are given as follow:</p><disp-formula id="scirp.55866-formula521"><label>(2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x36.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula> are the upper and lower values of projections <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x39.png" xlink:type="simple"/></inline-formula> on the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x39.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x40.png" xlink:type="simple"/></inline-formula>th eigenvector<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x39.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x40.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x41.png" xlink:type="simple"/></inline-formula>, respectively. Here, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x39.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x40.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x41.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x42.png" xlink:type="simple"/></inline-formula>is the number of partitions which determines the granularity for a projection<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x37.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x38.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x39.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x40.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x41.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x42.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x43.png" xlink:type="simple"/></inline-formula>.</p><p>The next item is prototype<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x44.png" xlink:type="simple"/></inline-formula>. A prototype is the mean vector of all data allocated to each entry (Lines 7 and 11) which is calculated as follows:</p><disp-formula id="scirp.55866-formula522"><label>(3)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x45.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.55866-formula523"><graphic  xlink:href="http://html.scirp.org/file/2-9601296x46.png"  xlink:type="simple"/></disp-formula><p>where <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula> are the previous prototype which is registered in the entry <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula> and training data that belong to <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula>th subset of unique hash values<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x51.png" xlink:type="simple"/></inline-formula>, respectively. Meanwhile, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x51.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x52.png" xlink:type="simple"/></inline-formula>is the previous occurrence frequency of the entry <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x51.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x52.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x53.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x51.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x52.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x53.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x54.png" xlink:type="simple"/></inline-formula> is the number of training data that has a similar hash value. <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x47.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x48.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x49.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x50.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x51.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x52.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x53.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x54.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x55.png" xlink:type="simple"/></inline-formula>is regarded as the representative point of the subregion.</p><p>The third item is the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x56.png" xlink:type="simple"/></inline-formula> and it is calculated as follows (Line 12):</p><disp-formula id="scirp.55866-formula524"><label>(4)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x57.png"  xlink:type="simple"/></disp-formula><p>where the output margin <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x58.png" xlink:type="simple"/></inline-formula> is given by subtracting the second largest network output <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x59.png" xlink:type="simple"/></inline-formula> from the largest network output <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x58.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x59.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x60.png" xlink:type="simple"/></inline-formula> as follows:</p><disp-formula id="scirp.55866-formula525"><label>(5)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x61.png"  xlink:type="simple"/></disp-formula><p>The fourth item is the outlier flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x62.png" xlink:type="simple"/></inline-formula> which is determined as follows (Line 12):</p><disp-formula id="scirp.55866-formula526"><label>(6)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x63.png"  xlink:type="simple"/></disp-formula><p>The last item is the occurrence frequency <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x64.png" xlink:type="simple"/></inline-formula> of similar data in an entry. Whenever a new training data is assigned to an entry, the occurrence frequency is increased by one. The details of margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x64.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x65.png" xlink:type="simple"/></inline-formula> and outlier</p><p>flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x66.png" xlink:type="simple"/></inline-formula> are discussed in the following Section 2.2 and 3.4, respectively.</p></sec><sec id="s2_2"><title>2.2. Data Selection and RBF Bases Selection Using LSH</title><p>When a large number of data are given simultaneously under an incremental learning environment, it is impor- tant to learn only essential data in a classifier model. Obviously, this is because the learning must be completed as quickly as possible; otherwise, the next data may be given before the learning is completed. In RAN-LSH, the data selection is conducted by using LSH. First, all <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x67.png" xlink:type="simple"/></inline-formula> training data in a given training set are projected to <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x67.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x68.png" xlink:type="simple"/></inline-formula> eigenvectors. Then, for each training data, the projection value is encoded into a hash code whose granularity is determined by the number of partitions<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x67.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x68.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x69.png" xlink:type="simple"/></inline-formula>, and the obtained hash codes are transformed into a hash value.</p><p>If a matched entry with the same hash value is found and the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x70.png" xlink:type="simple"/></inline-formula> is “1”, it means the classifier is well trained (Line 6 in Algorithm 2). Then, the mean vector is calculated and the occurrence frequency <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x71.png" xlink:type="simple"/></inline-formula> is incremented by<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x71.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x72.png" xlink:type="simple"/></inline-formula>. Otherwise (Line 10 in Algorithm 2), the output margin <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x70.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x71.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x72.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x73.png" xlink:type="simple"/></inline-formula> in Equation (5) is calculated</p><p>and the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula> and outlier flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x75.png" xlink:type="simple"/></inline-formula> are updated by Equations (4) and (6), respectively. Note that the training data associated with the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x75.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x76.png" xlink:type="simple"/></inline-formula> are eliminated from the training set (see Algorithm 1 Lines 12 - 14). On the contrary, if the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x75.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x76.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x77.png" xlink:type="simple"/></inline-formula> is “0”, it means a given data should be trained. After the learning phase, the margin flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x75.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x76.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x77.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x78.png" xlink:type="simple"/></inline-formula> would be updated. Nevertheless, updating the margin flag of every prototype in the hash table would increase the learning time. As mentioned before, the prototype with <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x74.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x75.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x76.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x77.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x78.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x79.png" xlink:type="simple"/></inline-formula></p><p>means the classifier is “well-trained” around the prototype. Thus, this prototype does not need to be updated. Meanwhile, prototype with <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x80.png" xlink:type="simple"/></inline-formula> should be updated because there would probably be regions that have become “well-trained” after the learning phase (Line 17 in Algorithm 1 and Line 12 in Algorithm 2).</p><p>LSH is also used to find RBF bases near to the training data (Lines 8 - 13 in Algorithm 3). In RAN-LSH, only the connection weights connected to the selected RBF bases are updated in the following procedures. Firstly, the hash values of RBF bases <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x81.png" xlink:type="simple"/></inline-formula> are retrieved from a hash table. Next, the LSH distance <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x81.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x82.png" xlink:type="simple"/></inline-formula> for each <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x81.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x82.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x83.png" xlink:type="simple"/></inline-formula>th RBFs is calculated as follows:</p><disp-formula id="scirp.55866-formula527"><graphic  xlink:href="http://html.scirp.org/file/2-9601296x84.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.55866-formula528"><label>(7)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x85.png"  xlink:type="simple"/></disp-formula><p>Then, only RBF bases whose LSH distance is less than a threshold <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x86.png" xlink:type="simple"/></inline-formula> are selected for a learning purpose. This is because it is considered that if the LSH distance is large, the RBF output would become very small and the weight update could be negligible. Finally, the selected RBF bases are used to solve the linear equation in <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x86.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x87.png" xlink:type="simple"/></inline-formula> [<xref ref-type="bibr" rid="scirp.55866-ref18">18</xref>] .</p></sec><sec id="s2_3"><title>2.3. RAN Classifier</title><p>Let the number of inputs, RBF units, and outputs be<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x89.png" xlink:type="simple"/></inline-formula>, and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x89.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x90.png" xlink:type="simple"/></inline-formula>, respectively. RBF outputs <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x89.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x90.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x91.png" xlink:type="simple"/></inline-formula> and the network outputs <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x89.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x90.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x91.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x92.png" xlink:type="simple"/></inline-formula> of inputs <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x88.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x89.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x90.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x91.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x92.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x93.png" xlink:type="simple"/></inline-formula> are calculated as follows:</p><disp-formula id="scirp.55866-formula529"><label>(8)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x94.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.55866-formula530"><label>(9)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x95.png"  xlink:type="simple"/></disp-formula><p>where<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x99.png" xlink:type="simple"/></inline-formula> are the center of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x99.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x100.png" xlink:type="simple"/></inline-formula>th RBF unit, the variance of the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x99.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x100.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x101.png" xlink:type="simple"/></inline-formula>th RBF unit, the connection weight from the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x99.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x100.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x101.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x102.png" xlink:type="simple"/></inline-formula>th RBF unit to the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x96.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x97.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x98.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x99.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x100.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x101.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x102.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x103.png" xlink:type="simple"/></inline-formula>th RBF unit and the bias, respectively.</p><p>Algorithm 3 shows the learning algorithm of RAN classifier. In RAN-LSH, RBF centers are not trained but selected based on the output error. If the output error is large, it indicates that a new RBF unit should be added (Lines 5 and 22). As mentioned above, only connection weights for active RBF units are updated (Lines 15 - 17).</p></sec></sec><sec id="s3"><title>3. The Proposed Malicious Spam Email Detection System</title><p><xref ref-type="fig" rid="fig1">Figure 1</xref> illustrates the architecture of the proposed autonomous online malicious spam email detection system which is composed of three components: 1) autonomous spam email collection system; 2) text processing and feature transformation; and 3) RAN-LSH classifier embedded with the data selection and outlier detection mechanisms.</p><p>As mentioned in Section 2.2, learning all the given data is not a good strategy under incremental learning environments because the learning may not be completed before a new data set is given [<xref ref-type="bibr" rid="scirp.55866-ref19">19</xref>] . To enhance the adaptibility to dynamic environments, the learning should be carried out with essential data that are selected in an online fashion. There are two types of essential data for a learning purpose. The first type is the data located close to a class boundary [<xref ref-type="bibr" rid="scirp.55866-ref20">20</xref>] , while the other is the data located outside of the learned region (i.e., outlier). In order to ensure fast and accurate learning, the data selection mechanism should be introduced into a classifier model to find such essential data from a given chunk of data.</p><p>The first type of essential data has been discussed in Section 2.2. On the other hand, the second type of essential data are selected by the outlier detection. This type of essential data selection is introduced into the previous RAN-LSH classifier. The outlier detection relies on the output margin and the number of occurrence of similar data in the input space which are represented by outlier flag<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x104.png" xlink:type="simple"/></inline-formula>.</p><p>In the following subsection, we explain the details of the three components of the autonomous online malicious spam email detection system, as well as the autonomous labeling system.</p><sec id="s3_1"><title>3.1. Autonomous Spam Email Collection System</title><p><xref ref-type="fig" rid="fig1">Figure 1</xref>(a) illustrates the process of obtaining double bounce emails. Let us consider a case that a spammer sends a large number of emails. In many cases, almost all emails will reach existing users. However, it is very likely that some email addresses are no longer in use for some reason. Therefore, the email server would return such emails with unknown addresses to the sender. If the spammer intends to send a malicious email, it is also likely that the spammer has faked the originating address and such emails would be re-sent to the receiver. This type of unreachable error email is called “double-bounce email” [<xref ref-type="bibr" rid="scirp.55866-ref7">7</xref>] and they are usually disposed of by the email server on the receiver’s side. We utilize this mechanism of generating double-bounce emails to collect malicious spam emails automatically.</p></sec><sec id="s3_2"><title>3.2. Autonomous Labeling System</title><p>To use double-bounce emails as training data under the supervised learning, we would need their class labels. Needless to say, spammers try to conceal their malicious intention; therefore, it is not easy to determine the maliciousness from the collected double-bounce emails. The only way to identify the maliciousness is by click- ing the URLs. Evidently, this is very dangerous for general users; therefore, we use a crawling-type web mali- ciousness analyzer called SPIKE, which was developed by the National Institute of Information and Communi- cations Technology (NICT) in Japan.</p><p><xref ref-type="fig" rid="fig2">Figure 2</xref> illustrates how the maliciousness of URLs in a spam email is analyzed in SPIKE [<xref ref-type="bibr" rid="scirp.55866-ref21">21</xref>] . The URL links in the email are first extracted from a double-bounce email and SPIKE downloads the html file and attached materials (e.g., java scripts, pdf, doc files) in the entrance page. It then continues to find other URLs in the downloaded pages again. This process is conducted recursively by crawling the linked websites, and all the downloaded materials are analyzed. Emails that are only link to a normal webpage with non-malicious contents are considered as non-malicious spam emails (i.e., all contents of Webpage 1 - 6 in <xref ref-type="fig" rid="fig2">Figure 2</xref> are normal), whereas the emails with at least one suspicious content (i.e., one of Webpage 1 - 6 in <xref ref-type="fig" rid="fig2">Figure 2</xref> is malicious) are identified as malicious spam emails.</p></sec><sec id="s3_3"><title>3.3. Text Processing and Feature Transformation</title><p>In order for the classifier to carry out the classification task effectively, the classifier requires instances as the input instead of the raw spam emails for the learning purpose. The instances consist of informative features with a fixed-length which are extracted from the emails. Thus, appropriate pre-processing steps are required so that the arbitrary data of text messages are transformed into features with numerical features. <xref ref-type="fig" rid="fig1">Figure 1</xref>(b) demons- trates the pre-processing module of the spam email detection system. Feature extraction of spam emails involves tokenizing and lemmatizing the documents into bag-of-words (BoW). Tokenization breaks the sentences in the emails into pieces of words and removes frequent words called stop words such as “the”, “which”, “are”, etc. Besides filtering out stop words, lemmatization also reduces the number of words in BoW by transforming redundant words that end with “ing”, “ed” and “s” into their root word (e.g., “learned” to “learn”).</p><p>The BoW features usually consist thousands or millions of feature vectors. In general, only some features are informative and are able to differentiate different classes. Therefore, feature selection is carried out to select the most informative features in order to reduce the number of dimensions and avoid the computational complexity. Firstly, the initial training data are transformed into feature vectors with term frequency-inverse document fre-</p><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> Example of web crawler and content analysis using SPIKE</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x105.png"/></fig><p>quency (TF-IDF) feature representation. Next, linear <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x106.png" xlink:type="simple"/></inline-formula> Support Vector Machines (SVM) is used as a feature selection strategy that requires two steps; training using linear SVM [<xref ref-type="bibr" rid="scirp.55866-ref22">22</xref>] and eliminating features with low weights [<xref ref-type="bibr" rid="scirp.55866-ref23">23</xref>] . SVM is able to find a decision boundary by optimizing the following objective function:</p><disp-formula id="scirp.55866-formula531"><label>(10)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x107.png"  xlink:type="simple"/></disp-formula><p>which maximizes the margin <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x108.png" xlink:type="simple"/></inline-formula> of hyperplane <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x108.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x109.png" xlink:type="simple"/></inline-formula> between two classes <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x108.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x109.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x110.png" xlink:type="simple"/></inline-formula> and contains</p><p>only a minimum training error <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula> (i.e., training data located above the support vectors which belong to the class of the training data). The parameter <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x112.png" xlink:type="simple"/></inline-formula> controls the trade off between margin maximization and errors of the SVM on training data, where a larger <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x112.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x113.png" xlink:type="simple"/></inline-formula> corresponds to a higher penalty to errors. The weights <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x112.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x113.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x114.png" xlink:type="simple"/></inline-formula> obtained is used to select <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x112.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x113.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x114.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x115.png" xlink:type="simple"/></inline-formula> number of features by choosing the highest <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x111.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x112.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x113.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x114.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x115.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x116.png" xlink:type="simple"/></inline-formula>-rank weights.</p><p>To represent the selected features of initial training data and the remaining training data, the normalized TF-IDF [<xref ref-type="bibr" rid="scirp.55866-ref24">24</xref>] is used to measure the importance of a word to a document (i.e., document refers to the spam email) in the collection of documents given by the following equation:</p><disp-formula id="scirp.55866-formula532"><label>(11)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/2-9601296x117.png"  xlink:type="simple"/></disp-formula><p>where<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x119.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x119.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x120.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x119.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x120.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x121.png" xlink:type="simple"/></inline-formula> are the frequency of term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x119.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x120.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x121.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x122.png" xlink:type="simple"/></inline-formula> in a document<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x118.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x119.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x120.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x121.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x122.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x123.png" xlink:type="simple"/></inline-formula>, the total</p><p>frequencies of all terms in document<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula>, the total number of document in corpus and number of documents which have term<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula>, respectively. The normalized term frequency (TF) is used to provide a balanced value to all documents that have a different number of words. If term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula> appears frequently in a document <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula> and seldom occurs in other documents in<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula>, the value of TF-IDF would be high where both TF and inverse document frequency (IDF) obtain high values. This indicates that term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula> is important to document<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula>. Otherwise, if either the occurrence of term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula> is low in document <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x132.png" xlink:type="simple"/></inline-formula> or term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x132.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x133.png" xlink:type="simple"/></inline-formula> is always appears in other documents in<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x132.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x133.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x134.png" xlink:type="simple"/></inline-formula>, this would indicate that term <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x132.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x133.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x134.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x135.png" xlink:type="simple"/></inline-formula> is not important to document <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x124.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x125.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x126.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x127.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x128.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x129.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x130.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x131.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x132.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x133.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x134.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x135.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x136.png" xlink:type="simple"/></inline-formula> where the value of TF-IDF is low or “0”.</p><p>After going through the entire procedure above, these data are used as the input to the classifier model. The details of the classifier model are discussed in the previous Section 2.2.</p></sec><sec id="s3_4"><title>3.4. Outlier Detection</title><p>Although SPIKE can judge the maliciousness of spam emails, the analysis takes time, from a few minutes to even longer than ten minutes. Therefore, it is difficult to check all the collected double-bounce emails by SPIKE in real time. We introduce the outlier detection mechanism into RAN-LSH in order to reduce the number of spam emails to be checked by SPIKE. That is, only a new type of unknown spam emails (i.e., outlier) should be selected and sent to SPIKE for labeling. For this purpose, we propose a spam email detection system by combining RAN-LSH classifier [<xref ref-type="bibr" rid="scirp.55866-ref15">15</xref>] and SPIKE, so that the learning time is accelerated compared to when using SPIKE alone. In this study, we detect an outlier based on the output margin<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula>, outlier threshold <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula> and the occurrence frequency threshold<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x139.png" xlink:type="simple"/></inline-formula>. The data with low output margins are considered as unknown emails for the current classifier and thus should be categorized as outlier. In addition, the number of similar data in each entry <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x139.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x140.png" xlink:type="simple"/></inline-formula> is also important to decide whether the data is outlier or not. We assume that the data that do not frequently occurred (i.e., data allocated to an entry with small<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x139.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x140.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x141.png" xlink:type="simple"/></inline-formula>) can also be categorized as outlier although the output margins are slightly higher than<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x139.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x140.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x141.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x142.png" xlink:type="simple"/></inline-formula>. The outlier flag <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x137.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x138.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x139.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x140.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x141.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x142.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x143.png" xlink:type="simple"/></inline-formula> is calculated using Equation (6). The algorithm of the outlier detection is summarized in Algorithm 4.</p></sec></sec><sec id="s4"><title>4. Performance Evaluation</title><sec id="s4_1"><title>4.1. Experimental Setup</title><p>The detection performance is evaluated under incremental learning settings to study the following effects: 1) the effect of threshold parameters and 2) the effectiveness of daily updates.</p><disp-formula id="scirp.55866-formula533"><graphic  xlink:href="http://html.scirp.org/file/2-9601296x144.png"  xlink:type="simple"/></disp-formula><p>In the former experiment, we investigate the effects of the following three threshold parameters to the perfor- mance: threshold of accumulation ratio<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x145.png" xlink:type="simple"/></inline-formula>, that of output margin<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x145.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x146.png" xlink:type="simple"/></inline-formula>, and that of tolerant distance<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x145.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x146.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x147.png" xlink:type="simple"/></inline-formula>. In the latter experiment, we study the effect of incremental learning through comparison with the batch learning scheme. <xref ref-type="fig" rid="fig3">Figure 3</xref> illustrates how labeled spam emails are trained in (a) batch learning scheme and (b) incremen- tal learning scheme. In the batch learning scheme, we adopt the conventional RBF network (RBFN) (i.e., RBFN usually used as batch learning [<xref ref-type="bibr" rid="scirp.55866-ref25">25</xref>] ) as a classifier and a sliding window is introduced to define a data set to be trained every day. In this experiment, the time-window size is preliminarily determined as 12 days via the cross- validation using the spam emails collected during a different period. Therefore, as seen in <xref ref-type="fig" rid="fig3">Figure 3</xref>(a), the first learning stage is carried out on Day 12 using a set of spam emails collected from Day 1 to Day 12, and the data set from Day 13 is used to test the performance. Then, the time-window is shifted by one day at the second learning stage; that is, a set of spam emails collected from Day 2 to Day 13 is used for training, and the data set from Day 14 is used to test. Note that RBFN is retrained with a set of 12-day spam emails at every learning stage in a batch mode. On the other hand, in the incremental learning scheme, batch learning is first applied to an initial data set, which is composed of spam emails collected in the first 12 days. After that, this initial detection model is updated daily using one day training data and the system is tested using the next day data. A set of spam emails collected is forwarded to SPIKE to get the labels on maliciousness, and the pairs of a spam email and its class label are used as a training set on the next day. The number of collected spam emails is different every day. Their maximum, minimum, and average numbers are 756, 26, and 207, respectively.</p><p>In this detection system, three parameters are determined empirically. The parameters are accumulation ratio<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula>, output margin <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula> and tolerant distance<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula>. In the following experiment, the parameters are set to:<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula>, and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula>. Meanwhile, the error threshold <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula> is set to 0.5. The other parameters, RBF width<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x155.png" xlink:type="simple"/></inline-formula>, the number of partitions<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x155.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x156.png" xlink:type="simple"/></inline-formula>, errors penalty of SVM<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x155.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x156.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x157.png" xlink:type="simple"/></inline-formula>, outlier threshold<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x155.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x156.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x157.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x158.png" xlink:type="simple"/></inline-formula>, occurrence frequency threshold <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x148.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x149.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x150.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x151.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x152.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x153.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x154.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x155.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x156.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x157.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x158.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x159.png" xlink:type="simple"/></inline-formula> and time-window size are determined in the initial learning phase through the cross-validation. There are 20,448 double bounce emails with 8334 malicious spam emails and 12,114 non-malicious spam emails used in this study that were collected from 1st March 2013 to 10th May 2013.</p><p>It is important for the proposed system to correctly classify not only positive examples (malicious spam emails) but also negative ones (non-malicious spam emails). Therefore, in this study, three evaluations are used which are: recall rate, precision rate and F1 measure rate. The actual class labels are the class labels given by SPIKE, whilst the prediction labels are obtained by the detection system. The recall rate and the precision rate measure the ability of the detection system to classify the malicious spam emails (positive samples) correctly which takes into account different types of error. On the other hand, the F1 measure is the harmonic mean of recall and precision rate. The recall rate considers type II error (i.e., a malicious spam email is wrongly classi- fied as non-malicious spam email), whereas the precision rate considers type I error (i.e., a non-malicious spam email is wrongly classified as malicious spam email). If the malicious spam email detection system obtained a low recall rate, the users are exposed to the danger of malware infection because some users may click an URL that leads to malicious websites. Such a misclassification must be avoided in any cases. In the second situation, there is low risk of the malware attack. The system only gives strict conditions where most of the non-malicious spam emails are categorized as malicious spam emails. Therefore, to design a good malicious spam email detec- tion system, it is crucial to have at least high percentage of recall rate to reduce the risk of malware attack.</p><fig-group id="fig3"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Comparison of learning scheme between batch learning and incremental learning. (a) Batch learning; (b) Incremental learning.</title></caption><fig id ="fig3_1"><label>(b)</label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x160.png"/></fig><fig id ="fig3_2"><label></label><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x161.png"/></fig></fig-group><p>However, if the malicious spam email detection system is able to obtain high rate of both recall and precision, we can say that the developed system is nearly a perfect detection system.</p></sec><sec id="s4_2"><title>4.2. Effects of Threshold Parameters</title><p>First, let us examine the threshold parameters and their effect to the detection system. Here, we study the influence of<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula> so that the parameters are optimized to ensure fast learning property of the detection system while having low misclassification rate. The first parameter is the threshold of accumulation ratio<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula>. If <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x166.png" xlink:type="simple"/></inline-formula> is set to a large value, an eigenspace to define hash functions has high-dimensions. Thus, the length of a hash code becomes long, resulting in the enlargement of a hash table. Therefore the searching of similar data would require a longer time since there are many hash values registered in hash table. The next parameter is the output margin threshold<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x166.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x167.png" xlink:type="simple"/></inline-formula>. This parameter controls the amount of selected data to be learned by RAN-LSH. As the value of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x166.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x167.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x168.png" xlink:type="simple"/></inline-formula> is set to a higher value, the representation of the “not well-learned” region would become wider and the number of selected data is increased in the incremental learning phase, resulting in slower learning. The third important parameter is the tolerant distance <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x166.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x167.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x168.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x169.png" xlink:type="simple"/></inline-formula> which determines the distance of near RBF bases. By updating the weights of only near RBF bases (i.e., using small value of<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x162.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x163.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x164.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x165.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x166.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x167.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x168.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x169.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x170.png" xlink:type="simple"/></inline-formula>), the time needed to solve the linear equation using Singular Value Decomposition (SVD) is shorten, thus the learning time would be accelerated.</p><p>To determine an appropriate value of each parameter, the cross-validation is performed for the initial training set, and the obtained parameter values are fixed over the incremental phase. <xref ref-type="table" rid="table1">Table 1</xref>(a) and <xref ref-type="table" rid="table1">Table 1</xref>(b) show the F1 measure and the learning time, respectively, using several combination values of the accumulation ratio <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula> and output margin<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula>. As seen in <xref ref-type="table" rid="table1">Table 1</xref>(a), the highest F1 measure is obtained when <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x173.png" xlink:type="simple"/></inline-formula> and<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x173.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x174.png" xlink:type="simple"/></inline-formula>. For output margin<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x173.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x174.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x175.png" xlink:type="simple"/></inline-formula>, F1 measure does not differ much from the F1 measure with output margin<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x173.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x174.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x175.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x176.png" xlink:type="simple"/></inline-formula>. This result is not surprising because high value of output margins represents that the data are “well-learned” or correctly classified. By adding these data, it is expected that the classification rate would not improve although the number of selected data and RBF bases created are increased. As a result for learning using more training data shown by a higher<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x171.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x172.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x173.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x174.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x175.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x176.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x177.png" xlink:type="simple"/></inline-formula>, the learning time would also increase. We assume that the data which are located on the border of different classes should have network outputs in the range of “0.4” to “0.6”.</p><table-wrap-group id="1"><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The evaluation using several values of accumulation ratio <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x178.png" xlink:type="simple"/></inline-formula> and output margin threshold <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x178.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x179.png" xlink:type="simple"/></inline-formula> for the spam email detection system. The performance measures are: (a) the F1 measure [%], and (b) initial learning time [sec.]</title></caption><table-wrap id="1_1"><caption><title> (b)</title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x180.png" xlink:type="simple"/></inline-formula></th><th align="center" valign="middle"  colspan="5"  ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x181.png" xlink:type="simple"/></inline-formula></th></tr></thead><tr><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.1</td><td align="center" valign="middle" >0.2</td><td align="center" valign="middle" >0.3</td><td align="center" valign="middle" >0.4</td></tr><tr><td align="center" valign="middle" >0.5</td><td align="center" valign="middle" >82.8</td><td align="center" valign="middle" >87.0</td><td align="center" valign="middle" >88.4</td><td align="center" valign="middle" >88.2</td><td align="center" valign="middle" >88.1</td></tr><tr><td align="center" valign="middle" >0.7</td><td align="center" valign="middle" >86.6</td><td align="center" valign="middle" >87.3</td><td align="center" valign="middle" >90.5</td><td align="center" valign="middle" >87.8</td><td align="center" valign="middle" >88.2</td></tr><tr><td align="center" valign="middle" >0.9</td><td align="center" valign="middle" >87.0</td><td align="center" valign="middle" >87.7</td><td align="center" valign="middle" >90.8</td><td align="center" valign="middle" >88.0</td><td align="center" valign="middle" >87.9</td></tr></tbody></table></table-wrap><table-wrap id="1_2"><caption><title></title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x182.png" xlink:type="simple"/></inline-formula></th><th align="center" valign="middle"  colspan="5"  ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x183.png" xlink:type="simple"/></inline-formula></th></tr></thead><tr><td align="center" valign="middle" >0.05</td><td align="center" valign="middle" >0.1</td><td align="center" valign="middle" >0.2</td><td align="center" valign="middle" >0.3</td><td align="center" valign="middle" >0.4</td></tr><tr><td align="center" valign="middle" >0.5</td><td align="center" valign="middle" >126.5</td><td align="center" valign="middle" >128.6</td><td align="center" valign="middle" >130.1</td><td align="center" valign="middle" >130.3</td><td align="center" valign="middle" >131.2</td></tr><tr><td align="center" valign="middle" >0.7</td><td align="center" valign="middle" >128.0</td><td align="center" valign="middle" >129.7</td><td align="center" valign="middle" >130.3</td><td align="center" valign="middle" >130.9</td><td align="center" valign="middle" >132.6</td></tr><tr><td align="center" valign="middle" >0.9</td><td align="center" valign="middle" >128.6</td><td align="center" valign="middle" >129.8</td><td align="center" valign="middle" >132.1</td><td align="center" valign="middle" >132.7</td><td align="center" valign="middle" >139.8</td></tr></tbody></table></table-wrap></table-wrap-group><p>Therefore, the output margins <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x184.png" xlink:type="simple"/></inline-formula> can be estimated to be between “0” to “0.2” where the data with output margins in these range are important to be learned to reduce the misclassification rate.</p><p>On the other hand, <xref ref-type="table" rid="table2">Table 2</xref> demonstrates the suitable value of the tolerant distance <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula> using appropriate value of <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula> and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula> obtained previously which are “0.9” and “0.2”, respectively. Tolerant distance <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula> also give influence to the classification rate and the speed of the detection system by controlling the distance which defines the area of near RBF bases. As we can see, the suitable value for the tolerant distance is 2. It means that only the RBF centers that differ from the given training data at two projection vector <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula> are used to update the weight. If the <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula> is too small, it indicates that the area of selected RBF bases is not enough to approximate the weights correctly. Whereas for <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula> that is too large, it would be similar to the approach of updating weights using all RBF bases. Thus, the size of RBF outputs <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula> in <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula> would be bigger and therefore, the decomposition steps using SVD would require a longer time. Even though the results show the evaluation performance during initial learning, we expect a similar result from the incremental learning phase. It is because parameter<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x195.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x195.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x196.png" xlink:type="simple"/></inline-formula> are also required during the incremental learning phase. For the next experiment, we set the value of<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x195.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x196.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x197.png" xlink:type="simple"/></inline-formula>, <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x195.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x196.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x197.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x198.png" xlink:type="simple"/></inline-formula>and <inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x185.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x186.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x187.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x188.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x189.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x190.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x191.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x192.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x193.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x194.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x195.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x196.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x197.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x198.png" xlink:type="simple"/></inline-formula><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x199.png" xlink:type="simple"/></inline-formula> to be “0.2”, “0.9” and “2”, respectively.</p></sec><sec id="s4_3"><title>4.3. Effectiveness of Incremental Learning</title><p>All learning parts in the detection system including pre-processing and classifier module are very crucial which give effect to the performance result. In this experiment, we compare the performance of the proposed online detection system with different learning scheme and classifier model to see the competency of the proposed method. <xref ref-type="fig" rid="fig4">Figure 4</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref> show the recall rate and precision rate for the detection system with the follow- ing three combinations of classifiers and learning schemes: RBFN (batch learning), RAN (incremental learning), and RAN-LSH (incremental learning) (see <xref ref-type="fig" rid="fig3">Figure 3</xref>). The batch learning is carried out using 12-days of training data and it is retrained incrementally. While for the incremental learning, the classifier is updated incrementally using 1-day of training data. As seen in <xref ref-type="fig" rid="fig4">Figure 4</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref>, the proposed one-pass learning of the detection system is capable to learn and carry out the classification task effectively since our proposed system obtained almost the same classification rate as the memory-based learning approach (i.e., batch learning). In fact, our proposed method does not need large memory size to store the training data compared to the memory-based learning. In this study, 12-days length of window size is used for the batch learning to learn incrementally, whereas for incremental learning, only 1-day data set is used as training data. Besides that, we also compare the performance of conventional classifier RAN using the same incremental learning scheme. Our previous study in [<xref ref-type="bibr" rid="scirp.55866-ref15">15</xref>] shows that RAN-LSH can learn fast. As seen in <xref ref-type="fig" rid="fig4">Figure 4</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref>, our detection system obtains a comparable result against the conventional classifier model.</p><fig id="fig4"  position="float"><label><xref ref-type="fig" rid="fig4">Figure 4</xref></label><caption><title> Transitions of recall rates in the malicious spam email detection system with three learning schemes</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x200.png"/></fig><fig id="fig5"  position="float"><label><xref ref-type="fig" rid="fig5">Figure 5</xref></label><caption><title> Transitions of precision in the malicious spam email detection system with three learning schemes</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/2-9601296x201.png"/></fig><table-wrap id="table2" ><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> The performance using different values of tolerant distance<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x202.png" xlink:type="simple"/></inline-formula></title></caption><table><tbody><thead><tr><th align="center" valign="middle"  rowspan="2"  >Evaluation</th><th align="center" valign="middle"  colspan="4"  ><inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/2-9601296x203.png" xlink:type="simple"/></inline-formula></th></tr></thead><tr><td align="center" valign="middle" >0</td><td align="center" valign="middle" >1</td><td align="center" valign="middle" >2</td><td align="center" valign="middle" >5</td></tr><tr><td align="center" valign="middle" >F1 measure [%]</td><td align="center" valign="middle" >88.5</td><td align="center" valign="middle" >88.8</td><td align="center" valign="middle" >89.3</td><td align="center" valign="middle" >89.3</td></tr><tr><td align="center" valign="middle" >Initial learning time [sec.]</td><td align="center" valign="middle" >138.2</td><td align="center" valign="middle" >139.1</td><td align="center" valign="middle" >140.1</td><td align="center" valign="middle" >144.9</td></tr></tbody></table></table-wrap></sec><sec id="s4_4"><title>4.4. Overall Performance of Malicious Spam Email Detection System</title><p>The overall performance is evaluated by averaging over the whole incremental learning phase. The recall rate, precision rate and F1-measure for the three learning models are summarized in <xref ref-type="table" rid="table3">Table 3</xref>. From <xref ref-type="table" rid="table3">Table 3</xref>, we can see that the proposed detection system can learn 482 times faster than the conventional RBFN model, and can learn 46 times faster than RAN. This is because our detection system can find a set of similar data in a given training set very quickly using LSH; thus, only a data set falling in an untrained region are selected to learn and the others are discarded. In addition, only near RBF centers are used to update the connection weights. The recall rate and F1 measure of RBFN are higher than those of the other incremental models because this model keeps a large number of data for training; where this requires high computational costs and large memory. Since</p><table-wrap id="table3" ><label><xref ref-type="table" rid="table3">Table 3</xref></label><caption><title> Overall performance of malicious spam email detection system</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Performance evaluation</th><th align="center" valign="middle" >RBFN batch</th><th align="center" valign="middle" >RAN incremental</th><th align="center" valign="middle" >RAN-LSH incremental</th></tr></thead><tr><td align="center" valign="middle" >Recall Rate [%]</td><td align="center" valign="middle" >94.6</td><td align="center" valign="middle" >89.4</td><td align="center" valign="middle" >90.7</td></tr><tr><td align="center" valign="middle" >Precision Rate [%]</td><td align="center" valign="middle" >90.7</td><td align="center" valign="middle" >90.8</td><td align="center" valign="middle" >87.2</td></tr><tr><td align="center" valign="middle" >F1 Measure [%]</td><td align="center" valign="middle" >92.2</td><td align="center" valign="middle" >89.3</td><td align="center" valign="middle" >87.7</td></tr><tr><td align="center" valign="middle" >Learning Time [sec.]</td><td align="center" valign="middle" >4532.7</td><td align="center" valign="middle" >432.0</td><td align="center" valign="middle" >9.4</td></tr></tbody></table></table-wrap><p>the classification rate of recall rate, precision rate and F1 measure do not differ much from the other model, we can conclude that the proposed system is able to update efficiently and able to give class label of the incoming emails within a short time.</p></sec></sec><sec id="s5"><title>5. Conclusions</title><p>We have proposed a malicious spam email detection system using BoW features, where the classifier adopts LSH to select essential data and near RBF bases. We use two types of essential data: 1) the data located close to a class boundary; and 2) the data located outside of the learned region (i.e., outlier). The proposed scheme provides desirable learning characteristics as an autonomous malicious spam email detection system and able to adapt to new trends of malicious emails quickly. In addition, our detection system is quite fast compared with SPIKE which often needs a long time to complete the maliciousness analysis. By using the proposed system, it is possible to give proper alerts to users quickly based on up to date information. Since the learning is quite fast and the detection performance is comparable to the conventional models, we can conclude that the proposed system is suitable to be implemented in an email client software on the user side.</p><p>Currently, the proposed detection system has no pruning function for RBF bases. Therefore, as the learning is continued for a long time, the number of RBF could be increased excessively, and this causes longer learning time. Then, in the worst scenario, the learning may not converge before new training data are given. To avoid such a disastrous situation, a proper number of RBF bases should always be maintained by introducing an online pruning mechanism into RAN-LSH. Besides that, our detection system uses selected features from initial learning training data. As our future work, we intend to construct an adaptive hash table to adapt to the changes of feature vectors from the recent BoW without forgetting the previous knowledge. It is expected that the detection system would be more stable and robust to the new malicious spam email attacks.</p></sec><sec id="s6"><title>Acknowledgements</title><p>This work is partially supported by the Ministry of Education, Science, Sports and Culture, Grant-in-Aid for Scientific Research (C) 24500173, the University of Tun Hussein Onn Malaysia (UTHM) and the Ministry of Education Malaysia (KPM).</p></sec><sec id="s7"><title>NOTES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.55866-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Vuong, T.P. and Gan, D. (2012) A Targeted Malicious Email (TME) Attack Tool. 6th International Conference on Cybercrime, Forensics, Education and Training (CFET), Christ Church Canterbury.</mixed-citation></ref><ref id="scirp.55866-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Nagarjuna, B.V.R.R. and Sujatha, V. (2013) An Innovative Approach for Detecting Targeted Malicious E-Mail. International Journal of Application or Innovation in Engineering &amp; Management (IJAIEM), 2, 422-428.</mixed-citation></ref><ref id="scirp.55866-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Symantec Corporation (2014) Internet Security Threat Report 2014, Vol. 19, 1-98.  http://www.symantec.com/content/en/us/enterprise/other_resources/bistr_main_report_v19_212 
91018.en-us.pdf</mixed-citation></ref><ref id="scirp.55866-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Hurcombe, J. (2014) Malicious Links: Spammers Change Malware Delivery Tactics.http://www.symantec.com/connect/blogs/malicious-links-spammers-change-malware-delivery-tactics</mixed-citation></ref><ref id="scirp.55866-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Amin, R.M. (2011) Detecting Targeted Malicious Email through Supervised Classification of Persistent Threat and Recipient Oriented Features. Ph.D. Dissertation, Dept. Eng. and Applied Sciences, George Washington University, Washington.http://www.researchgate.net/publication/224265677_Detecting_Targeted_Malicious_Email_Using_ 
Persistent_Threat_and_Recipient_Oriented_Features</mixed-citation></ref><ref id="scirp.55866-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Hadnagy, C. (2011) Social Engineering: The Art of Human Hacking. Wiley, Indianapolis.</mixed-citation></ref><ref id="scirp.55866-ref7"><label>7</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Jungsuk</surname><given-names> S. </given-names></name>,<etal>et al</etal>. (<year>2011</year>)<article-title>Clustering and Feature Selection Methods for Analyzing Spam Based Attacks</article-title><source> Journal of the National Institute of Information and Communications Technology</source><volume> 58</volume>,<fpage> 35</fpage>-<lpage>50</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.55866-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Criddle, L. What Are Bots, Botnets and Zombies?http://www.webroot.com/za/en/home/resources/tips/pc-security/security-what-are-bots-botnets- 
and-zombies</mixed-citation></ref><ref id="scirp.55866-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Nazirova, S. (2011) Survey on Spam Filtering Techniques. Communications and Network, 3, 153-160. http://www.scirp.org/journal/PaperInformation.aspx?PaperID=6769#.VPkYAzWlilN http://dx.doi.org/10.4236/cn.2011.33019</mixed-citation></ref><ref id="scirp.55866-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Datar, M., Immorlica, N., Indyk, P. and Mirrokni, V.S. (2004) Locality-Sensitive Hashing Scheme Based on p-Stable Distributions. Proceedings of Symposium on Computational Geometry (SoCG'04), 253-262. http://dl.acm.org/citation.cfm?id=997857 http://dx.doi.org/10.1145/997817.997857</mixed-citation></ref><ref id="scirp.55866-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Andoni, A. and Indyk, P. (2008) Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions. Communications of the ACM, 51, 117-122. http://dl.acm.org/citation.cfm?id=1327494 http://dx.doi.org/10.1145/1327452.1327494</mixed-citation></ref><ref id="scirp.55866-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Gu, X., Zhang, Y., Zhang, L., Zhang, D. and Li, J. (2013) An Improved Method of Locality Sensitive Hashing for Indexing Large-Scale and High-Dimensional Features. Signal Processing, 93, 2244-2255. http://dl.acm.org/citation.cfm?id=2464367 http://dx.doi.org/10.1016/j.sigpro.2012.07.014</mixed-citation></ref><ref id="scirp.55866-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Lee, K.M. and Lee, K.M. (2012) Similar Pair Identification Using Locality-Sensitive Hashing Technique. Proceedings of Joint 6th International Conference on Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2117-2119. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6505385 http://dx.doi.org/10.1109/SCIS-ISIS.2012.6505385</mixed-citation></ref><ref id="scirp.55866-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Shen, H., Li, T., Li, Z. and Ching, F. (2008) Locality Sensitive Hashing Based Searching Scheme for a Massive Database. Proceedings of IEEE Southeastcon’08, 123-128. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4494271</mixed-citation></ref><ref id="scirp.55866-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Ali, S.H.A., Fukase, K. and Ozawa, S. (2013) A Neural Network Model for Large-Scale Stream Data Learning Using Locally Sensitive Hashing. Neural Information Processing Lecture Notes in Computer Science, 369-376. http://link.springer.com/chapter/10.1007%2F978-3-642-42054-2_46</mixed-citation></ref><ref id="scirp.55866-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Platt, J. (1991) A Resource-Allocating Network for Function Interpolation. Neural Computation, 3, 213-225. http://sci2s.ugr.es/keel/pdf/algorithm/articulo/plat1991.pdf http://dx.doi.org/10.1162/neco.1991.3.2.213</mixed-citation></ref><ref id="scirp.55866-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Ozawa, S., Pang, S. and Kasabov, N. (2008) Incremental Learning of Chunk Data for Online Pattern Classification Systems. IEEE Transactions on Neural Networks, 19, 1061-1074. http://www.lib.kobe-u.ac.jp/repository/90001005.pdf http://dx.doi.org/10.1109/TNN.2007.2000059</mixed-citation></ref><ref id="scirp.55866-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Haykin, S. (1999) Neural Networks: A Comprehensive Foundation. Prentice Hall, Upper Saddle River.</mixed-citation></ref><ref id="scirp.55866-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Langley, P. (1994) Selection of Relevant Features in Machine Learning. Proceedings of the AAAI Fall Symposium on Relevance, New Orleans, 4-6 November 1994, 140-144.</mixed-citation></ref><ref id="scirp.55866-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">Oyang, Y.J., Hwang, S.C., Ou, Y.Y., Chen, C.Y. and Chen, Z.W. (2005) Data Classification with Radial Basis Function Networks Based on a Novel Kernel Density Estimation Algorithm. IEEE Transactions on Neural Networks, 16, 225-236. http://dx.doi.org/10.1109/TNN.2004.836229 
http://ieeexplore.ieee.org/xpl/abstractAuthors.jsp?tp=&amp;arnumber=1388471&amp;url=http%3A% 
2F%2Fieeexplore.ieee.org%2Fiel5%2F72%2F30214%2F01388471.pdf%3Farnumber% 
3D1388471</mixed-citation></ref><ref id="scirp.55866-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Dai, Y., Tada, S., Ban, T., Nakazato, J., Shimamura, J. and Ozawa, S. (2014) Detecting Malicious Spam Mails: An Online Machine Learning Approach. Neural Information Processing Lecture Notes in Computer Science, 8836, 365-372. http://link.springer.com/chapter/10.1007%2F978-3-319-12643-2_45</mixed-citation></ref><ref id="scirp.55866-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Cortes, C. and Vapnik, V. (1995) Support-Vector Networks. Machine Learning, 20, 273-297. http://link.springer.com/article/10.1023%2FA%3A1022627411411 http://dx.doi.org/10.1007/BF00994018</mixed-citation></ref><ref id="scirp.55866-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Brank, J., Grobelnik, M., Milic-Frayling, N. and Mladenic, D. (2002) Feature Selection Using Linear Support Vector Machines. Proceedings of the 3rd International Conference on Data Mining Methods and Databases for Engineering, Finance, and Other Fields, Bologna, Italy, 25-27 September 2002, 84-89.</mixed-citation></ref><ref id="scirp.55866-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M. and Gatford, M. (1996) Okapi at TREC-3. Proceedings of the Third NIST Text Retrieval Conference (TREC3), NIST Special Publication 500-225, Washington DC, 109-126.</mixed-citation></ref><ref id="scirp.55866-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Ozawa, S., Tabuchi, T., Nakasaka, S. and Roy, A. (2010) An Autonomous Incremental Learning Algorithm for Radial Basis Function Networks. Journal of Intelligent Learning Systems and Appli-cations, 2, 179-189. http://www.scirp.org/journal/PaperInformation.aspx?PaperID=3333#.VPkOYTWlilM http://dx.doi.org/10.4236/jilsa.2010.24021</mixed-citation></ref></ref-list></back></article>