<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">IJIS</journal-id><journal-title-group><journal-title>International Journal of Intelligence Science</journal-title></journal-title-group><issn pub-type="epub">2163-0283</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ijis.2013.31A006</article-id><article-id pub-id-type="publisher-id">IJIS-29547</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Computer Science&amp;Communications</subject></subj-group></article-categories><title-group><article-title>
 
 
  Missing Values Imputation Based on Iterative Learning
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>uaxiong</surname><given-names>Li</given-names></name><xref ref-type="aff" rid="aff1"><sub>1</sub></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff1"><label>1</label><addr-line>School of Management and Engineering, Nanjing University, Nanjing, China</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>huaxiongli@nju.edu.cn</email></corresp></author-notes><pub-date pub-type="epub"><day>29</day><month>03</month><year>2013</year></pub-date><volume>03</volume><issue>01</issue><fpage>50</fpage><lpage>55</lpage><history><date date-type="received"><day>November</day>	<month>3,</month>	<year>2012</year></date><date date-type="rev-recd"><day>January</day>	<month>17,</month>	<year>2013</year>	</date><date date-type="accepted"><day>January</day>	<month>27,</month>	<year>2013</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
   
   Databases for machine learning and data mining often have missing values. How to develop effective method for missing values imputation is a crucial important problem in the field of machine learning and data mining. In this paper, several methods for dealing with missing values in incomplete data are reviewed, and a new method for missing values imputation based on iterative learning is proposed. The proposed method is based on a basic assumption: There exist cause-effect connections among condition attribute values, and the missing values can be induced from known values. In the process of missing values imputation, a part of missing values are filled in at first and converted to known values, which are used for the next step of missing values imputation. The iterative learning process will go on until an incomplete data is entirely converted to a complete data. The paper also presents an example to illustrate the framework of iterative learning for missing values imputation. 
  
 
</p></abstract><kwd-group><kwd>Incomplete Data; Missing Values Imputation; Iterative Learning; Intension; Extension</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Many data mining and machine learning methods have been well developed based on the assumption that the attribute values of each object in the universe is known [1,2]. However, real-world data often contain missing values either because some values are lost or because the cost of acquiring them are too high. How to properly learning knowledge from such incomplete data has become a crucial problem in the field of machine learning and data mining. It is necessary to develop new methods that can be used to effectively learn rules from incomplete data. Such problem has been widely faced in literature [3-12].</p><p>The simplest method to solve this problem is to delete the objects with missing values, then use the rest of the objects as training set to the successive learning process. Such methods may result in the decrease of information for learning. Another similar method is to ignore the objects with missing values in some certain phase of learning. For example in algorithm C4.5 [<xref ref-type="bibr" rid="scirp.29547-ref3">3</xref>], cases with unknown values are ignored while computing the information content, and the information gain for an attribute X is then multiplied by the fraction of cases for which the value of X is known. Although ignoring missing values may be of efficiency when there are not so many missing values, it will be of invalidation when the number of missing values is very large.</p><p>Another important category to deal with missing values focus on filling in missing values so that incomplete data can be converted into complete data, which are also called missing values imputation. Such a conversion is conducted before the main process of rule induction, therefore it is a kind of preprocessing. After such preprocessing, those proposed methods for inducing rules from complete data can be used. Among all the methods to fill in missing values, several main approaches are frequently mentioned in literature. The simplest method is to fill in missing values with “most common attribute value”. For example algorithm CN2 [<xref ref-type="bibr" rid="scirp.29547-ref4">4</xref>], proposed by Clark, P. and Niblett, T. in 1989, uses this strategy. A improved version of this method uses “concept most common attribute value” to fill in missing values, i.e., the attribute values selected to filled in missing values are restricted to the same concept [<xref ref-type="bibr" rid="scirp.29547-ref5">5</xref>]. Grzymala-Busse, J. W. proposes “assigning all possible values of the attribute restricted to the given concept” in 1991 [<xref ref-type="bibr" rid="scirp.29547-ref6">6</xref>], which is effective when the number of missing values is not so large. However, it may result in high cost of computation since all possible values are considered when there exist many missing values. Ghahramani, Z. and Jordan, M. I. present a framework based on maximum likelihood density estimation for learning from incomplete data in 1994 [<xref ref-type="bibr" rid="scirp.29547-ref7">7</xref>]. They use mixture models for the density estimates and make two distinct appeals to the Expectation Maximization principle in deriving a learning algorithm. EM is used both for the estimation of mixture components and for coping with missing data. Moreover, in rough set theory, many researchers extend the equivalence relation to other binary relations such as similarity relation, tolerance relation, dominance relation in order to deal with missing values [8-11]. Such extensions are based on certain assumptions: missing values are considered equal to some known values. Thus they can be regarded as another kind method to fill in missing values.</p><p>There are still two fundamental important problems in these existing methods for filling in missing methods. One is the gradual use of the known values and the filledin missing values in the whole process. Most methods fill in all missing values before learning, i.e., the filling-in process is one-off finished without considering the use of filled-in values for the next step of filling in other missing values. After filling in some missing values in the universe, information will increase. It is a reasonable strategy to fill in other missing values by the utilization of the filled-in values since the filled-in values bring new available information, and this process can recursively continue until a certain terminate conditions are satisfied.</p><p>The other problem is the strategy of filling in missing values. In the existing methods, missing values imputation and learning process are separately treated. There are many missing values imputation methods as well as many learning theories. The connection of these two category processes are not sufficiently discussed. In fact, we may take the missing values imputation problem as a special case of learning process. The missing values can be regarded as a learning target, and the data with known values present a train data sets. Therefore, we may establish the connections between the missing values imputation and learning process. Many learning theories can be introduce to missing values imputation, which presents a new view on missing values imputation. The objective of this paper is to propose a new framework for missing values imputation by introducing gradually iterative learning approach.</p></sec><sec id="s2"><title>2. Learning-Based Missing Values Imputation</title><p>Traditionally, filling in missing values is usually considered as a technique method to fix up the original data for rule induction. For an information table, condition attributes and decision attributes are treated distinctly as different kinds of attributes according to their different roles in the rule induction. It is assumed that there exist some certain relationship between the condition attribute values and decision attribute values. Such relationship can be denoted as a rule:</p><p>if (<img src="4-1680060\afc410ad-587f-46c8-b6fd-840dc2925cc7.jpg" />and <img src="4-1680060\32ea66f5-16e9-4d0a-9477-8f8c16c9f600.jpg" /> and <img src="4-1680060\72d27834-38a2-4a9a-a0cd-be2538dc6884.jpg" /> and<img src="4-1680060\ff6d9976-8564-444f-bc9e-4e05f5abefe3.jpg" />) then <img src="4-1680060\1354b2db-0fe9-4231-b110-aa639cd93d30.jpg" />where <img src="4-1680060\273b296c-ff4f-4ab0-ad0b-0594f15f86ee.jpg" /> are condition attributes, <img src="4-1680060\5e57d248-00b5-441f-ba63-1398fc132772.jpg" /> are values of condition attributes, and <img src="4-1680060\6e9b34ed-fcf3-4b2d-80a2-08fb4b3d0051.jpg" /> is decision attribute, <img src="4-1680060\3a685988-5cbb-4dfc-8f3c-e2983770688d.jpg" />is the value of decision attribute. A main task of machine learning and data mining is to find out such rules that can properly express the relationship between condition attribute values and decision attribute values. As incomplete data is concerned, rule induction is difficult to fulfill since some attribute values are missing. By filling in missing values with some certain approaches, rule induction may be easily fulfilled as incomplete data have been converted to complete data.</p><p>From another perspective, filling in missing values can also be concerned as a learning process if we treat missing values as a special unknown decision output that need to learn from the known values. The goal of this kind of learning is to induce rules for filling in missing values. Suppose <img src="4-1680060\d9e69317-a6f2-4ea2-b776-fc91dd6b5477.jpg" /> is an attribute with missing values, we may consider filling in missing values of <img src="4-1680060\c3038879-7e59-4685-959d-1eb37df72654.jpg" /> as a process of learning such a set of rules as:</p><p>if (<img src="4-1680060\16b531b9-94ca-47d8-94c1-7864c29ba30e.jpg" />and <img src="4-1680060\63349a29-43d2-4419-85dc-2edc3c869f42.jpg" /> and <img src="4-1680060\f2dfefd0-c816-4c3e-ba65-d8392cfd57d0.jpg" /> and <img src="4-1680060\135e5871-f137-45ba-94f9-e6ae2e1dbd01.jpg" /> and<img src="4-1680060\c30eab9a-baae-4942-a665-1bc64e9f0350.jpg" />) and <img src="4-1680060\60240cf0-6420-40a7-80c1-ad99e8d7cf74.jpg" /> and <img src="4-1680060\41d10744-77aa-4bfa-af9a-30476d945bd5.jpg" /> then<img src="4-1680060\c7d1b7b5-c7b8-46e4-ade5-b25c865dc3b7.jpg" />if (<img src="4-1680060\ba69410e-6415-404c-a813-b6d510883538.jpg" />and <img src="4-1680060\7d1127a0-6905-4273-bc65-fc3f1352a60c.jpg" /> and <img src="4-1680060\185efcae-51a3-4767-bf8d-80d5489adabd.jpg" /> and <img src="4-1680060\62a2248b-0c79-43a8-bf43-36b8f31e949a.jpg" /> and<img src="4-1680060\d30133f0-40c3-400f-a35a-4bd1bbacef00.jpg" />) and <img src="4-1680060\49697336-302f-4a91-b541-97528e965997.jpg" /> and <img src="4-1680060\a61deaf8-20af-4f9d-865b-3336e1573fa9.jpg" /> then<img src="4-1680060\74cbb753-3c2f-4d35-af32-3b19abffbe70.jpg" />, <img src="4-1680060\7aa04798-ecfe-4e46-8109-4e511da8e33e.jpg" /></p><p>if (<img src="4-1680060\cc38f11f-e8f7-4614-ab15-64287607c6fd.jpg" />and <img src="4-1680060\7144da74-88de-4e37-8f04-bee63aad5266.jpg" /> and <img src="4-1680060\1d94b1d4-70a9-4c28-a20e-e78727a7eebd.jpg" /> and <img src="4-1680060\86b61f4f-8af9-469b-91a4-ae92b601952b.jpg" /> and<img src="4-1680060\83e63b06-65c9-4390-9ac8-6977e5157fb7.jpg" />) and <img src="4-1680060\01fe736d-c244-4eed-9b43-5b6e0f510024.jpg" /> and <img src="4-1680060\a4f57d65-e2d5-46b2-bfc1-7b8e82644be4.jpg" /> then<img src="4-1680060\99cb0ad5-a5fb-4c10-b894-74be6218625c.jpg" />.</p><p>It is a reasonable assumption that there may exist some connections among condition attribute values. Such a viewpoint provide us a new learning-oriented understanding on filling in missing values. Many appropriate methods of machine learning can be used. In next section, we will propose a gradual iterative learning method for filling in missing values.</p></sec><sec id="s3"><title>3. Iterative Learning for Missing Values Imputation</title><p>Many psychologist believe that the cognitive process of human being is a gradual learning process. At the beginning of learning process, those simple elementary knowledge is acquired at first, which is the foundation of the next step learning. With the accumulating of elementary knowledge, cognitive ability may get stronger so that more complex concept may be learnt in the next step of learning process. Such a gradually iterative learning process of human being can be used in filling in missing values. At the beginning of filling in missing values, there may be only few missing values that can be filled in with high confidence since initially there often exist many missing values and the usable information is scarce. After a partial process of filling in missing values, some missing values have been converted to known values since they have been filled in with some certain values, thus known values increase and add more usable information, which may be available for the next step of filling in remained missing values. Based on the refined data, some missing values that can not be certainly filled in previously may get more confidence on some certain values. This reiteration will go on so that an incomplete data may be gradually converted to complete data.</p><p>Such iterative learning approach is similar to some other existing learning methods such as semi-supervised learning, a machine learning technique proposed by Shahshahani and Landgrebe [<xref ref-type="bibr" rid="scirp.29547-ref13">13</xref>]. Semi-supervised learning focus on learning from both labeled and unlabeled data— a small amount of labeled data with a large amount of unlabeled data. It falls between unsupervised learning and supervised learning. Many machine learning researchers have found that unlabeled data, when used in conjunction with a small amount of labeled data, can produce considerable improvement in learning accuracy. With iteratively labeling those unlabeled objects according to refined data, the performance of learning continually increases.</p><p>By using iterative learning method, we propose a new learning approach of filling in missing values. The main idea can be described as follows. At the beginning of the learning process, some of missing values are filled in based on learning from existing known values, thus the number of known values increase and the remained missing values can be filled in by learning from the refined set of known values. This process will go on until a certain condition is satisfied.</p><p>A high-level algorithm for filling in missing values using iterative learning approach is proposed as follows:</p><p>Input: an information table <img src="4-1680060\f37d5470-b18a-4072-86b2-bb90638206bd.jpg" /> with known values set <img src="4-1680060\1df0f3ae-9619-4ea7-98e4-e34328e0b583.jpg" /> and missing values set <img src="4-1680060\c4811135-7fc6-4581-965d-f269bab03a3c.jpg" /></p><p>Output: a refined information table <img src="4-1680060\7d9de13b-1d75-4242-b800-63d29b6115c5.jpg" /> with known values set <img src="4-1680060\eb536fdd-3d0c-4b2e-a40a-80bca8194444.jpg" /> and missing values set <img src="4-1680060\31d25a07-5110-4fc3-9909-f0c269f53b36.jpg" /></p><p>Set <img src="4-1680060\9446ca23-7115-4384-9b12-30630ea14a63.jpg" /></p><p>Repeat the following operations Select a subset <img src="4-1680060\b77395b4-b92f-451d-8837-6452aaaf5041.jpg" /> of<img src="4-1680060\934e35b6-bb3b-4cdf-b670-ef65f9892967.jpg" />;</p><p>Fill in <img src="4-1680060\6cd3bc6a-e596-4720-9d75-10b698321753.jpg" /> based on learning from<img src="4-1680060\ab17cd7c-65c3-4cf4-bec4-0881241d1d58.jpg" />;</p><p>Replace <img src="4-1680060\941f3129-ce60-434b-913b-0c60f9399fca.jpg" /> by filled-in values set<img src="4-1680060\b6879ac8-ea78-48c6-9dd6-0ec03440943c.jpg" />;</p><p><img src="4-1680060\9c7f0da4-1b12-40a9-a710-456ec27d476e.jpg" />;</p><p><img src="4-1680060\753927f1-1b16-43cd-95d1-bfc0b22c1b9a.jpg" />;</p><p>Until a termination condition is satisfied Return <img src="4-1680060\35cdd183-871c-49ec-b7df-c9ff4c283426.jpg" /></p><p>In the high-level algorithm mentioned above, known values set and missing values set will be refined in each of many learning iterations. The key point of this algorithm is to decide the strategy of filling in missing values:</p><p>Select a subset <img src="4-1680060\485673d5-c194-40e7-a8b5-9c3c5131ec00.jpg" /> of<img src="4-1680060\4352ccf4-e3b8-4bce-8d1f-6945bff7ff08.jpg" />;</p><p>Fill in <img src="4-1680060\813f2e60-b3f9-4e9e-90a6-d43e0ad51c9a.jpg" /> based on learning from<img src="4-1680060\9cd36da2-8d38-4c9c-8981-c6ab4f3ecae4.jpg" />;</p><p>Many methods of machine learning can be used. Here we adopt concept formation and learning as a solution.</p></sec><sec id="s4"><title>4. Concept Formation in Incomplete Information Table</title><p>In general, a concept can be defined by a pair of intension and extension, where the intension of a concept is given by a set of properties and the extension of a concept normally defined with respect to a particular set of examples [14-17]. Considering a complete information table such as <xref ref-type="table" rid="table1">Table 1</xref> [<xref ref-type="bibr" rid="scirp.29547-ref14">14</xref>], we may express the conception as this pattern.</p><p>Let <img src="4-1680060\41775d84-4a02-4562-9f8c-ba6dae08a355.jpg" /> be a finite set of attributes or features. For each attribute<img src="4-1680060\6be9937a-7e33-4298-b150-94ffefb9801b.jpg" />, we associate it with a set of values or labels<img src="4-1680060\1d053a01-c4bd-4727-adcd-832f53022ca7.jpg" />. Let <img src="4-1680060\ccf5cb0e-fe45-4e2a-87e6-c3f283127f30.jpg" /> be a set of universe whose elements are called objects. For each<img src="4-1680060\30f51e42-4ebe-4f1f-9969-97288e6f636b.jpg" />, there is a mapping <img src="4-1680060\947c22f7-bd42-49a7-8bd8-a4bea49a2909.jpg" /> connecting elements of <img src="4-1680060\49a7c5be-237a-49b4-a3c4-b578db953529.jpg" /> and elements of<img src="4-1680060\18aa5c64-fa9a-4ce9-a573-61046c8ba028.jpg" />, and the value of an object <img src="4-1680060\a67134dc-5cd2-4996-bb03-186f4895459e.jpg" /> on an attribute <img src="4-1680060\4e6a1ec6-3e56-4df1-8558-13083b5f43c9.jpg" /> is denoted by<img src="4-1680060\4a169398-c840-4b1e-8feb-72a814e4a754.jpg" />. Commonly in complete information table, where the attribute values of each object are known, it is assumed that the mapping <img src="4-1680060\1fe8c99f-b9a6-45d7-8590-8f3be1026312.jpg" /> is single-valued.</p><p>For an incomplete information table, there exist objects whose attribute values are multiple-valued or unknown. In this case, the value of an object <img src="4-1680060\7b32016d-a33a-4d4c-a098-46e1edcb0b89.jpg" /> on an attribute <img src="4-1680060\ca07152f-6fe2-4326-9dc9-1f607f012dea.jpg" /> i.e. <img src="4-1680060\64995760-e3a1-4311-b15f-e923e41c2c77.jpg" />can be expressed by *, namely<img src="4-1680060\5cec9647-9191-4615-8574-1e40d38c1c16.jpg" />. An example of an incomplete table is presented in <xref ref-type="table" rid="table2">Table 2</xref>.</p><p><xref ref-type="table" rid="table1">Table 1</xref>. An complete information table.</p><p><img src="4-1680060\993adf49-04c1-4c1a-9c9e-d0e9fa2f2701.jpg" /></p><p><xref ref-type="table" rid="table2">Table 2</xref>. Original incomplete information table.</p><p><img src="4-1680060\df90c0b5-bf92-4bc7-a0be-761acd740cd1.jpg" /></p><p>In this paper, we adopt the decision logic language <img src="4-1680060\72a9bacf-3c01-4e05-8ade-77283f1bff6f.jpg" /> used and studied by Pawlak to formally define intensions of concepts [<xref ref-type="bibr" rid="scirp.29547-ref14">14</xref>]. In an incomplete information system, an atomic formula is given by<img src="4-1680060\d3542ad7-7f35-48e8-94fd-864ccc819035.jpg" />, where<img src="4-1680060\4d44ba57-66af-4b3a-a86d-de224f134a70.jpg" />, and it should be stressed that the unknown attribute value (null value) i.e. “<sub>*</sub>” is not included in<img src="4-1680060\b8e56575-ac73-4089-bbbc-b05316110872.jpg" />. Therefore, in an atomic formula, <img src="4-1680060\fb3efeb6-c4f2-4448-b755-740f4033589d.jpg" />is illegal. For each atomic formula<img src="4-1680060\a80a12b8-04b8-4d17-b741-7d047caccfc9.jpg" />, an object <img src="4-1680060\61451915-256c-443c-9179-a6edadee66d0.jpg" /> satisfies it if<img src="4-1680060\b2d5696f-9030-4f3d-a353-aa7cbf1b42bd.jpg" />, written<img src="4-1680060\d147d1cc-b0a9-4236-b5e8-c1fbd1790e8d.jpg" />. Otherwise, it does not satisfy <img src="4-1680060\4ceb6d3a-81d1-45ac-a81c-46a12caca79c.jpg" /> or <img src="4-1680060\f721e4e7-0303-4185-9e6b-4ee9aa3915a3.jpg" /> and is written<img src="4-1680060\8612a626-3574-4db2-8506-84b60fde6341.jpg" />. From atomic formulas, we can construct other formulas by applying the logic connectives <img src="4-1680060\ac178700-c261-4c3b-8ebd-8884440cb024.jpg" /> and<img src="4-1680060\a1609af5-05b6-4ee7-a7b9-92f702dafb1d.jpg" />. The satisfiability of any formula is defined as follows:</p><p>1) <img src="4-1680060\52c523e9-e59f-42aa-a3b5-a8ad4683422d.jpg" />iff not<img src="4-1680060\5fed9172-e6b5-4b84-8702-b12af2441949.jpg" />2) <img src="4-1680060\0530c006-f6cd-436f-81a6-c7d4d701de5a.jpg" />iff <img src="4-1680060\19759701-782f-4697-8261-1ec7f069175b.jpg" /> and<img src="4-1680060\dd61bc6b-251f-4cb8-b192-59a702ec6746.jpg" />3) <img src="4-1680060\a3f69c79-6fd6-41bf-96e6-e714185f4bee.jpg" />iff <img src="4-1680060\060f2649-7aca-43ca-93d1-603d68c3269b.jpg" /> or<img src="4-1680060\eaf62c4b-c334-42b4-82ec-62eb5dff40ec.jpg" />4) <img src="4-1680060\9e26aeac-d974-4c38-a511-cdffe77da32e.jpg" />iff<img src="4-1680060\fd4f84ff-ccbe-4308-8066-f0cdcf913253.jpg" />5)<img src="4-1680060\ea1ad4e8-116b-4cc2-a190-32d94d836fb8.jpg" />. iff <img src="4-1680060\5838648f-03f2-4726-bdbe-60ed9c59bd36.jpg" /> and<img src="4-1680060\b1a743fd-4616-46a7-98d3-0c31a86d4ea5.jpg" />.</p><p>Traditionally, a basic formula <img src="4-1680060\f494a6fe-cfcd-471b-bfdf-15bcc08aad0e.jpg" /> can represent a intension of a concept, and the extension of the concept can be represented by the elements which satisfy the formula, denoted by<img src="4-1680060\7c9a8498-cb84-43f0-840b-2d8453c9faf8.jpg" />. When considering the missing attribute values, all elements with the value of the attribute <img src="4-1680060\528637bf-6036-48a3-a93a-0e130acfcc68.jpg" /> is are not included. Namely, <img src="4-1680060\e6f034b7-c3f9-4dd2-86d7-d133cf05ccd4.jpg" />only contains the elements whose value of attribute <img src="4-1680060\68010f2c-92fe-4e1b-bca6-bf84775293d5.jpg" /> is exactly<img src="4-1680060\3a218d81-bed9-4bce-8067-d64479c9d8b0.jpg" />. In <xref ref-type="table" rid="table2">Table 2</xref>, for example:</p><p><img src="4-1680060\7a0fec03-1a81-45e8-995f-3504615c510c.jpg" /></p><p>Moreover, following results can be got by applying the logic connectives:</p><p><img src="4-1680060\3615f1c6-57fe-41f9-8012-76383fde7df2.jpg" /></p></sec><sec id="s5"><title>5. Iterative Imputation Using Concept Formation</title><p>Let us take <xref ref-type="table" rid="table2">Table 2</xref> as an example to describe the process of iterative learning for filling in missing values using concept formation. Suppose we’ll learn the rules for filling in missing values of attribute <img src="4-1680060\dedd29d5-d5cb-4c2b-a926-878bea0f8a10.jpg" /> at first. Training set used to learn such rules for filling in missing values of attribute <img src="4-1680060\36f9d471-89dd-482c-b897-bb0fc3eded54.jpg" /> is restricted to <img src="4-1680060\721940d2-7501-4685-ba9b-b05a6168d3dc.jpg" />. Then we list all related concepts:</p><p><img src="4-1680060\01845203-48d0-4fc2-a71a-66c972fc3635.jpg" /></p><p>We can find <img src="4-1680060\d6569dd7-7959-4f24-b1c6-6ab4116793d7.jpg" /> and</p><p><img src="4-1680060\b415dbdc-ca10-48a0-9e6a-8ce7e8f0cf27.jpg" />thus a set of rules is inducted as:</p><p><img src="4-1680060\061ff862-90f4-4882-bb04-728fb396f45b.jpg" /></p><p>then we fill in <img src="4-1680060\b71a8d99-387c-454d-a172-ec0340bc8c03.jpg" /> and <img src="4-1680060\bb4f1b88-2a45-4d6e-8fbe-3797a462473f.jpg" /> with<img src="4-1680060\481ee060-512d-4bab-b2cf-4b2554c823e6.jpg" />. Therefore information table has been converted to <xref ref-type="table" rid="table3">Table 3</xref>.</p><p>We’ll use the refined data to learn rule for filling in missing values of attribute <img src="4-1680060\17997093-655c-4204-9820-43b39a804edc.jpg" /> in the next step. Training set for learning is restricted to<img src="4-1680060\2687ea55-1eb5-4dcc-b325-2a947a0b0582.jpg" />. By using the similar approach mentioned above, we can find</p><p><img src="4-1680060\80f3b325-6ede-4af3-8594-7b3774f54272.jpg" /></p><p>and the corresponding rule is:</p><p><img src="4-1680060\bd565535-10ca-4a2e-b78c-a5f3b5022542.jpg" /></p><p>then <img src="4-1680060\d6fb4703-2bc9-4d2c-80b4-8a5da89e90f4.jpg" /> is filled in with<img src="4-1680060\0db104b8-f02a-49bc-a28f-80c33eba9133.jpg" />. Therefore the information table is converted to <xref ref-type="table" rid="table4">Table 4</xref>.</p><p>Similarly, <img src="4-1680060\d04b7abd-8d69-435b-9ba6-d551976f547c.jpg" />can be filled in with <img src="4-1680060\a46c07b4-1568-4dbb-86ed-77162087b7ae.jpg" /> on the basis of<img src="4-1680060\4b620355-612e-448c-9a3c-b33310946144.jpg" />, i.e.,</p><p><img src="4-1680060\c0cd876c-c210-4045-b10e-32d471be6a62.jpg" /></p><p><xref ref-type="table" rid="table3">Table 3</xref>. Missing values imputation: Headache.</p><p><img src="4-1680060\36b31bed-45a7-458f-a25d-5b40fca0af21.jpg" /></p><p><xref ref-type="table" rid="table4">Table 4</xref>. Missing values imputation: Temperature.</p><p><img src="4-1680060\fc8334d0-97f6-4444-85b6-a46fd44aa45f.jpg" /></p><p><xref ref-type="table" rid="table5">Table 5</xref> presents the final repaired data. We may find that the incomplete data has been converted to a complete data after four-step filling in missing values based on iterative learning, then the complete information table can be used for traditional rule induction based on any exist learning methods.</p><p>Remark: In the process of iterative learning for missing values imputation, the selection order of the missing values for imputation is a fundamental problem, which affects the final imputation result. In order to present a high quality imputation result, we should fill in the missing values with the highest confidence in each phase of imputation, that is, we prefer to fill in the missing values associated with a higher confidence rule, which leads to a more reliable imputation result.</p><p>It is a basic assumption that there exist some certain relationship among the different attributes. Therefore, in the missing values imputation process, the iterative learning method is used to discover the connection between the known values and the missing values. It should be noted that many machine learning approaches besides concept learning can also be used to discover the relationship between the known values and missing values. In the future work, we will further study on the missing values select order for imputation and present detailed experimental analysis on the proposed method. In addition, many other machine learning methods for iterative missing values imputation will be further investigated in the future work.</p><p><xref ref-type="table" rid="table5">Table 5</xref>. Missing values imputation: Nausea.</p><p><img src="4-1680060\c477f531-9291-44d9-a1d6-3e86759ff9dd.jpg" /></p></sec><sec id="s6"><title>6. Conclusion</title><p>In this paper, several methods for dealing with missing values in incomplete data are reviewed, and a new method for missing values imputation based on iterative learning is proposed. At the beginning of missing values imputation, there may be only few missing values that can be filled in with high confidence. After some missing values are filled in, they are converted to known values, then known values increase and more usable information is introduced, which may be available for the next step of missing values imputation. Based on the refined data, some missing values that can not be certainly filled in previously may get more confidence on some certain values. This reiteration will go on so that an incomplete data may be gradually converted to complete data. The paper present a theoretic framework of missing values imputation as well as a practical solution.</p></sec><sec id="s7"><title>7. Acknowledgements</title><p>This research is supported by the National Natural Science Foundation of China (71201076), and the Natural Science Foundation of Jiangsu, China (BK2011564).</p></sec><sec id="s8"><title>REFERENCES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.29547-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">T. M. Mitchell, “Generalization as Search,” Artificial Intelligence, Vol. 18, No. 2, 1982, pp. 203-226. 
doi:10.1016/0004-3702(82)90040-6</mixed-citation></ref><ref id="scirp.29547-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">T. M. Mitchell, “Machine Learning,” McGraw-Hill, New York, 1997.</mixed-citation></ref><ref id="scirp.29547-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">J. R. Quinlan, “C4.5: Programs for Machine Learning,” Morgan Kaufmann, San Mateo, 1993.</mixed-citation></ref><ref id="scirp.29547-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">P. Clark and T. Niblett, “The CN2 Induction Algorithm,” Machine Learning, Vol. 3, No. 4, 1989, pp. 261-283. 
doi:10.1007/BF00116835</mixed-citation></ref><ref id="scirp.29547-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">J. W. Grzymala-Busse and W. J. Grzymala-Busse, “An Experimental Comparison of Three Rough Set Approaches to Missing Attribute Values,” In: J. Peters, A. Skowron, I. Duntsch, J. Grzymala-Busse, E. Orlowska and L. Polkowski, Eds. LNCS Transactions on Rough Sets VI, Springer, Berlin, 2007, pp. 31-50.</mixed-citation></ref><ref id="scirp.29547-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">J. W. Grzymala-Busse, “On the Unknown Attribute Values in Learning from Examples,” In: Z. Ras and M. Zemankova, Eds., Proceedings of 6th International Symposium on Methodologies for Intelligent Systems, Springer, Berlin, 1991, pp. 368-377.</mixed-citation></ref><ref id="scirp.29547-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Z. Ghahramani and M. I. Jordan, “Supervised Learning from Incomplete Data via an EM Approach,” In: J. D. Cowan, G. Tesauro and J. Alspector, Eds., Advances in Neural Information Processing Systems, Morgan Kaufmann, San Mateo, 1994, pp. 120-127.</mixed-citation></ref><ref id="scirp.29547-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">S. Greco, B. Matarazzo and R. Slowinski, “Handling Missing Values in Rough Set Analysis of Multi-Attribute and Multi-Criteria Decision Problems,” In: N. Zhong, A. Skowron and S. Ohsuga, Eds., Proceedings of 7th International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing, Springer, Berlin, 1999, pp. 146-157.</mixed-citation></ref><ref id="scirp.29547-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">M. Kryszkiewicz, “Rough Set Approach to Incomplete Information Systems, Information Sciences, Vol. 112, No. 1-4, 1998, pp. 39-49. 
doi:10.1016/S0020-0255(98)10019-1</mixed-citation></ref><ref id="scirp.29547-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">M. Kryszkiewicz, “Rules in Incomplete Information Systems,” Information Sciences, Vol. 113, No. 3-4, 1999, pp. 271-292. doi:10.1016/S0020-0255(98)10065-8</mixed-citation></ref><ref id="scirp.29547-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">J. Stefanowski and A. Tsoukiàs, “On the Extension of Rough Sets under Incomplete Information,” International Journal of Intelligent System, Vol. 16, No. 1, 2000, pp. 29-38.</mixed-citation></ref><ref id="scirp.29547-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">H. X. Li, Y. Y. Yao, X. Z. Zhou and B. Huang, “Two-Phase Rule Induction from Incomplete Data,” In: G. Wang, T. Li, J. Grzymala-Busse, D. Miao, A. Skowron and Y. Yao, Eds., Proceedings of 3rd International Conference on Rough Sets and Knowledge Technology, Springer, Berlin, pp. 47-54.</mixed-citation></ref><ref id="scirp.29547-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">B. Shahshahani and D. Landgrebe, “The Effect of Unlabeled Samples in Reducing the Small Sample Size Problem and Mitigating the Hughes Phenomenon,” IEEE Transactions on Geoscience and Remote Sensing, Vol. 32, No. 5, 1994, pp. 1087-1095. doi:10.1109/36.312897</mixed-citation></ref><ref id="scirp.29547-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Y. Y. Yao, “Concept Formation and Learning: A Cognitive Informatics Perspective,” In: C. Chan, W. Kinsner, Y. Wang and D. Miller, Eds., Proceedings of 3rd IEEE International Conference on Cognitive Informatics, IEEE CS Press, New York, 2004, pp. 42-51.</mixed-citation></ref><ref id="scirp.29547-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Y. Y. Yao and N. Zhong, “An Analysis of Quantitative Measures Associated with Rules,” In: X. Wu, K. Ramamohanarao and K. Korb, Eds., Proceedings of 2nd Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer, Berlin, 1999, pp. 479-488.</mixed-citation></ref><ref id="scirp.29547-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Y. Zhao, Y. Y. Yao and J. T. Yao, “Level-Wise Construction of Decision Trees for Classification,” International Journal of Software Engineering and Knowledge Engineering, Vol. 16, No. 1, 2006, pp. 103-123. 
doi:10.1142/S0218194006002690</mixed-citation></ref><ref id="scirp.29547-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">J. T. Yao and Y. Y. Yao, “Induction of Classification Rules by Granular Computing,” In: J. Alpigini, J. Peters, A. Skowron and N. Zhong, Eds., Proceedings of 3rd International Conference on Rough Sets and Current Trends in Computing, Springer, Berlin, 2002, pp. 331-338. 
doi:10.1007/3-540-45813-1_43</mixed-citation></ref></ref-list></back></article>