<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AJCM</journal-id><journal-title-group><journal-title>American Journal of Computational Mathematics</journal-title></journal-title-group><issn pub-type="epub">2161-1203</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/ajcm.2015.52017</article-id><article-id pub-id-type="publisher-id">AJCM-57418</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  A Survey on Semantic Similarity Measures between Concepts in Health Domain
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>bdelhakeem</surname><given-names>M. B. Abdelrahman</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ahmad</surname><given-names>Kayed</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>CS Department, IT Faculty, Middle East University (MEU), Amman, Jordan</addr-line></aff><aff id="aff1"><addr-line>College of Post Graduate Studies, Sudan University of Science and Technology, Khartoum, Sudan</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>ambashir@imamu.edu.sa(BMBA)</email>;<email>drkayed@ymail.com(AK)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>13</day><month>05</month><year>2015</year></pub-date><volume>05</volume><issue>02</issue><fpage>204</fpage><lpage>214</lpage><history><date date-type="received"><day>15</day>	<month>April</month>	<year>2015</year></date><date date-type="rev-recd"><day>accepted</day>	<month>21</month>	<year>June</year>	</date><date date-type="accepted"><day>26</day>	<month>June</month>	<year>2015</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  The similarity between biomedical terms/concepts is a very important task for biomedical information extraction and knowledge discovery. The measures and tests are tools used to define how to measure the goodness of ontology or its resources. The semantic similarity measuring techniques can be classified into three classes: first, measuring semantic similarity using ontology/ taxonomy; second, using training corpora and information content and third, combination between them. Some of the semantic similarity measures are based on the path length between the concept nodes as well as the depth of the 
  <em>LCS</em> node in the ontology tree or hierarchy, and these measures assign high similarity when the two concepts are in the lower level of the hierarchy. However, most of the semantic similarity measures can be adopted to be used in health domain (Biomedical Domain). Many experiments have been conducted to check the applicability of these measures. In this paper, we investigate to measure semantic similarity between two concepts within single ontology or multiple ontologies in UMLS Metathesaurus (MeSH, SNOMED-CT, ICD), and compare my results to human experts score by correlation coefficient.
 
</p></abstract><kwd-group><kwd>Biomedical Ontology</kwd><kwd> Semantic Similarity</kwd><kwd> Biomedical Concept</kwd><kwd> Unified Medical Language System (UMLS)</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>Ontology is test bed of semantic web, capturing knowledge about certain area via providing relevant concept and relation between them. Quality metrics are essential to evaluate the quality. Metrics are based on structure and semantic level. At the present, the ontology evaluation is based only on structural metrics, which has not been very appropriate in providing desired results.</p><p>Semantic similarity measures are widely used in Natural Language Processing. We show how six existing domain-independent measures can be adapted to the biomedical domain. Semantic similarity techniques are becoming important components in most intelligent knowledge-based and Semantic Information Retrieval (SIR) systems [<xref ref-type="bibr" rid="scirp.57418-ref1">1</xref>] .</p><p>The classification of these includes semantic similarity measures for single ontology and for multiple ontologies, and the classification is based on how the semantic similarity measure is quantified [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] .</p><p>Ahmad kayed et al. [<xref ref-type="bibr" rid="scirp.57418-ref3">3</xref>] : proposed various methodologies to evaluate ontologies: most of them belong to one of the following categories:</p><p>o Coverage techniques between an ontology and a domain of knowledge that the ontology is created for.</p><p>o Human experts who try to assess how well the ontology meets a set of predefined criteria, standards, and requirements.</p><p>o Using the ontology in the context of an application to evaluate its effectiveness. The use of the system may detect weakness or strength points in the ontology.</p><p>o Comparing the ontology with other ontologies in the same domain.</p><p>o Studying ontology’s relationships considering some measures.</p><p>o Comparing the formal representation of the ontology with other ontologies formal representations, criterions, or measures.</p><p>Measures and tests are provided to define how we can measure the “goodness” of ontology or its resources. Many experiments have been conducted to check the applicability of these measures [<xref ref-type="bibr" rid="scirp.57418-ref3">3</xref>] .</p><p>General English ontology based structure similarity measures can be adopted to be used into the biomedical domain within UMLS. New approach for measuring semantic similarity between biomedical concepts using multiple ontologies is proposed by Al-Mubaid and Nguyen [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] . They proposed new ontology structure based technique for measuring semantic similarity between single ontology and multiple ontologies in the biomedical domain within the frame work of Unified Medical Subject Language System (UMLS). Their proposed measure is based on three features [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] : first, cross modified path length between two concepts; second, new features of common specificity of concepts in the ontology; third, local ontology granularity of ontology cluster.</p></sec><sec id="s2"><title>2. Health Domain Ontologies</title><p>Most of the semantic similarity work in the biomedical domain uses only ontology (e.g. MeSH, SOMED-CT) for computing the similarity between the biomedical terms [<xref ref-type="bibr" rid="scirp.57418-ref5">5</xref>] . However, in this work we use ICD-10 ontology as primary source to computing the similarity between concepts in biomedical domain.</p><p>New ontologies in biology and medicine continue to increase as the need for them arises. Some of the most well-studied and prominent examples are presented here.</p><p>1) MeSH: Stands for Medical Subject Headings, is the one of source vocabularies used in UMLS. Its includes about 15 high level categories, and each category is divided into sub categories and assign a letter: A for Anatomy, B for organisms, C for diseases, and so on [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref6">6</xref>] .</p><p>2) SNOMED-CT: Stand for Systemized Nomenclature of Medicine Clinical Term, was included in UMLS in May 2004 [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] . is a comprehensive clinical Ontology maintained by the International Health Terminology Standards Development Organization (IHTSDO) [<xref ref-type="bibr" rid="scirp.57418-ref8">8</xref>] .</p><p>3) FMA: Stand for Foundational Model of Anatomy: One of the most coherently structured ontologies in biomedicine is the Foundational Model of Anatomy, domain ontology of the classes and relationships that pertain to the structural organization of the human body [<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] .</p><p>4) GALEN: Stand for Generalized Architecture for Languages, Encyclopedias and Nomenclatures (GALEN), is Common Reference Model. The goal of the GALEN project is to provide re-usable terminology resources for clinical systems [<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref9">9</xref>] .</p><p>5) Medical Entities Dictionary: The Medical Entities Dictionary (MED) is a concept-oriented terminology developed and used in Columbia University and the New York Presbyterian Hospital (NYPH) [<xref ref-type="bibr" rid="scirp.57418-ref10">10</xref>] . It currently contains approximately 97,000 concepts organized into a semantic network of frame-based term descriptions, encompassing those terms used in laboratory, pharmacy, radiology, and billing systems. It includes knowledge about synonyms, taxonomic and other types of relations, and mappings to other terminologies.</p><p>6) National Cancer Institute Thesaurus. The NCI Thesaurus is a description logic-based terminology that is a component of the US National Cancer Institute (NCI) Bioinformatics ca CORE distribution [<xref ref-type="bibr" rid="scirp.57418-ref9">9</xref>] .</p><p>7) Unified Medical Language System: Many studies evaluating the usefulness of the UMLS as a terminology and knowledge resource for tasks ranging from terminology translation to domain ontology construction have been published in recent years [<xref ref-type="bibr" rid="scirp.57418-ref9">9</xref>] .</p><p>8) International Classification of Diseases (ICD): The newest edition (ICD-10) is divided into 21 chapters: (Infections, Neoplasm, Blood Diseases, Endocrine Diseases, etc.), and denote about 14,000 classes of diseases and related problems. The first character of the ICD code is a letter, and each letter is associated with a particular chapter, except for the letter D, which is used in both Chapter II, Neoplasm, and Chapter III, Diseases of the blood and blood-forming organs and certain disorders involving the immune mechanism, and the letter H, which is used in both Chapter VII, Diseases of the eye and adnexa and Chapter VIII, Diseases of the ear and mastoid process. Four chapters (Chapters I, II, XIX and XX) use more than one letter in the first position of their codes. Each chapter contains sufficient three-character categories to cover its content; not all available codes are used, allowing space for future revision and expansion. Chapters I-XVII relate to diseases and other morbid conditions, and Chapter XIX to injuries, poisoning and certain other consequences of external causes. The remaining chapters complete the range of subject matter nowadays included in diagnostic data. Chapter XVIII covers Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified. Chapter XX, External causes of morbidity and mortality, was traditionally used to classify causes of injury and poisoning, but, since the Ninth Revision, has also provided for any recorded external cause of diseases and other morbid conditions. Finally, Chapter XXI, Factors influencing health status and contact with health services, is intended for the classification of data explaining the reason for contact with health-care services of a person not currently sick, or the circumstances in which the patient is receiving care at that particular time or otherwise having some bearing on that person’s care. The chapters are subdivided into homogeneous “blocks” of three-character categories. most of the three character categories are subdivided by means of a fourth, numeric character after a decimal point, allowing up to 10 subcategories [<xref ref-type="bibr" rid="scirp.57418-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref11">11</xref>] .</p></sec><sec id="s3"><title>3. Semantic Similarity Measures Challenges in the Health Domain</title><p>Most of the semantic similarity work in the biomedical domain uses only ontology (e.g. MeSH, SOMED-CT) for computing the similarity between the biomedical terms.</p><p>Most of existing semantic similarity measures that used ontology structure as the primary source can’t measure the similarity between terms/concepts using single ontology or multiple ontologies in the biomedical domain within frame work Unified Medical Language System (UMLS).</p><p>Some of the semantic similarity measures have been adopted to biomedical field by incorporating domain information extracted from clinical data or medical ontologies.</p></sec><sec id="s4"><title>4. Semantic Similarity Measures Classification</title><p><xref ref-type="fig" rid="fig1">Figure 1</xref> and <xref ref-type="fig" rid="fig2">Figure 2</xref> [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] : illustrate the semantic similarity classification for single ontology and cross ontologies. To find Semantic similarity between two terms in ontology, by find shortest path length between them in the ontology (shortest path length) giving the length are is-a/part of. A number of approaches have been developed using ontology as primary information sources, and mostly applied in the general English domain using for example WordNet. However, most of the semantic similarity techniques such as general English ontology based structure similarity measures can be adopted to be used into the biomedical domain within UMLS framework.</p></sec><sec id="s5"><title>5. Semantic Similarity Measures for Single Ontology</title><p>In this paper, we focus only on these semantic similarity measures that used ontology as primary information source.</p><sec id="s5_1"><title>5.1. Ontology Structure-Based Similarity Measures</title><p>Most of the measures that are based on the structure of the ontology are actually based on: path length/distance (shortest path length) between the two concept nodes, and depth of concept nodes in the ontology/is-a hierarchy tree, e.g. some of the measures are based on WordNet include: path length, Wu &amp; palmer, leacock &amp; chodorow, and Li et al. [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] .</p><fig id="fig1"  position="float"><label><xref ref-type="fig" rid="fig1">Figure 1</xref></label><caption><title> Classification of semantic similarity measures for single ontology</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/14-1100432x5.png"/></fig><fig id="fig2"  position="float"><label><xref ref-type="fig" rid="fig2">Figure 2</xref></label><caption><title> Classification of semantic similarity measures for cross ontology</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/14-1100432x6.png"/></fig><sec id="s5_1_1"><title>5.1.1 Path Length Based Measures</title><p>The similarity measurement among concepts is based on the path distance separating the concepts. These measures compute similarity in terms of the shortest path between the target synsets (group of synonyms) in the taxonomy.</p><p>Rada et al. [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] in this measure the semantic distance is computed by counting the number of edges between concepts in the taxonomy. The experiments were conducted using MeSH (Medical Subject Headings-Biomedi- cal ontology) ontology. They assume two concept c<sub>1</sub>, c<sub>2</sub> as shortest path linking them (sp (c<sub>1</sub>, c<sub>2</sub>)) as estimate distance.</p><disp-formula id="scirp.57418-formula156"><label>(1)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x7.png"  xlink:type="simple"/></disp-formula><p><xref ref-type="fig" rid="fig3">Figure 3</xref> [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref13">13</xref>] : show the shortest path between two concepts a<sub>5</sub> and b<sub>1 </sub>is<inline-formula><inline-graphic xlink:href="http://html.scirp.org/file/14-1100432x8.png" xlink:type="simple"/></inline-formula>.</p><p>Also simple edge-counting measure proposed by Rada [<xref ref-type="bibr" rid="scirp.57418-ref14">14</xref>] :</p><disp-formula id="scirp.57418-formula157"><label>(2)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x9.png"  xlink:type="simple"/></disp-formula><p>where N<sub>1</sub> and N<sub>2</sub> are the minimum number of taxonomical links from c<sub>1</sub> to c<sub>2</sub> to their LCS, respectively.</p></sec><sec id="s5_1_2"><title>5.1.2. Depth Relative Measures</title><p>Depth relative measures are basically the shortest path approaches, but they consider the depth of the edges connecting the two concepts in the overall structure of the ontology to quantify similarity. It calculates the depth</p><fig id="fig3"  position="float"><label><xref ref-type="fig" rid="fig3">Figure 3</xref></label><caption><title> Hierarchy tree of concepts</title></caption><graphic mimetype="image"   position="float"  xlink:type="simple"  xlink:href="http://html.scirp.org/file/14-1100432x10.png"/></fig><p>from the root of the taxonomy to the target concept.</p><p>1) Wu and Palmer Similarity Measure [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] proposed a new method which define the semantic similarity measure between concepts C<sub>1</sub> and C<sub>2</sub> as</p><disp-formula id="scirp.57418-formula158"><label>(3)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x11.png"  xlink:type="simple"/></disp-formula><p>where N<sub>1</sub> is the length given as number of nodes in the path from C<sub>1</sub> to C<sub>3</sub> which is the least common super concept of C<sub>1</sub> and C<sub>2</sub>, and N<sub>2</sub> is the length given in number of nodes on a path from C<sub>2</sub> to C<sub>3</sub>. N<sub>3</sub> represents the global depth of the hierarchy and it serves as the scaling factor. For example: (LCS (G00.1, G00.3) = G00 and LCS (G00, G01) = G) of two concept nodes and N<sub>1</sub>, N<sub>2</sub> are the path lengths from each concept node to LCS, respectively.</p><p>2) Leacok and Chodorow [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] are proposed non linear adaptation of Rada’s distance:</p><disp-formula id="scirp.57418-formula159"><label>(4)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x12.png"  xlink:type="simple"/></disp-formula><p>max_depth is longest of the shortest path linking concept to concept, which subsumed all others. The Least Common Ancestor (LCA) of concept a<sub>5</sub> and b<sub>1</sub> is r in <xref ref-type="fig" rid="fig3">Figure 3</xref>.</p></sec></sec><sec id="s5_2"><title>5.2. Information Content-Based Similarity Measure</title><p>These measures use Information Content (IC) of concept nodes drive from ontology hierarchy structure and corpus statistics. Some of Information Content-based similarity measures in WordNet include: [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] .</p><sec id="s5_2_1"><title>5.2.1. Resnik Similarity Measure</title><p>Resnik [<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] the similarity between a pair of concepts (c<sub>1</sub> and c<sub>2</sub>) is estimated as the amount of taxonomical information they share. In a taxonomy, this information is represented by the least common subsumer of both terms (LCS (c<sub>1</sub>, c<sub>2</sub>)), which is the most specific taxonomical ancestor common to c<sub>1</sub> and c<sub>2</sub> in a given ontology. Formally:</p><disp-formula id="scirp.57418-formula160"><label>(5)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x13.png"  xlink:type="simple"/></disp-formula></sec><sec id="s5_2_2"><title>5.2.2. Lin Similarity Measure</title><p>This measure depends on the relation between information content (IC) of the LCS of two concepts and the sum of the information content of the individual concepts [<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref15">15</xref>] .</p><p>Formally:</p><disp-formula id="scirp.57418-formula161"><label>(6)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x14.png"  xlink:type="simple"/></disp-formula></sec><sec id="s5_2_3"><title>5.2.3. Jiang and Conrath Similarity Measure</title><p>Jiang and Conrath [<xref ref-type="bibr" rid="scirp.57418-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref16">16</xref>] define the measure as subtract the information content of the LCS from the sum of the information content of the individual concepts, formally:</p><disp-formula id="scirp.57418-formula162"><label>(7)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x15.png"  xlink:type="simple"/></disp-formula></sec></sec><sec id="s5_3"><title>5.3. Hybrid-Based Similarity Measures</title><p>Combine the above ideas [<xref ref-type="bibr" rid="scirp.57418-ref8">8</xref>] : term similarity is computed by matching synonyms, term neighborhoods and term features.</p></sec><sec id="s5_4"><title>5.4. Feature-Based Similarity Measure</title><p>Measure similarity between two terms as a function of their properties (e.g., their description or “glosses” in WordNet or “scope notes” in MeSH) or based on their relationships to other similar terms in the taxonomy [<xref ref-type="bibr" rid="scirp.57418-ref8">8</xref>] . Analyzing the amount of common and non-common knowledge features to estimate similarity between two concepts [<xref ref-type="bibr" rid="scirp.57418-ref17">17</xref>] .</p></sec></sec><sec id="s6"><title>6. Semantic Similarity Measures for Cross Ontology</title><p>In this case, the concepts for which similarity is to be assessed belong to two different ontologies. The secondary ontology is connected to the primary ontology through the common nodes. Two nodes in two ontologies are equivalent if they refer to the same concept.</p><sec id="s6_1"><title>6.1. Al-Mubaid and Nguyen Similarity Measure</title><p>They are combined both features, in one measure, i.e. they will take the specificity of concept node into account by utilizing the depth features of concepts. The LCS of two nodes determine the common specificity of two concept according to ontology structure only. Further more local densities such as link strength/weight also affect the similarity. One way for measuring local density is using IC of concept based on corpus statistics, and since there is no standard corpus in biomedical domain. They used only ontology based features as properties of semantic similarity. Their proposed method don’t required ontology essentially with formal semantic relation between terms, it can be applied onto any terminology structure, structure vocabulary or in general Direct Acyclic Graph (DAG) with length between the nodes.</p><p>In this measure they are put rules and assumptions which satisfied their proposed measure. They wont to combine all semantic features in one measure in an effective and logical way.</p><p>Rule 1: The semantic similarity scale system reflects the degree of similarity of pairs of concepts comparably in single ontology or in cross-ontology. This rule ensures that the mapping of one ontology (called secondary ontology) to another ontology (called primary ontology) does not deteriorate the similarity scale of the primary ontology [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] .</p><p>Rule 2: The semantic similarity must obey local ontology’s similarity rule as follow:</p><p>Rule 2.1: The shorter the distance between two concept nodes in the ontology, the more they are similar.</p><p>Rule 2.2: Lower level pairs of concept nodes are more similar than higher level pairs.</p><p>Rule 2.3: The maximum similarity is arises when the two concept nodes are the same node in the ontology.</p><p>Assumptions</p><p>1) They used logarithms (inverse of exponential for semantic distance). In rule 2.3, the semantic similarity reached higher similarity when the two concept nodes are in the same node regardless of any other features, hence, should used non linear approach to combine the features.</p><p>2) Non linear function is universal combination low of semantic similarity features.</p></sec><sec id="s6_2"><title>6.2. New Common Specificity Features</title><p>Proposed by [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] , they used path length and depth of concept nodes to improved performance. The least common subsumer (LCS) of two concepts node in the ontology is lowest node that connect pairs of concepts. It used to determine common specificity of two concept nodes in the cluster. So finding the depth of their LCS node and then scaling this depth by depth D of the cluster as follow:</p><disp-formula id="scirp.57418-formula163"><label>(8)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x16.png"  xlink:type="simple"/></disp-formula><p>where D is depth of the cluster. The smaller common specificity of two concept nodes means that they are more similar and share more information.</p><p>Single cluster similarity: Their proposed measure for single cluster is:</p><disp-formula id="scirp.57418-formula164"><label>(9)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x17.png"  xlink:type="simple"/></disp-formula><p>where α &gt; 0 and β &gt; 0, k constant and must be (k ≥ 1), and Cspec calculate in Equation (8).</p><p>Sem = 0 when depth = 1 regardless of (Cspec).</p></sec><sec id="s6_3"><title>6.3. Cross-Cluster Semantic Similarity</title><p>The cluster has largest depth is main cluster (primary cluster) and all remaining cluster is secondary.</p><p>Case 1 similarity within primary cluster:</p><p>When two concept nodes in the primary cluster, used Equation (9) to measure similarity.</p><p>Case 2 cross cluster similarity:</p><p>The LCS of two concept node is global root node, which belong to the two clusters, and one of the two cluster belong to the primary cluster while another belong to secondary cluster. Then the common specificity is given as follow:</p><disp-formula id="scirp.57418-formula165"><label>(10)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x18.png"  xlink:type="simple"/></disp-formula><p>where D<sub>primary </sub>is the depth of the primary cluster.<sup> </sup></p><p>Case 3: Similarity within a single secondary ontology: when two concept nodes are in single secondary cluster.</p><disp-formula id="scirp.57418-formula166"><label>(11)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x19.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.57418-formula167"><label>(12)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x20.png"  xlink:type="simple"/></disp-formula><disp-formula id="scirp.57418-formula168"><label>(13)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x21.png"  xlink:type="simple"/></disp-formula><p>where Path(C<sub>1</sub>, C<sub>2</sub>) secondary and CSpec(C<sub>1</sub>, C<sub>2</sub>) secondary are the Path and CSpec between C<sub>1</sub> &amp; C<sub>2</sub> in the secondary cluster.</p><p>Case 4 Similarity within multiple secondary ontology:</p><p>One of the secondary clusters acts temporarily as the primary cluster to calculate Cspec and path using cross- cluster approach as in case 2 above. Then semantic distance is computed using case 3.</p><p>Hisham Al-Mubaid &amp; Nguyen [<xref ref-type="bibr" rid="scirp.57418-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref18">18</xref>] proposed measure take the depth of their least common subsumer (LCS) and the distance of the shortest path between them. The higher similarity arises when the two concept are in the lower level of the hierarchy. Their similarity measure is:</p><disp-formula id="scirp.57418-formula169"><label>(14)</label><graphic position="anchor" xlink:href="http://html.scirp.org/file/14-1100432x22.png"  xlink:type="simple"/></disp-formula><p>where</p><p>L(c<sub>1</sub>, c<sub>2</sub>) is shortest distance between c1 and c<sub>2</sub>.</p><p>Depth L(c<sub>1</sub>, c<sub>2</sub>) is depth of L(c<sub>1</sub>, c<sub>2</sub>) using node counting.</p><p>L(c<sub>1</sub>, c<sub>2</sub>) lowest common subsumer of c<sub>1</sub> and c<sub>2</sub>.</p><p>D is maximum depth of the taxonomy.</p><p>The similarity equal 1, where two concept nodes are in the same cluster/ontology. The maximum value of this measure occurs when one of the concept is the left most leaf node, and the other concept is right leaf node in the tree.</p><p>Path distance between two concepts, when two pairs of two concepts have the same path distance, they have the same value of semantic similarity. In <xref ref-type="fig" rid="fig3">Figure 3</xref>, similarity (n<sub>1</sub>, n<sub>5</sub>) = similarity (n<sub>2</sub>, n<sub>4</sub>) but (n<sub>2</sub>, n<sub>4</sub>) share more information and attributes, so they are more similar than (n<sub>1</sub>, n<sub>5</sub>). In this measure the high numeric similarity result between (c<sub>1</sub>, c<sub>2</sub>) means the lower semantic similarity between two concept.</p><p>In the ICD tree, let us consider an example in ICD10 terminology. The category tree is “Inflammatory diseases of the central nervous system” and is assigned letter G in ICD10 terminology version 2015 [<xref ref-type="bibr" rid="scirp.57418-ref19">19</xref>] at the link (http://www.apps.who.int/classifications/icd10/browse/2015/en). This tree looks as follows:</p><p>Inflammatory diseases of the central nervous system [G]</p><disp-formula id="scirp.57418-formula170"><graphic  xlink:href="http://html.scirp.org/file/14-1100432x23.png"  xlink:type="simple"/></disp-formula><p>The similarity between “bacterial meningitis, not elsewhere classified [G00]” and “meningitis in bacterial diseases classified [G01]” is less similarity than the similarity between “Pneumococcal meningitis [G00.1]” and “Staphylococcal meningitis [G00.3]”. However, in this measure they take into account the depth of the LCS of two concepts, in path length and leacock &amp; chodorwo produce semantic similarity for two pairs [(G00, G01) and (G00.1, G00.3)] in sim (c<sub>1</sub>, c<sub>2</sub>) measure Equation (14) give high similarity in lower level in the ontology hierarchy ([G00.1, G00.3]).</p><p>Choi &amp; Kim (C.K.) [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref13">13</xref>] semantic similarity measure was developed to measure concept similarity in the Yahoo category tree. However, it doesn’t use LCS in the measure so like the case of the example above, it gives the same similarity value for the two pair (a<sub>1</sub>, b<sub>1</sub>) and (a<sub>2</sub>, a<sub>5</sub>), see <xref ref-type="table" rid="table1">Table 1</xref>. With this analysis, we see that Al-Mubaid &amp; Nyguan Measure can surpass the other measures.</p><p>The higher numeric similarity result between (G00, G01) means the lower semantic similarity between (G00, G01).</p></sec></sec><sec id="s7"><title>7. Semantic Similarity Measures in Health Domain</title><sec id="s7_1"><title>7.1. Rada et al.</title><p>Proposed semantic distance as a potential measure for semantic similarity between two concepts in MeSH, and</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Measure comparison</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Pair of Concepts</th><th align="center" valign="middle" >P. L</th><th align="center" valign="middle" >L. C</th><th align="center" valign="middle" >C. K</th><th align="center" valign="middle" >Al-Mubaid &amp; Nyguan Measure (Equation (14))</th></tr></thead><tr><td align="center" valign="middle" >G00-G01</td><td align="center" valign="middle" >0.33</td><td align="center" valign="middle" >2.08</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >4.32</td></tr><tr><td align="center" valign="middle" >G00.1-G00.3</td><td align="center" valign="middle" >0.33</td><td align="center" valign="middle" >2.08</td><td align="center" valign="middle" >0.91</td><td align="center" valign="middle" >4.17</td></tr></tbody></table></table-wrap><p>implemented the shortest path length measure, called CDist, based on the shortest distance between two concept nodes in the ontology. They evaluated CDist on UMLS Metathesaurus (MeSH, SNOMED, ICD9), and then compared the CDist similarity scores to human expert scores by correlation coefficients.</p></sec><sec id="s7_2"><title>7.2. Caviedes and Cimino</title><p>[<xref ref-type="bibr" rid="scirp.57418-ref7">7</xref>] implemented shortest path based measure, called CDist, based on the shortest distance between two concepts nodes in the ontology. They evaluated CDist on UMLS Metathesaurus (MeSH, SNOMED, ICD9), and then compared the CDist similarity scores to human expert scores by correlation coefficient.</p></sec><sec id="s7_3"><title>7.3. Pedersen et al.</title><p>[<xref ref-type="bibr" rid="scirp.57418-ref1">1</xref>] proposed semantic similarity and relatedness in the biomedicine domain, by applied a corpus-based context vector approach to measure similarity between concepts in SNOMED-CT. Their context vector approach is ontology-free but requires training text, for which, they used text data from Mayo Clinic corpus of medical notes.</p></sec></sec><sec id="s8"><title>8. Evaluation</title><sec id="s8_1"><title>8.1. Datasets</title><p>There are no standard human rating sets for semantic similarity in biomedical domain. Thus, Al-Mubaid and Nguyen [<xref ref-type="bibr" rid="scirp.57418-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref13">13</xref>] used dataset from Pedersen et al. [<xref ref-type="bibr" rid="scirp.57418-ref1">1</xref>] in <xref ref-type="table" rid="table2">Table 2</xref>, which was annotated by 3 physician and 9 medical index experts to evaluate their proposed measure in biomedical domain.</p><table-wrap-group id="2"><label><xref ref-type="table" rid="table2">Table 2</xref></label><caption><title> Dataset</title></caption><table-wrap id="2_1"><table><tbody><thead><tr><th align="center" valign="middle" ></th><th align="center" valign="middle" >Concept 1</th><th align="center" valign="middle" >Concept 2</th><th align="center" valign="middle" >Phys</th><th align="center" valign="middle" >Expert</th></tr></thead><tr><td align="center" valign="middle" >4</td><td align="center" valign="middle" >Renal failure</td><td align="center" valign="middle" >Kidney failure</td><td align="center" valign="middle" >4.0000</td><td align="center" valign="middle" >4.0000</td></tr><tr><td align="center" valign="middle" >5</td><td align="center" valign="middle" >Heart</td><td align="center" valign="middle" >Myocardium</td><td align="center" valign="middle" >3.3333</td><td align="center" valign="middle" >3.0000</td></tr><tr><td align="center" valign="middle" >1</td><td align="center" valign="middle" >Stroke</td><td align="center" valign="middle" >Infarct</td><td align="center" valign="middle" >3.0000</td><td align="center" valign="middle" >2.7778</td></tr><tr><td align="center" valign="middle" >7</td><td align="center" valign="middle" >Abortion</td><td align="center" valign="middle" >Miscarriage</td><td align="center" valign="middle" >3.0000</td><td align="center" valign="middle" >3.3333</td></tr><tr><td align="center" valign="middle" >9</td><td align="center" valign="middle" >Delusion</td><td align="center" valign="middle" >Schizophrenia</td><td align="center" valign="middle" >3.0000</td><td align="center" valign="middle" >2.2222</td></tr><tr><td align="center" valign="middle" >11</td><td align="center" valign="middle" >Congestive heart failure</td><td align="center" valign="middle" >Pulmonary edema</td><td align="center" valign="middle" >3.0000</td><td align="center" valign="middle" >1.4444</td></tr><tr><td align="center" valign="middle" >8</td><td align="center" valign="middle" >Metastasis</td><td align="center" valign="middle" >Adenocarcinoma</td><td align="center" valign="middle" >2.6667</td><td align="center" valign="middle" >1.7778</td></tr><tr><td align="center" valign="middle" >17</td><td align="center" valign="middle" >Calcification</td><td align="center" valign="middle" >Stenosis</td><td align="center" valign="middle" >2.6667</td><td align="center" valign="middle" >2.0000</td></tr><tr><td align="center" valign="middle" >10</td><td align="center" valign="middle" >Diarrhea</td><td align="center" valign="middle" >Stomach cramps</td><td align="center" valign="middle" >2.3333</td><td align="center" valign="middle" >1.3333</td></tr><tr><td align="center" valign="middle" >19</td><td align="center" valign="middle" >Mitral stenosis</td><td align="center" valign="middle" >Atrial fibrillation</td><td align="center" valign="middle" >2.3333</td><td align="center" valign="middle" >1.3333</td></tr><tr><td align="center" valign="middle" >20</td><td align="center" valign="middle" >Chronic obstructive pulmonary disease</td><td align="center" valign="middle" >Lung infiltrates</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.8889</td></tr><tr><td align="center" valign="middle" >2</td><td align="center" valign="middle" >Rheumatoid arthritis</td><td align="center" valign="middle" >Lupus</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.1111</td></tr><tr><td align="center" valign="middle" >3</td><td align="center" valign="middle" >Brain tumor</td><td align="center" valign="middle" >Intracranial hemorrhage</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.3333</td></tr><tr><td align="center" valign="middle" >15</td><td align="center" valign="middle" >Carpal tunnel Syndrome</td><td align="center" valign="middle" >Osteoarthritis</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.1111</td></tr><tr><td align="center" valign="middle" >18</td><td align="center" valign="middle" >Diabetes mellitus</td><td align="center" valign="middle" >Hypertension</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >27</td><td align="center" valign="middle" >Acne</td><td align="center" valign="middle" >Syringe</td><td align="center" valign="middle" >2.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >12</td><td align="center" valign="middle" >Antibiotic</td><td align="center" valign="middle" >Allergy</td><td align="center" valign="middle" >1.6667</td><td align="center" valign="middle" >1.2222</td></tr><tr><td align="center" valign="middle" >13</td><td align="center" valign="middle" >Cortisone</td><td align="center" valign="middle" >Total knee replacement</td><td align="center" valign="middle" >1.6667</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >14</td><td align="center" valign="middle" >Pulmonary embolus</td><td align="center" valign="middle" >Myocardial infarction</td><td align="center" valign="middle" >1.6667</td><td align="center" valign="middle" >1.2222</td></tr></tbody></table></table-wrap><table-wrap id="2_2"><table><tbody><thead><tr><th align="center" valign="middle" >16</th><th align="center" valign="middle" >Pulmonary fibrosis</th><th align="center" valign="middle" >Lung cancer</th><th align="center" valign="middle" >1.6667</th><th align="center" valign="middle" >1.4444</th></tr></thead><tr><td align="center" valign="middle" >6</td><td align="center" valign="middle" >Cholangiocarcinoma</td><td align="center" valign="middle" >Colonoscopy</td><td align="center" valign="middle" >1.3333</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >29</td><td align="center" valign="middle" >Lymphoid hyperplasia</td><td align="center" valign="middle" >Laryngeal cancer</td><td align="center" valign="middle" >1.3333</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >21</td><td align="center" valign="middle" >Multiple Sclerosis</td><td align="center" valign="middle" >Psychosis</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >22</td><td align="center" valign="middle" >Appendicitis</td><td align="center" valign="middle" >Osteoporosis</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >23</td><td align="center" valign="middle" >Rectal polyp</td><td align="center" valign="middle" >Aorta</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >24</td><td align="center" valign="middle" >Xerostomia</td><td align="center" valign="middle" >Alcoholic cirrhosis</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >25</td><td align="center" valign="middle" >Peptic ulcer disease</td><td align="center" valign="middle" >Myopia</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >26</td><td align="center" valign="middle" >Depression</td><td align="center" valign="middle" >Cellulitis</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >28</td><td align="center" valign="middle" >Varicose vein</td><td align="center" valign="middle" >Entire knee meniscus</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr><tr><td align="center" valign="middle" >30</td><td align="center" valign="middle" >Metastasis</td><td align="center" valign="middle" >Hyperlipidemia</td><td align="center" valign="middle" >1.0000</td><td align="center" valign="middle" >1.0000</td></tr></tbody></table></table-wrap></table-wrap-group></sec><sec id="s8_2"><title>8.2. Experiments and Results</title><p><xref ref-type="table" rid="table2">Table 2</xref>. Test set of 30 medical term pairs sorted in the order of the averaged physicians’ scores (taken from Pedersen et al. 2005 [<xref ref-type="bibr" rid="scirp.57418-ref1">1</xref>] ). Al-Mubaid and Nguyen [<xref ref-type="bibr" rid="scirp.57418-ref13">13</xref>] [<xref ref-type="bibr" rid="scirp.57418-ref18">18</xref>] find only 25 out of the 30 concept pairs in MeSH using MeSH browser version 2006.</p><p>Pedersen et al. [<xref ref-type="bibr" rid="scirp.57418-ref1">1</xref>] tested 29 out of the 30 concept pairs as one pair was not found in SNOMED-CT). The concept pairs in bold, in <xref ref-type="table" rid="table2">Table 2</xref>, are the ones that contains a term that was not found in MeSH and we did not include in their experiments. Some terms have more than one position in MeSH tree, for example, the term “Acne” has three different positions: C17.800.030.150, C17.800.271.125.200, and C17.800.794.111. They use in this case the minimum semantic similarity distance between the two concepts.</p></sec></sec><sec id="s9"><title>9. Conclusions &amp; Future Work</title><p>In this work, we discuss the basics of semantic similarity measures, the classification of single ontology similarity measures and cross ontologies similarity measures. We prepare a brief introduction of the various semantic similarity measures in health domain. However, from all the above, we can used SemDist as semantic similarity measures in the health domain.</p><p>In future work, we intend to explore the semantic similarity measures in health domain (ICD, MeSH, and SNOMED-CT) within UMLS frame work. We also prepare to implement a web-based user interface for all these semantic similarity measures and to make it available freely to researchers over the Internet. That will be much helpful for interested researchers in the field of bioinformatics text mining.</p></sec></body><back><ref-list><title>References</title><ref id="scirp.57418-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Pedersen T., Pakhomov, S.V.S., Patwardhan, S. and Chute, C.G. (2007) Measures of Semantic Similarity and Relatedness in the Biomedical Domain. Journal of Biomedical Informatics, 40, 288-299.  
http://dx.doi.org/10.1016/j.jbi.2006.06.004</mixed-citation></ref><ref id="scirp.57418-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Al-Mubaid, H. and Nguyen, H.A. (2009) Measuring Semantic Similarity between Biomedical concepts within Multiple Ontologies. IEEE Transactions on Systems, Man, and Cybernetics—Part C: Applications and Reviews, 39, 389-397.</mixed-citation></ref><ref id="scirp.57418-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Kayed, A., et al. (2013) Ontology Evaluation: Which Test to Use. 2013 5th International Conference on Computer Science and Information Technology (CSIT), Amman, 27-28 March 2013, 45-48.  
http://dx.doi.org/10.1109/csit.2013.6588756</mixed-citation></ref><ref id="scirp.57418-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Al-Mubaid, H. and Nguyen, H.A. (2006) A Cluster-Based Approach for Semantic Similarity in the Biomedical Domain. Proceedings of the 28th IEEE, EMBS Annual International Conference, New York, 30 Augugst-3 September 2006, 2713-2717. http://dx.doi.org/10.1109/iembs.2006.259235</mixed-citation></ref><ref id="scirp.57418-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Al-Mubaid, H. and Nguyen, H.A. (2006) Using MEDLINE as Standard Corpus for Measuring Semantic Similarity in the Biomedical Domain. 6th IEEE Symposium on BionInformatics and BioEngineering (BIBE’06), Arlington, 16-18 October 2006, 315-318. http://dx.doi.org/10.1109/bibe.2006.253295</mixed-citation></ref><ref id="scirp.57418-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Tartir, S. (2009) Ontology-Driven Question Answering and Ontology Quality Evaluation. PhD Thesis, University of Georgia, Athens.</mixed-citation></ref><ref id="scirp.57418-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Batet, M., Sánchez, D. and Valls, A. (2011) An Ontology-Based Measure to Compute Semantic Similarity in Biomedicine. Journal of Biomedical Informatics, 44, 118-125. http://dx.doi.org/10.1016/j.jbi.2010.09.002</mixed-citation></ref><ref id="scirp.57418-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Garla1, V.N. and Brandt, C. (2012) Semantic Similarity in the Biomedical Domain: An Evaluation across Knowledge Sources. Garla and Brandt BMC Bioinformatics, 13, 261. http://dx.doi.org/10.1186/1471-2105-13-261</mixed-citation></ref><ref id="scirp.57418-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Ivanovic, M. and Budimac, Z. (2014) An Overview of Ontologies and Data Resources in Medical Domains. Expert Systems with Applications, 41, 5158-5166. http://dx.doi.org/10.1016/j.eswa.2014.02.045</mixed-citation></ref><ref id="scirp.57418-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Nguyen, H. and Al-Mubaid, H. (2006) New Semantic Similarity Techniques of Concepts Applied in the Biomedical Domain and WordNet. MS Thesis, University of Houston Clear Lake, Houston.</mixed-citation></ref><ref id="scirp.57418-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">World Health Organization (2010) International Statistical Classification of Diseases and Related Health Problems. 10th Revision Edition, WHO, Geneva.</mixed-citation></ref><ref id="scirp.57418-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Elavarasi, S.A., Akilandeswari, J. and Menaga, K. (2014) A Survey on Semantic Similarity Measure. International Journal of Research in Advent Technology, 2, 389-398.</mixed-citation></ref><ref id="scirp.57418-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Batet, M. (2010) Ontology-Based Semantic Clustering. Ph.D. Thesis.</mixed-citation></ref><ref id="scirp.57418-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Sanchez, D., Solé-Ribalta, A., Batet, M. and Serratosa, F. (2012) Enabling Semantic Similarity Estimation across Mul-tiple Ontologies: An Evaluation in the Biomedical Domain. Journal of Biomedical Informatics, 45, 141-155. 
http://dx.doi.org/10.1016/j.jbi.2011.10.005</mixed-citation></ref><ref id="scirp.57418-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Hliaoutakis, A., Varelas, G., Voutsakis, E., Petrakis, E.G.M. and Milios, E. (2006) Information Retrieval by Semantic Similarity. International Journal on Semantic Web and Information Systems, 3, 55-73.</mixed-citation></ref><ref id="scirp.57418-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Sathya, D. and Uthayan, K.R. (2011) Proposal for Semantic Metric to Assess the Quality of Ontologies. Proceedings of the 2011 International Conference on Signal Processing, Communication, Computing and Networking Technologies, Thuckafay, 21-22 July 2011, 754-756. http://dx.doi.org/10.1109/ICSCCN.2011.6024651</mixed-citation></ref><ref id="scirp.57418-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Lin, D.K. (1998) An Information-Theoretic Definition of Similarity. Proceedings of the 15th International Conference on Machine Learning, Madison, 24-27 July 1998, 296-304.</mixed-citation></ref><ref id="scirp.57418-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Al-Mubaid, H. and Nguyen, H.A. (2006) New Ontology-Based Semantic Similarity Measure for the Biomedical Domain. Proceedings of the 2006 IEEE International Conference on Granular Computing, Atlanta, 10-12 May 2006, 623-628.</mixed-citation></ref><ref id="scirp.57418-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">http://apps.who.int/classifications/icd10/browse/2015/en#/VI</mixed-citation></ref></ref-list></back></article>