<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">AM</journal-id><journal-title-group><journal-title>Applied Mathematics</journal-title></journal-title-group><issn pub-type="epub">2152-7385</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/am.2012.312A289</article-id><article-id pub-id-type="publisher-id">AM-26033</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  A Note on Generalized Inverses of Distribution Function and Quantile Transformation
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>hangyong</surname><given-names>Feng</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Hongyue</surname><given-names>Wang</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xin</surname><given-names>M. Tu</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Jeanne</surname><given-names>Kowalski</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Department of Biostatistics and Computational Biology University of Rochester, Rochester, USA</addr-line></aff><aff id="aff2"><addr-line>Department of Biostatistics &amp;amp; Bioinformatics, Rollins School of Public Health, 
Winship Cancer Institute, Atlanta, USA</addr-line></aff><author-notes><corresp id="cor1">* E-mail:<email>feng@bst.rochester.edu(HF)</email>;</corresp></author-notes><pub-date pub-type="epub"><day>28</day><month>12</month><year>2012</year></pub-date><volume>03</volume><issue>12</issue><fpage>2098</fpage><lpage>2100</lpage><history><date date-type="received"><day>September</day>	<month>7,</month>	<year>2012</year></date><date date-type="rev-recd"><day>October</day>	<month>7,</month>	<year>2012</year>	</date><date date-type="accepted"><day>October</day>	<month>14,</month>	<year>2012</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
   In this paper we study the relations of four possible generalized inverses of a general distribution functions and their right-continuity properties. We correct a right-continuity result of the generalized inverse used in statistical literature. We also prove the validity of a new generalized inverse which is always right-continuous. 
 
</p></abstract><kwd-group><kwd>Distribution Function; Quantile Function; Right Continuity</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>A distribution function defined on <img src="13-7401098\6d284024-8ead-4832-ae1b-b312891a8a05.jpg" /> is a map <img src="13-7401098\dd31dd82-5966-48c8-8b6d-ba1b17c14286.jpg" /> such that F is nondecreasing and right continuous (see for example [<xref ref-type="bibr" rid="scirp.26033-ref1">1</xref>], p. 22). It is called a probability distribution function (PDF) if</p><p><img src="13-7401098\0ffee6f9-ba93-46ec-b54f-f6682fb41555.jpg" />and<img src="13-7401098\2d58710c-7484-4441-986a-d2c6a95c8d2d.jpg" />. Suppose X is a random variable with distribution function F which is continuous, then <img src="13-7401098\7da996c7-44df-45fa-81bf-c0f9cfc81e21.jpg" /> has standard uniform distribution. Furthermore, if F is also strictly increasing with inverse<img src="13-7401098\8e2c0258-2a83-4e9d-b468-19a8fd65d0ac.jpg" />, and U is a standard uniform random variable, then <img src="13-7401098\5897bc3c-e321-41cf-a126-4770a23c3488.jpg" /> has distribution function F. This fact is the basis for generating random numbers given a distribution function. Hence if F is a PDF, <img src="13-7401098\7d0649d8-b19e-40bd-8ba5-24d062099df9.jpg" />is also called the quantile function of F [<xref ref-type="bibr" rid="scirp.26033-ref2">2</xref>].</p><p>The distribution function dose not, in general, have an inverse (in strict sense) as it may be not strictly increasing, for example, the PDF of a discrete random variable. In statistics, the empirical distribution function (EDF) from a random sample is a step function. If we want to (nonparametrically) estimate the population quantiles from the sample data, we need to find an appropriate ‘inverse’ of the EDF. Unfortunately there is no universally accepted definition of sample quantiles given the data. For example, Langford [<xref ref-type="bibr" rid="scirp.26033-ref3">3</xref>] compares many methods proposed in literatures to calculate quantiles from data and finds that none of them is uniformly better than others.</p><p>There are four meaningful ways to define a generalized inverse of a distribution function F as follows:</p><p><img src="13-7401098\b263a0e4-579e-4307-933b-297bc3cbd9b5.jpg" /></p><p>It’s easy to see that these methods are different in dealing with endpoints of flat parts if F is not strictly increasing. In fact, <img src="13-7401098\09025cd8-2d7a-4b4f-bd3d-9ac3528dc64b.jpg" />is the definition of generalized inverse in literatures, for example, [4-7]. It’s obvious that all these generalized inverse functions are nondecreasing and equal the inverse of F if F is strictly monotone increasing.</p><p>In this manuscript we study some properties of these four functions and their relations to the quantile transformation in probability theory. The quantile transformation is the theoretical basis for random number generation in simulation studies in statistics. In simulation studies we usually need to generate random samples from a given distribution. The general method is first to generate uniform random variables on <img src="13-7401098\2c21b680-512b-4840-9e94-a45d6bd22d99.jpg" /> and then to use the quantile transformation to transform the uniform random variables to the random sample we need. Our results shows that any one of the generalized inverses defined above will work as the quantile transformation.</p></sec><sec id="s2"><title>2. Relations between the Four Generalized Inverses</title><p>Four generalized inverse of a distribution function were introduced in last section. In this section we prove the following relations between them.</p><p>Theorem 1. The four generalized inverses of <img src="13-7401098\b4c60a27-a1fd-4aef-9500-2b7d5094b1b9.jpg" /> defined above satisfy</p><p><img src="13-7401098\1eff89c3-beb1-4782-950a-fda7604c24c4.jpg" /></p><p>Proof. For any<img src="13-7401098\e60d23a7-6226-487b-965f-b1da000ea120.jpg" />, if<img src="13-7401098\8e372998-6350-4145-b598-bd86548ac0a0.jpg" />, then</p><p><img src="13-7401098\fec653b4-5c84-4d1c-84c6-f4baddd708b4.jpg" />, therefore<img src="13-7401098\5789e6bd-ab3b-491d-b7b1-3d5a8bc55d6c.jpg" />, which means<img src="13-7401098\b46b2615-7a8e-4060-8588-ab2a483074bf.jpg" />. For any<img src="13-7401098\456486fb-d254-4c16-8496-fb4adfb7ba58.jpg" />, from the definition of<img src="13-7401098\3b44cb87-2ca9-4e6e-8364-c2340f153ae1.jpg" />, we have<img src="13-7401098\62e2bcd3-8f0e-4c31-a99f-7425dcbddcf3.jpg" />. Therefore<img src="13-7401098\b83d6f5e-87db-4c69-ac68-32b1d8a673a3.jpg" />, which means</p><p><img src="13-7401098\b729f283-871f-47f1-affa-c6dc69473f80.jpg" />. It’s obvious that<img src="13-7401098\a6cd71b5-9457-4a84-a695-89c601c10556.jpg" />. If<img src="13-7401098\114eb100-a77e-4d08-9586-343e750b9814.jpg" />, then<img src="13-7401098\fd466e9c-6f80-4c00-89db-6349e28960ce.jpg" />, therefore<img src="13-7401098\90f9cd35-4c01-4e76-b0f9-b926b46900b5.jpg" />, which means<img src="13-7401098\8b7b8887-fc02-431e-8725-7b1d29cdd28c.jpg" />. For any</p><p><img src="13-7401098\e5a1c35a-4ed6-4d77-9691-ae8f840f4065.jpg" />, from the definition of<img src="13-7401098\46e205e4-3876-490e-a8bc-1bf9f305204f.jpg" />, we have</p><p><img src="13-7401098\f7465510-6c0d-4e10-93a4-c06cf514ca26.jpg" />. Then<img src="13-7401098\2f759d4d-8171-4bec-a2a3-1cc8e2502731.jpg" />, which means<img src="13-7401098\8e037736-6a4d-4452-a7aa-12584b9b3711.jpg" />. □</p><p>Remark. Theorem 1 shows that there are actually only two distinct versions of the generalized inverse of <img src="13-7401098\4a3f2707-ab2c-40bf-aec3-f240ecb15beb.jpg" /> defined in Section 1. The generalized inverse <img src="13-7401098\c5e17312-1a7e-499a-bc3f-bd4f6e14673f.jpg" /> widely used in literature (e.g. [<xref ref-type="bibr" rid="scirp.26033-ref4">4</xref>], p. 113) is the smaller one. The asymptotic property of sample quantiles based on <img src="13-7401098\c52f0746-b7d0-42e8-afa1-6ba6e4313a22.jpg" /> have been studied extensively in statistical literatures (e.g. [<xref ref-type="bibr" rid="scirp.26033-ref4">4</xref>], p. 113). However, the asymptotic property of sample quantiles based on <img src="13-7401098\05f9f649-9f0c-4dab-bbc3-5017f773a929.jpg" /> has not been reported. It’s reasonably to conjecture that it should have the same asymptotic properties as that based on<img src="13-7401098\6f7abd3e-f8b0-431f-ae08-bd08d8477f4f.jpg" />.</p></sec><sec id="s3"><title>3. Right Continuity</title><p>The distribution function F is right continuous. We want to know if its generalized inverses are also right continuous. Here is the result for its two versions of generalized inverses.</p><p>Theorem 2. <img src="13-7401098\15736479-ada9-409d-aabf-29d058771082.jpg" />is right continuous. Generally <img src="13-7401098\9dadb983-a955-4df1-b3be-ff381ce8c5b7.jpg" /> is not right continuous.</p><p>Proof. We first prove that <img src="13-7401098\27541219-61aa-4302-b374-64f985f9bd3e.jpg" /> is right continuous by contradiction. Suppose not. Then there exist <img src="13-7401098\4a85af4c-9abb-46c0-95b2-d4874cf0dce6.jpg" /> and <img src="13-7401098\903e2bce-a392-4e8b-a396-4ad720f6c23a.jpg" /> such that</p><p><img src="13-7401098\d755cbc0-be70-4ae9-ba76-5b0bd6906952.jpg" /></p><p>Then<img src="13-7401098\9b01d1d8-2492-4d9b-89eb-165aa2ae576c.jpg" />. Hence there exist <img src="13-7401098\f138be36-6d1d-4005-b17b-ca6ee67646d6.jpg" /> such that<img src="13-7401098\e0042393-bb40-4816-9000-044848bf03bb.jpg" />. Therefore<img src="13-7401098\0b785035-d4c6-41c0-a030-798d7ea06073.jpg" />. A contradiction.</p><p>As for<img src="13-7401098\e9563927-c249-41e2-bb58-c7879840883b.jpg" />, let <img src="13-7401098\bba3d4d5-5b4c-43e6-8739-51a8de985cbc.jpg" /> be defined as</p><p><img src="13-7401098\6af1e7eb-3a4d-4b28-98d2-da37804644cb.jpg" /></p><p>Then for any<img src="13-7401098\575dba49-d026-451d-a120-f589a88581ab.jpg" />. □</p><p>Remark 1. <img src="13-7401098\3c441ac7-789e-4de4-8e51-42d3b307caf6.jpg" />is called the right-continuous version inverse of<img src="13-7401098\1ef8542c-daa8-47f5-9a5f-3c4c3ebf823c.jpg" />. The right-continuity property of both the distribution function and its quantile transform based on <img src="13-7401098\3cac8d33-e017-406b-b222-a6534b1088b2.jpg" /> shows a symmetric property between these two functions. Marshall and Olkin [<xref ref-type="bibr" rid="scirp.26033-ref8">8</xref>] gave an nice introduction to the generalized inverse of a distribution function and prove that <img src="13-7401098\9952d4d7-4179-4379-9814-b3cece304c31.jpg" /> was right continuous in a different way. However, they did not give the inequalities in our Theorem 1.</p><p>Remark 2. There are some mistakes in statistical literatures about the continuity properties of generalized inverse of distribution functions. For example, Andersen et al. (1993, p. 274) stated that <img src="13-7401098\89c1e776-a353-4095-8087-d5520c71404b.jpg" /> was the rightcontinuous inverse of F. According to our Theorem 2, their claim is incorrect.</p></sec><sec id="s4"><title>4. Generalized Inverse and Quantile Transformation</title><p>In this section we assume that F is a PDF. It’s well known that if U is uniformly distributed on<img src="13-7401098\cc8e11b9-9825-4fd0-a1b7-bbaea95a6329.jpg" />, then the random variable <img src="13-7401098\ee000294-5ba4-421b-9e34-10f7cb2f33e8.jpg" /> has distribution function F. Durrett [<xref ref-type="bibr" rid="scirp.26033-ref9">9</xref>] gives a nice proof. In his proof, he constructed a probability space<img src="13-7401098\e587a64b-7761-476c-9da4-8fb2a7368204.jpg" />, where <img src="13-7401098\56c18929-4984-4399-9e2d-223e72df1ffd.jpg" />, <img src="13-7401098\05db14c8-1158-49db-bc0d-8866dbc60bf7.jpg" />is the Borel <img src="13-7401098\0859b6e6-561d-4d1a-b4e3-c46aa9f2ae5b.jpg" />-field on<img src="13-7401098\957d03b8-d9e4-4ee9-865b-22700c707520.jpg" />, and P is the Lebesgue measure. For each x, define two sets</p><p><img src="13-7401098\9fa92218-da89-420c-ba48-cadfc36f5837.jpg" />and<img src="13-7401098\c5a9890c-956f-43f5-819a-2ae97adbacab.jpg" />. It’s easy to prove that<img src="13-7401098\34f491cd-d569-454c-a385-0819739cdd48.jpg" />. Then<img src="13-7401098\971e19b5-4e23-4b28-8143-20fe9558604a.jpg" />. We have similar result for<img src="13-7401098\0bbca0ca-ba55-4097-8666-14c39ab0108c.jpg" />.</p><p>Theorem 3. <img src="13-7401098\7009dd9b-03f8-4ad1-9838-a1c776ca02fc.jpg" />has distribution function<img src="13-7401098\3a00e0de-fcd8-4e40-a024-6b06600770d0.jpg" />.</p><p>Proof. Following the same idea of Durrett (2010), for any x, define<img src="13-7401098\dfca39fa-1bc1-4b58-8aad-69f547695af0.jpg" />.</p><p>It’s easy to prove that<img src="13-7401098\0d793130-35d8-4324-8b5b-5d1273a39e05.jpg" />. In general,<img src="13-7401098\f2b4cedf-c815-4660-b153-f2f8395dc4a4.jpg" />. However, if we define<img src="13-7401098\d5a9a1fa-d5f8-4b8d-9ea8-7fa2bc500fe2.jpg" />, then</p><p><img src="13-7401098\77b612b4-bfcf-4460-86ad-579b7bc76aa6.jpg" />. For if<img src="13-7401098\301618bc-a3a9-449a-a4b2-19072463fbc9.jpg" />, then<img src="13-7401098\abc09553-e431-47dd-9453-cd96f181037c.jpg" />, i.e.</p><p><img src="13-7401098\d969b973-d352-46b3-b47c-ad990a886462.jpg" />. As<img src="13-7401098\26b8afd1-c735-4f86-947d-03a4299d4789.jpg" />, we have</p><p><img src="13-7401098\21c8f19a-f5f6-4e9f-98de-7622a9d2001c.jpg" />. □</p></sec><sec id="s5"><title>5. Conclusion</title><p>In this paper we study the relations of four popular generalized inverses of a general distribution functions and their right-continuity properties. Our results indicate that the generalized inverse <img src="13-7401098\6bcb362f-0654-45d4-829c-9323a6c333ac.jpg" /> widely used in literature may be not right continuous. We also prove that for a PDF<img src="13-7401098\c638fe35-adb6-4934-899f-c3a3e0f52bd0.jpg" />, <img src="13-7401098\93a08971-57c5-4bc6-91ae-4027b0b020d0.jpg" />is a valid quantile transformation which has one more property (right continuity) than the quantile transformation <img src="13-7401098\fa08b143-71ee-4c6f-b39d-554aceab5b9d.jpg" /> which is currently used. One remaining problem is to show that the sample quantile based on <img src="13-7401098\accfd3fa-3dc0-4316-8171-95732ce70a1c.jpg" /> has the same asymptotic properties as that based on<img src="13-7401098\85941177-97e2-45dd-8602-76b04d0c345f.jpg" />. Since both <img src="13-7401098\7f20f170-252a-4668-a7fb-ae409263023c.jpg" /> and <img src="13-7401098\8ffd4b4d-a3d4-44b4-923a-7b11262cbe06.jpg" /> as reasonable generalized inverse of F, their average</p><p><img src="13-7401098\dd84a5b0-7627-49e1-a749-c9e9f1ae0531.jpg" />should also be a good candidate of generalized inverse. The properties of this now function deserves further exploration.</p></sec><sec id="s6"><title>6. Acknowledgements</title><p>This research was supported by grants 5U19AI056390- 05 and 3 UL1 RR024160-02S1 from the National Institutes of Health.</p></sec><sec id="s7"><title>REFERENCES</title></sec><sec id="s8"><title>NOTES</title></sec></body><back><ref-list><title>References</title><ref id="scirp.26033-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">R. B. Ash, “Probability and Measure Theory,” 2nd Edition, Academic Press, San Diego, 2000.</mixed-citation></ref><ref id="scirp.26033-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">A. W. Van der Vaart, “Asymptotic Statistics,” Cambridge University Press, New York, 1998.</mixed-citation></ref><ref id="scirp.26033-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">E. Langford, “Quartiles in Elementary Statistics,” Journal of Statistics Education, Vol. 14, No. 3, 2006. 
www.amstat.org/publications/jse/v14n3/langford.html</mixed-citation></ref><ref id="scirp.26033-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">P. K. Andersen, O Borgan, R. D. Gill and N. Keiding, “Statistical Models Based on Counting Processes,” Springer, New York, 1993.  
doi:10.1007/978-1-4612-4348-9</mixed-citation></ref><ref id="scirp.26033-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">S. I. Resnick, “Extreme Values, Regular Variation, and Point Processes,” Springer, New York, 1987.</mixed-citation></ref><ref id="scirp.26033-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">P. Embrechts, C. Klüppelberg and T. Mikosch, “Modeling Extremal Events for Insurance and Finance,” Springer, New York, 1997.</mixed-citation></ref><ref id="scirp.26033-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">A. J. McNeil, R. Frey and P. Embrechts, “Quantitative Risk Management: Concepts, Techniques, Tools,” Princeton University Press, Princeton, 2005.</mixed-citation></ref><ref id="scirp.26033-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">A. W. Marshall and I. Olkin, “Life Distributions,” Springer, New York, 2007.</mixed-citation></ref><ref id="scirp.26033-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">R. Durrett, “Probability: Theory and Examples,” 4th Edition, Cambridge University Press, New York, 2010. 
doi:10.1017/CBO9780511779398</mixed-citation></ref></ref-list></back></article>