Contents 1 Richness 2 Shannon index 2.1 Rényi entropy 3 Simpson index 3.1 Inverse Simpson index 3.2 Gini–Simpson index 4 Berger–Parker index 5 See also 6 References 7 Further reading 8 External links

Richness Main article: Species richness Richness R simply quantifies how many different types the dataset of interest contains. For example, species richness (usually noted S) of a dataset is the number of different species in the corresponding species list. Richness is a simple measure, so it has been a popular diversity index in ecology, where abundance data are often not available for the datasets of interest. Because richness does not take the abundances of the types into account, it is not the same thing as diversity, which does take abundances into account. However, if true diversity is calculated with q = 0, the effective number of types (0D) equals the actual number of types (R).[2][4]

Shannon index The Shannon index has been a popular diversity index in the ecological literature, where it is also known as Shannon's diversity index, the Shannon–Wiener index,[citation needed] the Shannon–Weaver index and the Shannon entropy. The measure was originally proposed by Claude Shannon to quantify the entropy (uncertainty or information content) in strings of text.[5] The idea is that the more different letters there are, and the more equal their proportional abundances in the string of interest, the more difficult it is to correctly predict which letter will be the next one in the string. The Shannon entropy quantifies the uncertainty (entropy or degree of surprise) associated with this prediction. It is most often calculated as follows: H ′ = − ∑ i = 1 R p i ln ⁡ p i {\displaystyle H'=-\sum _{i=1}^{R}p_{i}\ln p_{i}} where pi is the proportion of characters belonging to the ith type of letter in the string of interest. In ecology, pi is often the proportion of individuals belonging to the ith species in the dataset of interest. Then the Shannon entropy quantifies the uncertainty in predicting the species identity of an individual that is taken at random from the dataset. Although the equation is here written with natural logarithms, the base of the logarithm used when calculating the Shannon entropy can be chosen freely. Shannon himself discussed logarithm bases 2, 10 and e, and these have since become the most popular bases in applications that use the Shannon entropy. Each log base corresponds to a different measurement unit, which have been called binary digits (bits), decimal digits (decits) and natural digits (nats) for the bases 2, 10 and e, respectively. Comparing Shannon entropy values that were originally calculated with different log bases requires converting them to the same log base: change from the base a to base b is obtained with multiplication by logba.[5] It has been shown that the Shannon index is based on the weighted geometric mean of the proportional abundances of the types, and that it equals the logarithm of true diversity as calculated with q = 1:[3] H ′ = − ∑ i = 1 R p i ln ⁡ p i = − ∑ i = 1 R ln ⁡ p i p i {\displaystyle H'=-\sum _{i=1}^{R}p_{i}\ln p_{i}=-\sum _{i=1}^{R}\ln p_{i}^{p_{i}}} This can also be written H ′ = − ( ln ⁡ p 1 p 1 + ln ⁡ p 2 p 2 + ln ⁡ p 3 p 3 + ⋯ + ln ⁡ p R p R ) {\displaystyle H'=-(\ln p_{1}^{p_{1}}+\ln p_{2}^{p_{2}}+\ln p_{3}^{p_{3}}+\cdots +\ln p_{R}^{p_{R}})} which equals H ′ = − ln ⁡ p 1 p 1 p 2 p 2 p 3 p 3 ⋯ p R p R = ln ⁡ ( 1 p 1 p 1 p 2 p 2 p 3 p 3 ⋯ p R p R ) = ln ⁡ ( 1 ∏ i = 1 R p i p i ) {\displaystyle H'=-\ln p_{1}^{p_{1}}p_{2}^{p_{2}}p_{3}^{p_{3}}\cdots p_{R}^{p_{R}}=\ln \left({1 \over p_{1}^{p_{1}}p_{2}^{p_{2}}p_{3}^{p_{3}}\cdots p_{R}^{p_{R}}}\right)=\ln \left({1 \over {\prod _{i=1}^{R}p_{i}^{p_{i}}}}\right)} Since the sum of the pi values equals unity by definition, the denominator equals the weighted geometric mean of the pi values, with the pi values themselves being used as the weights (exponents in the equation). The term within the parentheses hence equals true diversity 1D, and H' equals ln(1D).[1][3][4] When all types in the dataset of interest are equally common, all pi values equal 1 / R, and the Shannon index hence takes the value ln(R). The more unequal the abundances of the types, the larger the weighted geometric mean of the pi values, and the smaller the corresponding Shannon entropy. If practically all abundance is concentrated to one type, and the other types are very rare (even if there are many of them), Shannon entropy approaches zero. When there is only one type in the dataset, Shannon entropy exactly equals zero (there is no uncertainty in predicting the type of the next randomly chosen entity). Rényi entropy The Rényi entropy is a generalization of the Shannon entropy to other values of q than unity. It can be expressed: q H = 1 1 − q ln ⁡ ( ∑ i = 1 R p i q ) {\displaystyle {}^{q}H={\frac {1}{1-q}}\;\ln \left(\sum _{i=1}^{R}p_{i}^{q}\right)} which equals q H = ln ⁡ ( 1 ∑ i = 1 R p i p i q − 1 q − 1 ) = ln ⁡ ( q D ) {\displaystyle {}^{q}H=\ln \left({1 \over {\sqrt[{q-1}]{\sum _{i=1}^{R}p_{i}p_{i}^{q-1}}}}\right)=\ln({}^{q}\!D)} This means that taking the logarithm of true diversity based on any value of q gives the Rényi entropy corresponding to the same value of q.

Simpson index The Simpson index was introduced in 1949 by Edward H. Simpson to measure the degree of concentration when individuals are classified into types.[6] The same index was rediscovered by Orris C. Herfindahl in 1950.[7] The square root of the index had already been introduced in 1945 by the economist Albert O. Hirschman.[8] As a result, the same measure is usually known as the Simpson index in ecology, and as the Herfindahl index or the Herfindahl–Hirschman index (HHI) in economics. The measure equals the probability that two entities taken at random from the dataset of interest represent the same type.[6] It equals: λ = ∑ i = 1 R p i 2 {\displaystyle \lambda =\sum _{i=1}^{R}p_{i}^{2}} , where R is richness (the total number of types in the dataset). This equation is also equal to the weighted arithmetic mean of the proportional abundances pi of the types of interest, with the proportional abundances themselves being used as the weights.[1] Proportional abundances are by definition constrained to values between zero and unity, but it is a weighted arithmetic mean, hence λ ≥ 1/R, which is reached when all types are equally abundant. By comparing the equation used to calculate λ with the equations used to calculate true diversity, it can be seen that 1/λ equals 2D, i.e. true diversity as calculated with q = 2. The original Simpson's index hence equals the corresponding basic sum.[2] The interpretation of λ as the probability that two entities taken at random from the dataset of interest represent the same type assumes that the first entity is replaced to the dataset before taking the second entity. If the dataset is very large, sampling without replacement gives approximately the same result, but in small datasets the difference can be substantial. If the dataset is small, and sampling without replacement is assumed, the probability of obtaining the same type with both random draws is: ℓ = ∑ i = 1 R n i ( n i − 1 ) N ( N − 1 ) {\displaystyle \ell ={\frac {\sum _{i=1}^{R}n_{i}(n_{i}-1)}{N(N-1)}}} where ni is the number of entities belonging to the ith type and N is the total number of entities in the dataset.[6] This form of the Simpson index is also known as the Hunter–Gaston index in microbiology.[9] Since mean proportional abundance of the types increases with decreasing number of types and increasing abundance of the most abundant type, λ obtains small values in datasets of high diversity and large values in datasets of low diversity. This is counterintuitive behavior for a diversity index, so often such transformations of λ that increase with increasing diversity have been used instead. The most popular of such indices have been the inverse Simpson index (1/λ) and the Gini–Simpson index (1 − λ).[1][2] Both of these have also been called the Simpson index in the ecological literature, so care is needed to avoid accidentally comparing the different indices as if they were the same. Inverse Simpson index The inverse Simpson index equals: 1 λ = 1 ∑ i = 1 R p i 2 = 2 D {\displaystyle {\frac {1}{\lambda }}={1 \over \sum _{i=1}^{R}p_{i}^{2}}={}^{2}D} This simply equals true diversity of order 2, i.e. the effective number of types that is obtained when the weighted arithmetic mean is used to quantify average proportional abundance of types in the dataset of interest. The index is also as a measure of the effective number of parties. Gini–Simpson index The original Simpson index λ equals the probability that two entities taken at random from the dataset of interest (with replacement) represent the same type. Its transformation 1 − λ therefore equals the probability that the two entities represent different types. This measure is also known in ecology as the probability of interspecific encounter (PIE)[10] and the Gini–Simpson index.[2] It can be expressed as a transformation of true diversity of order 2: 1 − λ = 1 − ∑ i = 1 R p i 2 = 1 − 1 2 D {\displaystyle 1-\lambda =1-\sum _{i=1}^{R}p_{i}^{2}=1-{\frac {1}{{}^{2}D}}} The Gibbs–Martin index of sociology, psychology and management studies,[11] which is also known as the Blau index, is the same measure as the Gini–Simpson index.

Berger–Parker index The Berger–Parker[12] index equals the maximum pi value in the dataset, i.e. the proportional abundance of the most abundant type. This corresponds to the weighted generalized mean of the pi values when q approaches infinity, and hence equals the inverse of true diversity of order infinity (1/∞D).

See also Alpha diversity Beta diversity Cultural diversity Effective number of parties, a diversity index applied to political parties Gamma diversity Isolation index Measurement of biodiversity Qualitative variation Relative abundance Species diversity Species richness

References ^ a b c d e Hill, M. O. (1973). "Diversity and evenness: a unifying notation and its consequences". Ecology. 54: 427–432. doi:10.2307/1934352.  ^ a b c d e f g Jost, L (2006). "Entropy and diversity". Oikos. 113: 363–375. doi:10.1111/j.2006.0030-1299.14714.x.  ^ a b c d e Tuomisto, H (2010). "A diversity of beta diversities: straightening up a concept gone awry. Part 1. Defining beta diversity as a function of alpha and gamma diversity". Ecography. 33: 2–22. doi:10.1111/j.1600-0587.2009.05880.x.  ^ a b c d e Tuomisto, H (2010). "A consistent terminology for quantifying species diversity? Yes, it does exist". Oecologia. 4: 853–860. doi:10.1007/s00442-010-1812-0.  ^ a b Shannon, C. E. (1948) A mathematical theory of communication. The Bell System Technical Journal, 27, 379–423 and 623–656. ^ a b c Simpson, E. H. (1949). "Measurement of diversity". Nature. 163: 688. doi:10.1038/163688a0.  ^ Herfindahl, O. C. (1950) Concentration in the U.S. Steel Industry. Unpublished doctoral dissertation, Columbia University. ^ Hirschman, A. O. (1945) National power and the structure of foreign trade. Berkeley. ^ Hunter, PR; Gaston, MA (1988). "Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity". J Clin Microbiol. 26 (11): 2465–2466. PMC 266921 . PMID 3069867.  ^ Hurlbert, S.H. (1971). "The nonconcept of species diversity: A critique and alternative parameters". Ecology. 52: 577–586. doi:10.2307/1934145.  ^ Gibbs, Jack P.; William T. Martin (1962). "Urbanization, technology and the division of labor". American Sociological Review. 27: 667–677. doi:10.2307/2089624. JSTOR 2089624.  ^ Berger, Wolfgang H.; Parker, Frances L. (June 1970). "Diversity of Planktonic Foraminifera in Deep-Sea Sediments". Science. 168 (3937): 1345–1347. doi:10.1126/science.168.3937.1345. PMID 17731043.

Further reading Colinvaux, Paul A. (1973). Introduction to Ecology. Wiley. ISBN 0-471-16498-4.  Cover, Thomas M.; Thomas, Joy A. (1991). Elements of Information Theory. Wiley. ISBN 0-471-06259-6.  See chapter 5 for an elaboration of coding procedures described informally above. Chao, A.; Shen, T-J. (2003). "Nonparametric estimation of Shannon's index of diversity when there are unseen species in sample" (PDF). Environmental and Ecological Statistics. 10 (4): 429–443. doi:10.1023/A:1026096204727. Archived from the original (PDF) on 2007-08-12.

External links Simpson's Diversity index Diversity indices gives some examples of estimates of Simpson's index for real ecosystems. Retrieved from "https://en.wikipedia.org/w/index.php?title=Diversity_index&oldid=813242661" Categories: Measurement of biodiversityIndex numbersSummary statistics for categorical dataHidden categories: All articles with unsourced statementsArticles with unsourced statements from December 2015Pages using div col without cols and colwidth parametersPages using Columns-list with deprecated parameters