Show simple item record

dc.contributor.advisorWilson, Simonen
dc.contributor.authorAl-Ghamdi, Asmaaen
dc.date.accessioned2021-02-25T14:37:46Z
dc.date.available2021-02-25T14:37:46Z
dc.date.issued2021en
dc.date.submitted2021en
dc.identifier.citationAl-Ghamdi, Asmaa, An Integrated Framework for Estimating the Number of Classes with Application for Species Estimation, Trinity College Dublin.School of Computer Science & Statistics, 2021en
dc.identifier.otherYen
dc.descriptionAPPROVEDen
dc.description.abstractThe two most common approaches for estimating the number of distinct classes within a population are either to use sampling data directly with combinatorial arguments or to extrapolate historical discovery data. However, in the former case, such detailed sampling data is often unavailable, while the latter approach makes assumptions on the form of parametric curves used to fit the discovery data, that are often lacking in theoretical justification. Instead, we propose an integrated transdisciplinary framework that dissolves the boundaries between the above two approaches. This is achieved by directly describing the samplingdiscovery process in parallel with describing a co-variate latent e↵ort process, where we have historical discovery data for the former process and some proxy data for the latent process. The linkage between these two processes allows one to form data on sampling records by forcing some constraints on how many samples were taken over time. Due to the nature of the constrained data, many inference techniques become infeasible. However, simulation-based methods such as Approximate Bayesian Computation remain available. Our proposed approach is demonstrated and analysed through many simulation experiments, and finally applied in the ecology field to estimate the number of species as an example of the number of classes problem.en
dc.publisherTrinity College Dublin. School of Computer Science & Statistics. Discipline of Statisticsen
dc.rightsYen
dc.subjectReliability, Species Estimation, Latent effort Process, Approximate Bayesian Computationen
dc.titleAn Integrated Framework for Estimating the Number of Classes with Application for Species Estimationen
dc.typeThesisen
dc.relation.referencesThe fit of the Poisson-inverse Gaussian distribution to the several well-known data sets examined here suggests that it may be an appropriate model for species abundance data under specific conditions but will not always be the clear choiceen
dc.relation.referencesThe main finding is that both the method of computing the marginal likelihood as the expectation of the likelihood under the prior, as the method of computing the harmonic mean of likelihood values when sampling from the posterior, are not trustworthy....en
dc.relation.referencesThe code to implement the logistic function approach of [64] is written in C. This is a pure curve-fitting approach and only uses the discovery times as its input. It was applied to the CoL data. This data set was too large to be handled by the code, which was solved by dividing the yearly discovery counts by 40 and rounding to an integer, then fitting this smaller data set, obtaining an estimate of the number of species remaining to be discovered and multiplying that back by 40. The WoRMS data had a similar problem and that was solved similarly, but by dividing the number of discoveries per year by 10.en
dc.type.supercollectionthesis_dissertationsen
dc.type.supercollectionrefereed_publicationsen
dc.type.qualificationlevelDoctoralen
dc.identifier.peoplefinderurlhttps://tcdlocalportal.tcd.ie/pls/EnterApex/f?p=800:71:0::::P71_USERNAME:AALGHAMDen
dc.identifier.rssinternalid224453en
dc.rights.ecaccessrightsopenAccess
dc.contributor.sponsorScholarship from King Abdulaziz University, Saudi Arabiaen
dc.identifier.urihttp://hdl.handle.net/2262/95385


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record