Fuzzy heaping mechanism for heaped count data with imprecision
- Authors
- Jung, Hye-Young; Choi, Heawon; Park, Taesung
- Issue Date
- 2018
- Publisher
- Springer Verlag
- Keywords
- Cigarettes smoked per day; Fuzzy heaping model; Genetic variants; Heaped data; Heaping; Imprecision; Reported data
- Citation
- Soft Computing, v.22, no.14, pp.4585 - 4594
- Indexed
- SCIE
SCOPUS
- Journal Title
- Soft Computing
- Volume
- 22
- Number
- 14
- Start Page
- 4585
- End Page
- 4594
- URI
- https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/7878
- DOI
- 10.1007/s00500-017-2641-4
- ISSN
- 1432-7643
- Abstract
- In genetic association studies, the traits of interest may sometimes be collected from the reported data. Since subjects report exact responses and/or rounded responses, the histogram of data frequently exhibits spikes at particular values. This phenomenon, known as heaping, can cause difficulties in performing the association test via standard modeling approaches. Recently, several models have been proposed to identify the true unobservable underlying distribution from heaped data. However, all of these methods depend on probabilistic assumptions regarding the heaping mechanism. Unfortunately, probabilistic models cannot represent heaped data effectively, because heaping can be caused by imprecisely reported values. This type of imprecision is different from probabilistic uncertainty, which is described well by a probabilistic model. In this paper, we propose a fuzzy heaping model to identify genetic variants for the heaped count data. Our fuzzy model uses a mixture of likelihood functions for precisely and imprecisely reported data, treating heaped data as imprecise data represented by fuzzy sets. Moreover, since reported count data may include excess zeros, as well as heaped data, we extend our fuzzy heaping model to handle excess zeros. Through simulation studies, we show that the proposed fuzzy heaping model controls type I errors effectively and has great power to identify causal variants. We illustrate the proposed fuzzy heaping model through a study of the identification of genetic variants associated with the number of cigarettes smoked per day. © 2017, Springer-Verlag Berlin Heidelberg.
- Files in This Item
-
Go to Link
- Appears in
Collections - COLLEGE OF SCIENCE AND CONVERGENCE TECHNOLOGY > ERICA 수리데이터사이언스학과 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.