An application of machine learning for the identification of adolescent smoking risk factors
View File(s)
- Author(s)
- Details
-
Sophia J. Chung, PhD, MSN, RN; Youngji Li
- Sigma Affiliation
- Gamma
Visitor Statistics
Visits vs Downloads
Visitors - World Map
Top Visiting Countries
Country | Visits |
---|
Top Visiting Cities
City | Visits |
---|
Visits (last 6 months)
Downloads (last 6 months)
Popular Works for Chung, Sophia by View
Title | Page Views |
---|
Popular Works for Chung, Sophia by Download
Title | Downloads |
---|
View Citations
Citations
Methods: The 2015 Korean Youth Risk Behviors Web-based Survey (KYRBS) was used as the data source of this study. The KYRBS is an annual, nationwide survey conducted in South Korea to examine health behaviors that include cigarette smoking, individual hygiene, and alcohol consumption. Data gatered in the 2015 KYRBS was collected via self-report questionnaires responded to by 68,043 students in grades 7 through 12 in randomly-selected 800 schools in South Korea. For this study, we used 5,123 surveys which completed items concerning smooking on the questionnaires. This study utilized the machine-learning pipeline developed by Fayyad (1996) and Yoon (2015). To reduce the "surse of dimensionality," in which a high number of inter-related variables in large dataset interfere with the accuracy of the machine-learning model, we selected clinically meaningful features based on the concpetual framework for adolescent risk behaviors (Jessor, 1991). Then, we applied three machine learning algorithms embedded in Weka (i.e., J48, Naïve Bayes, and Logistic Regression) to build a predictive model for the smoking behavior of the adolescents represented by the KYRBY dataset. The final model was selected based on the accuracy of not only the predictive model, but also the F-measure calculated using precision and recall rate.
Results: Through the feature selection process, we classified 40 features into three predictive categories. Among three machine algorithms we applied, we found that the Logistic Regression algorithm demonstrated the highest level of accuracy (i.e., 84.0% of adolescent smokers were correctly classified; F-measure = 0.795). Using this model, grade (-0.06) and alcohol consumption (-0.56) were the top two features with the highest coefficietns. In other words, middle school students and students who had never drank alcohol were highly associated with the behavior of smoking.
Conclusion: Our studey demonstrates that a machine-learning approach is effective in identifying behavioral predictors from a large, complex dataset—in this case, the behavioral predicators associated with smoking using the KYRBY. However, our study results were inconsistent with those reported in the literature. Previous study shooed that increasing grade and previous alcohol consumption were associated with adolescents' smoking behaviors (Mendol, 2013; Talip, 2015). Further study with association between smoking behaviors and alcohol consumption among Korean adolescent is needed. Although this study did have some limitations (e.g., the data from the KYRBY is cross-sectional), our machine-learning approach shows promise, and subsequent research using longitudinal data can take into account the trends of association implicit in creating a predictive model.
Event Theme: Influencing Global Health Through the Advancement of Nursing Scholarship
Items submitted to a conference/event were evaluated/peer-reviewed at the time of abstract submission to the event. No other peer-review was provided prior to submission to the Henderson Repository.
Type | Poster |
Acquisition | Proxy-submission |
Review Type | Abstract Review Only: Reviewed by Event Host |
Format | Text-based Document |
Evidence Level | N/A |
Keywords | Adolescents; Cigarette Smoking; Machine Learning |
Name | 28th International Nursing Research Congress |
Host | Sigma Theta Tau International |
Location | Dublin, Ireland |
Date | 2017 |
All rights reserved by the author(s) and/or publisher(s) listed in this item record unless relinquished in whole or part by a rights notation or a Creative Commons License present in this item record.
All permission requests should be directed accordingly and not to the Sigma Repository.
All submitting authors or publishers have affirmed that when using material in their work where they do not own copyright, they have obtained permission of the copyright holder prior to submission and the rights holder has been acknowledged as necessary.
Related items
Showing items related by title, author, creator and subjects.
-
Modifiable cardiovascular risk factors in the early adolescent period
Greenwalt, Julia A.The purpose of this study is to describe patterns and relationships among the modifiable cardiovascular risk factors of smoking behavior, overweight, physical inactivity and poor dietary behaviors within a ninth grade ... -
Obesity-Related Behaviors of Korean Female Adolescents in Their Classroom-Based Peer Networks
Chung, Sophia (2016-03-21)Session presented on Sunday, November 8, 2015: Purpose: The purpose of this pilot study is to examine obesity-related behaviors of female Korean adolescents within a classroom-based peer network. Design: A complete social ... -
Factors associated with intermittent and light smoking among Korean adolescents
Park, In Sook; Ra, Jin Suk (2017-06-05)Purpose: Smoking in adolescence is a risk factor of lung cancer and death from cardio-cerebral vascular disease in adulthood; in the short term, it can also be associated with adolescents’ poor psychological heath ... -
The relationships between healthy literacy and self-care of diabetic management in children with diabetes mellitus
Yang, Yi-Ling; Huang, Li-Chi; Wang, Chung-Hsing; Lee, Jo-HuaThis study will to explore the relationships between healthy literacy and self-care of diabetic management in adolescent children with diabetes mellitus, inculding Type I and Type II. -
The application of reference group theory in Chinese adolescent smoking
Tsai, Han-Yi; Tsai, Tzu-I (2017-09-26)In Taiwan, smoking prevalence rate has substantial declined in adults but not in adolescents. Reference group theory, modified Q methodology and cart sorting were adapted to this study design so as to explore the main ...