Skip to main content

ORIGINAL ARTICLE

Int J Public Health, 19 January 2023
This article is part of the Special Issue PUBLIC HEALTH AND PRIMARY CARE, IS 1+1=1? View all 47 articles

Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Haihong Liu,Haihong Liu1,2Xiaolei Zhang,Xiaolei Zhang3,4Haining Liu,,
Haining Liu2,5,6*Sheau Tsuey Chong,
Sheau Tsuey Chong1,7*
  • 1Centre for Research in Psychology and Human Well-being, Faculty of Social Sciences and Humanities, Universiti Kebangsaan Malaysia, Bangi, Malaysia
  • 2Department of Psychology, Chengde Medical University, Chengde, China
  • 3Department of Biomedical Engineering, Chengde Medical University, Chengde, China
  • 4Faculty of Engineering, Universiti Putra Malaysia, Serdang, Malaysia
  • 5Hebei Key Laboratory of Nerve Injury and Repair, Chengde Medical University, Chengde, China
  • 6Hebei International Research Center of Medical Engineering, Chengde Medical University, Chengde, China
  • 7Counselling Psychology Programme, Secretariat of Postgraduate Studies, Faculty of Social Sciences and Humanities, Universiti Kebangsaan Malaysia, Bangi, Malaysia

Objective: To explore the predictive value of machine learning in cognitive impairment, and identify important factors for cognitive impairment.

Methods: A total of 2,326 middle-aged and elderly people completed questionnaire, and physical examination evaluation at baseline, Year 2, and Year 4 follow-ups. A random forest machine learning (ML) model was used to predict the cognitive impairment at Year 2 and Year 4 longitudinally. Based on Year 4 cross-sectional data, the same method was applied to establish a prediction model and verify its longitudinal prediction accuracy for cognitive impairment. Meanwhile, the ability of random forest and traditional logistic regression model to longitudinally predict 2-year and 4-year cognitive impairment was compared.

Results: Random forest models showed high accuracy for all outcomes at Year 2, Year 4, and cross-sectional Year 4 [AUC = 0.81, 0.79, 0.80] compared with logistic regression [AUC = 0.61, 0.62, 0.70]. Baseline physical examination (e.g., BMI, Blood pressure), biomarkers (e.g., cholesterol), functioning (e.g., functional limitations), demography (e.g., age), and emotional status (e.g., depression) characteristics were identified as the top ten important predictors of cognitive impairment.

Conclusion: ML algorithms could enhance the prediction of cognitive impairment among the middle-aged and older Chinese for 4 years and identify essential risk markers.

Introduction

With the current rapidly aging global population, the burden of dementia in low-income countries is expected to increase dramatically in the coming decades [1]. Currently, 47.47 million people worldwide have been diagnosed with dementia but by 2050, this number is expected to double [2]. To identify the population with the highest risk of dementia, focusing on the early stages of the pathological process is a viable strategy for prevention. Cognitive impairment is characterized by decreased memory, attention and language, and deterioration in other cognitive functions, including mild cognitive impairment and dementia [3]. At present, neuropsychological assessment is an important method for screening and diagnosing cognitive impairment. For example, neuropsychological examinations such as Mini-mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA) are useful evaluation tests for cognitive function [5, 6]. Considering the shortage of community professionals and time, telephone interview for cognitive status (TICS) with fewer items has demonstrated its value as an effective screening tool for cognitive impairment in the community compared with other methods such as MMSE [4]. This method complements commonly used cognitive function evaluation tools, by identifying potential risk factors for cognitive impairment. Early screening of individual cognitive impairment is crucial to preventing cognitive decline, and progression to dementia through immediate effective treatment and management strategies.

At present, a fundamental strategy to prevent or minimize cognitive decline is through early detection of the risk factors for cognitive impairment, which benefits preventive intervention [5]. Therefore, the prediction of cognitive impairment plays an important role in mitigating and preventing cognitive impairment. However, predicting cognitive impairment is a challenging process. Few empirical studies have compared the predictors of cognitive impairment, and these methods mainly used meta-analysis methods and traditional regression methods. For example, a meta-analysis reported in the Lancet in 2020, revealed that approximately 35% of dementia was contributed by nine factors, including previous education, high blood pressure, middle-aged obesity, and late-life depression [5]. Researchers have used traditional regression statistical methods to identify the common predictors of cognitive impairment outcomes including demographic characteristics and general health information. However, these studies did not point out the importance ranking of influencing factors.

Notably, in these studies, the most commonly used analysis method was regression-based inferential statistics. Yet, the predictors produced by these surveys were insensitive [6]. Several issues associated with conventional statistics limit their robust prediction of complex neurodegenerative processes. Traditional regression-related approaches can only accommodate a restricted number of predictors and cannot process the complex multi-class characteristic variables [7]. In addition, these methods are based on linear assumptions and may not be able to effectively manage more complex patterns including non-linear and higher dimensional [8].

Emerging computing methods using machine learning can optimize the prediction of cognitive impairment, to overcome the shortcomings of traditional methods. Machine learning(ML) has been used for clinical classification and prediction based on extracted high-dimensional features from data [9]. The random forest is a typical ML technique with high predictive performance and robustness as regards to its accuracy and ease of implementation [10, 11]. This method has a high level of predictive ability. It creates multiple decision trees by implementing random sampling in the same data set, combining them and finally predicting the target variable [12, 13]. Importantly, random forests also have excellent predictive ability to discover the correlation between explanatory variables and diseases, while preventing over-fitting when multiple explanatory variables are applied to the model [14, 15]. Compared with other methods, random forest demonstrated the highest accuracy compared with other methods in predicting cognitive impairment [16, 17].

In recent years, machine learning, has been increasingly used in research to predict cognitive impairment. These studies mainly have two characteristics. First, most of these studies use expensive sample data sources, such as MRI, PET and other medical imaging methods [1820]. Second, predictor variables in some studies that predict cognitive impairment or other diseases, are only self-reported variables [16, 21, 22].

Cognitive impairment is an age-related condition caused by Alzheimer’s disease, vascular dementia, mixed dementia or other related types of dementias with no cure [23, 24]. Therefore, ML technology has high potential value in evaluation of risk factors of cognitive impairment. Conventional regression-based approaches have been used to effectively identify key risk factors, including demographic (e.g., education, venerable age, gender) [25], physical condition (e.g., body mass index (BMI), hearing difficulty) [26, 27], lifestyle activities (e.g., smoking, drinking and instrumental activities of daily living (IADL) [28], poor psychological wellbeing (e.g., depression) [2931], and chronic diseases (e.g., chronic lower back pain) [32]. However, few studies have examined the effect of these independent risk factors on cognitive impairment among older Chinese using a prospective design.

Given the potentially severe consequences of cognitive impairment, an improvement in the predictions for middle-age and older people is important. To solve this problem and fill gaps in the previous literature, this study aims to examine the predictive power of the ML model for cognitive impairment using China Health and Retirement Longitudinal Survey database. In addition, the study compared the prediction accuracy of random forest in ML with traditional inferential statistical method (logistic regression). First, a large representative elderly sample, which consisted of middle-aged and elderly people, participated in a 3-year survey. Second, random forest and logistic regression were used to longitudinally predict cognitive impairment and identify ten most important predictors, including biological factors and psychological at 2-year and 4-year follow-ups from 44 baseline predictors. These baseline predictors included demographic (e.g., age, education level), health status and functioning (e.g., physical functions, biomarkers), emotional status (e.g., depression), lifestyle and behavior (e.g., smoking, drinking, sleeping habits). Finally, a 4-year follow-up cross-sectional data was applied to construct a model to verify the accuracy of the longitudinal prediction model for previous 2 years.

It was assumed that the random forest model of ML could predict long-term cognitive impairment. In addition, compared with the logistic regression model, the random forest model could improve the prediction of long-term cognitive impairment outcomes. ML model could also screen out the risk factors with the most significant impact on cognitive impairment. Overall, the study results provide insights into the practicability of this innovative computational method, which has potential diagnostic value in cognitive decline. More importantly, the model provided a ranking of predictors of cognitive impairment which is invaluable for identifying the risk various factors (simple and easily available variables) of cognitive impairment in daily life, for effective prevention and intervention to promote healthy aging [23].

Methods

Dataset and Participants

The data for this study were obtained from the CHARLS from 2011 to 2015. This longitudinal survey covered 450 villages or communities in 150 counties/districts, of which 52.67% comprised rural areas, and 47.33% was urban areas [33]. The CHARLS survey aims to build high-quality public databases of individuals and families of middle-aged and older persons aged 45 and above across the country. The national baseline survey was conducted in 2011, whereas the second and third data surveys were carried out in 2013and 2015, respectively. A total of 19,817 respondents were involved in the 2011 baseline survey. In 2013, a total of 18,605 respondents participated in the assessment of cognitive ability. In 2015, a total of 21,095 respondents participated in all the same surveys as the baseline survey. First, the 2011, 2013, and 2015 data were merged according to the principle of ID and household ID matching, and a total of 4,043 participants participated in the survey from 2011 to 2015. Then, missed follow-up data and interviewees answered by others, and variables missing >20% of patient data were excluded from the analysis. Finally, respondents who participated in all three surveys and without the characteristics in the aforementioned exclusion criteria were included in the analysis. The final sample size was 2,326. (The data preprocessing process is shown in Supplementary Figure S1). Specifically, basic information module, health status and functioning module, physical examination and blood-based biomarkers data from 2011 to 2015, and cognitive module data from 2013 were used in this study. The study protocol of the CHARLS was approved by the Peking University Biomedical Ethics Committee, which conformed to the standards set by the latest revision of the Declaration of Helsinki (IRB00001052-11015) (http://charls.ccer.edu.cn/charls/, https://opendata.pku.edu.cn/dataverse/CHARLS).

Patient and Public Involvement

In this study, we used data from open database CHARLS, which is a nationally longitudinal survey. Therefore, no direct patient was involved and contacted.

Cognitive Function

To measure the cognitive status of the research population, several measurements of the telephone interview of cognitive status (TICS-10) in the CHARLS data were used [34, 35]. These included date, week, and season among others (orientation and attention), which scored with 5 points; 100 minus 7 calculation series scored with 5 points; the recall, delayed word recall and episodic memory for 10 words was scored with 20 points; and the drawing of two repeated five-sided graphs (visual spatial abilities) was scored with 1 point. The total cognitive function score was 31 points. In general, a higher score indicated better cognitive function of middle-aged and elderly people.

Based on the previous studies, the results of cognitive classification and baseline demographics (e.g., age, education level, marital status, type of residence), health status and functioning (ADL, IADL, functional limitations, life expectancy, eyes, hearing, oral cavity, pain, physical examination, and blood indicators), emotional status (depression CES-D), lifestyle and behavior (sleeping, physical activity, social interaction, smoking, and drinking) were related. Therefore, 44 baseline features in the model were selected to reflect the previously determined predictors in the dataset.

Demographics

Demographic variables included gender, age, education (no formal education illiterate, does not finish primary school but capable of reading or writing, sishu, elementary school, middle school, vocational school, two/three-year college/associate degree, four-year college/bachelor’s degree, Post-graduate, Master’s degree), household registration (agricultural household registration, non-agricultural household registration and unified residence household registration), marital status (being married and not being married) among others. Six options were present in the questionnaire regarding marital status, namely “married and living with spouse,” “married but not living with a spouse for a temporary period due to work and other reasons,” “separated,” “divorced,” “widowed,” and “never married.” The first two types of marital status were defined as “being married,” while the remaining two types were defined as “not being married.”

Health Status and Functioning

In this section, to assess health status and functioning, predictive variables mainly related to health and function were used, including eyesight (close) and eyesight (distant objects), hearing problem, tooth loss, chronic disease (participants self-reported whether they had a chronic disease diagnosed by a doctor), pain (are you often troubled with body pains?), physical functions (height, weight, BMI, and respiratory function, blood pressure), blood indicators, activities of daily living (ADL) and instrumental activities of daily living (IADL) [36]. ADL includes dressing, bathing, eating, getting into and out of bed, using the toilet, controlling urination and defecation and IADL includes doing household chores, preparing hot meals, shopping, money management, taking medicine). Are there any difficulties in these daily routines? There are 4 options for all questions: 1. No difficulty, 2. Difficulty but achievable, 3. Difficulty and need help, 4. Unable to complete. A higher score indicates lower quality of acting.

Functional limitations, included “running 1 km,” “walking 100 m,” “sedentary standing up,” “climbing stairs,” and “picking up coins” and nine questions regarding the difficulty of nine basic activities, with a total score range within 0–27 points. A higher score indicated deteriorating body function Following that life expectancy was assessed through the possibility of an individual’s assumption about living until the expected age. Accordingly, the questions comprised a rating of 1–5, which indicated from “nearly impossible” to “very certain.”

Venous blood samples (biomarkers) comprised: high-sensitivity C-reactive protein (hsCRP), glycosylated hemoglobin (HbA1c), total cholesterol, high density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, triglycerides, glucose, blood urea nitrogen (BUN), creatinine, uric acid, and cystatin C [37, 38].

Emotional Status

The CHARLS database employed the Center for Epidemiologic Studies Depression Scale-10 (CESD-10) to investigate the depression risk among middle-aged and elderly people, with depression scores ranging from 0 to 30 points. Subsequently, higher score indicated higher susceptibility to depression. A score of 10 points or higher indicates high risk of depression [39].

Lifestyle and Behavior

This study identified lifestyle and behavior as the main factors affecting cognitive function. The other factors identified included sleeping habits (night sleep time and nap time), eating habits (number of meals a day), smoking, drinking, physical activity (amount of exercise per person), energy expenditure (total number of high and low activities of each person * weight), and social interaction (entertainment activities, service activities, other activities, and whether to participate in social activities).

Outcome Variables

During the 2013 and 2015 follow-ups, the outcome variables of the survey were consistent with the cognitive function test in 2011 variables such as date, week, and season (orientation and attention), and scored 5 points; 100 minus 7 calculation series, scored 5 points, recall and delayed recall word recall episodic memory 10 words, scored 20 points, and drawing two repeated five-sided graphs (visual spatial abilities) scored 1 point. The total cognitive score was 31 points. When the total score of participants exceeded 1 standard deviation lower than the standard of the corresponding age group, the participants were classified as cognitively impaired. Meanwhile, other participants were defined as having normal cognition [34, 40].

Performance Evaluation and Data Analyses

All data were analyzed using SPSS version 26.0 version and Python version 3.8. The random forest and logistic regression model were employed to predict the cognitive function of middle-aged and elderly people in 2013 and 2015. Subsequently, 2,326 data sets were divided into a training set (70%, N = 1,628) and test set (30%, N = 698). Some missing values were imputed using the method of nearest neighbor imputation. A flow chart of the random forest data analysis process is shown in Supplementary Figure S1. The parameters set of the random model were as follows: maximum depth of the forest = 6, and maximum number of leaves = 90(to alleviate over-fitting). The learning rate was set to 0.001, and the training evaluation index was 100 iterations of AUC training. When the number of iterations exceeded 5 times, no further increase in the AUC value was observed, and the training was stopped to prevent over-fitting. To validate the model, the 10-fold cross-validation method was used.

In the logistic regression model, the input data were standardized in order to speed up the gradient descent to find the optimal solution. The regularization parameter was selected as “L2,” the number of cross-validation was set to 10 folds, the loss function was optimized by the second derivative matrix of the loss function, the regularization coefficient was set to 20 equal parts from −2 to 2, and the error range of the iteration termination criterion was 0.01.

Descriptive Analysis

Results were expressed as the mean (± standard deviation) of continuous variables or the percentage of subjects in categorical variables. The AUC value in the Receiver Operator Characteristic curve (ROC) was the area under the ROC curve, which reflected the performance of the model. The AUC >0.9 was considered very good, 0.8–0.90 was considered good, 0.7–0.8 was regarded as fair, and <0.7 was regarded as poor [41].

Results

Sample Characteristics

The baseline characteristics of this study in 2011 are presented in Table 1. In the first year (baseline, 2011), there were 2,326 participants, including 1,318 people aged 45–59 (middle-aged) and 1,008 people aged 60 and above (elderly people). The average score of cognitive function was 12.94 (±5.95). The specific score for each baseline or variable was as follows: demographic (5), health status (24), functioning (3), emotional status (2), and lifestyle and behavior (10).

TABLE 1
www.frontiersin.org

TABLE 1. Baseline sample characteristics (China, 2011).

Model Performance

Table 2 shows the comparison of AUC between random forest and logistic regression. The results show that random forest performs better than logistic regression. Table 3 illustrates the model performance indicators of random forest models. The results for the 2nd and 4th year follow-up model indicators show that the AUC of the 3 ML models (model a1, model a2, model b) were 0.81, 0.79, and 0.80, respectively (see Supplementary Figures S2–S4). The models fit was good and fair.

TABLE 2
www.frontiersin.org

TABLE 2. AUC of random forest and logistic regression (China, 2022).

TABLE 3
www.frontiersin.org

TABLE 3. Model performance (China, 2022).

Predictor Variables Importance

Table 4 presents the predictor variable ranking results by importance of 10 feature selection methods that were performed on the dataset. The cognitive classification prediction model for 2013 ranked the variables from the most important to the lest important as follows: “education,” “triglycerides,” “age,” “BMI,” “LDL cholesterol,” “uric acid,” “functional limitations,” “pulse,” “HDL cholesterol,” and “life expectancy.” For 2015, the prediction of cognitive classification model’s variable ranking was in the following order: “BMI,” “depression,” “pulse,” “systolic,” “education,” “total cholesterol,” “blood urea nitrogen (BUN),” “HDL,” “cholesterol,” “Uric acid,” and “HsCRP.” The important factors of the forecast model for the verification of horizontal data were organized in the following order: “BMI,” “pulse,” “depression,” “breath,” “creatinine,” “age,” “night sleep time,” “education,” “triglycerides,” and “total cholesterol.”

TABLE 4
www.frontiersin.org

TABLE 4. Importance of variables of predictive models and validated predictive models (China,2022).

Discussion

Advantages of Machine Learning Methods

With the advances in the electronic age, the application of machine learning to clinical disease diagnosis, differential diagnosis, and disease prediction is increasing [42, 43]. This study provides a new method of machine learning to predict cognitive impairment in middle-aged and elderly people. The method of mutual verification of the combination of longitudinal and cross-sectional data was adopted to improve the effectiveness of the model. Good results were achieved, with the best model producing an AUC of 0.81 [41] compared with logistic regression [22, 42].

Indeed, logistic regression was previously considered a standard method for binary classification. Compared with machine learning methods, logistic regression is limited by assumptions of normality and linear relationships and may not evaluate the non-linear and complex relationships between physiological and social data. As a non-parametric technique, random forest in machine learning can overcome the shortcomings of under-fitting in traditional regression methods, and at the same time prevent over-fitting [11, 44]. It is thus considered a more flexible method for assessing complex interactions between variables. This study also corroborates previous findings that random forest is more suitable for the application of high-dimensional variables and large-scale data [45]. At present, it has become an alternative standard classification method to logistic regression [12]. Couronné et al. used 243 real data sets to conduct systematic large-scale comparative study, which showed that the average prediction performance of random forest is better than that of logistic regression [46]. In view of the fact that there are many variable dimensions involved in this study, the sample size was large. Predictive models for cognitive impairment in middle-aged and elderly people are lacking. Therefore, the present study established a predictive model of cognitive impairment in middle-aged and elderly people using the random forest method. The results indicated that the use of sociodemographic characteristics, health status and functioning, and emotional state could accurately predict cognitive impairment.

Impact of Demographic Variables on Cognitive Impairment

From the results, the demographic variables of age and education level are important predictors, which is similar to previous research results [22, 47]. Previous studies showed that the prevalence of dementia was higher among the people aged 65 years old or older [47]. Education level could reduce the risk of cognitive impairment and dementia [48, 49]. Compared with people with no education, less education was associated with a lower risk of cognitive impairment. Notably, education contributes to cognitive reserve [50, 51].

Impact of Health Status on Cognitive Impairment

Blood predictors including non-invasive markers were used in the current research to predict cognitive impairment. Non-invasive markers help predict patients with normal cognitive status or cognitive impairment, which may also contribute to better preventive measures [52, 53]. The results of this study indicated that Biomarkers in health status are important risk factors for predicting cognitive impairment. This finding is consistent with previous studies. We also found that uric acid, HsCRP, creatinine, LDL cholesterol, HDL cholesterol, total cholesterol, triglycerides, and BUN in venous blood samples were important predictors of the cognitive impairment. This finding is consistent with previous studies showing that low uric acid is a risk factor for cognitive impairment. When at an appropriate level, uric acid could reduce the occurrence and development of cognitive impairment [54]. Decreased creatinine concentration may indicate the occurrence of cognitive decline [55].

The results of epidemiological studies indicated the presence of a preliminary correlation between inflammation and cognitive impairment [56]. Noble, Manly [57] reported that the elderly with higher C-reactive protein levels are at a higher risk of memory impairment. Therefore, evidence supports the important role of this biomarker as a vascular risk factor for cognitive decline [58]. Furthermore, a 31-year longitudinal study concluded that the HsCRP changes during the middle age could reflect the underlying process of aging-related cognitive decline [59]. Longitudinal studies also found that higher baseline Triglycerides and LDL-C concentrations were associated with a higher rate of cognitive decline, but the effect of Triglycerides was not significant [60, 61]. Overall, our findings were slightly different from those of previous studies. Notably, HDL cholesterol and triglycerides showed an important predictive effect on cognitive impairment. The latest meta-analysis experiments have concluded that triglycerides potentially affect cognition [6264].

For the indicators of physical examination, the findings of this study were in line with the previous findings that physiology is correlated with cognitive function was consistent. A useful indicator of physical health status is BMI, which screens for human weight. Furthermore, BMI ranked higher in terms of the important features for both regardless of it being the longitudinal prediction model and horizontal verification model. A longitudinal study conducted in South Korea found that obesity or weight loss in the later stages of life did not affect the risk of cognitive impairment [65]. Another study found that obesity at the middle age was an important predictor of the development of cognitive impairment in the later life phase [66]. In the latest research, it was proposed that being underweight was possibly an important risk factor for cognitive impairment among the elderly in China [26]. The results of a machine learning study also show that BMI ranked among the top 10 predictors of mild cognitive impairment (MCI) [21, 67]. Therefore, interventions for cognitive function among the elderly should target weight management.

The importance of blood pressure was identified. Similarly, previous studies highlighted a positive association between elevated diastolic or systolic blood pressure and the risk of cognitive impairment [68]. This condition played a crucial role in the guidance of routine clinical practice. To illustrate, effective control of blood pressure could reduce the risk of cognitive impairment, which was in line with previous research results [69]. Furthermore, in the present study systolic pressure was an important predictor of cognitive impairment, and has been established to cause cerebrovascular diseases and subsequently reduce cognitive ability [70, 71]. Pulse is also an important predictor. Previous studies also that showed the combination of higher pulse speed and age contributed to a gradual decrease in cognitive ability [72].

In a meta-analysis, a negative correlation was found between pulse wave velocity and cognition, particularly executive function, memory, and overall cognition. However, this association was independent of demographic, clinical, and evaluation characteristics [73]. In all horizontally verified prediction models, the feature of the breath (vital capacity) function was ranked fourth. Previous studies have also verified the relationship between lung function and cognitive function. A recent systematic review reported that although some research has shown a correlation between lung function and cognition, the result of the current study indicated several limitations and used a single measurement method [74]. Similarly, the present study, used only one type of expiratory volume as an indicator of lung function. Thus, further study is required to unravel the complex relationship among these factors.

Functional limitations have been proved as important predictors of cognitive impairment in previous studies. Multiple horizontal and longitudinal studies have concluded that poorer dysfunction is associated with more severe cognitive impairment [75, 76]. In recent years, functional limitations have been added to the cognitive impairment screening test [77, 78]. This proves that the negative effect of functional limitation in cognitive dysfunction is adequate.

Effect of Depression on Cognitive Impairment

The present study demonstrated that depression is an important predictor of cognitive impairment. In the predictive model, depression was ranked first in importance. Furthermore, the negative effect harm of depression has been proven in multiple studies, including accelerating individual cognitive impairment [2931, 79, 80]. Depressive symptoms reflect the individual’s emotional state, and the 2-year predictive model demonstrated that life expectancy is also an important factor in the predictive model. Our results showed that life expectancy is one of the important factors affecting cognitive impairment. In fact, life expectancy refers to the possibility for middle-aged and elderly people to envision living until the expected age is reached, a higher life expectancy reflects a positive emotional state. Previous research found that positive emotions are more likely to protect against cognitive decline, whereas negative emotions are associated with a higher risk of mild cognitive impairment and dementia [81]. Therefore, relevant literature has suggested that dementia interventions based on established positive psychology principles could help elderly people cope with their diseases [82, 83]. The concept of improving cognitive function through psychosocial interventions is now also gaining acceptance [84].

Influence of Lifestyle and Behavior on Cognitive Function

In this study, night time sleep, as in lifestyle and behavior factor, played an important role in the horizontal data prediction model. Previous studies demonstrated that poor sleep quality was associated with poor cognitive function [85, 86]. As a solution, behavioral intervention to improve sleep could effectively improve the cognitive ability of patients with cognitive impairment [86], proving that night time sleep is important for the cognitive function of middle-aged and elderly people.

Conclusion

In conclusion, this study has proved the accuracy of ML using vertical and cross-sectional data. Importantly, this study added the measure of non-invasive marker blood. This survey has proved the utility of ML in long-term predictions of persistent cognitive impairment and identifications risk markers, supporting testing of new ML algorithms to predict disease progression. Furthermore, the availability of these variables suggests their potential use in screening for future cognitive impairment among the elderly in the community. The risk factors discovered could also be used by clinical staff to treat cognitive impairment and develop intervention programs. More importantly, the predictors of cognitive impairment found in this study through predictive models can be subjects of public education to reduce their effect on cognitive impairment in daily life. This is of great significance for preventing the deterioration of the quality of life of middle-aged and elderly people and promoting the healthy aging of society.

Limitation

There are some limitations in this study. First, this study only used the method of random forest in comparison with logistic regression. There is a lack of comparison with other methods, such as the support vector machine method. Although the advantages of random forest have been demonstrated in other studies, further comparison with other methods is needed. Second, although our model screened the important risk factors for cognitive impairment, the specific relationship between these risk factors and cognitive impairment, whether being positive or negative, remained unclear. Therefore, further research is required to elucidate the value of these variables in cognitive impairment. Third, the measure of cognitive impairment in this study was self-reported by division respondents rather than derived from clinical judgment. While this is a good way for the community to implement measurements, there is a risk of pseudocognitive impairment. Therefore, future research should incorporate clinical reporting to refine the findings.

Ethics Statement

The studies involving human participants were reviewed and approved by the Peking University Biomedical Ethics Committee (IRB00001052-11015). The patients/participants provided their written informed consent to participate in this study.

Author Contributions

HhL was responsible for the collection of data. XZ conducted the statistical analysis. HhL, HnL, XZ, and SC interpreted the data. HhL and HnL wrote the manuscript and manuscript preparation. HnL was responsible for data management. Overall, all authors provided suggestions during the preparation of the manuscript and approved the final version submitted for publication.

Funding

This work was funded by the Fundamental Research Funds for Chengde Medical University (KY202113), the Humanities and Social Science Research Project of Hebei Education Department (SQ2023137), the Hebei Natural Science Foundation (C2022406010), the Open Project of Hebei Key Laboratory of Nerve Injury and Repair (NJKF202105) and University-Level Scientific Research Project in CDMC (202307).

Conflict of Interest

The authors declare that they do not have any conflicts of interest.

Acknowledgments

We thank all participants in the China Health and Retirement Longitudinal Study. We thank the CHARLS research team for providing the data. We acknowledge the contribution of Technology Innovation Guidance Project Science and Technology Work Conference of Hebei Provincial Department of Science and Technology supported this research.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.ssph-journal.org/articles/10.3389/ijph.2023.1605322/full#supplementary-material

Supplementary Figure S1 | Flowchart of data preprocessing and random forest data analysis experiment (China, 2022).

Supplementary Figure S2 | Receiver operating characteristic curve of 2-year (2013). The area under the receiver operating characteristics curve (AUC) is 0.81 (China, 2022).

Supplementary Figure S3 | Receiver operating characteristic curve of 4-year (2015). The area under the receiver operating characteristics curve (AUC) is 0.79 (China, 2022).

Supplementary Figure S4 | Receiver operating characteristic curve of cross-sectional verification data (2015). The area under the receiver operating characteristics curve (AUC) is 0.80 (China, 2022).

Abbreviations

ADL, activities of daily living; CHARLS, China health and retirement longitudinal study; IADL, instrumental activities of daily living; ML, machine learning; TICS, telephone interview of cognitive status.

References

1. Prince, M, Guerchet, M, and Prina, M. Policy Brief for Heads of Government: The Global Impact of Dementia 2013-2050. London: Alzheimer's Disease International (2013). p. 14.

Google Scholar

2. Prince, MJ, Wimo, A, Guerchet, MM, Ali, GC, Wu, YT, and Prina, M. World Alzheimer Report 2015 - the Global Impact of Dementia. London: Alzheimer's Disease International (2015). p. 89.

Google Scholar

3. Patnode, CD, Perdue, LA, Rossom, RC, Rushkin, MC, Redmond, N, Thomas, RG, et al. U.S. Preventive Services Task Force Evidence Syntheses, Formerly Systematic Evidence Reviews. Rockville, MD, USA: Agency for Healthcare Research and Quality (2020).

Google Scholar

4. Espeland, MA, Rapp, SR, Katula, JA, Andrews, LA, Felton, D, Gaussoin, SA, et al. Telephone Interview for Cognitive Status (TICS) Screening for Clinical Trials of Physical Activity and Cognitive Training: The Seniors Health and Activity Research Program Pilot (SHARP-P) Study. Int J Geriatr Psychiatry (2011) 26(2):135–43. doi:10.1002/gps.2503

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Livingston, G, Huntley, J, Sommerlad, A, Ames, D, Ballard, C, Banerjee, S, et al. Dementia Prevention, Intervention, and Care: 2020 Report of the Lancet Commission. The Lancet (2020) 396(10248):413–46. doi:10.1016/s0140-6736(20)30367-6

CrossRef Full Text | Google Scholar

6. Walsh, CG, Ribeiro, JD, and Franklin, JC. Predicting Suicide Attempts in Adolescents with Longitudinal Clinical Data and Machine Learning. J Child Psychol Psychiatry (2018) 59(12):1261–70. doi:10.1111/jcpp.12916

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Burke, TA, Ammerman, BA, and Jacobucci, R. The Use of Machine Learning in the Study of Suicidal and Non-suicidal Self-Injurious Thoughts and Behaviors: A Systematic Review. J Affect Disord (2019) 245:869–84. doi:10.1016/j.jad.2018.11.073

PubMed Abstract | CrossRef Full Text | Google Scholar

8. van Eeden, WA, Luo, C, van Hemert, AM, Carlier, IVE, Penninx, BW, Wardenaar, KJ, et al. Predicting the 9-year Course of Mood and Anxiety Disorders with Automated Machine Learning: A Comparison between Auto-Sklearn, Naïve Bayes Classifier, and Traditional Logistic Regression. Psychiatry Res (2021) 299:113823. doi:10.1016/j.psychres.2021.113823

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Koppe, G, Meyer-Lindenberg, A, and Durstewitz, D. Deep Learning for Small and Big Data in Psychiatry. Neuropsychopharmacology (2021) 46(1):176–90. doi:10.1038/s41386-020-0767-z

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Futoma, J, Morris, J, and Lucas, J. A Comparison of Models for Predicting Early Hospital Readmissions. J Biomed Inform (2015) 56:229–38. doi:10.1016/j.jbi.2015.05.016

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Kessler, RC, van Loo, HM, Wardenaar, KJ, Bossarte, RM, Brenner, LA, Cai, T, et al. Testing a Machine-Learning Algorithm to Predict the Persistence and Severity of Major Depressive Disorder from Baseline Self-Reports. Mol Psychiatry (2016) 21(10):1366–71. doi:10.1038/mp.2015.198

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Breiman, L. Random Forests. Machine Learn (2001) 45(1):5–32. doi:10.1023/a:1010933404324

CrossRef Full Text | Google Scholar

13. Fathima, AS, and Manimeglai, D. Analysis of Significant Factors for Dengue Infection Prognosis Using the Random forest Classifier. Int J Adv Comput Sci Appl (2015) 6(2):240–5.

Google Scholar

14. Lunetta, KL, Hayward, LB, Segal, J, and Van Eerdewegh, P. Screening Large-Scale Association Study Data: Exploiting Interactions Using Random Forests. BMC Genet (2004) 5:32. doi:10.1186/1471-2156-5-32

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Kuroki, Y. Risk Factors for Suicidal Behaviors Among Filipino Americans: A Data Mining Approach. Am J Orthopsychiatry (2015) 85(1):34–42. doi:10.1037/ort0000018

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Byeon, H. A Prediction Model for Mild Cognitive Impairment Using Random Forests. J Int J Adv Comput Sci App (2015) 6(12):8–12. doi:10.14569/ijacsa.2015.061202

CrossRef Full Text | Google Scholar

17. Velazquez, M, and Lee, YAlzheimer’s Disease Neuroimaging Initiative. Random forest Model for Feature-Based Alzheimer's Disease Conversion Prediction from Early Mild Cognitive Impairment Subjects. PLoS One (2021) 16(4):e0244773. doi:10.1371/journal.pone.0244773

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Anwar, SM, Majid, M, Qayyum, A, Awais, M, Alnowami, M, and Khan, MK. Medical Image Analysis Using Convolutional Neural Networks: A Review. J Med Syst (2018) 42(11):226. doi:10.1007/s10916-018-1088-1

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Lebedev, AV, Westman, E, Van Westen, GJ, Kramberger, MG, Lundervold, A, Aarsland, D, et al. Random forest Ensembles for Detection and Prediction of Alzheimer's Disease with a Good Between-Cohort Robustness. Neuroimage Clin (2014) 6:115–25. doi:10.1016/j.nicl.2014.08.023

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Li, H, Liu, Y, Gong, P, Zhang, C, and Ye, JAlzheimers Disease Neuroimaging Initiative. Hierarchical Interactions Model for Predicting Mild Cognitive Impairment (MCI) to Alzheimer's Disease (AD) Conversion. PLoS One (2014) 9(1):e82450. doi:10.1371/journal.pone.0082450

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Gomez-Ramirez, J, Avila-Villanueva, M, and Fernandez-Blazquez, MA. Selecting the Most Important Self-Assessed Features for Predicting Conversion to Mild Cognitive Impairment with Random forest and Permutation-Based Methods. Sci Rep (2020) 10(1):20630. doi:10.1038/s41598-020-77296-4

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Na, KS. Prediction of Future Cognitive Impairment Among the Community Elderly: A Machine-Learning Based Approach. Sci Rep (2019) 9(1):3335. doi:10.1038/s41598-019-39478-7

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Abd-El Mohsen, SA, Algameel, MM, Hawash, M, Abd Elrahman, S, and Wafik, W. Predicting Cognitive Impairment Among Geriatric Patients at Asir central Hospital, saudi arabia. Saudi J Biol Sci (2021) 28(10):5781–5. doi:10.1016/j.sjbs.2021.06.023

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Costanza, A, Xekardaki, A, Kovari, E, Gold, G, Bouras, C, and Giannakopoulos, P. Microvascular burden and Alzheimer-type Lesions across the Age Spectrum. J Alzheimers Dis (2012) 32(3):643–52. doi:10.3233/JAD-2012-120835

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Perez, A, Manning, KJ, Powell, W, and Barry, LC. Cognitive Impairment in Older Incarcerated Males: Education and Race Considerations. Am J Geriatr Psychiatry (2021) 29(10):1062–73. doi:10.1016/j.jagp.2021.05.014

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Ren, Z, Li, Y, Li, X, Shi, H, Zhao, H, He, M, et al. Associations of Body Mass index, Waist Circumference and Waist-To-Height Ratio with Cognitive Impairment Among Chinese Older Adults: Based on the Clhls. J Affect Disord (2021) 295:463–70. doi:10.1016/j.jad.2021.08.093

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Chen, L. Leisure Activities and Psychological Wellbeing Reduce the Risk of Cognitive Impairment Among Older Adults with Hearing Difficulty: A Longitudinal Study in china. Maturitas (2021) 148:7–13. doi:10.1016/j.maturitas.2021.03.011

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Katayama, O, Lee, S, Bae, S, Makino, K, Shinkai, Y, Chiba, I, et al. Relationship between Instrumental Activities of Daily Living Performance and Incidence of Mild Cognitive Impairment Among Older Adults: A 48-month Follow-Up Study. Arch Gerontol Geriatr (2021) 94:104034. doi:10.1016/j.archger.2020.104034

CrossRef Full Text | Google Scholar

29. Fang, F, Gao, Y, Schulz, PE, Selvaraj, S, and Zhang, Y. Brain Controllability Distinctiveness between Depression and Cognitive Impairment. J Affect Disord (2021) 294:847–56. doi:10.1016/j.jad.2021.07.106

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Costanza, A, Amerio, A, Aguglia, A, Escelsior, A, Serafini, G, Berardelli, I, et al. When Sick Brain and Hopelessness Meet: Some Aspects of Suicidality in the Neurological Patient. CNS Neurol Disord Drug Targets (2020) 19(4):257–63. doi:10.2174/1871527319666200611130804

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Costanza, A, Baertschi, M, Weber, K, Canuto, A, and Canuto, A. Neurological Diseases and Suicide: From Neurobiology to Hopelessness. Rev Med Suisse (2015) 11(461):402–5.

PubMed Abstract | Google Scholar

32. Lu, H, Liu, J, Gu, G, Li, X, Yin, S, and Cui, D. Nonlinear Phase Synchronization Analysis of Eeg Signals in Amnesic Mild Cognitive Impairment with Type 2 Diabetes Mellitus. Neuroscience (2021) 472:25–34. doi:10.1016/j.neuroscience.2021.07.022

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Zhao, Y, Hu, Y, Smith, JP, Strauss, J, and Yang, G. Cohort Profile: The china Health and Retirement Longitudinal Study (Charls). Int J Epidemiol (2014) 43(1):61–8. doi:10.1093/ije/dys203

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Chen, C, Park, J, Wu, C, Xue, Q, Agogo, G, Han, L, et al. Cognitive Frailty in Relation to Adverse Health Outcomes Independent of Multimorbidity: Results from the china Health and Retirement Longitudinal Study. Aging (2020) 12(22):23129–45. doi:10.18632/aging.104078

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Pangman, VC, Sloan, J, and Guse, L. An Examination of Psychometric Properties of the Mini-Mental State Examination and the Standardized Mini-Mental State Examination: Implications for Clinical Practice. Appl Nurs Res (2000) 13(4):209–13. doi:10.1053/apnr.2000.9231

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Troyer, AK. Activities of Daily Living (ADL). In: JS Kreutzer, J DeLuca, and B Caplan, editors. Encyclopedia of Clinical Neuropsychology. New York, NY: Springer New York (2011). p. 28–30.

CrossRef Full Text | Google Scholar

37. Crimmins, EM, and Seeman, T. Integrating Biology into Demographic Research on Health and Aging (With a Focus on the Macarthur Study of Successful Aging). Cells and Surveys: Should Biological Measures Be Included in Social Science Research? Washington, DC, USA: US National Academies Press (2001).

Google Scholar

38. Seeman, C. Supplement: Aging, Health, and Public Policy || Integrating Biology into the Study of Health Disparities. Popul Develop Rev (2004) 30:89–107.

Google Scholar

39. Williams, MW, Li, CY, and Hay, CC. Validation of the 10-item center for Epidemiologic Studies Depression Scale post Stroke. J Stroke Cerebrovasc Dis (2020) 29(12):105334. doi:10.1016/j.jstrokecerebrovasdis.2020.105334

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Jak, AJ, Bondi, MW, Delano-Wood, L, Wierenga, C, Corey-Bloom, J, Salmon, DP, et al. Quantification of Five Neuropsychological Approaches to Defining Mild Cognitive Impairment. Am J Geriatr Psychiatry (2009) 17(5):368–75. doi:10.1097/JGP.0b013e31819431d5

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Šimundić, A-M. Measures of Diagnostic Accuracy: Basic Definitions. EJIFCC (2009) 19(4):203–11.

PubMed Abstract | Google Scholar

42. Haynos, AF, Wang, SB, Lipson, S, Peterson, CB, Mitchell, JE, Halmi, KA, et al. Machine Learning Enhances Prediction of Illness Course: A Longitudinal Study in Eating Disorders. Psychol Med (2020) 51:1392–402. doi:10.1017/S0033291720000227

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Nagaraj, S, and Duong, TQ. Deep Learning and Risk Score Classification of Mild Cognitive Impairment and Alzheimer’s Disease. J Alzheimer's Dis (2021) 80(3):1079–90. doi:10.3233/jad-201438

CrossRef Full Text | Google Scholar

44. An, C, Lim, H, Kim, DW, Chang, JH, Choi, YJ, and Kim, SW. Machine Learning Prediction for Mortality of Patients Diagnosed with Covid-19: A Nationwide Korean Cohort Study. Sci Rep (2020) 10(1):18716. doi:10.1038/s41598-020-75767-2

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Maroco, J, Silva, D, Rodrigues, A, Guerreiro, M, Santana, I, and de Mendonça, A. Data Mining Methods in the Prediction of Dementia: A Real-Data Comparison of the Accuracy, Sensitivity and Specificity of Linear Discriminant Analysis, Logistic Regression, Neural Networks, Support Vector Machines, Classification Trees and Random Forests. BMC Res Notes (2011) 4(1):299. doi:10.1186/1756-0500-4-299

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Couronne, R, Probst, P, and Boulesteix, AL. Random forest versus Logistic Regression: A Large-Scale Benchmark experiment. BMC Bioinformatics (2018) 19(1):270. doi:10.1186/s12859-018-2264-5

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Yang, L, Jin, X, Yan, J, Jin, Y, Xu, S, Xu, Y, et al. Comparison of Prevalence and Associated Risk Factors of Cognitive Function Status Among Elderly between Nursing Homes and Common Communities of china: A Strobe-Compliant Observational Study. Medicine (2019) 98(49):e18248. doi:10.1097/MD.0000000000018248

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Sattler, C, Toro, P, Schonknecht, P, and Schroder, J. Cognitive Activity, Education and Socioeconomic Status as Preventive Factors for Mild Cognitive Impairment and Alzheimer's Disease. Psychiatry Res (2012) 196(1):90–5. doi:10.1016/j.psychres.2011.11.012

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Vanoh, D, Shahar, S, Razali, R, Manaf, ZA, and Hamid, TA. Influence of Gender Disparity in Predicting Occurrence of Successful Aging, Usual Aging and Mild Cognitive Impairment. J Int J Gerontol (2019) 13(3):207–11. doi:10.6890/IJGE.201909_13(3).0005

CrossRef Full Text | Google Scholar

50. Farfel, JM, Nitrini, R, Suemoto, CK, Grinberg, LT, Ferretti, RE, Leite, RE, et al. Very Low Levels of Education and Cognitive reserve: A Clinicopathologic Study. Neurology (2013) 81(7):650–7. doi:10.1212/WNL.0b013e3182a08f1b

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Roe, CM, Xiong, C, Miller, JP, and Morris, JC. Education and Alzheimer Disease without Dementia: Support for the Cognitive reserve Hypothesis. Neurology (2007) 68(3):223–8. doi:10.1212/01.wnl.0000251303.50459.8a

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Getsios, D, Migliaccio-Walle, K, and Caro, JJ. Nice Cost-Effectiveness Appraisal of Cholinesterase Inhibitors. Pharmacoeconomics (2007) 25(12):997–1006. doi:10.2165/00019053-200725120-00003

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Jimenez-Balado, J, Maisterra, O, and Delgado, P. Non-invasive Markers of Vascular Disease: An Opportunity for Early Diagnosis of Cognitive Impairment. Atherosclerosis (2020) 312:101–3. doi:10.1016/j.atherosclerosis.2020.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Xue, L, Liu, Y, Xue, H, Xue, J, Sun, K, Wu, L, et al. Low Uric Acid Is a Risk Factor in Mild Cognitive Impairment. Neuropsychiatr Dis Treat (2017) 13:2363–7. doi:10.2147/NDT.S145812

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Johansona, CE, Stopa, EG, Daiello, L, De la Monte, SM, Keane, M, and Ott, BR. Disrupted Blood-Csf Barrier to Urea and Creatinine in Mild Cognitive Impairment and Alzheimer's Disease. J Alzheimer's Dis (2018) 08(2):435. doi:10.4172/2161-0460.1000435

CrossRef Full Text | Google Scholar

56. Gorelick, PB. Role of Inflammation in Cognitive Impairment: Results of Observational Epidemiological Studies and Clinical Trials. Ann N Y Acad Sci (2010) 1207:155–62. doi:10.1111/j.1749-6632.2010.05726.x

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Noble, JM, Manly, JJ, Schupf, N, Tang, MX, Mayeux, R, and Luchsinger, JA. Association of C-Reactive Protein with Cognitive Impairment. Arch Neurol (2010) 67(1):87–92. doi:10.1001/archneurol.2009.308

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Ooi, TC, Meramat, A, Rajab, NF, Shahar, S, Ismail, IS, Azam, AA, et al. Intermittent Fasting Enhanced the Cognitive Function in Older Adults with Mild Cognitive Impairment by Inducing Biochemical and Metabolic Changes: A 3-year Progressive Study. Nutrients (2020) 12(9):2644. doi:10.3390/nu12092644

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Laurin, D, David Curb, J, Masaki, KH, White, LR, and Launer, LJ. Midlife C-Reactive Protein and Risk of Cognitive Decline: A 31-year Follow-Up. Neurobiol Aging (2009) 30(11):1724–7. doi:10.1016/j.neurobiolaging.2008.01.008

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Ma, C, Yin, Z, Zhu, P, Luo, J, Shi, X, and Gao, X. Blood Cholesterol in Late-Life and Cognitive Decline: A Longitudinal Study of the Chinese Elderly. Mol Neurodegener (2017) 12(1):24. doi:10.1186/s13024-017-0167-y

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Parthasarathy, V, Frazier, DT, Bettcher, BM, Jastrzab, L, Chao, L, Reed, B, et al. Triglycerides Are Negatively Correlated with Cognitive Function in Nondemented Aging Adults. Neuropsychology (2017) 31(6):682–8. doi:10.1037/neu0000335

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Avgerinos, KI, Egan, JM, Mattson, MP, and Kapogiannis, D. Medium Chain Triglycerides Induce Mild Ketosis and May Improve Cognition in Alzheimer's Disease. A Systematic Review and Meta-Analysis of Human Studies. Ageing Res Rev (2020) 58:101001. doi:10.1016/j.arr.2019.101001

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Dimache, AM, Salaru, DL, Sascau, R, and Statescu, C. The Role of High Triglycerides Level in Predicting Cognitive Impairment: A Review of Current Evidence. Nutrients (2021) 13(6):2118. doi:10.3390/nu13062118

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Pondugula, SR, Majrashi, M, Almaghrabi, M, Abbott, K, Govindarajulu, M, Ramesh, S, et al. Predictable Hematological Markers Associated with Cognitive Decline in Valid Rodent Models of Cognitive Impairment. Toxicol Mech Methods (2020) 30(6):454–61. doi:10.1080/15376516.2020.1760984

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Noh, HM, Han, J, Kim, YJ, Jung, JH, Roh, YK, and Song, HJ. Sex Differences in the Relationship between Cognitive Impairment and Overweight or Obesity in Late Life: A 3-year Prospective Study. Medicine (2019) 98(9):e14736. doi:10.1097/MD.0000000000014736

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Xu, WL, Atti, AR, Gatz, M, Pedersen, NL, Johansson, B, and Fratiglioni, L. Midlife Overweight and Obesity Increase Late-Life Dementia Risk: A Population-Based Twin Study. Neurology (2011) 76(18):1568–74. doi:10.1212/WNL.0b013e3182190d09

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Aschwanden, D, Aichele, S, Ghisletta, P, Terracciano, A, Kliegel, M, Sutin, AR, et al. Predicting Cognitive Impairment and Dementia: A Machine Learning Approach. J Alzheimers Dis (2020) 75(3):717–28. doi:10.3233/JAD-190967

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Ravaglia, G, Forti, P, Maioli, F, Martelli, M, Servadei, L, Brunetti, N, et al. Conversion of Mild Cognitive Impairment to Dementia: Predictive Role of Mild Cognitive Impairment Subtypes and Vascular Risk Factors. Dement Geriatr Cogn Disord (2006) 21(1):51–8. doi:10.1159/000089515

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Xia, C, Vonder, M, Sidorenkov, G, Oudkerk, M, de Groot, JC, van der Harst, P, et al. The Relationship of Coronary Artery Calcium and Clinical Coronary Artery Disease with Cognitive Function: A Systematic Review and Meta-Analysis. J Atheroscler Thromb (2020) 27(9):934–58. doi:10.5551/jat.52928

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Streit, S, Poortvliet, RKE, Elzen, W, Blom, JW, and Gussekloo, J. Systolic Blood Pressure and Cognitive Decline in Older Adults with Hypertension. Ann Fam Med (2019) 17(2):100–7. doi:10.1370/afm.2367

PubMed Abstract | CrossRef Full Text | Google Scholar

71. de Montgolfier, O, Pouliot, P, Gillis, MA, Ferland, G, Lesage, F, Thorin-Trescases, N, et al. Systolic Hypertension-Induced Neurovascular Unit Disruption Magnifies Vascular Cognitive Impairment in Middle-Age Atherosclerotic ldlr(-/-):Hapob(+/+) Mice. Geroscience (2019) 41(5):511–32. doi:10.1007/s11357-019-00070-6

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Elias, MF, Robbins, MA, Budge, MM, Abhayaratna, WP, Dore, GA, and Elias, PK. Arterial Pulse Wave Velocity and Cognition with Advancing Age. Hypertension (2009) 53(4):668–73. doi:10.1161/HYPERTENSIONAHA.108.126342

PubMed Abstract | CrossRef Full Text | Google Scholar

73. Alvarez-Bueno, C, Cunha, PG, Martinez-Vizcaino, V, Pozuelo-Carrascosa, DP, Visier-Alfonso, ME, Jimenez-Lopez, E, et al. Arterial Stiffness and Cognition Among Adults: A Systematic Review and Meta-Analysis of Observational and Longitudinal Studies. J Am Heart Assoc (2020) 9(5):e014621. doi:10.1161/JAHA.119.014621

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Duggan, EC, Graham, RB, Piccinin, AM, Jenkins, ND, Clouston, S, Muniz-Terrera, G, et al. Systematic Review of Pulmonary Function and Cognition in Aging. J Gerontol B Psychol Sci Soc Sci (2020) 75(5):937–52. doi:10.1093/geronb/gby128

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Lee, HH, Hong, CT, Wu, D, Chi, WC, Yen, CF, Liao, HF, et al. Association between Ambulatory Status and Functional Disability in Elderly People with Dementia. Int J Environ Res Public Health (2019) 16(12):2168. doi:10.3390/ijerph16122168

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Farias, ST, Lau, K, Harvey, D, Denny, KG, Barba, C, and Mefford, AN. Early Functional Limitations in Cognitively normal Older Adults Predict Diagnostic Conversion to Mild Cognitive Impairment. J Am Geriatr Soc (2017) 65(6):1152–8. doi:10.1111/jgs.14835

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Roebuck-Spencer, TM, Glen, T, Puente, AE, Denney, RL, Ruff, RM, Hostetter, G, et al. Cognitive Screening Tests versus Comprehensive Neuropsychological Test Batteries: A National Academy of Neuropsychology Education Paper†. Arch Clin Neuropsychol (2017) 32(4):491–8. doi:10.1093/arclin/acx021

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Md Fadzil, NH, Shahar, S, Rajikan, R, Singh, DKA, Mat Ludin, AF, Subramaniam, P, et al. A Scoping Review for Usage of Telerehabilitation Among Older Adults with Mild Cognitive Impairment or Cognitive Frailty. Int J Environ Res Public Health (2022) 19(7):4000. doi:10.3390/ijerph19074000

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Liew, TM. Depression, Subjective Cognitive Decline, and the Risk of Neurocognitive Disorders. Alzheimers Res Ther (2019) 11(1):70. doi:10.1186/s13195-019-0527-7

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Zlatar, ZZ, Muniz, MC, Espinoza, SG, Gratianne, R, Gollan, TH, Galasko, D, et al. Subjective Cognitive Decline, Objective Cognition, and Depression in Older Hispanics Screened for Memory Impairment. J Alzheimers Dis (2018) 63(3):949–56. doi:10.3233/jad-170865

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Dos Santos, SB, Rocha, GP, Fernandez, LL, de Padua, AC, and Reppold, CT. Association of Lower Spiritual Well-Being, Social Support, Self-Esteem, Subjective Well-Being, Optimism and hope Scores with Mild Cognitive Impairment and Mild Dementia. Front Psychol (2018) 9:371. doi:10.3389/fpsyg.2018.00371

PubMed Abstract | CrossRef Full Text | Google Scholar

82. Korthauer, LE, Goveas, J, Espeland, MA, Shumaker, SA, Garcia, KR, Tindle, H, et al. Negative Affect Is Associated with Higher Risk of Incident Cognitive Impairment in Nondepressed Postmenopausal Women. J Gerontol A Biol Sci Med Sci (2018) 73(4):506–12. doi:10.1093/gerona/glx175

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Adam, D, Ramli, A, and Shahar, S. Effectiveness of a Combined Dance and Relaxation Intervention on Reducing Anxiety and Depression and Improving Quality of Life Among the Cognitively Impaired Elderly. Sultan Qaboos Univ Med J (2016) 16(1):e47–53. doi:10.18295/squmj.2016.16.01.009

PubMed Abstract | CrossRef Full Text | Google Scholar

84. Mohd Safien, A, Ibrahim, N, Subramaniam, P, Shahar, S, Din, NC, Ismail, A, et al. Randomized Controlled Trials of a Psychosocial Intervention for Improving the Cognitive Function Among Older Adults: A Scoping Review. Gerontol Geriatr Med (2021) 7:23337214211025167. doi:10.1177/23337214211025167

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Dzierzewski, JM, Dautovich, N, and Ravyts, S. Sleep and Cognition in Older Adults. Sleep Med Clin (2018) 13(1):93–106. doi:10.1016/j.jsmc.2017.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Bubu, OM, Andrade, AG, Umasabor-Bubu, OQ, Hogan, MM, Turner, AD, de Leon, MJ, et al. Obstructive Sleep Apnea, Cognition and Alzheimer's Disease: A Systematic Review Integrating Three Decades of Multidisciplinary Research. Sleep Med Rev (2020) 50:101250. doi:10.1016/j.smrv.2019.101250

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: longitudinal study, machine learning, random forest, middle-aged and older Chinese, cognitive impairment, dementia

Citation: Liu H, Zhang X, Liu H and Chong ST (2023) Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study. Int J Public Health 68:1605322. doi: 10.3389/ijph.2023.1605322

Received: 15 August 2022; Accepted: 09 January 2023;
Published: 19 January 2023.

Edited by:

Gabriel Gulis, University of Southern Denmark, Denmark

Reviewed by:

Alessandra Costanza, University of Geneva, Switzerland

Copyright © 2023 Liu, Zhang, Liu and Chong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Haining Liu, liuhn0401@sina.com; Sheau Tsuey Chong, stchong@ukm.edu.my

This Original Article is part of the IJPH Special Issue “Public Health and Primary Care, is 1+1=1?”

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.