Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (21)

Search Parameters:
Keywords = common data model (CDM)

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
20 pages, 2946 KiB  
Article
Machine-Learning Parsimonious Prediction Model for Diagnostic Screening of Severe Hematological Adverse Events in Cancer Patients Treated with PD-1/PD-L1 Inhibitors: Retrospective Observational Study by Using the Common Data Model
by Seok Jun Park, Seungwon Yang, Suhyun Lee, Sung Hwan Joo, Taemin Park, Dong Hyun Kim, Hyeonji Kim, Soyun Park, Jung-Tae Kim, Won Gun Kwack, Sung Wook Kang, Yun-Kyoung Song, Jae Myung Cha, Sang Youl Rhee and Eun Kyoung Chung
Diagnostics 2025, 15(2), 226; https://doi.org/10.3390/diagnostics15020226 - 20 Jan 2025
Viewed by 564
Abstract
Background/Objectives: Earlier detection of severe immune-related hematological adverse events (irHAEs) in cancer patients treated with a PD-1 or PD-L1 inhibitor is critical to improving treatment outcomes. The study aimed to develop a simple machine learning (ML) model for predicting irHAEs associated with [...] Read more.
Background/Objectives: Earlier detection of severe immune-related hematological adverse events (irHAEs) in cancer patients treated with a PD-1 or PD-L1 inhibitor is critical to improving treatment outcomes. The study aimed to develop a simple machine learning (ML) model for predicting irHAEs associated with PD-1/PD-L1 inhibitors. Methods: We utilized the Observational Medical Outcomes Partnership–Common Data Model based on electronic medical records from a tertiary (KHMC) and a secondary (KHNMC) hospital in South Korea. Severe irHAEs were defined as Grades 3–5 by the Common Terminology Criteria for Adverse Events (version 5.0). The predictive model was developed using the KHMC dataset, and then cross-validated against an independent cohort (KHNMC). The full ML models were then simplified by selecting critical features based on the feature importance values (FIVs). Results: Overall, 397 and 255 patients were included in the primary (KHMC) and cross-validation (KHNMC) cohort, respectively. Among the tested ML algorithms, random forest achieved the highest accuracy (area under the receiver operating characteristic curve [AUROC] 0.88 for both cohorts). Parsimonious models reduced to 50% FIVs of the full models showed comparable performance to the full models (AUROC 0.83–0.86, p > 0.05). The KHMC and KHNMC parsimonious models shared common predictive features including furosemide, oxygen gas, piperacillin/tazobactam, and acetylcysteine. Conclusions: Considering the simplicity and adequate predictive performance, our simplified ML models might be easily implemented in clinical practice with broad applicability. Our model might enhance early diagnostic screening of irHAEs induced by PD-1/PD-L1 inhibitors, contributing to minimizing the risk of severe irHAEs and improving the effectiveness of cancer immunotherapy. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

13 pages, 1447 KiB  
Article
Impact of Uric Acid Levels on Mortality and Cardiovascular Outcomes in Relation to Kidney Function
by Young-Eun Kwon, Shin-Young Ahn, Gang-Jee Ko, Young-Joo Kwon and Ji-Eun Kim
J. Clin. Med. 2025, 14(1), 20; https://doi.org/10.3390/jcm14010020 - 24 Dec 2024
Viewed by 588
Abstract
Background: Uric acid levels are linked to cardiovascular outcomes and mortality, especially in chronic kidney disease (CKD). However, their impact across varying kidney function remains unclear. Methods: We conducted a retrospective cohort study using the Observational Medical Outcomes Partnership Common Data [...] Read more.
Background: Uric acid levels are linked to cardiovascular outcomes and mortality, especially in chronic kidney disease (CKD). However, their impact across varying kidney function remains unclear. Methods: We conducted a retrospective cohort study using the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) database from a single center. Adult patients with at least one serum uric acid measurement between 2002 and 2021 were included and categorized by estimated glomerular filtration rate (eGFR): normal kidney function (≥90 mL/min/1.73 m2), mild dysfunction (60–89 mL/min/1.73 m2), moderate dysfunction (30–59 mL/min/1.73 m2), and advanced dysfunction (<30 mL/min/1.73 m2). The primary outcome was all-cause mortality with secondary outcomes being myocardial infarction (MI) and heart failure (HF). Results: A total of 242,793 participants were analyzed. Uric acid levels showed a U-shaped association with all-cause mortality in advanced kidney dysfunction, where both low (<3 mg/dL) and high (>10 mg/dL) levels increased mortality risk. In mild kidney dysfunction, lower uric acid levels were linked to better survival. HF risk increased linearly with higher uric acid, particularly in normal kidney function, while no significant association was found between uric acid and MI in any group. Conclusions: Uric acid levels are associated with mortality in a U-shaped pattern for advanced kidney dysfunction, while lower levels appear protective in mild dysfunction. These findings suggest the need for personalized uric acid management in CKD patients based on their kidney function. Full article
(This article belongs to the Section Nephrology & Urology)
Show Figures

Figure 1

15 pages, 3351 KiB  
Article
Trends in Antidiabetic Drug Use and Safety of Metformin in Diabetic Patients with Varying Degrees of Chronic Kidney Disease from 2010 to 2021 in Korea: Retrospective Cohort Study Using the Common Data Model
by Sung Hwan Joo, Seungwon Yang, Suhyun Lee, Seok Jun Park, Taemin Park, Sang Youl Rhee, Jae Myung Cha, Sandy Jeong Rhie, Hyeon Seok Hwang, Yang Gyun Kim and Eun Kyoung Chung
Pharmaceuticals 2024, 17(10), 1369; https://doi.org/10.3390/ph17101369 - 14 Oct 2024
Viewed by 1329
Abstract
Background/Objectives: This study aimed to investigate trends in antidiabetic drug use and assess the risk of metformin-associated lactic acidosis (MALA) in patients with chronic kidney disease (CKD). Methods: A retrospective observational analysis based on the common data model was conducted using electronic medical [...] Read more.
Background/Objectives: This study aimed to investigate trends in antidiabetic drug use and assess the risk of metformin-associated lactic acidosis (MALA) in patients with chronic kidney disease (CKD). Methods: A retrospective observational analysis based on the common data model was conducted using electronic medical records from 2010 to 2021. The patients included were aged ≥18, diagnosed with CKD and type 2 diabetes, and had received antidiabetic medications for ≥30 days. MALA was defined as pH ≤ 7.35 and arterial lactate ≥4 mmol/L. Results: A total of 8318 patients were included, with 6185 in CKD stages 1–2 and 2133 in stages 3a–5. Metformin monotherapy was the most prescribed regimen, except in stage 5 CKD. As CKD progressed, metformin use significantly declined; insulin and meglitinides were most frequently prescribed in end-stage renal disease. Over the study period, the use of SGLT2 inhibitors (13.3%) and DPP-4 inhibitors (24.5%) increased significantly, while sulfonylurea use decreased (p < 0.05). Metformin use remained stable in earlier CKD stages but significantly decreased in stage 3b or worse. The incidence rate (IR) of MALA was 1.22 per 1000 patient-years, with a significantly increased IR in stage 4 or worse CKD (p < 0.001). Conclusions: Metformin was the most prescribed antidiabetic drug in CKD patients in Korea with a low risk of MALA. Antidiabetic drug use patterns varied across CKD stages, with a notable decline in metformin use in advanced CKD and a rise in SGLT2 inhibitor prescriptions, underscoring the need for further optimized therapy. Full article
(This article belongs to the Section Pharmacology)
Show Figures

Figure 1

14 pages, 755 KiB  
Article
Transforming a Large-Scale Prostate Cancer Outcomes Dataset to the OMOP Common Data Model—Experiences from a Scientific Data Holder’s Perspective
by Nora Tabea Sibert, Johannes Soff, Sebastiano La Ferla, Maria Quaranta, Andreas Kremer and Christoph Kowalski
Cancers 2024, 16(11), 2069; https://doi.org/10.3390/cancers16112069 - 30 May 2024
Viewed by 1277
Abstract
To enhance international and joint research collaborations in prostate cancer research, data from different sources should use a common data model (CDM) that enables researchers to share their analysis scripts and merge results. The OMOP CDM maintained by OHDSI is such a data [...] Read more.
To enhance international and joint research collaborations in prostate cancer research, data from different sources should use a common data model (CDM) that enables researchers to share their analysis scripts and merge results. The OMOP CDM maintained by OHDSI is such a data model developed for a federated data analysis with partners from different institutions that want to jointly investigate research questions using clinical care data. The German Cancer Society as the scientific lead of the Prostate Cancer Outcomes (PCO) study gathers data from prostate cancer care including routine oncological care data and survey data (incl. patient-reported outcomes) and uses a common data specification (called OncoBox Research Prostate) for this purpose. To further enhance research collaborations outside the PCO study, the purpose of this article is to describe the process of transferring the PCO study data to the internationally well-established OMOP CDM. This process was carried out together with an IT company that specialised in supporting research institutions to transfer their data to OMOP CDM. Of n = 49,692 prostate cancer cases with 318 data fields each, n = 392 had to be excluded during the OMOPing process, and n = 247 of the data fields could be mapped to OMOP CDM. The resulting PostgreSQL database with OMOPed PCO study data is now ready to use within larger research collaborations such as the EU-funded EHDEN and OPTIMA consortium. Full article
Show Figures

Figure 1

14 pages, 1369 KiB  
Article
Symptoms and Conditions in Children and Adults up to 90 Days after SARS-CoV-2 Infection: A Retrospective Observational Study Utilizing the Common Data Model
by Minjung Han, Taehee Chang, Hae-ryoung Chun, Suyoung Jo, Yeongchang Jo, Dong Han Yu, Sooyoung Yoo and Sung-il Cho
J. Clin. Med. 2024, 13(10), 2911; https://doi.org/10.3390/jcm13102911 - 15 May 2024
Viewed by 1460
Abstract
Background/Objectives: There have been widespread reports of persistent symptoms in both children and adults after SARS-CoV-2 infection, giving rise to debates on whether it should be regarded as a separate clinical entity from other postviral syndromes. This study aimed to characterize the clinical [...] Read more.
Background/Objectives: There have been widespread reports of persistent symptoms in both children and adults after SARS-CoV-2 infection, giving rise to debates on whether it should be regarded as a separate clinical entity from other postviral syndromes. This study aimed to characterize the clinical presentation of post-acute symptoms and conditions in the Korean pediatric and adult populations. Methods: A retrospective analysis was performed using a national, population-based database, which was encoded using the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). We compared individuals diagnosed with SARS-CoV-2 to those diagnosed with influenza, focusing on the risk of developing prespecified symptoms and conditions commonly associated with the post-acute sequelae of COVID-19. Results: Propensity score matching yielded 1,656 adult and 343 pediatric SARS-CoV-2 and influenza pairs. Ninety days after diagnosis, no symptoms were found to have elevated risk in either adults or children when compared with influenza controls. Conversely, at 1 day after diagnosis, adults with SARS-CoV-2 exhibited a significantly higher risk of developing abnormal liver function tests, cardiorespiratory symptoms, constipation, cough, thrombophlebitis/thromboembolism, and pneumonia. In contrast, children diagnosed with SARS-CoV-2 did not show an increased risk for any symptoms during either acute or post-acute phases. Conclusions: In the acute phase after infection, SARS-CoV-2 is associated with an elevated risk of certain symptoms in adults. The risk of developing post-acute COVID-19 sequelae is not significantly different from that of having postviral symptoms in children in both the acute and post-acute phases, and in adults in the post-acute phase. These observations warrant further validation through studies, including the severity of initial illness, vaccination status, and variant types. Full article
(This article belongs to the Section Infectious Diseases)
Show Figures

Figure 1

17 pages, 1604 KiB  
Article
Explanatory Cognitive Diagnosis Models Incorporating Item Features
by Manqian Liao, Hong Jiao and Qiwei He
Viewed by 1986
Abstract
Item quality is crucial to psychometric analyses for cognitive diagnosis. In cognitive diagnosis models (CDMs), item quality is often quantified in terms of item parameters (e.g., guessing and slipping parameters). Calibrating the item parameters with only item response data, as a common practice, [...] Read more.
Item quality is crucial to psychometric analyses for cognitive diagnosis. In cognitive diagnosis models (CDMs), item quality is often quantified in terms of item parameters (e.g., guessing and slipping parameters). Calibrating the item parameters with only item response data, as a common practice, could result in challenges in identifying the cause of low-quality items (e.g., the correct answer is easy to be guessed) or devising an effective plan to improve the item quality. To resolve these challenges, we propose the item explanatory CDMs where the CDM item parameters are explained with item features such that item features can serve as an additional source of information for item parameters. The utility of the proposed models is demonstrated with the Trends in International Mathematics and Science Study (TIMSS)-released items and response data: around 20 item linguistic features were extracted from the item stem with natural language processing techniques, and the item feature engineering process is elaborated in the paper. The proposed models are used to examine the relationships between the guessing/slipping item parameters of the higher-order DINA model and eight of the item features. The findings from a follow-up simulation study are presented, which corroborate the validity of the inferences drawn from the empirical data analysis. Finally, future research directions are discussed. Full article
(This article belongs to the Topic Psychometric Methods: Theory and Practice)
Show Figures

Figure 1

15 pages, 1639 KiB  
Article
Mapping the Oncological Basis Dataset to the Standardized Vocabularies of a Common Data Model: A Feasibility Study
by Jasmin Carus, Leona Trübe, Philip Szczepanski, Sylvia Nürnberg, Hanna Hees, Stefan Bartels, Alice Nennecke, Frank Ückert and Christopher Gundler
Cancers 2023, 15(16), 4059; https://doi.org/10.3390/cancers15164059 - 11 Aug 2023
Cited by 2 | Viewed by 1944
Abstract
In their joint effort against cancer, all involved parties within the German healthcare system are obligated to report diagnostics, treatments, progression, and follow-up information for tumor patients to the respective cancer registries. Given the federal structure of Germany, the oncological basis dataset (oBDS) [...] Read more.
In their joint effort against cancer, all involved parties within the German healthcare system are obligated to report diagnostics, treatments, progression, and follow-up information for tumor patients to the respective cancer registries. Given the federal structure of Germany, the oncological basis dataset (oBDS) operates as the legally required national standard for oncological reporting. Unfortunately, the usage of various documentation software solutions leads to semantic and technical heterogeneity of the data, complicating the establishment of research networks and collective data analysis. Within this feasibility study, we evaluated the transferability of all oBDS characteristics to the standardized vocabularies, a metadata repository of the observational medical outcomes partnership (OMOP) common data model (CDM). A total of 17,844 oBDS expressions were mapped automatically or manually to standardized concepts of the OMOP CDM. In a second step, we converted real patient data retrieved from the Hamburg Cancer Registry to the new terminologies. Given our pipeline, we transformed 1773.373 cancer-related data elements to the OMOP CDM. The mapping of the oBDS to the standardized vocabularies of the OMOP CDM promotes the semantic interoperability of oncological data in Germany. Moreover, it allows the participation in network studies of the observational health data sciences and informatics under the usage of federated analysis beyond the level of individual countries. Full article
(This article belongs to the Special Issue The Use of Real World (RW) Data in Oncology)
Show Figures

Figure 1

10 pages, 2961 KiB  
Article
A Multicenter Cohort Study on the Association between Metformin Use and Hearing Loss in Patients with Type 2 Diabetes Mellitus Using a Common Data Model
by Minjin Kim, Dong Heun Park, Hangseok Choi, Insik Song, Kang Hyeon Lim, Hee Soo Yoon, Yoon Chan Rah and June Choi
J. Clin. Med. 2023, 12(9), 3145; https://doi.org/10.3390/jcm12093145 - 27 Apr 2023
Cited by 3 | Viewed by 3355
Abstract
We attempted to explore the association between metformin use and hearing loss in in a large-scale study. This retrospective multicenter cohort study assessed the data of patients with type 2 diabetes mellitus (DM) aged over 40 years using the Observational Health Data Science [...] Read more.
We attempted to explore the association between metformin use and hearing loss in in a large-scale study. This retrospective multicenter cohort study assessed the data of patients with type 2 diabetes mellitus (DM) aged over 40 years using the Observational Health Data Science and Informatics open-source software and the Common Data Model database from 1 January 2002 to 31 December 2019. Each participant was selected using the ICD-10-CM diagnosis code E11 for type 2 DM with sensorineural hearing loss. The participants were divided into metformin and non-metformin users. The outcome measure was the first occurrence of hearing loss after the diagnosis of DM as measured by the CDM cohort study. A total of 80,596 patients, including 46,152 metformin users and 34,444 non-metformin users from three hospitals were assessed. After calibration, we compared the risk of hearing loss using Kaplan–Meier curves, and found significant differences between the groups. The calibrated hazard ratio in the three hospitals (0.79 [95% confidence interval, 0.57–1.12]) was summarized. These findings suggest that the probability of hearing loss-free survival in the metformin user group is higher than that in the non-metformin user group. Full article
(This article belongs to the Section Otolaryngology)
Show Figures

Figure 1

9 pages, 1010 KiB  
Article
Risk Assessment of Postoperative Pneumonia in Cancer Patients Using a Common Data Model
by Yong Hoon Lee, Do-Hoon Kim, Jisun Kim and Jaetae Lee
Cancers 2022, 14(23), 5988; https://doi.org/10.3390/cancers14235988 - 4 Dec 2022
Cited by 3 | Viewed by 2116
Abstract
The incidence of postoperative pneumonia (POP) in patients with cancer is high, but its incidence following major cancer surgeries is unclear. Therefore, we investigated the incidence and risk factors of POP after surgery in patients with the five most common cancers in Korea [...] Read more.
The incidence of postoperative pneumonia (POP) in patients with cancer is high, but its incidence following major cancer surgeries is unclear. Therefore, we investigated the incidence and risk factors of POP after surgery in patients with the five most common cancers in Korea using a common data model (CDM). Patients aged >19 years who underwent gastric, colon, liver, lung, or breast cancer surgery between January 2011 and December 2020 were included, excluding patients who underwent chemotherapy or radiotherapy. Pneumonia was defined as a pneumonia diagnosis code in patients hospitalized postoperatively. Gastric, colon, lung, breast, and liver cancers were noted in 4004 (47.4%), 622 (7.4%), 2022 (24%), 958 (11.3%), and 839 (9.9%) of 8445 patients, respectively. The cumulative POP incidence was 3.1% (n = 262), with the highest incidence in lung cancer (n = 91, 4.5%), followed by gastric (n = 133, 3.3%), colon (n = 19, 3.1%), liver (n = 14, 1.7%), and breast (n = 5, 0.5%) cancers. In multivariable analysis, older age, male sex, history of chronic pulmonary disease, mood disorder, and cerebrovascular disease were POP predictors. The cumulative POP incidence in the five cancers using the CDM was approximately 3%. Older age, male sex, chronic pulmonary disease, mood disorder, and cerebrovascular disease were POP risk factors in patients with cancer. Full article
Show Figures

Figure 1

11 pages, 3106 KiB  
Review
OMOP CDM Can Facilitate Data-Driven Studies for Cancer Prediction: A Systematic Review
by Najia Ahmadi, Yuan Peng, Markus Wolfien, Michéle Zoch and Martin Sedlmayr
Int. J. Mol. Sci. 2022, 23(19), 11834; https://doi.org/10.3390/ijms231911834 - 5 Oct 2022
Cited by 26 | Viewed by 5349
Abstract
The current generation of sequencing technologies has led to significant advances in identifying novel disease-associated mutations and generated large amounts of data in a high-throughput manner. Such data in conjunction with clinical routine data are proven to be highly useful in deriving population-level [...] Read more.
The current generation of sequencing technologies has led to significant advances in identifying novel disease-associated mutations and generated large amounts of data in a high-throughput manner. Such data in conjunction with clinical routine data are proven to be highly useful in deriving population-level and patient-level predictions, especially in the field of cancer precision medicine. However, data harmonization across multiple national and international clinical sites is an essential step for the assessment of events and outcomes associated with patients, which is currently not adequately addressed. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) is an internationally established research data repository introduced by the Observational Health Data Science and Informatics (OHDSI) community to overcome this issue. To address the needs of cancer research, the genomic vocabulary extension was introduced in 2020 to support the standardization of subsequent data analysis. In this review, we evaluate the current potential of the OMOP CDM to be applicable in cancer prediction and how comprehensively the genomic vocabulary extension of the OMOP can serve current needs of AI-based predictions. For this, we systematically screened the literature for articles that use the OMOP CDM in predictive analyses in cancer and investigated the underlying predictive models/tools. Interestingly, we found 248 articles, of which most use the OMOP for harmonizing their data, but only 5 make use of predictive algorithms on OMOP-based data and fulfill our criteria. The studies present multicentric investigations, in which the OMOP played an essential role in discovering and optimizing machine learning (ML)-based models. Ultimately, the use of the OMOP CDM leads to standardized data-driven studies for multiple clinical sites and enables a more solid basis utilizing, e.g., ML models that can be reused and combined in early prediction, diagnosis, and improvement of personalized cancer care and biomarker discovery. Full article
(This article belongs to the Special Issue Artificial Intelligence in Biomarker Discovery)
Show Figures

Figure 1

19 pages, 5659 KiB  
Article
Secure Access Control Realization Based on Self-Sovereign Identity for Cloud CDM
by Yunhee Kang and Young B. Park
Appl. Sci. 2022, 12(19), 9833; https://doi.org/10.3390/app12199833 - 29 Sep 2022
Cited by 1 | Viewed by 1921
Abstract
Public healthcare has transformed from treatment to preventive care and disease management. The Common Data Model (CDM) provides a standard data structure defined to utilize hospitals’ data. Digital identity takes a significant role as the body of information about an individual used by [...] Read more.
Public healthcare has transformed from treatment to preventive care and disease management. The Common Data Model (CDM) provides a standard data structure defined to utilize hospitals’ data. Digital identity takes a significant role as the body of information about an individual used by computer systems to identify and establish trust among organizations. The CDM research network, composed of users handling medical information, has several digital identities associated with their activity. A high central authority cost can be reduced by Distributed Ledger Technology (DLT). It enables users to control their identities independently of a third party. To preserve the privacy of researchers in clinical studies, secure identification is the main concern of identifying the researcher and its agents. To do so, they should pose a legally verifiable credential in the cloud CDM. By presenting the proof represented by the capability that the user has, each identity has access control that is linked to an authentication credential that the cloud CDM can verify. Assurance in one’s identity is confirmed by asserting claims with the identity and its capability, providing its verifiable credential to the authentication entity in the cloud CDM. This paper describes the user-centric claim-based identity operation model based on use cases to handle researcher identity in the cloud CDM. In this model, credentials are designed as a capability and presented to them to access SPs in the cloud CDM. To provide well-controlled access control in the cloud CDM, we build and prototype a capability based CDM management system. Full article
(This article belongs to the Special Issue Complex IoT Applications and Blockchain)
Show Figures

Figure 1

12 pages, 1317 KiB  
Article
Decreasing Incidence of Gastric Cancer with Increasing Time after Helicobacter pylori Treatment: A Nationwide Population-Based Cohort Study
by Taewan Kim, Seung In Seo, Kyung Joo Lee, Chan Hyuk Park, Tae Jun Kim, Jinseob Kim and Woon Geon Shin
Antibiotics 2022, 11(8), 1052; https://doi.org/10.3390/antibiotics11081052 - 3 Aug 2022
Cited by 4 | Viewed by 2320
Abstract
Background: Treatment of Helicobacter pylori (HP) has been shown to reduce the risk of gastric cancer (GC) development. However, previous studies have focused on patients at high risk of GC. This study aimed to assess the effect of HP treatment on the incidence [...] Read more.
Background: Treatment of Helicobacter pylori (HP) has been shown to reduce the risk of gastric cancer (GC) development. However, previous studies have focused on patients at high risk of GC. This study aimed to assess the effect of HP treatment on the incidence of GC in the general population. Materials and Methods: Medical records were obtained from the Common Data Model-converted sample Cohort of the National Health Insurance Service of Korea (NHIS-CDM). The target cohort included those who had been prescribed HP treatment and the comparator cohort included those who had not. The association between HP treatment and the risk of GC development was assessed using the Cox proportional hazard model. The incidences of GC according to the period after HP treatment in different age groups were analyzed using proportional trend tests. Results: After large-scale 1:4 propensity score matching, 2735 and 5328 individuals were included in the target and comparator cohorts, respectively. During the median follow-up of 6.5 years, the GC incidence was lower in the HP treatment cohort than in the comparator cohort, but this was statistically insignificant (hazard ratio [HR]: 0.76; 95% confidence interval [CI]: 0.50–1.13; p-value = 0.19). This trend was also observed among the older age (≥65 years, HR: 0.87; 95% CI: 0.44–1.68; p-value = 0.69) and male cohorts (HR: 0.82; 95% CI: 0.51–1.27; p-value = 0.38). Among 58,684 individuals who were treated for HP from the whole NHIS-CDM cohort, the incidence of GC consistently decreased over time and showed a marked decrease with increasing age (p for trend < 0.05). Conclusions: In all age groups of the general population, HP treatment could be recommended to reduce the risk of GC. Full article
(This article belongs to the Section The Global Need for Effective Antibiotics)
Show Figures

Figure 1

25 pages, 3717 KiB  
Article
Dealing with Missing Responses in Cognitive Diagnostic Modeling
by Shenghai Dai and Dubravka Svetina Valdivia
Psych 2022, 4(2), 318-342; https://doi.org/10.3390/psych4020028 - 14 Jun 2022
Cited by 2 | Viewed by 3227
Abstract
Missing data are a common problem in educational assessment settings. In the implementation of cognitive diagnostic models (CDMs), the presence and/or inappropriate treatment of missingness may yield biased parameter estimates and diagnostic information. Using simulated data, this study evaluates ten approaches for handling [...] Read more.
Missing data are a common problem in educational assessment settings. In the implementation of cognitive diagnostic models (CDMs), the presence and/or inappropriate treatment of missingness may yield biased parameter estimates and diagnostic information. Using simulated data, this study evaluates ten approaches for handling missing data in a commonly applied CDM (the deterministic inputs, noisy “and” gate (DINA) model): treating missing data as incorrect (IN), person mean (PM) imputation, item mean (IM) imputation, two-way (TW) imputation, response function (RF) imputation, logistic regression (LR), expectation-maximization (EM) imputation, full information maximum likelihood (FIML) estimation, predictive mean matching (PMM), and random imputation (RI). Specifically, the current study investigates how the estimation accuracy of item parameters and examinees’ attribute profiles from DINA are impacted by the presence of missing data and the selection of missing data methods across conditions. While no single method was found to be superior to other methods across all conditions, the results suggest the use of FIML, PMM, LR, and EM in recovering item parameters. The selected methods, except for PM, performed similarly across conditions regarding attribute classification accuracy. Recommendations for the treatment of missing responses for CDMs are provided. Limitations and future directions are discussed. Full article
(This article belongs to the Special Issue Computational Aspects and Software in Psychometrics II)
Show Figures

Figure 1

15 pages, 10055 KiB  
Article
Mapping Cancer Registry Data to the Episode Domain of the Observational Medical Outcomes Partnership Model (OMOP)
by Jasmin Carus, Sylvia Nürnberg, Frank Ückert, Catarina Schlüter and Stefan Bartels
Appl. Sci. 2022, 12(8), 4010; https://doi.org/10.3390/app12084010 - 15 Apr 2022
Cited by 9 | Viewed by 3132
Abstract
A great challenge in the use of standardized cancer registry data is deriving reliable, evidence-based results from large amounts of data. A solution could be its mapping to a common data model such as OMOP, which represents knowledge in a unified semantic base, [...] Read more.
A great challenge in the use of standardized cancer registry data is deriving reliable, evidence-based results from large amounts of data. A solution could be its mapping to a common data model such as OMOP, which represents knowledge in a unified semantic base, enabling decentralized analysis. The recently released Episode Domain of the OMOP CDM allows episodic modelling of a patient’ disease and treatment phases. In this study, we mapped oncology registry data to the Episode Domain. A total of 184,718 Episodes could be implemented, with the Concept of Cancer Drug Treatment most frequently. Additionally, source data were mapped to new terminologies as part of the release. It was possible to map ≈ 73.8% of the source data to the respective OMOP standard. Best mapping was achieved in the Procedure Domain with 98.7%. To evaluate the implementation, the survival probabilities of the CDM and source system were calculated (n = 2756/2902, median OAS = 82.2/91.1 months, 95% Cl = 77.4–89.5/84.4–100.9). In conclusion, the new release of the CDM increased its applicability, especially in observational cancer research. Regarding the mapping, a higher score could be achieved if terminologies which are frequently used in Europe are included in the Standardized Vocabulary Metadata Repository. Full article
(This article belongs to the Special Issue Data Science for Medical Informatics)
Show Figures

Figure 1

13 pages, 3226 KiB  
Article
Establishment of the Optimal Common Data Model Environment for EMR Data Considering the Computing Resources of Medical Institutions
by Tong Min Kim, Taehoon Ko, Yoon-sik Yang, Sang Jun Park, In-Young Choi and Dong-Jin Chang
Appl. Sci. 2021, 11(24), 12056; https://doi.org/10.3390/app112412056 - 17 Dec 2021
Cited by 1 | Viewed by 2799
Abstract
Electronic medical record (EMR) data vary between institutions. These data should be converted into a common data model (CDM) for multi-institutional joint research. To build the CDM, it is essential to integrate the EMR data of each hospital and load it according to [...] Read more.
Electronic medical record (EMR) data vary between institutions. These data should be converted into a common data model (CDM) for multi-institutional joint research. To build the CDM, it is essential to integrate the EMR data of each hospital and load it according to the CDM model, considering the computing resources of each hospital. Accordingly, this study attempts to share experiences and recommend computing resource-allocation designs. Here, two types of servers were defined: combined and separated servers. In addition, three database (DB) setting types were selected: desktop application (DA), online transaction processing (OLTP), and data warehouse (DW). Scale, TPS, average latency, 90th percentile latency, and maximum latency were compared across various settings. Virtual memory (vmstat) and disk input/output (disk) statuses were also described. Transactions per second (TPS) decreased as the scale increased in all DB types; however, the average, 90th percentile and maximum latencies exhibited no tendency according to scale. When compared with the maximum number of clients (DA client = 5, OLTP clients = 20, DW clients = 10), the TPS, average latency, 90th percentile latency, and maximum latency values were highest in the order of OLTP, DW, and DA. In vmstat, the amount of memory used for the page cache field and free memory currently available for DA, OLTP, and DW were large compared to other fields. In the disk, DA, OLTP, and DW all recorded the largest value in the average size of write requests, followed by the largest number of write requests per second. In summary, this study presents recommendations for configuring CDM settings. The configuration must be tuned carefully, considering the hospital’s resources and environment, and the size of the database must consider concurrent client connections, architecture, and connections. Full article
(This article belongs to the Special Issue New Trends in Medical Informatics II)
Show Figures

Figure 1

Back to TopTop