Clinical Trials Logo

Clinical Trial Summary

Using traditional machine learning classifiers, this study targets on comparing bag-of-words, word2cec and roberta on automated ICD coding related to cardiovascular diseases in Chinese corpus.


Clinical Trial Description

ICD coding is quite important as it serves as basis for a wide range of economic and academic applications. Currently, manual coding is mainly adopted, which faces several limits like being time-consuming and prone to error, and this makes automated ICD coding via machine learning a hot research topic. As an inevitable phase during machine learning, feature engineering plays a crucially important role in leading to promising coding performance. Although have reached enlightening conclusions, existing studies lacked comparison of different feature engineering methods. Finding out what methods under what circumstances perform better can be quite helpful in promoting practical applications of automated coding. The investigators will implement this study based on inpatient' data collected from electronic medical records from Fuwai Hospital, the world's largest medical center for cardiovascular disease. Bag-of-words, word2cec and roberta will be respectively used to extracted features from training data. Then code-wise logistic regression classifiers and support vector machine classifiers will be trained to auto-assign codes. Afterwards, performances of the models on test data will be evaluated. ;


Study Design


Related Conditions & MeSH terms


NCT number NCT04849195
Study type Observational
Source China National Center for Cardiovascular Diseases
Contact
Status Active, not recruiting
Phase
Start date March 1, 2021
Completion date April 2021

See also
  Status Clinical Trial Phase
Recruiting NCT05654272 - Development of CIRC Technologies
Recruiting NCT05650307 - CV Imaging of Metabolic Interventions
Recruiting NCT04515303 - Digital Intervention Participation in DASH
Completed NCT04056208 - Pistachios Blood Sugar Control, Heart and Gut Health Phase 2
Recruiting NCT04417387 - The Genetics and Vascular Health Check Study (GENVASC) Aims to Help Determine Whether Gathering Genetic Information Can Improve the Prediction of Risk of Coronary Artery Disease (CAD)
Not yet recruiting NCT06032572 - Evaluation of the Safety and Effectiveness of the VRS100 System in PCI (ESSENCE) N/A
Recruiting NCT04514445 - The BRAVE Study- The Identification of Genetic Variants Associated With Bicuspid Aortic Valve Using a Combination of Case-control and Family-based Approaches.
Enrolling by invitation NCT04253054 - Chinese Multi-provincial Cohort Study-Beijing Project
Completed NCT03273972 - INvestigating the Lowest Threshold of Vascular bENefits From LDL Lowering With a PCSK9 InhibiTor in healthY Volunteers N/A
Completed NCT03680638 - The Effect of Antioxidants on Skin Blood Flow During Local Heating Phase 1
Recruiting NCT04843891 - Evaluation of PET Probe [64]Cu-Macrin in Cardiovascular Disease, Cancer and Sarcoidosis. Phase 1
Completed NCT04083872 - Clinical Study to Investigate the Pharmacokinetic Profiles and Safety of Highdose CKD-385 in Healthy Volunteers(Fasting) Phase 1
Completed NCT04083846 - Clinical Study to Investigate the Pharmacokinetic Profiles and Safety of High-dose CKD-385 in Healthy Volunteers(Fed) Phase 1
Completed NCT03466333 - Postnatal Enalapril to Improve Cardiovascular fUnction Following Preterm Pre-eclampsia Phase 2
Completed NCT03693365 - Fluid Responsiveness Tested by the Effective Pulmonary Blood Flow During a Positive End-expiratory Trial
Completed NCT03619148 - The Incidence of Respiratory Symptoms Associated With the Use of HFNO N/A
Completed NCT04082585 - Total Health Improvement Program Research Project
Completed NCT05132998 - Impact of a Comprehensive Cardiac Rehabilitation Program Framework Among High Cardiovascular Risk Cancer Survivors N/A
Completed NCT05067114 - Solutions for Atrial Fibrillation Edvocacy (SAFE)
Completed NCT04098172 - Evaluate the Performance and Safety of Comet Pressure Guidewire in the Measurement of FFR N/A