REVIEW ARTICLE


Data Analysis and Mapping of Research Interest in Clinical Trials of Tuberculosis by Text Mining Platform of Artificial Intelligence using Open-source Tool Orange Canvas



Swayamprakash Patel1, *, Ashish Patel1, Umang Shah1, Mehul Patel1, Nilay Solanki1, Mruduka Patel2, Suchita Patel3
1 Ramanbhai Patel College of Pharmacy, Charotar University of Science and Technology (CHARUSAT), CHARUSAT Campus, Changa 388421, India;
2 Department of Clinical Research and Development, Meteroic Biopharmaceuticals, Ahmedabad, India;
3 Department of Information Technology, Institute of Science and Technology for Advanced Studies and Research, Vallabh Vidyanagar, India


Article Metrics

CrossRef Citations:
1
Total Statistics:

Full-Text HTML Views: 558
Abstract HTML Views: 124
PDF Downloads: 128
Total Views/Downloads: 902
Unique Statistics:

Full-Text HTML Views: 299
Abstract HTML Views: 95
PDF Downloads: 103
Total Views/Downloads: 568



Creative Commons License
Copyright: 2022 Bentham Science Publishers

Correspondence: Address correspondence to this author at the Ramanbhai Patel College of Pharmacy, Charotar University of Science and Technology (CHARUSAT), CHARUSAT Campus, Changa 388421, India; E-mail: swayamprakash.patel@gmail.com


Abstract

Background: Reading every clinical trial for any disease is tedious, as is determining the current progress, especially when the number of clinical trials is huge. The Text Mining Platform of Artificial Intelligence (AI) can help to simplify the task.

Methods: A large pool of tuberculosis clinical trials has been searched through the International Clinical Trial Registry Platform (ICTRP) and used as a textual dataset. The exported dataset of 1635 clinical studies, in a comma-separated format, is preprocessed for data analysis and text mining. Data preparation, corpus generation, text preprocessing, and finally, cluster analysis were carried out using the text-mining widget of the open-source machine learning tool. The hierarchical cluster analysis was used for mapping research interests in tuberculosis clinical trials.

Conclusion: The data mining of the exported dataset of tuberculosis clinical trials uncovered interesting facts in terms of numbers. Text mining presented a total of 41 hierarchical clusters that were further mapped in twenty-five (25) different research interests among tuberculosis clinical trials. A novel technique for the rapid and practical review of major clinical trials is demonstrated. As an open-source and GUI-based tool is used for work, any researcher with working knowledge of text mining may also use this technique for other clinical trials.

Keywords: Text mining, data analysis, hierarchical cluster analysis, tuberculosis, clinical trials, ICTRP, AI.