Synchronization of Machine Learning into Electronic Health Records

Meet N. Gandhi; Eshan Vatsa; Nitin S. Choubey

Call for Paper

October Edition

IJCA solicits high quality original research papers for the upcoming October edition of the journal. The last date of research paper submission is 22 September 2025

Submit your paper

Know more

The week's pick

Real-Time Video Transmission using Gaussian Minimum Shift Keying (GMSK) on GNU Radio and USRP for Radiation Monitoring Applications in Nuclear Reactors

Nabiha Ben Abid Abdalla M. Khattab Hani A.M. Harb Chokri Souani

Random Articles

Reseach Article

Synchronization of Machine Learning into Electronic Health Records

by Meet N. Gandhi, Eshan Vatsa, Nitin S. Choubey

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 177 - Number 26

Year of Publication: 2019

Authors: Meet N. Gandhi, Eshan Vatsa, Nitin S. Choubey

10.5120/ijca2019919751

Meet N. Gandhi, Eshan Vatsa, Nitin S. Choubey . Synchronization of Machine Learning into Electronic Health Records. International Journal of Computer Applications. 177, 26 ( Dec 2019), 40-47. DOI=10.5120/ijca2019919751

@article{ 10.5120/ijca2019919751,

author = { Meet N. Gandhi, Eshan Vatsa, Nitin S. Choubey },

title = { Synchronization of Machine Learning into Electronic Health Records },

journal = { International Journal of Computer Applications },

issue_date = { Dec 2019 },

volume = { 177 },

number = { 26 },

month = { Dec },

year = { 2019 },

issn = { 0975-8887 },

pages = { 40-47 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume177/number26/31065-2019919751/ },

doi = { 10.5120/ijca2019919751 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:47:01.620344+05:30

%A Meet N. Gandhi

%A Eshan Vatsa

%A Nitin S. Choubey

%T Synchronization of Machine Learning into Electronic Health Records

%J International Journal of Computer Applications

%@ 0975-8887

%V 177

%N 26

%P 40-47

%D 2019

%I Foundation of Computer Science (FCS), NY, USA

Abstract

The introduction of EHR (Electronic Health Record), in the medical field has been under discussion for a while but due to a very low acceptance rate of this technology by physicians, it has proven to be a risky gamble in the successful implementation of EHR. EHR uses data accumulated on the subject’s health to determine tests required, health analysis and real-time records to help the physician provide more accurate and detailed analysis on the subject. Due to Health Information Technology for Economic and Clinical Health (HITECH) there has been an increase in the amount of data accumulation by EHR. The data has great potential because of the large archive of information across the globe, but due to the random collection of data, it has resulted in the development of an unstructured record which has resulted to difficulty in transactions [1]. Even though there has been a large collection of data around the globe, the major issue has been making use of this data in a logical manner for purposeful implementation. The intention behind this paper is how to proceed with the implementation of machine learning in EHR along with its steps in order to analyze the data [2, 3] so that one can understand the pattern generated by the data provided. There are several machine learning algorithms for the interpretation of this data, but not all data are compactable with all the algorithms, thus in this paper the method of data gathering to applying machine learning algorithms on the data is explained and various ways to perform different steps are also discussed in detail.

References

Ohio-Machado L. 2011. Realizing the full potential of electronic health records: the role of natural language processing. J. Am. Med. Inform. Assoc. 18, 539 (doi:10.1136/amiajnl-2011-000501) [PMCfree article] [PubMed]
BruijnBd, Cherry C, Kiritchenko S, Martin J, Zhu X. 2011. Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inform. Assoc. 18, 557–562. (doi:10.1136/amiajnl-2011-000150) [PMC free article] [PubMed]
Opportunities and obstacles for deep learning in biology and medicine_doi: 10.1098/rsif.2017.0387
HIPAA act is available here:https://www.hhs.gov/sites/default/files/ocr/privacy/hipaa/understanding/coveredentities/De-identification/hhs_deid_guidance.pdf)
Opportunities and challenges in developing risk prediction models with electronic health record data: a systematic reviewJ Am Med Inform Assoc. 2017 Jan; 24(1): 198–208.Published online 2016 May 17. doi: 10.1093/jamia/ocw042
Future of electronic health records: implications for decision support. Rothman B, Leonard JC,Vigoda MM Mt Sinai J Med. 2012 Nov-Dec; 79(6):757-68. [PubMed] [Ref list]
Prediction of coronary heart disease using risk factor categories. Wilson PW, D'Agostino RB, Levy D, Belanger AM, Silbershatz H, Kannel WB Circulation. 1998 May 12; 97(18):1837-47.[PubMed] [Ref list]
Hersh WR, Weiner MG, Embi PJ, et al. Caveats for the use of operational electronic health record data in comparative effectiveness research. Med Care. 2013;51(8 Suppl3):S30–S37. [PMC free article][PubMed]
Knowledge Acquisition for Electronic Health Records on clouddoi.org/10.1016/j.procs.2017.08.031
Automatic de-identification of textual documents in the electronic health record: a review of recent research doi:10.1186/1471-2288-10-70
Gardner J, Xiong L: HIDE: An Integrated System for Health Information De-identification. Proceedings of the 2008 21st IEEE International Symposium on Computer-Based Medical Systems 2008, 254-9.
Aramaki E, et al: Automatic Deidentification by using Sentence Features and Label Consistency. i2b2 Workshop on Challenges in Natural Language Processing for Clinical Data, Washington, DC 2006.
R. M. B. A. Beckwith, U. J. Balis, and F. Kuo. Development and evaluation of an open source software tool for deidentification of pathology reports. B
Beckwith BA, et al: Development and evaluation of an open source software tool for deidentification of pathology reports. BMC Med Inform DecisMak 2006, 12.
Morrison FP, et al: Repurposing the clinical record: can an existing natural language processing system de-identify clinical notes? J Am Med Inform Assoc 2009, 16(1):37-9
Friedlin FJ, McDonald CJ: A software tool for removing patient identifying information from clinical documents. J Am Med Inform Assoc 2008, 15(5):601-10.
Hadoop Development Available: https://metadesignsolutions.com/hadoop-development
Thomas SM, et al: A successful technique for removing names in pathology reports using an augmented search and replace method. Proc AMIA Symp 2002, 777-81.
Taira RK, Bui AA, Kangarloo H: Identification of patient name referenceswithin medical documents using semantic selectional restrictions. Proc AMIA Symp 2002, 757-61.
Using your electronic medical record for research: a primer for avoiding pitfalls https://doi.org/10.1093/fampra/cmp068
Uzuner O, et al: A de-identifier for medical discharge summaries. ArtifIntell Med. 2008, 42 (1): 13-35. 10.1016/j.artmed.2007.10.001.
Wimmer H, Powell LM. A comparison of open source tools for sentiment analysis. 2015;1–9. Available:http://fotiad.is/blog/sentiment-analysis-comparison/.
Jovic, A, Brkic K, Bogunovic N. An overview of free software tools for general data mining. Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2014 37th International Convention on.IEEE. 2014: 1112–1117.
Herschel G, Linden A, Kart L. Magic quadrant for advanced analytics platforms. Available:http://www.gartner.com/technology/reprints.do?id=1-2A881DN&ct=150219&st=sb.
Landset S, Khoshgoftaar TM, Richter AN, Hasanin T. A survey of open source tools for machine learning with big data in the Hadoop ecosystem. J Big Data [Internet]. Springer International Publishing; 2015;2(1):24. Available:http://www.journalofbigdata.com/content/2/1/24.
Fayyad, Piatetsky-Shapiro, Smyth Communications of the ACM,1996.
Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters. Commun ACM [Internet]. 2008;51(1):1–13. Available:http://www.usenix.org/events/osdi04/tech/full_papers/dean/dean_html/.
ApacheHadoop. Available:http://hadoop.apache.org/.
ApacheMahout. Available:http://mahout.apache.org/.
Zaharia M, Chowdhury M, Das T, Dave A. Fast and interactive analytics over Hadoop data with Spark. USENIX Login. 2012;37(4):45–51.
Landset S, Khoshgoftaar TM, Richter AN, Hasanin T. A survey of open source tools for machine learning with big data in the Hadoop ecosystem. J Big Data [Internet]. Springer International Publishing; 2015;2(1):24. Available:http://www.journalofbigdata.com/content/2/1/24.
https://www.hindawi.com/journals/jhe/2018/4302425/
https://onlinelibrary.wiley.com/doi/full/10.1111/acem.12876
Data Mining: Concepts and Techniques by Jiawei Han and MichelineKamber.
DatasetAvailable:https://www.kaggle.com/asaumya/healthcare-data
Big data mining using Apache Spark Available: https://insidebigdata.com/2014/10/27/data-science-101-mining-big-data-apache-spark/
MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat
A micropartitioning technique for massive data analysis using MapReduce S. Mohanapriya ; P. Natesan.https://www.icanotes.com/2019/04/16/a-history-of-ehr-through-the-years/

Index Terms

Computer Science

Information Sciences

Keywords

EHR machine learning data extraction data mining tools analysis of data Naïve Bayes classifier