Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques

Rupali Sunil Wagh

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Characterization of Angular Error in Magnetic Head Tracking

July

2013

Design and Implementation of a Wireless Gesture Controlled Robotic Arm with Vision

October

2013

A Survey on Security in Medical Image Communication

September

2011

Application Specific Cache Simulation Analysis for Application Specific Instructionset Processor

March

2014

Reseach Article

Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques

by Rupali Sunil Wagh

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 66 - Number 23

Year of Publication: 2013

Authors: Rupali Sunil Wagh

10.5120/11258-6501

Rupali Sunil Wagh . Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques. International Journal of Computer Applications. 66, 23 ( March 2013), 32-34. DOI=10.5120/11258-6501

@article{ 10.5120/11258-6501,

author = { Rupali Sunil Wagh },

title = { Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques },

journal = { International Journal of Computer Applications },

issue_date = { March 2013 },

volume = { 66 },

number = { 23 },

month = { March },

year = { 2013 },

issn = { 0975-8887 },

pages = { 32-34 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume66/number23/11258-6501/ },

doi = { 10.5120/11258-6501 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:23:14.183454+05:30

%A Rupali Sunil Wagh

%T Knowledge Discovery from Legal Documents Dataset using Text Mining Techniques

%J International Journal of Computer Applications

%@ 0975-8887

%V 66

%N 23

%P 32-34

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Last few decades have witnessed exponential increase in the use of IT which has resulted into large amount of data being generated, stored and searched. Data may be highly structured stored as records of a DBMS, or may be totally unstructured like blog posts or plain text documents. With the abundance of information being available as text documents, the issue of retrieval of knowledge from such unstructured dataset is posing new challenges to the research community. Legal document analysis is one domain which generates and uses text information in semi structured as well as unstructured form. The process of legal reasoning and decision making is heavily dependent on information stored in text documents. Text Mining (TM) is defined as the process of extracting useful information from text data. Legal text documents are stored using natural languages. For efficient analysis of such documents, text mining, a specialized branch of machine learning can be suitably used. Text mining – which "mines text", is heavily associated with natural language processing and Information Retrieval. TM techniques can be used for extracting relevant knowledge from stored legal documents. The extracted knowledge is used to simplify the preparation of case base, facilitate in decision making and legal reasoning or for automatic identification of legal arguments. Research in the fields of information extraction, natural language processing, artificial intelligence and expert system has augmented text mining process for enhancing the knowledge discovery process in this domain. This paper proposes a study which is aimed at grouping of legal documents based on the contents without taking any external input using unsupervised text mining techniques.

References

Kong Yanqing and Guoliang Shi Guoliang, Advances in Theories and Applications of Text Mining. The 1st International Conference on Information Science and Engineering (ICISE2009)
K. A Vidhya and Aghila G, "Text Mining Process, Techniques and Tools : an Overview", International Journal of Information Technology and Knowledge Management,July-December 2010, Volume 2, No. 2, pp. 613-622
Merkl Dieter and Schweighofer Erich "En Route to Data Mining in Legal Text Corpora: Clustering, Neural Computation, and International Treaties", 0-8186-8147-0/97 IEEE 1997
Cheng Tin Tin, Leonard Cua Jeffrey, Davies Tan Mark, Gerard Yao Kenneth and EditaRoxas Rachel. Information Extraction from Legal Documents, 2009 Eighth International Symposium on Natural Language Processing, 2009 IEEE
Joshi Sachindra, Deshpande, Prasad M and Hampp Thomas. Improving the Efficiency of Legal E-Discovery. 2011 Annual SRII Global Conference, DOI 10. 1109/SRII. 2011. 97
Ismael Hasan, Javier Parapar, Roi Blanco. Segmentation of legislative documents using a domain-specific lexicon. 19th International Conference on Database and Expert Systems Application, DOI 10. 1109/DEXA. 2008. 45
Palmirani Monica and BrighiRaffaella. Metadata for the Legal Domain. Proceedings of the 14th International Workshop on Database and Expert Systems Applications (DEXA'03)
Dozier Christopher and Jackson Peter, "Mining text for expert witnesses",2005 IEEE
Roitblat Herbert L. , Kershaw Anne and Oot Patrick. "Document Categorization in Legal Electronic Discovery: Computer Classification vs. Manual Review",JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 61(1):1–11, 2010
S´anchez D. , Mart´?n-Bautista M. J. , Blanco I. , C. Justicia de la Torre. Text Knowledge Mining: An Alternative to Text Data Mining. 2008 IEEE International Conference on Data Mining Workshops
John Atkinson-Abutridy, Chris Mellish, and Stuart Aitken, "Combining Information Extraction With Genetic Algorithm for Text Mining", IEEE INTELLIGENT SYSTEMS
Li Yaxiong, Zhang Jianqiang, Dan Hu. Text Clustering Based on Domain Ontology and Latent Semantic Analysis. 2010 International Conference on Asian Language Processing, DOI 10. 1109/IALP. 2010. 55
MircoSperetta and Susan Gauch. Using Text Mining to Enrich the Vocabulary of Domain Ontologies. 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, DOI 10. 1109/WIIAT. 2008. 288

Index Terms

Computer Science

Information Sciences

Keywords

Text mining legal document search legal databases unsupervised learning