| With the development of information society,the legal system also ushered in the informatization reform.The Supreme Court proposed to build a "smart court" based on the frontier of the Times.Data validity and professionalism are the premise and benchmark of legal informatization.Traditional data collection is statistics-oriented and mainly relies on manual input,which leads to the problem that the data is not objective.There are also differences in the writing norms of the court documents.How to convert subjective text description with different standards into unified structured data becomes the key.The big data case parsing system is a project born to solve the above problems.This system extracts information from legal documents,converts them into semi-structured data that can be understood by computers.It realizes the calculation,analysis and storage of judicial big data.I am responsible for the realization of four modules of the system.The legal article analysis module is responsible for the analysis of legal documents for the collection of legal articles and the analysis of legal documents in the referee spring.The character information analysis module is responsible for identifying the identity of people in documents and analyzing relevant information.The label information statistics module is responsible for calculating and analyzing the relevant information that affects sentencing results.The judge work evaluation module is responsible for extracting and analyzing the judge’s work information.The big data case parsing system bases on Java language.The whole project adopts the Spring Boot framework.The system also uses ElasticSearch database.It gives consideration to data storage capacity and search performance,and supports both massive unstructured document data storage and distributed search with real-time analysis.Since the launch of the system,the legal documents analyzed and collected by the system have reached a level of 100,000,and the volume of judgment documents has reached a level of tens of millions. |