
xmLeges Marker is able to convert a legislative text into a XML-NIR file. It is useful to deal with legacy contents.
Automatic detection and tagging of the documents formal structure provides an extraordinarily fast and safe conversion of the text in a specific XML standard, simplifying the burden of a manual conversion.
Features
- xml conversion of different types of documents (laws, decrees, etc.)
- Management of specific provisions: amendments, notes, etc.
- Partitions numbering control; highlighting uncertain predictions; partitions IDs automatic managemen
- Different document formats management (txt, doc, html)
xmLegesMarker is avalaible for:
- Documents tagging Web service
- xmLegesEditor service
The engine is C/C++ coded and it is organised as follows:
- A Non-deteriministic Finite Automata (NFA) is used for the body of a law text: states are represented by the elements in DTD-NIR and transitions between states are determined by the formal rules of the section division. FLEX has been used for this part;
- A probabilistic automata (HMM: Hidden Markov Model) is used for any other division (headings, initial and ending wording, preface), where nodes are the words and states associated to each node are sub-elements of the divisions.