Presentation | Bibliography | Software | International | Email-me |

## XMLCorrector |

**XMLCorrector** is an implementation of an algorithm allowing to correct an XML document with respect to schema constraints expressed as a DTD. Namely, given a well-formed XML document *t* seen as a tree, a schema *S* and a non negative threshold *th* the algorithm finds every tree *t'* valid with respect to *S* such that the edit distance between *t* and *t'* is no bigger than *th*. The algorithm is based on a recursive exploration of the finite-state automata representing structural constraints imposed by the schema, as well as on the construction of an edit distance matrix storing edit sequences leading to correction candidate trees.

XMLCorrector has been developed by:

- Joshua AMAVI, Université d'Orléans, LIFO, France
- Alexande BOREL
- Béatrice BOUCHOU, Université François Rabelais Tours, LI, France
- Agata SAVARY, Université François Rabelais Tours, LI, France

XMLCorrector is a free software. It can be used under the terms of the GNU LGPL v3 license.

The current distribution package is downloadable from here. The package consists of:

- executable .jar file,
- executable .jar files of the open source libraries cited below,
- Java source codes,
- user's guide,
- JavaDoc technical documentation,
- examples of DTDs and trees to be corrected,
- the GNU LGPL v3 license (and the accompanying GNU GLP v3 license).

The algorithm has been described in the following publications:

- AMAVI, J., BOUCHOU, B., SAVARY, A. "On Correcting XML Documents With Respect to a Schema", submitted to
*The Computer Journal*. Also available as a technical report. - BOUCHOU B., CHERIAT A., HALFELD FERRARI M., SAVARY A. (2006), "XML Document Correction: Incremental Approach Activated by Schema Validation", in Proceedings of IDEAS 2006, Delhi, Inde, December 11-14 2006.
- BOUCHOU B., CHERIAT A., HALFELD FERRARI M., SAVARY A. (2006), "Integrating Correction into Incremental Validation", in Proceedings of BDA 2006, Lille, October 17-20, 2006.

XMLCorrector includes the following open source libraries:

- dk.brics.automaton v1.6 under the BSD 2-Clause license, developed by Anders Møller et al.
- jdom v1.1 under an open source license derived from the BSD 3-Clause License, developed by the JDOM Project
- bounce v0.18 under the BSD 3-Clause License, developed by Edwin Dankert
- com.wutka.dtd.DTDParser v1.21 under the GNU Lesser General Public License v3, developed by Mark Wutka

Presentation | Papers | Software | International | Email-me |