Krextor, the KWARC RDF Extractor, is an extensible XSLT-based framework for extracting RDF from XML, supporting multiple input languages as well as multiple output RDF notations. Krextor provides convenience templates that try to do “the right thing”™ in many common cases, as to reduce the need for manually writing repetitive code. The Publications provide further background on the design, requirements, and use cases behind Krextor.


The extracted RDF graph will in most cases be an outline of the semantic structure of an XML document, abstracting from the concrete syntax. It can be used for more easily exchanging or interlinking knowledge contained in XML documents on the semantic web. There are many tools that support querying RDF, using languages like SPARQL. If the extracted RDF is backed by an expressive ontology, a reasoner can be used to infer additional knowledge from it.

Supported Formats

Krextor comes with some number of extraction and output modules. Support for additional formats is easy to add. Please let us know if you have written any extraction or output module, test case, or documentation that you would like us to make a part of the Krextor default distribution.

Input Formats (Extraction Modules; varying stability)

The following input formats are already supported. Others are easy to add. Just copy an existing extraction module to get started.

Output Formats (all stable)


See Usage

Source code documentation

(generated using XSLTdoc)

External documentation

Last modified 4 years ago Last modified on 01/10/12 09:17:58