<mediawiki xmlns="http://www.mediawiki.org/xml/export-0.6/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.mediawiki.org/xml/export-0.6/ http://www.mediawiki.org/xml/export-0.6.xsd" version="0.6" xml:lang="en">
  <siteinfo>
    <sitename>WandoraWiki</sitename>
    <base>http://wandora.org/wiki/Main_Page</base>
    <generator>MediaWiki 1.19.1</generator>
    <case>first-letter</case>
    <namespaces>
      <namespace key="-2" case="first-letter">Media</namespace>
      <namespace key="-1" case="first-letter">Special</namespace>
      <namespace key="0" case="first-letter" />
      <namespace key="1" case="first-letter">Talk</namespace>
      <namespace key="2" case="first-letter">User</namespace>
      <namespace key="3" case="first-letter">User talk</namespace>
      <namespace key="4" case="first-letter">WandoraWiki</namespace>
      <namespace key="5" case="first-letter">WandoraWiki talk</namespace>
      <namespace key="6" case="first-letter">File</namespace>
      <namespace key="7" case="first-letter">File talk</namespace>
      <namespace key="8" case="first-letter">MediaWiki</namespace>
      <namespace key="9" case="first-letter">MediaWiki talk</namespace>
      <namespace key="10" case="first-letter">Template</namespace>
      <namespace key="11" case="first-letter">Template talk</namespace>
      <namespace key="12" case="first-letter">Help</namespace>
      <namespace key="13" case="first-letter">Help talk</namespace>
      <namespace key="14" case="first-letter">Category</namespace>
      <namespace key="15" case="first-letter">Category talk</namespace>
    </namespaces>
  </siteinfo>
  <page>
    <title>MediaWiki extractor</title>
    <ns>0</ns>
    <id>1628</id>
      <sha1>5n736b0yz62o9wbvpcpm9zuq1nkfsdj</sha1>
    <restrictions>edit=sysop:move=sysop</restrictions>
    <revision>
      <id>7554</id>
      <timestamp>2010-01-09T17:54:43Z</timestamp>
      <contributor>
        <username>Akivela</username>
        <id>3</id>
      </contributor>
      <comment>/* Postprocessing MediaWiki extracted topics */</comment>
      <text xml:space="preserve" bytes="1819">Wandora's [http://www.mediawiki.org/wiki/MediaWiki MediaWiki] extractor allows you to gather topics and associations from various large knowledge repositories such as [http://www.wikipedia.org Wikipedia]. The extractor can't handle HTML version of MediaWiki page but requires the XML exported page. MediaWiki extractor reads the XML dump of MediaWiki page and creates a topic for the page. Page content is attached to the topic as a text data occurrence. The extractor is started with '''File &gt; Extract &gt; Wiki &gt; [[MediaWikiExtractor|MediaWiki extractor]]'''. You can extract data from local XML files or directly from MediaWiki site using export URL of the page. For example the export URL of this page is

 http://www.wandora.net/wandora/wiki/index.php?title=Special:Export/MediaWiki_extractor

'''Note: Wandora or Wandora authors have no rights to give you any permission to use any content of any MediaWiki site. Wandora provides you nothing but a technology to create topic maps from MediaWiki pages. You should carefully read the content license of the MediaWiki site before using the extractor.'''

== Postprocessing MediaWiki extracted topics ==

The MediaWiki extractor does not process the content of extracted pages. However, it is possible to create associations out of page content using another tool in Wandora. Context menu has a tool called '''Topics &gt; Associations &gt; [[FindAssociationInOccurrence|Find associations in occurrence...]]''' that can be used to extract associations out of text data. The tool requires type and scope of processed occurrence, topic's role in new associations, and a regular expression used to recognize extracted topics in text data.

== See also ==

Wandora contains also separate [[Wikipedia extractor]] that is a graphical front end for MediaWiki extractor described here.</text>
    </revision>
  </page>
</mediawiki>
