User:Kipcool/importing
From OmegaWiki
Contents |
[edit] How does it work
The idea is to use TBX which is an XML standard for sharing terminological data. TBX and OmegaWiki are different but close. If we want to share the OmegaWiki with some people, we'll need OW <-> TBX conversion, so to import Wiktionaries, we should first:
- create a TBX -> OmegaWiki import script.
- create a Wikts -> TBX conversion script. That's what does the script below.
[edit] List
- af: should be possible (Manie can help ([1])
- de: should be ok
- en: should be ok
- es: needs a better formatting
- fo: ? (User:Spacebirdy can help)
- fr: should be ok
- is: (User:Spacebirdy can help)
- it: should be ok (User:SabineCretella)
- nah: should be possible (User:Piolinfax, User:Spacebirdy can help)
- nl: should be ok (User:GerardM can help)
- pt: should be possible (User:Spacebirdy can help)
- scn: should be ok (User:SabineCretella)
- sr: like de: = should be ok (Roter Baron)
- sv: should be ok (sanna and \Mike)
- tr: should be possible (%90) (User:Ugur Basak can help)
- vi: like nl: = should be ok (User:David)
- zh: maybe (User:Shibo77 can help)
[edit] Importing fr:
- done:
- definitions = definedMeaning
- language
- title of the article
- part of speech: nouns, verb, adjective, ...
- translations
- matched with the corresponding meaning when in the form [[word]] (2)
- '''the definition''' followed by a list of translations is still to be done...
- todo:
- partofspeech: phrase (noun phrase, verb phrase,...)
- synonyms
- antonyms
- examples
- permalink
- pronunciation
- etymology
- field/domain: {term|...}
- images
- see also (wikipedia...)
[edit] result
- use: "wiktfr2TBX.pl frtest.xml > TBXtest.xml"
- Entry: first 5000 lines of the French Wiktionary dump frtest.xml
- processing: a Perl script wiktfr2TBX.pl
- Output: TBX file

