Package: interlineaR 1.0
interlineaR: Importing Interlinearized Corpora and Dictionaries as Produced by Descriptive Linguistics Software
Interlinearized glossed texts (IGT) are used in descriptive linguistics for representing a morphological analysis of a text through a morpheme-by-morpheme gloss. 'InterlineaR' provide a set of functions that targets several popular formats of IGT ('SIL Toolbox', 'EMELD XML') and that turns an IGT into a set of data frames following a relational model (the tables represent the different linguistic units: texts, sentences, word, morphems). The same pieces of software ('SIL FLEX', 'SIL Toolbox') typically produce dictionaries of the morphemes used in the glosses. 'InterlineaR' provide a function for turning the LIFT XML dictionary format into a set of data frames following a relational model in order to represent the dictionary entries, the sense(s) attached to the entries, the example(s) attached to senses, etc.
Authors:
interlineaR_1.0.tar.gz
interlineaR_1.0.zip(r-4.5)interlineaR_1.0.zip(r-4.4)interlineaR_1.0.zip(r-4.3)
interlineaR_1.0.tgz(r-4.4-any)interlineaR_1.0.tgz(r-4.3-any)
interlineaR_1.0.tar.gz(r-4.5-noble)interlineaR_1.0.tar.gz(r-4.4-noble)
interlineaR_1.0.tgz(r-4.4-emscripten)interlineaR_1.0.tgz(r-4.3-emscripten)
interlineaR.pdf |interlineaR.html✨
interlineaR/json (API)
# Install 'interlineaR' in R: |
install.packages('interlineaR', repos = c('https://sylvainloiseau.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/sylvainloiseau/interlinear/issues
- vatlongos - A corpus of 10 texts of the Vatlongos (vtk) language
corpus-linguisticsdescriptive-linguisticsdictionariesinterlinear-gloss
Last updated 7 years agofrom:34d0b1de57. Checks:OK: 7. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Oct 30 2024 |
R-4.5-win | OK | Oct 30 2024 |
R-4.5-linux | OK | Oct 30 2024 |
R-4.4-win | OK | Oct 30 2024 |
R-4.4-mac | OK | Oct 30 2024 |
R-4.3-win | OK | Oct 30 2024 |
R-4.3-mac | OK | Oct 30 2024 |
Exports:available.entry.fieldsavailable.example.fieldsavailable.relation.fieldsavailable.sense.fieldsread.emeldread.liftread.panglossread.toolbox
Dependencies:cligluelifecyclemagrittrplyrRcppreshape2rlangstringistringrvctrsxml2
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Importing interlinearized corpora and dictionaries as produced by descriptive linguistics software | interlineaR-package interlineaR |
Information about the structure of the LIFT XML format in order to easily generate XPath expression and extract information. | entry.fields.spec example.fields.spec lift-format relation.fields.spec sense.fields.spec |
List of the available pieces of information for each entry (ie column in the entry table) | available.entry.fields available.example.fields available.relation.fields available.sense.fields lift.specification |
Read an EMELD XML document containing an interlinearized corpus. | read.emeld |
Parse a dictionary in XML LIFT (Lexicon Interchange FormaT) vocabulary and turn it into a set of data.frame | read.lift |
Read a file in the format used in the pangloss collection | read.pangloss |
Parse a Toolbox (SIL) text file | read.toolbox |
A corpus of 10 texts of the Vatlongos (vtk) language | vatlongos |