Package: interlineaR 1.0

interlineaR: Importing Interlinearized Corpora and Dictionaries as Produced by Descriptive Linguistics Software

Interlinearized glossed texts (IGT) are used in descriptive linguistics for representing a morphological analysis of a text through a morpheme-by-morpheme gloss. 'InterlineaR' provide a set of functions that targets several popular formats of IGT ('SIL Toolbox', 'EMELD XML') and that turns an IGT into a set of data frames following a relational model (the tables represent the different linguistic units: texts, sentences, word, morphems). The same pieces of software ('SIL FLEX', 'SIL Toolbox') typically produce dictionaries of the morphemes used in the glosses. 'InterlineaR' provide a function for turning the LIFT XML dictionary format into a set of data frames following a relational model in order to represent the dictionary entries, the sense(s) attached to the entries, the example(s) attached to senses, etc.

Authors:Sylvain Loiseau [aut, cre]

interlineaR_1.0.tar.gz
interlineaR_1.0.zip(r-4.7)interlineaR_1.0.zip(r-4.6)interlineaR_1.0.zip(r-4.5)
interlineaR_1.0.tgz(r-4.6-any)interlineaR_1.0.tgz(r-4.5-any)
interlineaR_1.0.tar.gz(r-4.7-any)interlineaR_1.0.tar.gz(r-4.6-any)
interlineaR_1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION
card.svg |card.png
interlineaR/json (API)

# Install 'interlineaR' in R:
install.packages('interlineaR', repos = c('https://sylvainloiseau.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/sylvainloiseau/interlinear/issues

Datasets:
  • vatlongos - A corpus of 10 texts of the Vatlongos (vtk) language

On CRAN:

Conda:

corpus-linguisticsdescriptive-linguisticsdictionariesinterlinear-gloss

4.75 score 4 stars 14 scripts 250 downloads 8 exports 12 dependencies

Last updated from:34d0b1de57. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK128
source / vignettesOK186
linux-release-x86_64OK119
macos-release-arm64OK211
macos-oldrel-arm64OK207
windows-develOK81
windows-releaseOK80
windows-oldrelOK74
wasm-releaseOK113

Exports:available.entry.fieldsavailable.example.fieldsavailable.relation.fieldsavailable.sense.fieldsread.emeldread.liftread.panglossread.toolbox

Dependencies:cligluelifecyclemagrittrplyrRcppreshape2rlangstringistringrvctrsxml2

interlineaR: Importing interlinearized corpora and dictionaries
Introduction | Reading EMELD XML interlinear corpus | Turning the EMELD XML document into a set of data frames | Contructing data set combining information from several data frame | Reading Toolbox interlinear corpus | Reading LIFT XML dictionary

Last update: 2018-04-26
Started: 2018-02-22

Case studies
A corpus of the Tuwari language | Frequency of the part of speech in the lexicon | Fréquency of the part of speech in the corpora | Are Affixe types specialized in some parts of speech ? | Frequencies of the meaning of a polysemous lexems

Last update: 2018-03-30
Started: 2018-02-27