Difference between revisions of "Wiki gloss plug-in"

From Mi'gmaq Wiki
Jump to navigation Jump to search
 
(11 intermediate revisions by the same user not shown)
Line 65: Line 65:
  
 
== Spacing ==
 
== Spacing ==
The number of spaces inserted between words in the GL element can be adjusted by using the parameter ''glmargin''.  Thus, we can render the same content with different spacings (for readability, for example), as follows:
+
The number of spaces inserted between words in the GL element can be adjusted by using the parameter ''margin''.  Thus, we can render the same content with different spacings (for readability, for example), as follows:
  
Here margin = 2 (this is the default value):
+
<pre>
 +
<gl margin=2 id="ex4">
 +
\gll Mali newti-al-asi-t     
 +
Mali one-around-move-3sg 
 +
\trans Mali wanders around alone.
 +
</gl>
 +
</pre>
  
<gl glmargin=2>
+
This produces <glr id="ex4"/>, where the margin = 2 (this is the default value):
 +
<gl margin=2 id="ex4">
 
\gll Mali newti-al-asi-t       
 
\gll Mali newti-al-asi-t       
 
Mali one-around-move-3sg   
 
Mali one-around-move-3sg   
Line 75: Line 82:
 
</gl>
 
</gl>
  
Here margin =5:
+
In <glr id="ex5"/> the margin = 5:
 
+
<gl margin=5 id="ex5">
 
 
<gl glmargin = 5>
 
 
\gll Mali newti-al-asi-t       
 
\gll Mali newti-al-asi-t       
 
Mali one-around-move-3sg   
 
Mali one-around-move-3sg   
Line 92: Line 97:
 
</gl>
 
</gl>
  
== Known issues ==
+
== Blank space and grammaticality marks ==
  
=== Unicode string handling ===
+
As in Latex gloss environments, curly brackets can be inserted in the gloss line in order to line up with unglossed text, such as the square brackets in <glr id="ex6"/>
While the GL extension does handle unicode characters, there are some instances where it will incorrectly align strings containings rich unicode characters.  Problematic characters include
+
<gl id="ex6">
<pre>
+
\gll Mali magwtgp [ lpa'tuj-ewei wenju'su'n ] 
  * ɪ́
+
Mali eat.3sg.past {} boy-POSS apple {} 
</pre>
+
\trans Mali ate the boy's apple.
When using these characters, the phrase and gloss may not perfectly line up.
 
 
 
= Examples =
 
 
 
Eventually, this page will provide documentation on the GLL Extension.  For now, I'm using it to simply illustrate my progress with the extension.  If you look at this page in the Edit view, you'll see how the ''gl'' tag environment is being used to produce the results below.
 
 
 
Remaining features/issues:
 
<ul>
 
  <li> Write better documentation
 
  <li> Add error checking
 
</ul>
 
 
 
== The basic GL block ==
 
 
 
<gl id="test_link">
 
ego sum laetus\\
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
 
</gl>
 
</gl>
  
== Changing spacing in the GL block ==
+
which was created with the following input:
The ''gllmargin'' option changes the number of spaces that are used to separate individual words in the the \gll section.  The default is 2 spaces.  This next block also makes use of the empty token in the gloss.
+
<pre>
<gl gllmargin="5">
+
<gl id="ex6">
\gll ego sum laetus\\
+
\gll Mali magwtgp [ lpa'tuj-ewei wenju'su'n ] 
{} VERB-1 ADJ\\
+
Mali eat.3SG.PST {} boy-POSS apple {} 
\trans I am happy
+
\trans Mali ate the boy's apple.
 
</gl>
 
</gl>
 +
</pre>
  
== More blocks ==
+
The same can be used to mark examples as ungrammatical or infelicitous, as in <glr id="ex7"/>
Here are a couple more blocks simply to illustrate numbering:
+
<gl id="ex7">
<gl id="test_link2">
+
\gll * jinm  ala  mesgilg 
\gll ego sum laetus\\
+
{} man that  big     
PRO VERB-1 ADJ\\
+
\trans That man is big.
\trans I am happy
 
 
</gl>
 
</gl>
  
<gl id="test_link3">
+
created with:
\gll ego sum laetus\\
+
<pre>
PRONOUN VERB-1 ADJ\\
+
<gl id="ex7">
\trans I am happy, this is test link 2
+
\gll * jinm  ala  mesgilg 
 +
{} man that  big     
 +
\trans That man is big.
 
</gl>
 
</gl>
 +
</pre>
  
Let me tell you something interesting about example <glr id="ex1"/>. What if I try to tell you something about <glr id="test_link2"/>.... or maybe <glr id="test_link3"/>?
+
== Known issues ==
  
<gl id="ex1">
+
=== Unicode string handling ===
\gll Jessica escribe ejemplo-s\\
+
While the GL extension does handle unicode characters, there are some instances where it will incorrectly align strings containings rich unicode charactersWhen using these characters, the phrase and gloss may not perfectly line up.
Jessica writes example-PL\\
 
\trans Jessica writes examples.
 
</gl>
 
 
 
<gl>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
== Restart numbering ==
 
The ''renum'' option makes the numbering restart at 1.  This next block should be numbered 1.
 
<gl gllmargin="5" renum="True">
 
ego sum laetus
 
\gll ego sum laetus
 
PRONOUN VERB-1 ADJ
 
\trans I am happy
 
</gl>
 
 
 
== Cross-referencing GL blocks ==
 
Using the <pre><glr id="...">[text]</glr></pre> tag, one can create a linked reference to an existing gl tagIf the text is omitted, then the number of the referenced tag is used. Otherwise, the text itself is linked.
 
 
 
<gl>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
<gl>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
  
<gl>
+
=== Formatting within GL blocks ===
\gll ego sum laetus\\
+
Currently, it is impossible to use special formatting (e.g. boldface, italics) inside a GL block.
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
<gl>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
<gl id="test_link3">
 
This is another linked block.
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
<gl>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
== Setting font size ==
 
The size of the font itself can be set using the ''font_size'' argument.
 
 
 
<gl fontsize=15>
 
\gll ego sum laetus\\
 
PRO VERB-1 ADJ\\
 
\trans I am happy
 
</gl>
 
 
 
== Handling unicode characters ==
 
 
 
<gl>
 
\gll úla úndʒi mɛmɪ́ski:{k,g}ax ulá ɪktɪk\\
 
this his.head is.bigger.than this other\\
 
\trans this head is bigger than the other one
 
</gl>
 

Latest revision as of 19:28, 19 February 2012

Overview

The Gloss extension provides a wiki markup element for linguistic annotations, developed for McGill Field Methods classes by Derek Ruths. The Gloss extension automatically aligns text with glosses, as well as provides auto-numbering and hyper-referencing. It closely follows the form and features of the Latex-based glossing tools, allowing the wiki source text to be easily pasted into Latex documents. This page provides both formal documentation of the extension as well as numerous examples of how the tool is used.

Installing the extension

To install the extension, you must have access to the includes directory of your MediaWiki installation. The installation is a two step procedure.

1. Copy the gloss.php file into the includes directory of your MediaWiki installation

2. Open the LocalSettings.php file in the root directory of your MediaWiki installation. Just below the line

require_once( "$IP/includes/DefaultSettings.php" );

add the line

require_once( "$IP/includes/gloss.php" );

You should now be able to use the Gloss extension in your wiki pages.

Using the extension

The GL element

The Gloss extension allows a wiki editor to automatically align text for interlinear glossing. In this extension, one phrase-gloss-translation entry is entered inside an xml-formatted element

<gl ...arguments...>
\gll ...text...
...gloss...
\trans ...translation...
</gl>

A simple example would be

<gl id="ex1">
\gll ela'l-ul magasan-g 
bring-1>2  store-LOC 
\trans I'm bringing you to the store.
</gl>

which produces the following block:

  1. ela'l-ul   magasan-g    
    bring-1>2  store-LOC    
    'I'm bringing you to the store.'

A fourth line can be added above the parsed line:

  1. ela'lul magasang
    ela'l-ul magasan-g bring-1>2 store-LOC
    'I'm bringing you to the store.'

Numbering and references

Notice that the example blocks above were automatically numbered. This number is inserted and auto-increments. As a result, the next block will receive the number 3.

  1. Piel  amalga-t   
    Piel  dance-3SG  
    'Piel dances.'

Beyond giving some structure to the blocks themselves, you can easily create hyperlinked references to specific GL elements by using the GL reference element

<glr id="...gl element id..."/>

. So, we can reference the example above using

<glr id="ex3"/>

which generates this: (2).

You can reset the numbering for a given GL element by using the GL element header:

<gl renum="true">
...
</gl>

Spacing

The number of spaces inserted between words in the GL element can be adjusted by using the parameter margin. Thus, we can render the same content with different spacings (for readability, for example), as follows:

<gl margin=2 id="ex4">
\gll Mali newti-al-asi-t       
Mali one-around-move-3sg  
\trans Mali wanders around alone.
</gl>

This produces (4), where the margin = 2 (this is the default value):

  1. Mali  newti-al-asi-t         
    Mali  one-around-move-3sg    
    'Mali wanders around alone.'

In (5) the margin = 5:

  1. Mali     newti-al-asi-t               
    Mali     one-around-move-3sg          
    'Mali wanders around alone.'

Font size

The size of the font used inside of the GL element can be set using the parameter fontsize. By default, the fontsize is 15. Here is the same content as above, rendered in font size 12.

  1. Piel  wenaqwie-t    
    Piel  jump-3SG      
    'Piel jumps'

Blank space and grammaticality marks

As in Latex gloss environments, curly brackets can be inserted in the gloss line in order to line up with unglossed text, such as the square brackets in (7)

  1. Mali  magwtgp       [  lpa'tuj-ewei  wenju'su'n  ]    
    Mali  eat.3sg.past     boy-POSS      apple            
    'Mali ate the boy's apple.'

which was created with the following input:

<gl id="ex6">
\gll Mali magwtgp [ lpa'tuj-ewei wenju'su'n ]  
Mali eat.3SG.PST {} boy-POSS apple {}   
\trans Mali ate the boy's apple.
</gl>

The same can be used to mark examples as ungrammatical or infelicitous, as in (8)

  1. *  jinm  ala   mesgilg    
       man   that  big        
    'That man is big.'

created with:

<gl id="ex7">
\gll * jinm  ala  mesgilg  
{} man that  big      
\trans That man is big.
</gl>

Known issues

Unicode string handling

While the GL extension does handle unicode characters, there are some instances where it will incorrectly align strings containings rich unicode characters. When using these characters, the phrase and gloss may not perfectly line up.

Formatting within GL blocks

Currently, it is impossible to use special formatting (e.g. boldface, italics) inside a GL block.