Nixi ntsikoo sa'ma yakui ka

Vertical Tabs

Reader
<?xml version="1.0" encoding="UTF-8"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml" schematypens="http://relaxng.org/ns/structure/1.0"?>
<?xml-model href="http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng" type="application/xml"
	schematypens="http://purl.oclc.org/dsdl/schematron"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0" xml:id="L098-tok">
      <teiHeader>
         <fileDesc>
            <titleStmt>
               <title xml:lang="mix"><seg xml:id="L098-00-01"><seg xml:id="L098-00-01-01">Nixi</seg> <seg xml:id="L098-00-01-02">ntsikoo</seg> <seg xml:id="L098-00-01-03">sa'ma</seg> <seg xml:id="L098-00-01-04">yakui</seg> <seg xml:id="L098-00-01-05">ka</seg></seg></title>
               <title xml:lang="esp"><seg xml:id="L098-00-02"><seg xml:id="L098-00-02-01">Como</seg> <seg xml:id="L098-00-02-02">el</seg> <seg xml:id="L098-00-02-03">armadillo</seg> <seg xml:id="L098-00-02-04">obtuvo</seg> <seg xml:id="L098-00-02-05">su</seg> <seg xml:id="L098-00-02-06">caparazón</seg></seg></title>         
               <author>María Gómez Hernández</author>
               <editor>Gisela Beckmann</editor>
               <editor>María M. Nieves</editor>
               <respStmt>
                  <resp>TEI Encoding</resp>
                  <name>Jack T. Bowers</name>  
               </respStmt>
            </titleStmt>
            <publicationStmt>
               <p>This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.</p>
            </publicationStmt>
            <sourceDesc>
               <bibl xml:id="bibl.L098"><title>Nixi ntsikoo sa'ma yakui ka</title> <author>Gómez Hernández, María</author>; <editor>Beckmann, Gisela</editor>; <editor>Nieves, María M.</editor>.<date>2007</date>. <edition>(2nd ed.)</edition>. <publisher>Instituto Lingüístico de Verano, A.C.</publisher> <pubPlace>Tlalpan, D.F., México</pubPlace> Obtained from:<ptr target="http://www-01.sil.org/mexico/mixteca/mixtepec/L098-Armadillo-mix.htm"/></bibl>
            </sourceDesc>
         </fileDesc>
         <profileDesc>
            <textDesc>
               <channel mode="w">@mode:'w' means that the primary means of utilising this document is by reading it's written form; </channel>
               <constitution type="single"/>
               <derivation type="original">this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content;</derivation>
               <domain type="armadillo"/>
               <factuality type="fiction"/>
               <interaction type="none"/>
               <preparedness type="NA">not applicable</preparedness>
               <purpose type="entertain"/>
            </textDesc>
            <abstract>
               <p xml:lang="esp">Un armadillo se puso a tejer ropa para cubrirse. Un día se fue a una fiesta con un tlacuache, se emborrachó y se ensució. Es por eso que los armadillos son de color gris.</p>
            </abstract>
         </profileDesc>
         <encodingDesc>
            <p>
               <list>
                  <item><label>segment</label>
                     <elementSpec ident="seg">
                        <desc>In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate:
                           <list>
                              <item>i) the entire sentences from original document (first level);</item>
                              <item>ii) the individual lexical items contained within the sentences (second level);</item>
                           </list>                  
                        </desc>
                     </elementSpec>
                  </item>
                  <item><label>character</label>
                     <elementSpec ident="c">
                        <desc>Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element;
                           In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated:<eg> a sentence that begins with: "¿"
                              (and ends with: "?") can be immediately recognized as sentence type "question/interrogative";</eg>  
                        </desc>
                     </elementSpec>
                  </item>
                  <item><label>division</label>
                     <elementSpec ident="div">
                        <desc>Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting</desc>
                     </elementSpec>
                  </item>
               </list>
            </p>
            <projectDesc>
               <p>project description to come<date  when-iso="2014-08-30"/></p>
            </projectDesc>
            <classDecl>
               <taxonomy xml:id="tax.sil-mix">
                  <category xml:id="pedagogical">
                     <catDesc>PEDAGOGICAL</catDesc>
                     <category xml:id="pedagogical-inter">
                        <catDesc>PEDAGOGICAL:INTERACTIVE</catDesc>
                     </category>
                     <category xml:id="pedagogical-ref">
                        <catDesc>PEDAGOGICAL:REFERENCE</catDesc>
                     </category>
                  </category>
                  <category xml:id="fiction">
                     <catDesc>FICTION</catDesc>
                     <category xml:id="fiction-fantasy">
                        <catDesc>FICTION:FANTASY</catDesc>
                     </category>
                     <category xml:id="fiction-realistic">
                        <catDesc>FICTION:REALISTIC</catDesc>
                     </category>
                  </category>
                  <category xml:id="folklore">
                     <catDesc>FOLKLORE</catDesc>
                  </category>
               </taxonomy>
            </classDecl>
            <projectDesc>
               <p>
                  This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety.
                  The encoding of this portion of the MIX Language resources can be divided into the following components:
                  <list>
                     <item>i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties;</item>
                     
                     <item>ii) Annotation and glossing of lexical information and structures in documents; tasks include;
                        
                        <list>
                           <item>glossing of orthographical lexical items;</item>
                           <item>assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.);</item>
                           <item>linking corresponding bilingual items (Spanish-Mixtec) within each document;</item>
                           <item>entry of analyzed, glossed and categorized lexical information into TEI-Dictionary;</item>
                           <item>link orthographical forms of lexical items with phonetic transcriptions of spoken forms;</item></list></item>
                     
                     <item>iii) Classification of Document Types and Ontology Linking;
                        <list>
                           <item>replace current SIL document classification model<ref target="#tax.sil-mix"/> with established and expansible document system which can be used to classify any new document or literature produced in the language in the future;</item>
                           <item>as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);</item>
                        </list>
                     </item>
                  </list>
               </p>
            </projectDesc>
         </encodingDesc>
      </teiHeader>
      <text decls="#fiction-fantasy">
         <body>
            <div xml:id="L098-01">
               <head>
                  <graphic url="L098-01-armadillo.png"></graphic>                        
               </head>
               <p>
                  <seg xml:id="L098-01-01"><seg xml:id="L098-01-01-01">Ntsio</seg> <seg xml:id="L098-01-01-02">iin</seg> <seg xml:id="L098-01-01-03">kii</seg> <seg xml:id="L098-01-01-04">iin</seg> <seg xml:id="L098-01-01-05">yakui</seg> <seg xml:id="L098-01-01-06">ntivixi</seg><c>,</c> <seg xml:id="L098-01-01-07">yii</seg> <seg xml:id="L098-01-01-08">yuku</seg> <seg xml:id="L098-01-01-09">koo</seg> <seg xml:id="L098-01-01-10"><seg xml:id="L098-01-01-11">sa'mi</seg></seg><c>.</c></seg>
                  <seg xml:id="L098-01-02"><seg xml:id="L098-01-02-01">Sara</seg> <seg xml:id="L098-01-02-02">ntakaninia</seg> <seg xml:id="L098-01-02-03">nixi</seg> <seg xml:id="L098-01-02-04">sava'i</seg> <seg xml:id="L098-01-02-05">sa'mi</seg><c>.</c></seg>
                  <seg xml:id="L098-01-03"><seg xml:id="L098-01-03-01">Ntantuki</seg> <seg xml:id="L098-01-03-02">yu'a</seg> <seg xml:id="L098-01-03-03">sara</seg> <seg xml:id="L098-01-03-04">nikitsai</seg> <seg xml:id="L098-01-03-05">kikui</seg><c>.</c></seg>
                  <seg xml:id="L098-01-04"><seg xml:id="L098-01-04-01">Kiku</seg> <seg xml:id="L098-01-04-02">xeen</seg> <seg xml:id="L098-01-04-03">kiku</seg> <seg xml:id="L098-01-04-04">nchu'i</seg> <seg xml:id="L098-01-04-05">ñantu'i</seg> <seg xml:id="L098-01-04-06">yuku</seg> <seg xml:id="L098-01-04-07">ka</seg><c>.</c></seg>
                  
                  <seg xml:id="L098-01-05"><seg xml:id="L098-01-05-01">Nikitsaa</seg> <seg xml:id="L098-01-05-02">iin</seg> <seg xml:id="L098-01-05-03">tsoko</seg> <seg xml:id="L098-01-05-04">sara</seg> <seg xml:id="L098-01-05-05">nikachia</seg> <seg xml:id="L098-01-05-06">saan</seg><c>:</c></seg>
                  <seg xml:id="L098-01-06"><c>¿</c><seg xml:id="L098-01-06-01">Nchii</seg> <seg xml:id="L098-01-06-02">sa'u</seg><c>,</c> <seg xml:id="L098-01-06-03">mpaa</seg> <seg xml:id="L098-01-06-04">luu</seg><c>?</c></seg>
                  <seg xml:id="L098-01-07"><c>-</c><seg xml:id="L098-01-07-01">Kiku</seg> <seg xml:id="L098-01-07-02">sa'ma</seg> <seg xml:id="L098-01-07-03">nikachi</seg> <seg xml:id="L098-01-07-04">yakui</seg> <seg xml:id="L098-01-07-05">ka</seg><c>.</c></seg>
                  <seg xml:id="L098-01-08"><c>-</c><seg xml:id="L098-01-08-01">Sara</seg> <seg xml:id="L098-01-08-02">nikachi</seg> <seg xml:id="L098-01-08-03">tsoko</seg> <seg xml:id="L098-01-08-04">ka</seg><c>:</c></seg>
                  <seg xml:id="L098-01-09"><seg xml:id="L098-01-09-01">Kua'a</seg> <seg xml:id="L098-01-09-02">sa'mu</seg><c>,</c> <seg xml:id="L098-01-09-03">nakiku</seg> <seg xml:id="L098-01-09-04">nachinchee</seg> <seg xml:id="L098-01-09-05">yo</seg><c>.</c></seg>
                  <seg xml:id="L098-01-10"><seg xml:id="L098-01-10-01">Natsinu</seg> <seg xml:id="L098-01-10-02">sa'mu</seg> <seg xml:id="L098-01-10-03">ra</seg> <seg xml:id="L098-01-10-04">nako'on</seg> <seg xml:id="L098-01-10-05">viko</seg><c>.</c></seg>
               </p>
            </div>
            <div xml:id="L098-02">
               <head>
                  <graphic url="L098-02-armadillos-w-snake.png"></graphic>
               </head>
               <p>
                  <seg xml:id="L098-02-01"><seg xml:id="L098-02-01-01">Sara</seg> <seg xml:id="L098-02-01-02">nchinchee</seg> <seg xml:id="L098-02-01-03">tsoko</seg> <seg xml:id="L098-02-01-04">ka</seg> <seg xml:id="L098-02-01-05">yakui</seg> <seg xml:id="L098-02-01-06">ka</seg><c>.</c></seg>
                  <seg xml:id="L098-02-02"><seg xml:id="L098-02-02-01">Kiku</seg> <seg xml:id="L098-02-02-02">kueyi</seg> <seg xml:id="L098-02-02-03">vari</seg> <seg xml:id="L098-02-02-04">yakui</seg> <seg xml:id="L098-02-02-05">ka</seg> <seg xml:id="L098-02-02-06">ra</seg> <seg xml:id="L098-02-02-07">koo</seg> <seg xml:id="L098-02-02-08">sa'mi</seg><c>.</c></seg>
                  <seg xml:id="L098-02-03"><seg xml:id="L098-02-03-01">Cha</seg> <seg xml:id="L098-02-03-02">nikitsaa</seg> <seg xml:id="L098-02-03-03">tsoko</seg> <seg xml:id="L098-02-03-04">ka</seg> <seg xml:id="L098-02-03-05">kikui</seg> <seg xml:id="L098-02-03-06">tiku</seg> <seg xml:id="L098-02-03-07">nani</seg> <seg xml:id="L098-02-03-08">nchu'a</seg><c>.</c></seg>
                  <seg xml:id="L098-02-04"><seg xml:id="L098-02-04-01">Sara</seg> <seg xml:id="L098-02-04-02">nikachi</seg> <seg xml:id="L098-02-04-03">yakui</seg> <seg xml:id="L098-02-04-04">ka</seg><c>:</c></seg>
                  <seg xml:id="L098-02-05"><seg xml:id="L098-02-05-01">Kue</seg> <seg xml:id="L098-02-05-02">va'a</seg> <seg xml:id="L098-02-05-03">kiku</seg> <seg xml:id="L098-02-05-04">ku</seg><c>.</c></seg>
                  <seg xml:id="L098-02-06"><seg xml:id="L098-02-06-01">Kini</seg> <seg xml:id="L098-02-06-02">xeen</seg> <seg xml:id="L098-02-06-03">na'a</seg> <seg xml:id="L098-02-06-04">sa'ma</seg> <seg xml:id="L098-02-06-05">sa'u</seg><c>.</c></seg>
                  <seg xml:id="L098-02-07"><seg xml:id="L098-02-07-01">Ntakinchai</seg> <seg xml:id="L098-02-07-02">sa'mi</seg> <seg xml:id="L098-02-07-03">kiku</seg> <seg xml:id="L098-02-07-04">mii</seg> <seg xml:id="L098-02-07-05">tiku</seg> <seg xml:id="L098-02-07-06">kuachi</seg><c>.</c></seg>
                  <seg xml:id="L098-02-08"><seg xml:id="L098-02-08-01">Sara</seg> <seg xml:id="L098-02-08-02">ntsinu</seg> <seg xml:id="L098-02-08-03">sa'mi</seg> <seg xml:id="L098-02-08-04">nikuu</seg> <seg xml:id="L098-02-08-05">yakua</seg> <seg xml:id="L098-02-08-06">xeen</seg> <seg xml:id="L098-02-08-07">ña</seg><c>.</c></seg>
                  <seg xml:id="L098-02-09"><seg xml:id="L098-02-09-01">Sara</seg> <seg xml:id="L098-02-09-02">nikachi</seg> <seg xml:id="L098-02-09-03">tsoko</seg> <seg xml:id="L098-02-09-04">ka</seg><c>:</c></seg>
                  <seg xml:id="L098-02-10"><seg xml:id="L098-02-10-01">Nako'on</seg> <seg xml:id="L098-02-10-02">viko</seg><c>.</c></seg>
                  <seg xml:id="L098-02-11"><seg xml:id="L098-02-11-01">Ma</seg> <seg xml:id="L098-02-11-02">ku'un</seg> <seg xml:id="L098-02-11-03">yu</seg><c>,</c> <seg xml:id="L098-02-11-04">vari</seg> <seg xml:id="L098-02-11-05">kini</seg> <seg xml:id="L098-02-11-06">xeen</seg> <seg xml:id="L098-02-11-07">kaa</seg> <seg xml:id="L098-02-11-08">sa'ma</seg><c>.</c></seg>
                  <seg xml:id="L098-02-12"><seg xml:id="L098-02-12-01">Nchaa</seg> <seg xml:id="L098-02-12-02">ntakacha</seg> <seg xml:id="L098-02-12-03">ña</seg> <seg xml:id="L098-02-12-04">nikachi</seg> <seg xml:id="L098-02-12-05">yakui</seg> <seg xml:id="L098-02-12-06">ka</seg><c>.</c></seg>
                  <seg xml:id="L098-02-14"><seg xml:id="L098-02-13-01">Kue</seg> <seg xml:id="L098-02-13-02">sa'a</seg> <seg xml:id="L098-02-13-03">ni</seg><c>,</c> <seg xml:id="L098-02-13-04">nako'on</seg> <seg xml:id="L098-02-13-05">nikachia</seg><c>.</c></seg>                            
               </p>
            </div>             
            <div xml:id="L098-03">
               <p>
                  <seg xml:id="L098-03-01"><seg xml:id="L098-03-01-01">Ntatsai</seg> <seg xml:id="L098-03-01-02">vi'i</seg> <seg xml:id="L098-03-01-03">kisi</seg> <seg xml:id="L098-03-01-04">si</seg> <seg xml:id="L098-03-01-05">ña</seg> <seg xml:id="L098-03-01-06">vitsi</seg><c>.</c></seg>
                  <seg xml:id="L098-03-02"><seg xml:id="L098-03-02-01">Sara</seg> <seg xml:id="L098-03-02-02">ntasta'in</seg> <seg xml:id="L098-03-02-03">ñu'i</seg> <seg xml:id="L098-03-02-04">takua</seg> <seg xml:id="L098-03-02-05">kutunia</seg><c>.</c></seg>
                  <seg xml:id="L098-03-03"><seg xml:id="L098-03-03-01">Nikuyachia</seg> <seg xml:id="L098-03-03-02">nuu</seg> <seg xml:id="L098-03-03-03">ituu</seg> <seg xml:id="L098-03-03-04">tsio</seg> <seg xml:id="L098-03-03-05">ña</seg> <seg xml:id="L098-03-03-06">ra</seg> <seg xml:id="L098-03-03-07">niki'vi</seg> <seg xml:id="L098-03-03-08">yaa</seg> <seg xml:id="L098-03-03-09">sa'mi</seg><c>.</c></seg>
                  <seg xml:id="L098-03-04"><seg xml:id="L098-03-04-01">Tsa'a</seg> <seg xml:id="L098-03-04-02">ña</seg> <seg xml:id="L098-03-04-03">ntiñu'u</seg> <seg xml:id="L098-03-04-04">kui</seg> <seg xml:id="L098-03-04-05">yee</seg> <seg xml:id="L098-03-04-06">vichi</seg><c>.</c></seg>
                  <graphic url="L098-03-armadillo.png"/>
               </p>                     
            </div>   
         </body>
      </text>
   </TEI>
Nixi ntsikoo sa'ma yakui ka Como el armadillo obtuvo su caparazón María Gómez Hernández Gisela Beckmann María M. Nieves TEI Encoding Jack T. Bowers

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Nixi ntsikoo sa'ma yakui ka Gómez Hernández, María; Beckmann, Gisela; Nieves, María M..2007. (2nd ed.). Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L098-Armadillo-mix.htm
@mode:'w' means that the primary means of utilising this document is by reading it's written form; this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content; not applicable

Un armadillo se puso a tejer ropa para cubrirse. Un día se fue a una fiesta con un tlacuache, se emborrachó y se ensució. Es por eso que los armadillos son de color gris.

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

project description to come

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model#tax.sil-mix with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Ntsio iin kii iin yakui ntivixi , yii yuku koo sa'mi . Sara ntakaninia nixi sava'i sa'mi . Ntantuki yu'a sara nikitsai kikui . Kiku xeen kiku nchu'i ñantu'i yuku ka . Nikitsaa iin tsoko sara nikachia saan : ¿ Nchii sa'u , mpaa luu ? - Kiku sa'ma nikachi yakui ka . - Sara nikachi tsoko ka : Kua'a sa'mu , nakiku nachinchee yo . Natsinu sa'mu ra nako'on viko .

Sara nchinchee tsoko ka yakui ka . Kiku kueyi vari yakui ka ra koo sa'mi . Cha nikitsaa tsoko ka kikui tiku nani nchu'a . Sara nikachi yakui ka : Kue va'a kiku ku . Kini xeen na'a sa'ma sa'u . Ntakinchai sa'mi kiku mii tiku kuachi . Sara ntsinu sa'mi nikuu yakua xeen ña . Sara nikachi tsoko ka : Nako'on viko . Ma ku'un yu , vari kini xeen kaa sa'ma . Nchaa ntakacha ña nikachi yakui ka . Kue sa'a ni , nako'on nikachia .

Ntatsai vi'i kisi si ña vitsi . Sara ntasta'in ñu'i takua kutunia . Nikuyachia nuu ituu tsio ña ra niki'vi yaa sa'mi . Tsa'a ña ntiñu'u kui yee vichi .

Toolbox

Themes:

Nixi ntsikoo sa'ma yakui ka Como el armadillo obtuvo su caparazón María Gómez Hernández Gisela Beckmann María M. Nieves TEI Encoding Jack T. Bowers

This is an XML-TEI markup of the original publications by SIL Mexico for the purposes of: archival and for the creation of a body of Mixtepec-Mixtec language resources (LR) (ISO: mix). Encoded documents used with the permission of original authors.

Nixi ntsikoo sa'ma yakui ka Gómez Hernández, María; Beckmann, Gisela; Nieves, María M..2007. (2nd ed.). Instituto Lingüístico de Verano, A.C. Tlalpan, D.F., México Obtained from:http://www-01.sil.org/mexico/mixteca/mixtepec/L098-Armadillo-mix.htm
@mode:'w' means that the primary means of utilising this document is by reading it's written form; this document is labeled "original" because the XML-TEI markup of it's contents have not made any alterations to the original content; not applicable

Un armadillo se puso a tejer ropa para cubrirse. Un día se fue a una fiesta con un tlacuache, se emborrachó y se ensució. Es por eso que los armadillos son de color gris.

In this document (and in all others in the SIL collection) the 'seg' element is used to encapsulate: i) the entire sentences from original document (first level); ii) the individual lexical items contained within the sentences (second level); Within the sentence level "seg" (segment) elements all punctuation characters (".", ";", ":", "¿", "?", "!", etc.) are encapsulated in the "c" (character) element; In future stages of the project, each of these punctuation characters will be formally linked with the linguistic categories with which it is associated: a sentence that begins with: "¿" (and ends with: "?") can be immediately recognized as sentence type "question/interrogative"; Where the page divisions in the original documents are, this encoding uses "div" (division) elements; this is to separate the linguistic, and knowledge data from formatting

project description to come

PEDAGOGICAL PEDAGOGICAL:INTERACTIVE PEDAGOGICAL:REFERENCE FICTION FICTION:FANTASY FICTION:REALISTIC FOLKLORE

This portion of the lexicography project deals with the only known body of literature in Mixtepec-Mixtec language which is comprised of series of 34* orthographic children's texts published by SIL Mexico (Summer Institute of Linguistics). The target audience of these documents is primarily young MIX speakers and are designed for use as classroom handouts and/or lessons for primary/elementary school-level. The topical content of these publications contain both culturally specific, and non-culturally specific subject matter supplemented by illustrations. Topics covered in these documents include: vocabulary, mathematics, telling time, geography, seasons, weather, local agricultural practices, fables, local Mixtec legends amongst others. These publications make up the second most important collection in this project and they remain the only source of reference for examples of how the working orthography of the Mixtec language is applied to the Mixtepec variety. The encoding of this portion of the MIX Language resources can be divided into the following components: i) TEI-XML markup of documents that supports reuse and extension within this project and/or for other interested parties; ii) Annotation and glossing of lexical information and structures in documents; tasks include; glossing of orthographical lexical items; assignment of lexical properties to sentences, phrases and lexical items (morpho-/syntactic; pragmatic; semantic etc.); linking corresponding bilingual items (Spanish-Mixtec) within each document; entry of analyzed, glossed and categorized lexical information into TEI-Dictionary; link orthographical forms of lexical items with phonetic transcriptions of spoken forms; iii) Classification of Document Types and Ontology Linking; replace current SIL document classification model with established and expansible document system which can be used to classify any new document or literature produced in the language in the future; as per the goal of the overall project, the use of language-specific glosses (including 'Interlingua') will be replaced by grounding the semantic profile of each lexical item by referencing ontological entities (from existing ontologies such as DBpedia, SKOS, etc.);

Ntsio iin kii iin yakui ntivixi , yii yuku koo sa'mi . Sara ntakaninia nixi sava'i sa'mi . Ntantuki yu'a sara nikitsai kikui . Kiku xeen kiku nchu'i ñantu'i yuku ka . Nikitsaa iin tsoko sara nikachia saan : ¿ Nchii sa'u , mpaa luu ? - Kiku sa'ma nikachi yakui ka . - Sara nikachi tsoko ka : Kua'a sa'mu , nakiku nachinchee yo . Natsinu sa'mu ra nako'on viko .

Sara nchinchee tsoko ka yakui ka . Kiku kueyi vari yakui ka ra koo sa'mi . Cha nikitsaa tsoko ka kikui tiku nani nchu'a . Sara nikachi yakui ka : Kue va'a kiku ku . Kini xeen na'a sa'ma sa'u . Ntakinchai sa'mi kiku mii tiku kuachi . Sara ntsinu sa'mi nikuu yakua xeen ña . Sara nikachi tsoko ka : Nako'on viko . Ma ku'un yu , vari kini xeen kaa sa'ma . Nchaa ntakacha ña nikachi yakui ka . Kue sa'a ni , nako'on nikachia .

Ntatsai vi'i kisi si ña vitsi . Sara ntasta'in ñu'i takua kutunia . Nikuyachia nuu ituu tsio ña ra niki'vi yaa sa'mi . Tsa'a ña ntiñu'u kui yee vichi .