This presentation relates the ongoing construction of a multilayer corpus of Mbyá (Tupi Guarani: Argentina, Brazil, Paraguay). It will discuss (i) corpus composition (ii) ethical, linguistic and technological issues in corpus design and annotation, and (iii) usefulness for leveraging legacy texts in documenting language variation and recent evolution. (session 1.1.6