<?xml version="1.0" encoding="UTF-8"?><mets:mets xmlns:mets="http://www.loc.gov/METS/" xmlns:suj="http://www.theses.fr/namespace/sujets" xmlns:tef="http://www.abes.fr/abes/documents/tef" xmlns:local="http://www.local.univ.fr/theses" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:metsRights="http://cosimo.stanford.edu/sdr/metsrights/" xmlns:xlink="http://www.w3.org/1999/xlink" xsi:schemaLocation="http://www.loc.gov/METS/ http://www.abes.fr/abes/documents/stef/stef_schemas.xsd" ID="STRA_ORI_OAI_35" OBJID="ORI_OAI_35">
<mets:dmdSec ID="STRA.IMPORT.DESCRIPTION_BIBLIOGRAPHIQUE" CREATED="2022-01-12T17:45:03">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_desc_these">
<mets:xmlData>
<tef:thesisRecord>
<dc:title xml:lang="fr">Système de conjugaison, reconnaissance morphosyntaxique statistique lemmatisation automatique de la classe verbale du grec moderne standard</dc:title>
<dcterms:alternative xml:lang="en">Conjugation system, statistical morphosyntactic recognition and automatic lemmatization of the modern Greek standard verb class</dcterms:alternative>
<dc:subject xml:lang="fr">Grec (langue) moderne</dc:subject>
<dc:subject xml:lang="en">Pas de mots clés en anglais</dc:subject>
<dc:subject xsi:type="dcterms:DDC">006.3</dc:subject>
<dc:subject xsi:type="dcterms:DDC">489.3</dc:subject>
<tef:sujetRameau xml:lang="fr">
<tef:vedetteRameauNomCommun>
<tef:elementdEntree autoriteSource="Sudoc" autoriteExterne="02722841X">Grec (langue) moderne</tef:elementdEntree>
<tef:subdivision autoriteExterne="027253139" autoriteSource="Sudoc" type="subdivisionDeForme">Thèses et écrits académiques</tef:subdivision>
</tef:vedetteRameauNomCommun>
</tef:sujetRameau>
<dcterms:abstract xml:lang="fr">Dans cette thèse nous présentons les résultats ainsi que la méthodologie adoptée pour la création d'un système d'analyse morphosyntaxique automatique et de lemmatisation sans dictionnaire des formes verbales monolexicales du grec moderne standard. Avec le modèle rétrograde MOSAIC (Koktova 1985) sur le chech comme point de départ, ainsi que d'autres modèles similaires sur le français (Caradec &amp; Saada 1982) et le grec moderne (Lexifanis, Kotsanis &amp; Maistros 1985), notre recherche a couvert 8.485 lexèmes verbaux grecs, en prenant les données des dictionnaires les plus récents (Kyriacopoulou 1990, Iordanidou 1992, Kriaras 1995, Babiniotis 1998, Institut d'Études Néohelléniques 1998). Il a ainsi été créé: un nouveau système de conjugaison de 385 modèles qui sert à la génération automatique de tous les morphèmes lexicaux/radicaux ainsi que de toutes les formes flexionnelles monolexicales une base de données des séquences graphémiques finales qui permet l'attribution automatique de modèle de conjugaison à n'importe quel lemme verbal une base de données de 151.527 séquences graphémiques finales, statistiquement établie et manuellement perfectionnée, qui peut s'employer pour la reconnaissance automatique de n'importe quelle forme verbale monolexicale et un système des règles morphophonologiques rétrogrades utilisées pour la lemmatisation linéaire des formes flexionnelles, qui fonctionne sur la base du nouveau système de conjugaison de 385 modèles.</dcterms:abstract>
<dcterms:abstract xml:lang="en">In this dissertation we present the final results of our 10-year research on the Modern Greek verbal system. The objective of the research has been twofold: i) the development of a statistical database containing word-final grapheme sequences, which, on the basis of Koktova's (1985) retrograde analysis model MOSAIC, allow for the automatic morphosyntactic recognition (tagging) of all monolexical verbal forms of the language without any access to relevant electronic lexicons and ii) the development of a verb lemmatization morphophonological rule system, both providing various applications in all major areas of Text Processing as well as Teaching of Modern Greek Standard. Within this framework, 24Mb of verbal linguistic data have been collected, generated and classified automatically, and manually checked and enriched. These have been submitted to the University in the form of an appendix. Only representative extracts appear in the dissertation. More specifically, they consist of: a) a file of 8,485 Modern Greek verbal lemmas, developed in accordance with the evidence provided by the most recent dictionaries of the language (Dictionary of the Modern Greek Dhemotiki, Kriaras 1995, Greek Dictionary,Tegopoulos-Fytrakis 1993, Abridged Dictionary of Modern Greek, Pagoulatou Publ. 1991, Dictionary of Modern Greek, Babiniotis 1998 and Dictionary of Modern Greek Koine, Triantafyllidis Inst., Aristotle University of Thessaloniki 1998) b) a new conjugation system of 385 paradigmatic models, which allows for the automatic generation of all verbal stems and monolexical forms c) a file of 1st person singular word-final grapheme sequences, which allows for the automatic attribution of paradigmatic model codes to any verbal lemmas of the language d) a file of 27,383 verbal stems characterized solely on the basis of their conjugation model and their permissible suffix set e) 103 files of 519,694 automatically generated and classified verbal forms f) 17 files of 151,527 word-final grapheme sequences, which declare the conjugation model, morphosyntactic content, absolute frquency and lemmatization code of verbal forms g) a linear lemmatization morphophonological rule system, which functions on the basis of the newly developed 385 conjugation model system. More analytically, in the 1st chapter we discuss the role which morphological lexicons and statistical approaches play in the automatic morphosyntactic recognition of word tokens (tagging). In the 2nd chapter we discuss the Modern Greek verb system, including a brief description of the vocabulary of the language (Katharevousa-Dhemotiki): the morphosyntactic categories marked, the conjugation system as presented in various grammatical descriptions over the last 40 years, the verbal stems and inflectional affixes involved, the stress pattern, the external/internal augment and reduplication occurrences, as well as the 2 most recently developed conjugation systems (Kyriacopoulou 1990 and Iordanidou 1992), in an effort to account for the need of developing a new conjugation system. In the 3rd chapter we describe the methodology employed for the collection and processing of data, whereas in the 4th chapter we present extensive extracts from the 10 databases developed all together. A pilot application of the proposed language tool is available on the Internet and can be found on the site of the Language and Education Technology Laboratory of the University of Athens Informatics/Telecommunications Department.</dcterms:abstract>
<dc:type>Electronic Thesis or Dissertation</dc:type>
<dc:type xsi:type="dcterms:DCMIType">Text</dc:type>
<dc:language xsi:type="dcterms:RFC3066">fr</dc:language>
<dcterms:spatial xml:lang="fr">France</dcterms:spatial>
</tef:thesisRecord>
</mets:xmlData>
</mets:mdWrap>
</mets:dmdSec>
<mets:dmdSec ID="STRA.IMPORT.VERSION_COMPLETE.DESCRIPTION.EDITION_ARCHIVAGE" CREATED="2022-01-12T17:45:03">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_desc_edition">
<mets:xmlData>
<tef:edition>
<dcterms:medium xsi:type="dcterms:IMT">PDF</dcterms:medium>
<dc:identifier xsi:type="dcterms:URI">https://publication-theses.unistra.fr/public/theses_doctorat/2006/LEMBESSI_Penelope_2006.pdf</dc:identifier>
<dcterms:extent/>
<tef:editeur>
<tef:nom>Université de Strasbourg</tef:nom>
<tef:place>Strasbourg</tef:place>
</tef:editeur>
<dcterms:issued xsi:type="dcterms:W3CDTF"/>
</tef:edition>
</mets:xmlData>
</mets:mdWrap>
</mets:dmdSec>
<mets:amdSec>
<mets:techMD ID="STRA.IMPORT.ADMINISTRATION">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_admin_these">
<mets:xmlData>
<tef:thesisAdmin>
<tef:auteur>
<tef:nom>Lembessi</tef:nom>
<tef:prenom>Zacharoula-Pénélope</tef:prenom>
<tef:dateNaissance>1900-01-02T00:00:00</tef:dateNaissance>
<tef:nationalite scheme="ISO-3166-1">FR</tef:nationalite>
<tef:autoriteExterne autoriteSource="Sudoc">086806718</tef:autoriteExterne>
</tef:auteur>
<dc:identifier xsi:type="tef:nationalThesisPID">http://www.theses.fr/2006STR20003</dc:identifier>
<dc:identifier xsi:type="tef:NNT">2006STR20003</dc:identifier>
<dcterms:dateAccepted xsi:type="dcterms:W3CDTF">2006-03-30T00:00:00</dcterms:dateAccepted>
<tef:thesis.degree>
<tef:thesis.degree.discipline xml:lang="fr">Linguistique</tef:thesis.degree.discipline>
<tef:thesis.degree.grantor>
<tef:nom>Université Marc Bloch (Strasbourg)</tef:nom>
<tef:autoriteExterne autoriteSource="Sudoc">026438763</tef:autoriteExterne>
</tef:thesis.degree.grantor>
<tef:thesis.degree.level>Doctorat</tef:thesis.degree.level>
<tef:thesis.degree.name xml:lang="fr">Docteur
es</tef:thesis.degree.name>
</tef:thesis.degree>
<tef:theseSurTravaux>non</tef:theseSurTravaux>
<tef:avisJury>oui</tef:avisJury>
<tef:directeurThese>
<tef:nom>Eytan</tef:nom>
<tef:prenom>Michel</tef:prenom>
<tef:autoriteExterne autoriteSource="Sudoc">030032253</tef:autoriteExterne>
</tef:directeurThese>
<tef:ecoleDoctorale>
<tef:nom>École doctorale Humanités (Strasbourg ; 2009-....)</tef:nom>
<tef:autoriteExterne autoriteSource="None">ED520</tef:autoriteExterne>
<tef:autoriteExterne autoriteSource="Sudoc">156498324</tef:autoriteExterne>
</tef:ecoleDoctorale>
<tef:oaiSetSpec>ddc:004</tef:oaiSetSpec>
<tef:oaiSetSpec>ddc:480</tef:oaiSetSpec>
</tef:thesisAdmin>
</mets:xmlData>
</mets:mdWrap>
</mets:techMD>
<mets:rightsMD ID="STRA.IMPORT.DROITS_UNIVERSITE">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_droits_etablissement_these">
<mets:xmlData>
<metsRights:RightsDeclarationMD RIGHTSCATEGORY="CONTRACTUAL">
<metsRights:Context CONTEXTCLASS="GENERAL PUBLIC">
<metsRights:Permissions DISPLAY="true" DUPLICATE="true" PRINT="true" COPY="true" MODIFY="false" DELETE="false"/>
</metsRights:Context>
</metsRights:RightsDeclarationMD>
<metsRights:Context CONTEXTCLASS="INSTITUTIONAL AFFILIATE">
<metsRights:Permissions DISPLAY="true" DUPLICATE="true" PRINT="true" COPY="true" MODIFY="false" DELETE="false"/>
</metsRights:Context>
</mets:xmlData>
</mets:mdWrap>
</mets:rightsMD>
<mets:rightsMD ID="STRA.IMPORT.DROITS_DOCTORANT">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_droits_auteur_these">
<mets:xmlData>
<metsRights:RightsDeclarationMD RIGHTSCATEGORY="CONTRACTUAL"/>
</mets:xmlData>
</mets:mdWrap>
</mets:rightsMD>
<mets:rightsMD ID="STRA.IMPORT.VERSION_COMPLETE.DROITS">
<mets:mdWrap MDTYPE="OTHER" OTHERMDTYPE="tef_droits_version">
<mets:xmlData>
<metsRights:RightsDeclarationMD RIGHTSCATEGORY="CONTRACTUAL"/>
</mets:xmlData>
</mets:mdWrap>
</mets:rightsMD>
</mets:amdSec>
<mets:fileSec>
<mets:fileGrp ID="FGrID1" USE="diffusion">
<mets:file ID="FID1" ADMID="position()" MIMETYPE="application/pdf" USE="maitre">
<mets:FLocat LOCTYPE="URL" xlink:href="https://publication-theses.unistra.fr/public/theses_doctorat/2006/LEMBESSI_Penelope_2006.pdf"/>
</mets:file>
</mets:fileGrp>
</mets:fileSec>
<mets:structMap TYPE="logical">
<mets:div TYPE="THESE" CONTENTIDS="http://mon-univ.fr/uid/uds-ori-283404/oeuvre" DMDID="STRA.IMPORT.DESCRIPTION_BIBLIOGRAPHIQUE" ADMID="STRA.IMPORT.ADMINISTRATION STRA.IMPORT.DROITS_UNIVERSITE STRA.IMPORT.DROITS_DOCTORANT">
<mets:div TYPE="VERSION_COMPLETE" CONTENTIDS="http://mon-univ.fr/uid/uds-ori-283404/oeuvre/version" ADMID="STRA.IMPORT.VERSION_COMPLETE.DROITS">
<mets:div TYPE="EDITION" CONTENTIDS="http://mon-univ.fr/uid/uds-ori-283404/oeuvre/version/edition" DMDID="STRA.IMPORT.VERSION_COMPLETE.DESCRIPTION.EDITION_ARCHIVAGE"/>
</mets:div>
</mets:div>
</mets:structMap>
</mets:mets>
