Treebank Statistics: UD_Old_Occitan-CorAG: POS Tags: NUM
There are 1 NUM lemmas (7%), 110 NUM types (2%) and 436 NUM tokens (1%).
Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: _
The 10 most frequent NUM types: tres, .vi., dus, .LXVI., .v., .xx., .i., .XXX., .x., .XL.
The 10 most frequent ambiguous lemmas: _ (NOUN 8359, ADP 6278, VERB 5468, DET 5372, PUNCT 4269, PRON 3493, CCONJ 3134, SCONJ 2046, ADV 1984, PROPN 1865, ADJ 1418, AUX 1213, NUM 436, PART 54)
The 10 most frequent ambiguous types: tres (NUM 41, ADV 2), .i. (NUM 8, DET 2), dos (NUM 9, PRON 2), XX (NUM 7, ADJ 1), une (DET 27, NUM 7, PRON 2), X (NUM 6, ADJ 1), XV (NUM 4, ADJ 1), VII (NUM 3, ADJ 1), III (ADJ 2, NUM 2), un (DET 112, PRON 23, NUM 2)
- tres
- NUM 41: Item tot thianser deu aver paus per tres dies abantz que ost de exir ;
- ADV 2: Sapiatz totz et sengles , nos aver vist , aver legit et diligemment aver regardat una inquisition feyta no y a gayres , de mandament de- -lo tres excellent et tres puissant seynhor Henric , rey d’ Anglaterra , per Johan , qui fo abat per La-Gracia-de-Diu et per mossen Hubert Hose , cavaley , suber las libertatz de la terra de Entre-dos-Mars et de- -los exces et alienations de- -los bayliatges et dreyts de- -lo subredeyt Rey en ladeyta terra , so es a ssaber en lo die de dissapta , prope la festa de Saincta-Agata , verges comenssada et terminada en la dominica de la septuagesima , en l’ an de nostre seynhor M (IIe) XXX V de laqual la tenor s’ ensec de mot à mot :
- .i.
- NUM 8: e lodiit en Fortaner que- -us deu saubar e segui enta que part se boilhen anadere de .i. die .
- DET 2: E que- -los dam per for que si nulhs hom plage de plage legal autre hom , la lei de- -lo plagad es .c. e .l. fl , e la lei nostre .lxv. fl , si es pravade leialmentz per testimonis , o per .i. judge jurad qui leialmentz la age menade e gardeade .
- dos
- XX
- une
- X
- XV
- VII
- NUM 3: Item , aucuns homes de Laseuba , per lo usatge de- -lo bosc de Capianc VII cair de leynha o de busqua a obs de- -los sercles .
- ADJ 1: En testimoni de laquau causa nos trametem questas nostras letras a vos autres , dadas vert o a Sulwerk lo VII jorn de hagost , en l- -an XLII de- -lo regne de- -lo seynhor Rey nostre payre .
- III
- ADJ 2: Vert Westm. lo III jorn de agost .
- NUM 2: A totz a- -losquaus las presens letras vindran , sapiat que nos autreyam a- -los nostres homes de Entre-dos-Mars de la diocesa de Bordeu , losquaus son tengudz a nos o a nostre prebost de aquerra terra a las albergadas , que aqueras medeyssas albergadas sian recebudas am edz per lodeyt prebost o autre per nostre nome en los locx o vilatges en losquaus de temps antic an acostumat esser recebut tant solament una vequada l’ an , am III homes a cavat et III a peys , sens plus . Ayssique sian refresquit competentment de viandas , et de autres necessarias , ayssi cum a acostumat .
- un
- DET 112: Judyat fo per un homi de Gurtz et autre de Bideren qui- -s deffene .
- PRON 23: l’ un es per man de senhor mayor ,
- NUM 2: fan a luy questz dreytz homanatge et de certas causas que tenen de luy fan a luy usatge o ost de I. cavaley , o de dos , o de escudey et am armas o arnes , certas et determinadas ayssi cum de antiqua costuma es cert et determinat sinauque ssia qui ten las causas que deben lo ost de un cavaley o de dos sino que sia tengut per privilegi segont la part que ten ,
Morphology
The form / lemma ratio of NUM is 110.000000 (the average of all parts of speech is 457.357143).
The 1st highest number of forms (110) was observed with the lemma “_”: (IIe), -lx., .CL., .CLX., .II.LV, .II.LXXX.VI, .IIC., .IIC.LII., .III.XCVIII., .IIIC., .IIIC.LXIIII., .IIII., .IIIIte., .IIIItre., .IIIes., .LV., .LXVI, .LXVI., .Lxvi., .M., .M.II.LXXXVIII.LXXXVIII., .VIC., .XII., .XIIII., .XL., .XXX., .XXXta., .c., .ccc., .i., .ii., .iii., .iiiien., .ix., .l., .lx., .lxv., .v., .ve., .vi., .viii., .x., .xv., .xviii., .xx., .xx.vii., I, I., II., III, IIIC., IIIC.LVIII, IIII, IX., L, LX, LXV, LXX, LXXIX., LXXXVIII., M, M., M.CC.XXXVI, MCC, MCCXXX, V, V., VI., VII, VIII, X, X., XCVIII., XII., XL, XLII, XV, XVII, XX, XXII., XXX, bint, cens, cent, des, dissapta, dissapte, doas, dos, dues, dus, mil, milia, miu, nau, quaranta, quatre, quinza, quoate, sed, seis, senglas, seys, sinc, tres, un, una, une, ung, vint.
NUM does not occur with any features.
Relations
NUM nodes are attached to their parents using 10 different relations: nummod (365; 84% instances), conj (56; 13% instances), nmod (3; 1% instances), obj (3; 1% instances), obl (3; 1% instances), nsubj (2; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances)
Parents of NUM nodes belong to 3 different parts of speech: NOUN (376; 86% instances), NUM (49; 11% instances), VERB (11; 3% instances)
345 (79%) NUM nodes are leaves.
58 (13%) NUM nodes have one child.
23 (5%) NUM nodes have two children.
10 (2%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 14 different relations: conj (57; 40% instances), cc (34; 24% instances), case (18; 13% instances), punct (11; 8% instances), det (6; 4% instances), nmod (4; 3% instances), advmod (2; 1% instances), cop (2; 1% instances), mark (2; 1% instances), nsubj (2; 1% instances), acl (1; 1% instances), amod (1; 1% instances), nummod (1; 1% instances), orphan (1; 1% instances)
Children of NUM nodes belong to 11 different parts of speech: NUM (49; 35% instances), CCONJ (34; 24% instances), ADP (18; 13% instances), PUNCT (11; 8% instances), ADV (9; 6% instances), NOUN (8; 6% instances), DET (6; 4% instances), AUX (2; 1% instances), SCONJ (2; 1% instances), VERB (2; 1% instances), ADJ (1; 1% instances)