Treebank Statistics: UD_Old_Occitan-CorAG: POS Tags: NUM
There are 1 NUM lemmas (7%), 118 NUM types (2%) and 477 NUM tokens (1%).
Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 9 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: _
The 10 most frequent NUM types: tres, .vi., dus, .LXVI., .v., .xx., dues, mil, .i., dos
The 10 most frequent ambiguous lemmas: _ (NOUN 9917, ADP 7391, VERB 6408, DET 6335, PUNCT 4963, PRON 3913, CCONJ 3587, ADV 2277, SCONJ 2220, PROPN 1872, ADJ 1680, AUX 1446, NUM 477, PART 53)
The 10 most frequent ambiguous types: tres (NUM 56, ADV 2), .i. (NUM 8, DET 2), XX (NUM 7, ADJ 1), une (DET 27, NUM 7, PRON 2), X (NUM 6, ADJ 1), XV (NUM 4, ADJ 1), VII (NUM 3, ADJ 1), III (ADJ 2, NUM 2), cens (NOUN 2, NUM 1), un (DET 112, PRON 23, NUM 2)
- tres
- NUM 56: Car sus- -ço à vous commettem nos tres vegades ,
- ADV 2: Sapiatz totz et sengles , nos aver vist , aver legit et diligemment aver regardat una inquisition feyta no y a gayres , de mandament de- -lo tres excellent et tres puissant seynhor Henric , rey d’ Anglaterra , per Johan , qui fo abat per La-Gracia-de-Diu et per mossen Hubert Hose , cavaley , suber las libertatz de la terra de Entre-dos-Mars et de- -los exces et alienations de- -los bayliatges et dreyts de- -lo subredeyt Rey en ladeyta terra , so es a ssaber en lo die de dissapta , prope la festa de Saincta-Agata , verges comenssada et terminada en la dominica de la septuagesima , en l’ an de nostre seynhor M (IIe) XXX V de laqual la tenor s’ ensec de mot à mot :
- .i.
- NUM 8: e lodiit en Fortaner que- -us deu saubar e segui enta que part se boilhen anadere de .i. die .
- DET 2: E que- -los dam per for que si nulhs hom plage de plage legal autre hom , la lei de- -lo plagad es .c. e .l. fl , e la lei nostre .lxv. fl , si es pravade leialmentz per testimonis , o per .i. judge jurad qui leialmentz la age menade e gardeade .
- XX
- une
- X
- XV
- VII
- NUM 3: Item , aucuns homes de Laseuba , per lo usatge de- -lo bosc de Capianc VII cair de leynha o de busqua a obs de- -los sercles .
- ADJ 1: En testimoni de laquau causa nos trametem questas nostras letras a vos autres , dadas vert o a Sulwerk lo VII jorn de hagost , en l- -an XLII de- -lo regne de- -lo seynhor Rey nostre payre .
- III
- ADJ 2: Vert Westm. lo III jorn de agost .
- NUM 2: A totz a- -losquaus las presens letras vindran , sapiat que nos autreyam a- -los nostres homes de Entre-dos-Mars de la diocesa de Bordeu , losquaus son tengudz a nos o a nostre prebost de aquerra terra a las albergadas , que aqueras medeyssas albergadas sian recebudas am edz per lodeyt prebost o autre per nostre nome en los locx o vilatges en losquaus de temps antic an acostumat esser recebut tant solament una vequada l’ an , am III homes a cavat et III a peys , sens plus . Ayssique sian refresquit competentment de viandas , et de autres necessarias , ayssi cum a acostumat .
- cens
- NOUN 2: Item , li cens et li esporle desobre escruitz foren assignat de las parropias per la protection et deffension de eras que es aperat captenhs en romans .
- NUM 1: Item , recebut de- -los homes de- -lo seynhor Rey de XV parropias cens livras per que los deffendessa de- -los nautoneys de- -lo port de Trejeyt , losquaus los arrauban quascun an , quascun temps de estat , de- -lo blat et de lurs causas no tant solament esta de lur deffendre , mas donet ayssimedeys sonx propres servient a- -los nautoneys a ffar ladeyta arraubeyria , recebuda la pecunia de la part adversa .
- un
- DET 112: En los autres temps , n’ i aye lo menhs un cert jorn ;
- PRON 23: aras a penas pot esser un o dos sustentat ;
- NUM 2: fan a luy questz dreytz homanatge et de certas causas que tenen de luy fan a luy usatge o ost de I. cavaley , o de dos , o de escudey et am armas o arnes , certas et determinadas ayssi cum de antiqua costuma es cert et determinat sinauque ssia qui ten las causas que deben lo ost de un cavaley o de dos sino que sia tengut per privilegi segont la part que ten ,
Morphology
The form / lemma ratio of NUM is 118.000000 (the average of all parts of speech is 542.857143).
The 1st highest number of forms (118) was observed with the lemma “_”: (IIe), -lx., .CL., .CLX., .II.LV, .II.LXXX.VI, .IIC., .IIC.LII., .III.XCVIII., .IIIC., .IIIC.LXIIII., .IIII., .IIIIte., .IIIItre., .IIIes., .LV., .LXVI, .LXVI., .Lxvi., .M., .M.II.LXXXVIII.LXXXVIII., .VIC., .XII., .XIIII., .XL., .XXX., .XXXta., .c., .ccc., .i., .ii., .iii., .iiiien., .ix., .l., .lx., .lxv., .v., .ve., .vi., .viii., .x., .xv., .xviii., .xx., .xx.vii., CINCQ, I, I., II., III, IIIC., IIIC.LVIII, IIII, IX., L, LX, LXV, LXX, LXXIX., LXXXVIII., M, M., M.CC.XXXVI, MCC, MCCXXX, V, V., VI., VII, VIII, X, X., XCVIII., XII., XL, XLII, XV, XVII, XX, XXII., XXX, bint, cens, cent, cinq, cinque, cinquoante, des, deux, dissapta, dissapte, doas, dos, dues, dus, mil, milia, miu, nau, oeyt, quaranta, quatre, quinza, quoate, sed, seis, senglas, seys, sieys, sinc, tres, un, una, une, ung, vingt, vint.
NUM does not occur with any features.
Relations
NUM nodes are attached to their parents using 11 different relations: nummod (400; 84% instances), conj (57; 12% instances), obl (4; 1% instances), flat (3; 1% instances), nmod (3; 1% instances), nsubj (3; 1% instances), obj (3; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances)
Parents of NUM nodes belong to 4 different parts of speech: NOUN (411; 86% instances), NUM (51; 11% instances), VERB (14; 3% instances), PRON (1; 0% instances)
379 (79%) NUM nodes are leaves.
62 (13%) NUM nodes have one child.
24 (5%) NUM nodes have two children.
12 (3%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 16 different relations: conj (60; 38% instances), cc (36; 23% instances), case (19; 12% instances), punct (11; 7% instances), det (6; 4% instances), nmod (5; 3% instances), advmod (3; 2% instances), cop (3; 2% instances), flat (3; 2% instances), acl (2; 1% instances), mark (2; 1% instances), nsubj (2; 1% instances), amod (1; 1% instances), appos (1; 1% instances), nummod (1; 1% instances), orphan (1; 1% instances)
Children of NUM nodes belong to 13 different parts of speech: NUM (51; 33% instances), CCONJ (36; 23% instances), ADP (19; 12% instances), PUNCT (11; 7% instances), ADV (10; 6% instances), NOUN (10; 6% instances), DET (6; 4% instances), AUX (3; 2% instances), PRON (3; 2% instances), VERB (3; 2% instances), SCONJ (2; 1% instances), ADJ (1; 1% instances), PROPN (1; 1% instances)