Treebank Statistics: UD_Old_French-PROFITEROLE: POS Tags: NUM
There are 30 NUM lemmas (1%), 128 NUM types (1%) and 1052 NUM tokens (0%).
Out of 15 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: _, @card@, deux, cent, mille1, trois, quatre, vingt, douze, sept
The 10 most frequent NUM types: deus, .ii., trois, quatre, dous, dis, cent, dui, set, .iiii.
The 10 most frequent ambiguous lemmas: _ (VERB 13207, NOUN 12804, PUNCT 12347, DET 11020, PRON 10386, ADP 9918, ADV 9278, CCONJ 4482, PROPN 3203, AUX 2986, SCONJ 2870, ADJ 2419, NUM 300, INTJ 47, X 8), @card@ (NUM 299, PRON 141, VERB 8, ADJ 4, DET 4), deux (NUM 73, PRON 19, ADJ 1), cent (NUM 51, PRON 5), mille1 (NUM 48, PRON 23), trois (NUM 42, PRON 14, ADJ 1), quatre (NUM 37, PRON 5), vingt (NUM 29, PRON 6), douze (NUM 28, PRON 2), sept (NUM 27, PRON 4, ADJ 1)
The 10 most frequent ambiguous types: deus (NUM 73, PRON 16, NOUN 15, AUX 1), .ii. (NUM 47, PRON 17), trois (NUM 47, PRON 20, VERB 3), quatre (NUM 31, PRON 8), dous (NUM 39, PRON 6, ADJ 2), dis (NUM 34, NOUN 19, VERB 10, PRON 6, ADV 1), cent (NUM 28, PRON 1), dui (NUM 29, PRON 18, ADJ 3, AUX 2, VERB 1), set (VERB 90, NUM 23, AUX 8, PRON 2, SCONJ 1), .iiii. (NUM 9, PRON 3)
- deus
- NUM 73: Ne vos en qier mentir deus moz .
- PRON 16: N’ avoit qu’ eus deus en cel païs ;
- NOUN 15: Ce est grant deus ;
- AUX 1: einsi te perdi Nostre Sires qui t’ avoit norri et escreu , et garni de toutes bones vertuz , et t’ avoit si haut levé que en son servise t’ avoit mis . Si que quant cil quida que tu fusses ses serjanz , et le servisses de les biens qu’ il t’ avoit prestez . Tu le lessas maintenant . Si que quant tu deus estre serjanz Jhesucrist tu devenis serjanz a le deable ,
- .ii.
- trois
- quatre
- dous
- dis
- cent
- dui
- NUM 29: A la lune Bien vit josté erent ensenble Li dui amant .
- PRON 18: Li dui en furent mort d’ espees , Li tierz d’ une seete ocis ;
- ADJ 3: De ces .ix. sont li .vii. roi , et li dui chevalier ,
- AUX 2: mais je li dui Anor faire non trop frarine .
- VERB 1: Se je ai fet ce que je dui , Si m’ an doit an tel gré savoir Con celi qui autrui avoir Anprunte ,
- set
- .iiii.
Morphology
The form / lemma ratio of NUM is 4.266667 (the average of all parts of speech is 3.463337).
The 1st highest number of forms (64) was observed with the lemma “_”: .c., .c.m., .cc., .iiij., .iiij.m., .iij., .iij.m., .ij., .l., .l.m., .lx., .lxij.m., .lxxx.m., .m., .vij., .vij.c., .x., .x.m., .xiiij., .xij., .xv., .xvij., .xx., .xx.m., .xxv., .xxv.m., .xxx.m., .xxxvi.m., cc., cent, chens, chent, chinc, chinquante, cinquante, deus, deux, dis, doi, dous, douze, m., mile, mille, nuef, quarante, quarte, quatorze, quatre, quinze, set, sis, soisante, soixante, tierce, trente, troi, trois, uit, un, une, uns, vint, úít.
The 2nd highest number of forms (54) was observed with the lemma “@card@”: .c., .i., .ii., .iii., .iiii., .iv., .lx., .v., .vi., .vii., .viii., .xiiii., .xvi., .xvii., .xviii., .xx., .xxx., Amedui, Anbedeus, Mil, XX, ambesdous, an.ii., c., cent, cenz, cinc, cinquante, deus, deux, dis, dous, doze, dui, huit, mile, milie, nof, premier, quarante, quatre, quinze, seisante, set, sis, treis, trente, treze, troi, trois, un, une, vint, vinz.
The 3rd highest number of forms (8) was observed with the lemma “ambedeux”: ambdui, ambe.ii., ambedui, ambesdous, ambsdous, amsdous, andex, ansdous.
NUM occurs with 1 features: NumType (1044; 99% instances)
NUM occurs with 2 feature-value pairs: NumType=Card, NumType=Ord
NUM occurs with 3 feature combinations.
The most frequent feature combination is NumType=Card (1042 tokens).
Examples: deus, .ii., trois, quatre, dous, cent, dis, dui, set, .iiii.
Relations
NUM nodes are attached to their parents using 10 different relations: nummod (577; 55% instances), amod (306; 29% instances), flat (92; 9% instances), conj (36; 3% instances), obl (17; 2% instances), obj (16; 2% instances), nsubj (5; 0% instances), dislocated (1; 0% instances), nmod (1; 0% instances), root (1; 0% instances)
Parents of NUM nodes belong to 8 different parts of speech: NOUN (813; 77% instances), NUM (124; 12% instances), PRON (42; 4% instances), VERB (41; 4% instances), PROPN (26; 2% instances), ADV (3; 0% instances), ADJ (2; 0% instances), (1; 0% instances)
859 (82%) NUM nodes are leaves.
145 (14%) NUM nodes have one child.
31 (3%) NUM nodes have two children.
17 (2%) NUM nodes have three or more children.
The highest child degree of a NUM node is 8.
Children of NUM nodes are attached using 14 different relations: flat (104; 38% instances), conj (44; 16% instances), cc (35; 13% instances), nmod (19; 7% instances), advmod (18; 7% instances), punct (14; 5% instances), det (13; 5% instances), case (9; 3% instances), acl:relcl (5; 2% instances), amod (5; 2% instances), appos (2; 1% instances), cc:nc (2; 1% instances), cop (1; 0% instances), dislocated (1; 0% instances)
Children of NUM nodes belong to 12 different parts of speech: NUM (124; 46% instances), CCONJ (44; 16% instances), ADV (29; 11% instances), PRON (16; 6% instances), DET (14; 5% instances), PUNCT (14; 5% instances), NOUN (11; 4% instances), ADP (9; 3% instances), VERB (5; 2% instances), ADJ (4; 1% instances), AUX (1; 0% instances), PROPN (1; 0% instances)