Treebank Statistics: UD_Occitan-TTB: POS Tags: NUM
There are 104 NUM lemmas (2%), 129 NUM types (2%) and 292 NUM tokens (1%).
Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 13 in number of tokens.
The 10 most frequent NUM lemmas: 2, 3, 4, 5, 1, 20, 7, 30, 15, 23
The 10 most frequent NUM types: tres, dos, dus, cinc, quatre, sèt, un, 23, doas, 15
The 10 most frequent ambiguous lemmas: 1000 (NUM 2, ADJ 1)
The 10 most frequent ambiguous types: dos (NUM 19, NOUN 1), un (DET 329, PRON 18, NUM 7, ADV 1), I (PRON 17, INTJ 1, NUM 1), nòu (ADJ 3, NUM 1)
- dos
- un
- DET 329: Sus aubres e dins los bartasses bresilhavan un fum d’ aucelons .
- PRON 18: N’ aviái ja un de vtt , mas totcòp decidiguèri qu’ aviá fach son temps .
- NUM 7: Lo bacèl : d’ un diamètre de 18 cm a lo cap d’ un margue de 70 cm a un mètre .
- ADV 1: - Anem ! anem ! li fasiá , me sembla qu’ èretz plus coratjós un còp èra !
- I
- PRON 17: I deu aver una explicacion a n’ aquò .
- INTJ 1: I , i , i !
- NUM 1: Se sap pas quora la latinizacion foguèt acabada mas sembla que i aguèt de diferéncias regionalas plan grandas : cèrtas regions occitanas perifericas ( dins los Pirenèus ) aurián poscut èsser latinizadas , o mai romanizadas , a los sègle VIII e IX apC . , mentre que Provença e lo Bas Lengadòc o foguèron a lo sègle I apC .
- nòu
Morphology
The form / lemma ratio of NUM is 1.240385 (the average of all parts of speech is 1.368971).
The 1st highest number of forms (7) was observed with the lemma “2”: 2, II, doas, dos, doàs, duas, dus.
The 2nd highest number of forms (4) was observed with the lemma “6”: 6, sieis, sièis, siès.
The 3rd highest number of forms (3) was observed with the lemma “1”: 1, I, un.
NUM does not occur with any features.
Relations
NUM nodes are attached to their parents using 12 different relations: nummod (174; 60% instances), obl (44; 15% instances), nmod (29; 10% instances), conj (16; 5% instances), flat (14; 5% instances), orphan (6; 2% instances), appos (2; 1% instances), nsubj (2; 1% instances), parataxis (2; 1% instances), ccomp (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM nodes belong to 7 different parts of speech: NOUN (204; 70% instances), VERB (49; 17% instances), NUM (26; 9% instances), PROPN (10; 3% instances), ADJ (1; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)
158 (54%) NUM nodes are leaves.
74 (25%) NUM nodes have one child.
49 (17%) NUM nodes have two children.
11 (4%) NUM nodes have three or more children.
The highest child degree of a NUM node is 6.
Children of NUM nodes are attached using 15 different relations: case (70; 33% instances), punct (60; 28% instances), cc (16; 8% instances), conj (16; 8% instances), det (16; 8% instances), nmod (14; 7% instances), flat (8; 4% instances), nummod (3; 1% instances), advmod (2; 1% instances), amod (1; 0% instances), cop (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), obl (1; 0% instances)
Children of NUM nodes belong to 11 different parts of speech: ADP (69; 33% instances), PUNCT (60; 28% instances), NUM (26; 12% instances), CCONJ (17; 8% instances), DET (16; 8% instances), NOUN (16; 8% instances), ADV (3; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)