home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Occitan-TTB: POS Tags: NUM

There are 104 NUM lemmas (2%), 129 NUM types (2%) and 292 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: 2, 3, 4, 5, 1, 20, 7, 30, 15, 23

The 10 most frequent NUM types: tres, dos, dus, cinc, quatre, sèt, un, 23, doas, 15

The 10 most frequent ambiguous lemmas: 1000 (NUM 2, ADJ 1)

The 10 most frequent ambiguous types: dos (NUM 19, NOUN 1), un (DET 329, PRON 18, NUM 7, ADV 1), I (PRON 17, INTJ 1, NUM 1), nòu (ADJ 3, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.240385 (the average of all parts of speech is 1.368971).

The 1st highest number of forms (7) was observed with the lemma “2”: 2, II, doas, dos, doàs, duas, dus.

The 2nd highest number of forms (4) was observed with the lemma “6”: 6, sieis, sièis, siès.

The 3rd highest number of forms (3) was observed with the lemma “1”: 1, I, un.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (174; 60% instances), obl (44; 15% instances), nmod (29; 10% instances), conj (16; 5% instances), flat (14; 5% instances), orphan (6; 2% instances), appos (2; 1% instances), nsubj (2; 1% instances), parataxis (2; 1% instances), ccomp (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (204; 70% instances), VERB (49; 17% instances), NUM (26; 9% instances), PROPN (10; 3% instances), ADJ (1; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)

158 (54%) NUM nodes are leaves.

74 (25%) NUM nodes have one child.

49 (17%) NUM nodes have two children.

11 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 15 different relations: case (70; 33% instances), punct (60; 28% instances), cc (16; 8% instances), conj (16; 8% instances), det (16; 8% instances), nmod (14; 7% instances), flat (8; 4% instances), nummod (3; 1% instances), advmod (2; 1% instances), amod (1; 0% instances), cop (1; 0% instances), dep (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), obl (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: ADP (69; 33% instances), PUNCT (60; 28% instances), NUM (26; 12% instances), CCONJ (17; 8% instances), DET (16; 8% instances), NOUN (16; 8% instances), ADV (3; 1% instances), ADJ (1; 0% instances), AUX (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances)