Treebank Statistics: UD_Breton-KEB: POS Tags: NUM
There are 123 NUM
lemmas (7%), 136 NUM
types (5%) and 233 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 4 in number of lemmas, 4 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: daou, unan, tri, 2007, 20, 4, pevar, 000, 1950, 3
The 10 most frequent NUM
types: daou, unan, 2007, 4, 000, 1950, 20, 3, 30, div
The 10 most frequent ambiguous lemmas: 4 (NUM 5, X 1), eil (ADJ 5, NUM 4), kentañ (ADJ 14, NUM 2), trede (NUM 2, ADJ 1)
The 10 most frequent ambiguous types: 4 (NUM 5, X 1), eil (NUM 4, ADJ 3), drede (ADJ 1, NUM 1), gant (ADP 114, NUM 1), kentañ (ADJ 6, NUM 1)
- 4
- eil
- drede
- gant
- kentañ
Morphology
The form / lemma ratio of NUM
is 1.105691 (the average of all parts of speech is 1.395664).
The 1st highest number of forms (3) was observed with the lemma “daou”: daou, div, zaou.
The 2nd highest number of forms (3) was observed with the lemma “tri”: dri, teir, tri.
The 3rd highest number of forms (2) was observed with the lemma “15”: 15, 15 %.
NUM
occurs with 3 features: Number (222; 95% instances), Gender (38; 16% instances), NumType (16; 7% instances)
NUM
occurs with 5 feature-value pairs: Gender=Fem
, Gender=Masc
, NumType=Ord
, Number=Plur
, Number=Sing
NUM
occurs with 8 feature combinations.
The most frequent feature combination is Number=Plur
(161 tokens).
Examples: daou, 2007, 4, 000, 1950, 20, 3, 30, 10, 2
Relations
NUM
nodes are attached to their parents using 18 different relations: nummod (96; 41% instances), obl (39; 17% instances), nmod (36; 15% instances), amod (16; 7% instances), nsubj (10; 4% instances), conj (8; 3% instances), root (5; 2% instances), appos (4; 2% instances), dep (4; 2% instances), nmod:gen (4; 2% instances), parataxis (3; 1% instances), flat:name (2; 1% instances), advcl (1; 0% instances), compound (1; 0% instances), fixed (1; 0% instances), obj (1; 0% instances), obl:agent (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM
nodes belong to 8 different parts of speech: NOUN (158; 68% instances), VERB (49; 21% instances), NUM (14; 6% instances), (5; 2% instances), ADJ (3; 1% instances), PROPN (2; 1% instances), CCONJ (1; 0% instances), SYM (1; 0% instances)
139 (60%) NUM
nodes are leaves.
56 (24%) NUM
nodes have one child.
20 (9%) NUM
nodes have two children.
18 (8%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 6.
Children of NUM
nodes are attached using 19 different relations: case (54; 33% instances), nmod (27; 16% instances), punct (18; 11% instances), dep (11; 7% instances), det (10; 6% instances), advmod (8; 5% instances), cc (8; 5% instances), conj (8; 5% instances), cop (5; 3% instances), nsubj (5; 3% instances), acl (2; 1% instances), aux (2; 1% instances), nummod (2; 1% instances), amod (1; 1% instances), aux:pass (1; 1% instances), fixed (1; 1% instances), mark (1; 1% instances), obl (1; 1% instances), parataxis (1; 1% instances)
Children of NUM
nodes belong to 14 different parts of speech: ADP (59; 36% instances), NOUN (27; 16% instances), PUNCT (18; 11% instances), NUM (14; 8% instances), ADV (10; 6% instances), DET (9; 5% instances), AUX (8; 5% instances), CCONJ (8; 5% instances), X (6; 4% instances), VERB (3; 2% instances), ADJ (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances), SCONJ (1; 1% instances)