home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Atis: POS Tags: NUM

There are 149 NUM lemmas (12%), 152 NUM types (7%) and 888 NUM tokens (2%). Out of 13 observed tags, the rank of NUM is: 4 in number of lemmas, 5 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: 7, on, 1, 27, 2, 6, 5, yirmi, 8, bir

The 10 most frequent NUM types: 7, on, 1, 27, 2, 6, 5, yirmi, bir, 12

The 10 most frequent ambiguous lemmas: 7 (NUM 68, NOUN 30), 1 (NUM 42, NOUN 11), 27 (NUM 33, NOUN 3), 2 (NUM 31, NOUN 13), 6 (NOUN 54, NUM 30), 5 (NOUN 64, NUM 27), 8 (NOUN 46, NUM 25), bir (DET 691, NUM 25, NOUN 3), 12 (NOUN 34, NUM 24), 4 (NOUN 28, NUM 24)

The 10 most frequent ambiguous types: bir (DET 683, NUM 23), yedi (NUM 13, VERB 1), birinci (ADJ 72, NUM 1), yedinci (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.020134 (the average of all parts of speech is 1.744205).

The 1st highest number of forms (2) was observed with the lemma “28”: 28, 28..

The 2nd highest number of forms (2) was observed with the lemma “430”: 430, 4:30.

The 3rd highest number of forms (2) was observed with the lemma “8”: 8, 8..

NUM occurs with 1 features: NumType (888; 100% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (881 tokens). Examples: 7, on, 1, 27, 2, 6, 5, yirmi, bir, 12

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (649; 73% instances), conj (51; 6% instances), compound (46; 5% instances), nmod (45; 5% instances), obl:tmod (44; 5% instances), nmod:tmod (28; 3% instances), nsubj (9; 1% instances), amod (6; 1% instances), flat (6; 1% instances), obl (2; 0% instances), parataxis (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: PROPN (440; 50% instances), NOUN (261; 29% instances), NUM (92; 10% instances), ADJ (81; 9% instances), VERB (11; 1% instances), ADV (2; 0% instances), (1; 0% instances)

635 (72%) NUM nodes are leaves.

135 (15%) NUM nodes have one child.

88 (10%) NUM nodes have two children.

30 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 15 different relations: nmod (131; 32% instances), case (66; 16% instances), compound (62; 15% instances), conj (50; 12% instances), cc (44; 11% instances), nmod:tmod (24; 6% instances), nmod:poss (6; 1% instances), punct (6; 1% instances), det (5; 1% instances), aux:q (4; 1% instances), acl (3; 1% instances), amod (2; 0% instances), fixed (2; 0% instances), nsubj (2; 0% instances), advmod (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NOUN (206; 50% instances), NUM (92; 23% instances), CCONJ (44; 11% instances), PROPN (24; 6% instances), ADJ (11; 3% instances), ADP (10; 2% instances), DET (9; 2% instances), PUNCT (6; 1% instances), AUX (4; 1% instances), ADV (1; 0% instances), VERB (1; 0% instances)