home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Atis: POS Tags: NUM

There are 197 NUM lemmas (19%), 270 NUM types (13%) and 1306 NUM tokens (3%). Out of 14 observed tags, the rank of NUM is: 3 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent NUM lemmas: 7, 5, 6, 10, 8, 12, on, 1, 4, 2

The 10 most frequent NUM types: 7, on, 5’ten, 1, 10’dan, 6’dan, 8’den, 2, 12’den, 6

The 10 most frequent ambiguous lemmas: bir (DET 669, NUM 25, ADJ 2), birinci (ADJ 71, NUM 1), yedinci (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: bir (DET 661, NUM 23), yedi (NUM 13, VERB 1), birinci (ADJ 70, NUM 1), yedinci (ADJ 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.370558 (the average of all parts of speech is 2.025565).

The 1st highest number of forms (5) was observed with the lemma “8”: 8, 8’de, 8’den, 8’e, 8..

The 2nd highest number of forms (4) was observed with the lemma “10”: 10, 10’a, 10’da, 10’dan.

The 3rd highest number of forms (4) was observed with the lemma “12”: 12, 12’de, 12’den, 12’si.

NUM occurs with 5 features: NumType (1306; 100% instances), Case (457; 35% instances), Number (457; 35% instances), Number[psor] (16; 1% instances), Person[psor] (16; 1% instances)

NUM occurs with 11 feature-value pairs: Case=Abl, Case=Dat, Case=Gen, Case=Loc, Case=Nom, NumType=Card, NumType=Ord, Number=Sing, Number[psor]=Sing, Person[psor]=2, Person[psor]=3

NUM occurs with 12 feature combinations. The most frequent feature combination is NumType=Card (842 tokens). Examples: 7, on, 1, 2, 6, bir, yirmi, 27, 8, 5’ten

Relations

NUM nodes are attached to their parents using 15 different relations: nummod (615; 47% instances), obl:tmod (257; 20% instances), nmod:tmod (143; 11% instances), nmod (79; 6% instances), conj (62; 5% instances), obl (55; 4% instances), compound (54; 4% instances), amod (15; 1% instances), nsubj (9; 1% instances), flat (7; 1% instances), root (4; 0% instances), fixed (2; 0% instances), nmod:poss (2; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: PROPN (412; 32% instances), NOUN (396; 30% instances), VERB (307; 24% instances), NUM (114; 9% instances), ADJ (71; 5% instances), (4; 0% instances), ADV (2; 0% instances)

640 (49%) NUM nodes are leaves.

247 (19%) NUM nodes have one child.

226 (17%) NUM nodes have two children.

193 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 18 different relations: nmod (574; 44% instances), case (369; 28% instances), nmod:tmod (156; 12% instances), conj (62; 5% instances), compound (61; 5% instances), cc (54; 4% instances), det (7; 1% instances), nmod:poss (7; 1% instances), punct (6; 0% instances), amod (5; 0% instances), aux:q (4; 0% instances), acl (3; 0% instances), nsubj (3; 0% instances), advmod (2; 0% instances), fixed (2; 0% instances), dislocated (1; 0% instances), list (1; 0% instances), obl:tmod (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: NOUN (725; 55% instances), ADP (315; 24% instances), NUM (114; 9% instances), PROPN (63; 5% instances), CCONJ (54; 4% instances), ADV (11; 1% instances), ADJ (10; 1% instances), DET (10; 1% instances), PUNCT (6; 0% instances), VERB (5; 0% instances), AUX (4; 0% instances), PRON (1; 0% instances)