home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Atis: POS Tags: NOUN

There are 309 NOUN lemmas (30%), 686 NOUN types (33%) and 12279 NOUN tokens (28%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: uçuş, ara, gün, havayol, sabah, akşam, saat, ücret, öğle, uçak

The 10 most frequent NOUN types: uçuşları, uçuş, uçuşlar, günü, uçuşu, uçuşlarını, öğleden, akşam, arasındaki, sabah

The 10 most frequent ambiguous lemmas: ara (NOUN 509, VERB 22), çarşamba (NOUN 187, PROPN 73), pazartesi (NOUN 101, PROPN 67), perşembe (NOUN 95, PROPN 52), pazar (NOUN 70, PROPN 67), kara (NOUN 56, ADJ 20), cuma (NOUN 55, PROPN 47), mevcut (ADJ 45, NOUN 42), ağustos (PROPN 48, NOUN 40), la (NOUN 20, PROPN 12)

The 10 most frequent ambiguous types: kara (NOUN 55, ADJ 20), cuma (NOUN 55, PROPN 1), aktarma (NOUN 43, VERB 1), mevcut (NOUN 42, ADJ 32), oturma (NOUN 19, VERB 1), Eylül (NOUN 9, PROPN 3), Northwest (PROPN 20, NOUN 9), bağlantılı (NOUN 8, ADJ 2), Express (NOUN 6, PROPN 2), Houston’a (PROPN 43, NOUN 5)

Morphology

The form / lemma ratio of NOUN is 2.220065 (the average of all parts of speech is 2.025565).

The 1st highest number of forms (37) was observed with the lemma “uçuş”: Uçuşumu, uçuş, uçuşa, uçuşla, uçuşlar, uçuşlara, uçuşlarda, uçuşlardaki, uçuşlardan, uçuşlardır, uçuşlarla, uçuşları, uçuşlarım, uçuşların, uçuşlarına, uçuşlarında, uçuşlarından, uçuşlarını, uçuşlarının, uçuşlarınız, uçuşlarınızın, uçuşlarıyla, uçuşta, uçuştur, uçuşu, uçuşum, uçuşun, uçuşuna, uçuşunda, uçuşundaki, uçuşundan, uçuşunu, uçuşunun, uçuşunuz, uçuşunuzu, uçuşunuzun, uçuşuyla.

The 2nd highest number of forms (21) was observed with the lemma “havayol”: Havayolları’nda, Havayolları’nı, Havayolları’nın, Havayolları’yla, Havayolu’nda, havayolları, havayollarıdır, havayollarına, havayollarında, havayollarındaki, havayollarından, havayollarını, havayollarının, havayollarıyla, havayolu, havayoludur, havayolundaki, havayolunu, havayolunun, havayolunuz, havayoluyla.

The 3rd highest number of forms (20) was observed with the lemma “uçak”: uçak, uçakla, uçaklar, uçaklarda, uçakları, uçakların, uçaklarının, uçaklarınızın, uçakta, uçaktır, uçağa, uçağı, uçağın, uçağına, uçağındaki, uçağından, uçağını, uçağının, uçağınız, uçağıyla.

NOUN occurs with 5 features: Number (11817; 96% instances), Case (11790; 96% instances), Number[psor] (5215; 42% instances), Person[psor] (5215; 42% instances), Person (26; 0% instances)

NOUN occurs with 16 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 69 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (4426 tokens). Examples: uçuş, akşam, sabah, saat, çarşamba, salı, yön, cumartesi, uçak, pazartesi

Relations

NOUN nodes are attached to their parents using 24 different relations: nmod (3211; 26% instances), obj (2515; 20% instances), nsubj (1657; 13% instances), compound (924; 8% instances), nmod:tmod (765; 6% instances), obl (669; 5% instances), root (612; 5% instances), amod (599; 5% instances), flat (379; 3% instances), obl:tmod (308; 3% instances), nmod:poss (266; 2% instances), conj (168; 1% instances), case (68; 1% instances), csubj (44; 0% instances), nsubj:pass (23; 0% instances), discourse (21; 0% instances), acl (15; 0% instances), parataxis (9; 0% instances), ccomp (8; 0% instances), xcomp (8; 0% instances), dislocated (4; 0% instances), fixed (4; 0% instances), dep (1; 0% instances), nummod (1; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: NOUN (4376; 36% instances), VERB (3938; 32% instances), ADJ (1293; 11% instances), NUM (725; 6% instances), (612; 5% instances), X (578; 5% instances), PROPN (428; 3% instances), ADV (145; 1% instances), PRON (118; 1% instances), ADP (60; 0% instances), DET (6; 0% instances)

3751 (31%) NOUN nodes are leaves.

3908 (32%) NOUN nodes have one child.

2675 (22%) NOUN nodes have two children.

1945 (16%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 6.

Children of NOUN nodes are attached using 28 different relations: nmod (6385; 41% instances), amod (2764; 18% instances), acl (1930; 12% instances), det (1248; 8% instances), nmod:tmod (992; 6% instances), case (444; 3% instances), compound (393; 3% instances), nmod:poss (334; 2% instances), nummod (192; 1% instances), cc (168; 1% instances), conj (150; 1% instances), advmod (112; 1% instances), flat (106; 1% instances), nsubj (102; 1% instances), obl (95; 1% instances), obj (35; 0% instances), discourse (22; 0% instances), advcl (19; 0% instances), ccomp (13; 0% instances), parataxis (11; 0% instances), aux:q (8; 0% instances), obl:tmod (7; 0% instances), xcomp (7; 0% instances), csubj (6; 0% instances), punct (4; 0% instances), dep (3; 0% instances), fixed (2; 0% instances), mark (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (4376; 28% instances), PROPN (4249; 27% instances), VERB (2278; 15% instances), ADJ (2008; 13% instances), DET (1255; 8% instances), ADP (412; 3% instances), NUM (396; 3% instances), ADV (315; 2% instances), CCONJ (174; 1% instances), PRON (77; 0% instances), AUX (8; 0% instances), PUNCT (4; 0% instances), INTJ (1; 0% instances)