home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Atis: POS Tags: NOUN

There are 453 NOUN lemmas (38%), 876 NOUN types (42%) and 13512 NOUN tokens (29%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: uç, gün, havayol, sabah, akşam, saat, ücret, öğle, ara, git

The 10 most frequent NOUN types: uçuşları, uçuş, uçuşlar, günü, uçuşu, uçuşlarını, akşam, öğleden, dönüş, gidiş

The 10 most frequent ambiguous lemmas: (NOUN 4040, VERB 141, ADJ 52, ADV 6), gün (NOUN 448, ADJ 59), havayol (NOUN 420, VERB 13, ADJ 2), saat (NOUN 313, ADJ 10), ücret (NOUN 305, ADJ 1), ara (NOUN 299, ADJ 226, VERB 23), git (NOUN 292, ADJ 280, VERB 76, ADV 6), dön (NOUN 257, ADJ 10, VERB 7, ADV 1), uçak (NOUN 227, VERB 2, ADJ 1), ulaşım (NOUN 214, VERB 1)

The 10 most frequent ambiguous types: dönüş (NOUN 245, VERB 1), cuma (NOUN 60, PROPN 1), kara (NOUN 45, ADJ 30), mevcut (NOUN 44, ADJ 30), dl (NOUN 9, ADJ 1), Eylül (NOUN 10, PROPN 3), Northwest (PROPN 20, NOUN 9), kalkış (NOUN 8, VERB 1), Express (NOUN 6, PROPN 2), Houston’a (PROPN 43, NOUN 6)

Morphology

The form / lemma ratio of NOUN is 1.933775 (the average of all parts of speech is 1.744205).

The 1st highest number of forms (44) was observed with the lemma “uç”: Uçuşumu, ucuna, ucundan, uçabilmemin, uçacağınızı, uçmadığı, uçmak, uçmakla, uçmam, uçmanın, uçmaya, uçtuğunu, uçuş, uçuşa, uçuşla, uçuşlar, uçuşlara, uçuşlarda, uçuşlardan, uçuşlarla, uçuşları, uçuşlarım, uçuşların, uçuşlarına, uçuşlarında, uçuşlarından, uçuşlarını, uçuşlarının, uçuşlarınız, uçuşlarınızın, uçuşlarıyla, uçuşta, uçuşu, uçuşum, uçuşun, uçuşuna, uçuşunda, uçuşundan, uçuşunu, uçuşunun, uçuşunuz, uçuşunuzu, uçuşunuzun, uçuşuyla.

The 2nd highest number of forms (18) was observed with the lemma “uçak”: uçak, uçakla, uçaklar, uçaklarda, uçakları, uçakların, uçaklarının, uçaklarınızın, uçakta, uçağa, uçağı, uçağın, uçağına, uçağından, uçağını, uçağının, uçağınız, uçağıyla.

The 3rd highest number of forms (16) was observed with the lemma “havayol”: Havayolları’nda, Havayolları’nı, Havayolları’nın, Havayolları’yla, Havayolu’nda, havayolları, havayollarına, havayollarında, havayollarından, havayollarını, havayollarının, havayollarıyla, havayolu, havayolunu, havayolunun, havayoluyla.

NOUN occurs with 5 features: Number (13512; 100% instances), Person (13512; 100% instances), Case (13508; 100% instances), Number[psor] (5756; 43% instances), Person[psor] (5756; 43% instances)

NOUN occurs with 15 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3

NOUN occurs with 67 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (5264 tokens). Examples: uçuş, akşam, dönüş, gidiş, sabah, saat, salı, uçmak, yön, cumartesi

Relations

NOUN nodes are attached to their parents using 25 different relations: nmod (3302; 24% instances), obj (2643; 20% instances), nsubj (1648; 12% instances), compound (1183; 9% instances), nmod:tmod (881; 7% instances), obl (749; 6% instances), root (622; 5% instances), obl:tmod (617; 5% instances), xcomp (450; 3% instances), flat (374; 3% instances), nmod:poss (293; 2% instances), amod (232; 2% instances), conj (204; 2% instances), case (82; 1% instances), ccomp (73; 1% instances), csubj (70; 1% instances), nsubj:pass (25; 0% instances), discourse (24; 0% instances), acl (15; 0% instances), parataxis (9; 0% instances), fixed (6; 0% instances), dislocated (5; 0% instances), nummod (3; 0% instances), advcl (1; 0% instances), dep (1; 0% instances)

Parents of NOUN nodes belong to 10 different parts of speech: NOUN (5446; 40% instances), VERB (4182; 31% instances), ADJ (2207; 16% instances), (622; 5% instances), PROPN (433; 3% instances), ADV (232; 2% instances), NUM (206; 2% instances), PRON (118; 1% instances), ADP (60; 0% instances), DET (6; 0% instances)

3812 (28%) NOUN nodes are leaves.

4360 (32%) NOUN nodes have one child.

3076 (23%) NOUN nodes have two children.

2264 (17%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 6.

Children of NOUN nodes are attached using 30 different relations: nmod (6720; 38% instances), amod (2772; 16% instances), acl (1997; 11% instances), det (1284; 7% instances), nmod:tmod (1154; 6% instances), case (826; 5% instances), compound (666; 4% instances), obl (553; 3% instances), nmod:poss (359; 2% instances), obj (247; 1% instances), cc (205; 1% instances), advmod (198; 1% instances), nummod (190; 1% instances), conj (180; 1% instances), flat (114; 1% instances), nsubj (107; 1% instances), obl:tmod (94; 1% instances), advcl (27; 0% instances), csubj (26; 0% instances), discourse (26; 0% instances), ccomp (24; 0% instances), parataxis (11; 0% instances), mark (10; 0% instances), xcomp (10; 0% instances), aux:q (8; 0% instances), nsubj:pass (5; 0% instances), punct (5; 0% instances), dep (3; 0% instances), fixed (2; 0% instances), dislocated (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (5446; 31% instances), PROPN (4757; 27% instances), ADJ (4528; 25% instances), DET (1291; 7% instances), ADP (794; 4% instances), ADV (405; 2% instances), NUM (261; 1% instances), CCONJ (212; 1% instances), PRON (78; 0% instances), VERB (38; 0% instances), AUX (8; 0% instances), PUNCT (5; 0% instances), INTJ (1; 0% instances)