Treebank Statistics: UD_Uzbek-TueCL: POS Tags: NOUN
There are 101 NOUN lemmas (36%), 135 NOUN types (32%) and 214 NOUN tokens (23%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: kitob, uy, maktab, bola, mashina, ona, doʻst, xona, er, kun
The 10 most frequent NOUN types: kitob, uyda, kitobni, maktabga, bola, onasini, Oʻqituvchi, mashina, non, shifokor
The 10 most frequent ambiguous lemmas: Piter (PROPN 4, NOUN 1), koʻk (ADJ 4, NOUN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN is 1.336634 (the average of all parts of speech is 1.489437).
The 1st highest number of forms (9) was observed with the lemma “uy”: Uyning, uy, uyda, uydagi, uydaginiki, uydami, uydamikan, uyga, uyiga.
The 2nd highest number of forms (5) was observed with the lemma “kitob”: kitob, kitobga, kitobi, kitobki, kitobni.
The 3rd highest number of forms (4) was observed with the lemma “bola”: bola, bolalar, bolalarga, bolaning.
NOUN occurs with 3 features: Case (214; 100% instances), Number (213; 100% instances), Poss (31; 14% instances)
NOUN occurs with 9 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Number=Plur, Number=Sing, Poss=Yes
NOUN occurs with 19 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing (83 tokens).
Examples: kitob, bola, Oʻqituvchi, mashina, non, shifokor, eʼlon, harakat, kuni, rasm
Relations
NOUN nodes are attached to their parents using 12 different relations: obl (63; 29% instances), obj (53; 25% instances), nsubj (44; 21% instances), root (19; 9% instances), nmod (10; 5% instances), compound:lvc (8; 4% instances), nsubj:pass (5; 2% instances), compound (4; 2% instances), conj (3; 1% instances), nmod:poss (2; 1% instances), xcomp (2; 1% instances), amod (1; 0% instances)
Parents of NOUN nodes belong to 6 different parts of speech: VERB (135; 63% instances), ADJ (32; 15% instances), NOUN (23; 11% instances), (19; 9% instances), PROPN (3; 1% instances), PRON (2; 1% instances)
130 (61%) NOUN nodes are leaves.
51 (24%) NOUN nodes have one child.
19 (9%) NOUN nodes have two children.
14 (7%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 5.
Children of NOUN nodes are attached using 17 different relations: punct (22; 16% instances), nsubj (21; 16% instances), amod (19; 14% instances), cop (13; 10% instances), nmod (11; 8% instances), nmod:poss (8; 6% instances), case (7; 5% instances), det (6; 4% instances), acl (5; 4% instances), nummod (5; 4% instances), obl (5; 4% instances), conj (4; 3% instances), compound (3; 2% instances), advmod (2; 1% instances), xcomp (2; 1% instances), aux (1; 1% instances), parataxis (1; 1% instances)
Children of NOUN nodes belong to 11 different parts of speech: NOUN (23; 17% instances), PUNCT (22; 16% instances), PROPN (20; 15% instances), ADJ (16; 12% instances), AUX (14; 10% instances), VERB (11; 8% instances), PRON (8; 6% instances), DET (7; 5% instances), ADP (6; 4% instances), NUM (5; 4% instances), ADV (3; 2% instances)