Treebank Statistics: UD_Uzbek-UT: POS Tags: PROPN
There are 244 PROPN lemmas (10%), 266 PROPN types (8%) and 308 PROPN tokens (5%).
Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent PROPN lemmas: Oʻzbekiston, Toshkent, Rossiya, Samarqand, Ukraina, Koreya, AQSH, AQSh, Buxoro, Erdoʻgʻan
The 10 most frequent PROPN types: Oʻzbekiston, Toshkent, Rossiya, Ukraina, Koreya, Samarqand, Toshkentda, AQSh, Amerika, Asqar
The 10 most frequent ambiguous lemmas: Ukraina (PROPN 5, NOUN 2), koʻz (NOUN 12, PROPN 1)
The 10 most frequent ambiguous types: Ukraina (PROPN 4, NOUN 2), Hokim (NOUN 1, PROPN 1), Togʻli (ADJ 1, PROPN 1)
- Ukraina
- Hokim
- Togʻli
Morphology
The form / lemma ratio of PROPN is 1.090164 (the average of all parts of speech is 1.456660).
The 1st highest number of forms (4) was observed with the lemma “Oʻzbekiston”: Oʻzbekiston, Oʻzbekistonda, Oʻzbekistonga, Oʻzbekistonning.
The 2nd highest number of forms (3) was observed with the lemma “AQSH”: AQSH, AQSHdagi, AQSHga.
The 3rd highest number of forms (3) was observed with the lemma “Samarqand”: Samarqand, Samarqandda, Samarqandga.
PROPN occurs with 5 features: Case (304; 99% instances), Number (301; 98% instances), Poss (38; 12% instances), Abbr (10; 3% instances), Foreign (2; 1% instances)
PROPN occurs with 11 feature-value pairs: Abbr=Yes, Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Foreign=Yes, Number=Plur, Number=Sing, Poss=Yes
PROPN occurs with 19 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing (214 tokens).
Examples: Oʻzbekiston, Toshkent, Koreya, Rossiya, Ukraina, Amerika, Asqar, Buxoro, Erdoʻgʻan, Isroil
Relations
PROPN nodes are attached to their parents using 14 different relations: flat (84; 27% instances), nmod (55; 18% instances), nsubj (45; 15% instances), compound (36; 12% instances), obl (30; 10% instances), nmod:poss (20; 6% instances), conj (17; 6% instances), obj (10; 3% instances), iobj (4; 1% instances), appos (2; 1% instances), root (2; 1% instances), ccomp (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Parents of PROPN nodes belong to 7 different parts of speech: NOUN (173; 56% instances), VERB (84; 27% instances), PROPN (43; 14% instances), ADJ (3; 1% instances), PRON (2; 1% instances), (2; 1% instances), INTJ (1; 0% instances)
220 (71%) PROPN nodes are leaves.
62 (20%) PROPN nodes have one child.
19 (6%) PROPN nodes have two children.
7 (2%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 6.
Children of PROPN nodes are attached using 13 different relations: flat (35; 27% instances), punct (31; 24% instances), conj (18; 14% instances), compound (15; 12% instances), cc (9; 7% instances), case (6; 5% instances), amod (4; 3% instances), obl (3; 2% instances), advmod (2; 2% instances), aux (2; 2% instances), nsubj (2; 2% instances), acl (1; 1% instances), nmod:poss (1; 1% instances)
Children of PROPN nodes belong to 11 different parts of speech: PROPN (43; 33% instances), PUNCT (31; 24% instances), NOUN (25; 19% instances), ADJ (9; 7% instances), CCONJ (7; 5% instances), ADP (6; 5% instances), ADV (2; 2% instances), AUX (2; 2% instances), VERB (2; 2% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)