home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: NOUN

There are 1671 NOUN lemmas (34%), 3061 NOUN types (35%) and 12681 NOUN tokens (24%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: колега, питання, закон, депутат, рада, ласка, рішення, законопроект, проект, комітет

The 10 most frequent NOUN types: колеги, ласка, питання, рішення, ради, закону, законопроект, проект, слово, комітету

The 10 most frequent ambiguous lemmas: раз (NOUN 43, ADV 1), уповноважений (NOUN 22, ADJ 2), військовий (ADJ 22, NOUN 14), перше (NOUN 14, ADV 1), друге (NOUN 12, ADJ 3), головуючий (NOUN 9, ADJ 1), ТСК (NOUN 3, PROPN 1), правда (NOUN 2, PART 1), Держдума (PROPN 2, NOUN 1), виступаючий (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: цілому (NOUN 40, ADJ 4), права (NOUN 31, ADJ 1), раді (NOUN 5, ADJ 1), перше (NOUN 7, ADJ 2), друге (ADJ 12, NOUN 4), головуючий (NOUN 7, ADJ 1), військових (NOUN 6, ADJ 1), військові (NOUN 6, ADJ 2), головне (NOUN 4, ADJ 2), рівні (NOUN 5, ADJ 1)

Morphology

The form / lemma ratio of NOUN is 1.831837 (the average of all parts of speech is 1.786380).

The 1st highest number of forms (9) was observed with the lemma “колега”: колег, колега, колегам, колегами, колеги, колего, колегою, колегу, колезі.

The 2nd highest number of forms (9) was observed with the lemma “мова”: мов, мова, мовам, мовами, мовах, мови, мовою, мову, мові.

The 3rd highest number of forms (8) was observed with the lemma “громадянин”: громадян, громадянам, громадянами, громадяни, громадянин, громадянина, громадянином, громадянину.

NOUN occurs with 10 features: Case (12679; 100% instances), Number (12678; 100% instances), Animacy (12658; 100% instances), Gender (12609; 99% instances), NumType (69; 1% instances), InflClass (37; 0% instances), BadStyle (31; 0% instances), Abbr (17; 0% instances), Typo (17; 0% instances), Animacy[gram] (1; 0% instances)

NOUN occurs with 24 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Animacy[gram]=Inan, BadStyle=Yes, Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, InflClass=Ind, NumType=Card, NumType=Ord, Number=Plur, Number=Ptan, Number=Sing, Typo=Yes

NOUN occurs with 152 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing (1391 tokens). Examples: ради, постанови, партії, комісії, статті, освіти, країни, Конституції, частини, держави

Relations

NOUN nodes are attached to their parents using 29 different relations: nmod (4262; 34% instances), obj (1853; 15% instances), obl (1737; 14% instances), nsubj (1350; 11% instances), conj (952; 8% instances), root (440; 3% instances), vocative (417; 3% instances), appos (342; 3% instances), obl:arg (279; 2% instances), fixed (270; 2% instances), iobj (173; 1% instances), parataxis (171; 1% instances), orphan (78; 1% instances), nsubj:pass (73; 1% instances), obl:agent (57; 0% instances), advcl (51; 0% instances), ccomp (31; 0% instances), xcomp (30; 0% instances), nummod:gov (26; 0% instances), acl:relcl (20; 0% instances), dislocated (20; 0% instances), nummod (16; 0% instances), reparandum (9; 0% instances), acl (7; 0% instances), flat (5; 0% instances), nsubj:outer (5; 0% instances), amod (4; 0% instances), flat:range (2; 0% instances), flat:title (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (5603; 44% instances), NOUN (5549; 44% instances), ADJ (543; 4% instances), (440; 3% instances), ADV (200; 2% instances), PROPN (113; 1% instances), PRON (72; 1% instances), NUM (53; 0% instances), DET (40; 0% instances), AUX (33; 0% instances), ADP (31; 0% instances), X (3; 0% instances), SCONJ (1; 0% instances)

1981 (16%) NOUN nodes are leaves.

4270 (34%) NOUN nodes have one child.

3948 (31%) NOUN nodes have two children.

2482 (20%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 10.

Children of NOUN nodes are attached using 40 different relations: nmod (4785; 23% instances), amod (4247; 20% instances), case (3387; 16% instances), punct (2474; 12% instances), det (1574; 7% instances), conj (961; 5% instances), cc (656; 3% instances), appos (617; 3% instances), acl:relcl (531; 3% instances), nsubj (239; 1% instances), nummod (204; 1% instances), acl (194; 1% instances), advmod (170; 1% instances), parataxis (168; 1% instances), nummod:gov (148; 1% instances), mark (136; 1% instances), orphan (85; 0% instances), cop (81; 0% instances), discourse (70; 0% instances), advmod:emph (62; 0% instances), expl (61; 0% instances), vocative (57; 0% instances), advmod:neg (51; 0% instances), obl (37; 0% instances), det:numgov (31; 0% instances), iobj (20; 0% instances), ccomp (15; 0% instances), advcl (11; 0% instances), det:nummod (7; 0% instances), reparandum (7; 0% instances), flat (6; 0% instances), csubj (5; 0% instances), flat:title (5; 0% instances), obj (4; 0% instances), compound (2; 0% instances), fixed (2; 0% instances), obl:agent (2; 0% instances), flat:name (1; 0% instances), obl:arg (1; 0% instances), parataxis:rel (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (5549; 26% instances), ADJ (4376; 21% instances), ADP (3382; 16% instances), PUNCT (2474; 12% instances), DET (1650; 8% instances), PROPN (867; 4% instances), VERB (729; 3% instances), CCONJ (648; 3% instances), NUM (382; 2% instances), PRON (341; 2% instances), ADV (287; 1% instances), PART (198; 1% instances), SCONJ (129; 1% instances), AUX (90; 0% instances), X (13; 0% instances)