home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-LFG: POS Tags: NOUN

There are 6238 NOUN lemmas (37%), 11161 NOUN types (37%) and 25323 NOUN tokens (19%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: pan, rok, człowiek, czas, pani, ręka, raz, głowa, dziecko, chwila

The 10 most frequent NOUN types: pan, pani, lat, domu, roku, pana, drzwi, raz, oczy, panie

The 10 most frequent ambiguous lemmas: czas (NOUN 183, VERB 3), rzecz (NOUN 52, ADP 2), sposób (NOUN 38, VERB 2), złoty (NOUN 29, ADJ 10), pora (NOUN 28, VERB 4), temat (NOUN 17, ADP 7), rano (ADV 25, NOUN 15), znajomy (NOUN 12, ADJ 3), dziś (ADV 46, NOUN 11), koło (ADP 14, NOUN 11)

The 10 most frequent ambiguous types: pana (NOUN 58, PROPN 1), sposób (NOUN 33, VERB 2), razem (NOUN 32, ADV 31), czasie (ADP 26, NOUN 25), myśli (NOUN 23, VERB 10), stanie (NOUN 23, VERB 4), sprawie (NOUN 15, ADP 6), złotych (NOUN 15, ADJ 1), drogi (NOUN 14, ADJ 3), dziś (ADV 25, NOUN 11)

Morphology

The form / lemma ratio of NOUN is 1.789195 (the average of all parts of speech is 1.795702).

The 1st highest number of forms (11) was observed with the lemma “oko”: oczach, oczami, oczom, oczu, oczy, oczyma, oczów, oka, okiem, oko, oku.

The 2nd highest number of forms (10) was observed with the lemma “człowiek”: człowiek, człowieka, człowiekiem, człowiekowi, człowieku, ludzi, ludziach, ludzie, ludziom, ludźmi.

The 3rd highest number of forms (10) was observed with the lemma “ręka”: rąk, ręce, ręka, rękach, rękami, ręki, rękoma, ręku, ręką, rękę.

NOUN occurs with 8 features: Case (25323; 100% instances), Gender (25323; 100% instances), Number (25323; 100% instances), SubGender (11867; 47% instances), Aspect (562; 2% instances), Polarity (562; 2% instances), VerbForm (562; 2% instances), Polite (17; 0% instances)

NOUN occurs with 21 feature-value pairs: Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Polarity=Neg, Polarity=Pos, Polite=Depr, SubGender=Masc1, SubGender=Masc2, SubGender=Masc3, VerbForm=Vnoun

NOUN occurs with 81 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (1879 tokens). Examples: chwilę, głowę, twarz, rację, rękę, niedzielę, sprawę, sobotę, drogę, uwagę

Relations

NOUN nodes are attached to their parents using 19 different relations: obl (9286; 37% instances), nsubj (6242; 25% instances), obj (4316; 17% instances), nmod (1574; 6% instances), nmod:poss (1467; 6% instances), conj (909; 4% instances), root (340; 1% instances), iobj (334; 1% instances), nsubj:pass (211; 1% instances), flat (159; 1% instances), appos (146; 1% instances), xcomp (138; 1% instances), vocative (102; 0% instances), obl:agent (58; 0% instances), ccomp (17; 0% instances), advcl (12; 0% instances), ccomp:obj (7; 0% instances), csubj (3; 0% instances), acl:relcl (2; 0% instances)

Parents of NOUN nodes belong to 9 different parts of speech: VERB (19285; 76% instances), NOUN (4155; 16% instances), ADJ (1130; 4% instances), (340; 1% instances), PROPN (119; 0% instances), ADV (102; 0% instances), PRON (93; 0% instances), DET (83; 0% instances), NUM (16; 0% instances)

7460 (29%) NOUN nodes are leaves.

10727 (42%) NOUN nodes have one child.

5435 (21%) NOUN nodes have two children.

1701 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 9.

Children of NOUN nodes are attached using 29 different relations: case (8536; 31% instances), amod (5654; 21% instances), det (2591; 9% instances), nmod:poss (2339; 8% instances), nmod (1754; 6% instances), punct (945; 3% instances), conj (906; 3% instances), advmod (807; 3% instances), flat (770; 3% instances), acl (733; 3% instances), nummod (716; 3% instances), cc (695; 3% instances), cop (337; 1% instances), nsubj (266; 1% instances), acl:relcl (99; 0% instances), appos (97; 0% instances), cop:locat (79; 0% instances), mark (55; 0% instances), obl (53; 0% instances), cc:preconj (27; 0% instances), expl:pv (24; 0% instances), aux:aglt (21; 0% instances), advcl (9; 0% instances), aux (7; 0% instances), csubj (4; 0% instances), aux:mood (3; 0% instances), vocative (3; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: ADP (8536; 31% instances), ADJ (6439; 23% instances), NOUN (4155; 15% instances), DET (2614; 9% instances), PROPN (1402; 5% instances), PUNCT (945; 3% instances), NUM (725; 3% instances), CCONJ (722; 3% instances), PART (718; 3% instances), PRON (556; 2% instances), AUX (447; 2% instances), VERB (129; 0% instances), ADV (100; 0% instances), SCONJ (44; 0% instances)