Treebank Statistics: UD_Czech-PDT: POS Tags: NOUN
There are 9001 NOUN
lemmas (33%), 18229 NOUN
types (34%) and 83173 NOUN
tokens (25%).
Out of 17 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: rok, strana, léta, cena, firma, doba, vláda, zákon, společnost, země
The 10 most frequent NOUN
types: p, let, roku, korun, roce, Kč, r, strany, firmy, případě
The 10 most frequent ambiguous lemmas: bod (NOUN 338, PROPN 1), stát (VERB 344, NOUN 329), den (NOUN 272, X 1), místo (NOUN 222, ADP 45, ADV 6), a (CCONJ 7162, NOUN 17, X 6), teplo (NOUN 91, ADV 1), pravda (NOUN 69, PART 2), s (ADP 2504, NOUN 27, X 10, PART 6), růst (NOUN 60, VERB 26), x (NOUN 32, SYM 19)
The 10 most frequent ambiguous types: p (NOUN 163, ADJ 2), s (ADP 1960, NOUN 72, X 10, PART 6), a (CCONJ 6945, ADJ 32, NOUN 17, X 6), září (NOUN 102, VERB 2), j (NOUN 9, ADJ 1), bod (NOUN 87, PROPN 1), stát (NOUN 75, VERB 50), den (NOUN 70, X 1), místo (NOUN 69, ADP 34, ADV 4), x (NOUN 32, SYM 19)
- p
- s
- a
- CCONJ 6945: Je naprosto bezbřehý a nevypočitatelný .
- ADJ 32: a . s . Malostranské nám . 2 118 00 Praha 1 Tel . / fax : 684 62 55
- NOUN 17: Ušetříte téměř 90 % proti variantě a ) .
- X 6: V gigantickém výstavním komplexu Porte de Versailles začal včera v Paříži jeden z největších a nejprestižnějších světových veletrhů módy Pret a Porter .
- září
- j
- bod
- stát
- den
- místo
- x
Morphology
The form / lemma ratio of NOUN
is 2.025219 (the average of all parts of speech is 1.964432).
The 1st highest number of forms (11) was observed with the lemma “strana”: s, str, stran, strana, stranami, stranou, stranu, strany, stranách, stranám, straně.
The 2nd highest number of forms (10) was observed with the lemma “hodina”: Hodina, h, hod, hodin, hodinami, hodinou, hodinu, hodiny, hodinách, hodině.
The 3rd highest number of forms (10) was observed with the lemma “ministr”: ministr, ministra, ministrem, ministrovi, ministru, ministry, ministrů, ministrům, ministře, ministři.
NOUN
occurs with 10 features: Polarity (81649; 98% instances), Gender (79711; 96% instances), Case (78979; 95% instances), Number (78979; 95% instances), Animacy (34831; 42% instances), VerbForm (5750; 7% instances), Abbr (4056; 5% instances), Style (80; 0% instances), Typo (18; 0% instances), Foreign (1; 0% instances)
NOUN
occurs with 25 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
, Polarity=Neg
, Polarity=Pos
, Style=Coll
, Style=Expr
, Style=Slng
, Style=Vrnc
, Typo=Yes
, VerbForm=Vnoun
NOUN
occurs with 151 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|Polarity=Pos
(6407 tokens).
Examples: strany, práce, vlády, společnosti, firmy, republiky, rady, přímky, doby, obrany
Relations
NOUN
nodes are attached to their parents using 27 different relations: nmod (27374; 33% instances), obl (14678; 18% instances), nsubj (13009; 16% instances), obj (9158; 11% instances), conj (5652; 7% instances), obl:arg (5181; 6% instances), root (2493; 3% instances), nsubj:pass (1407; 2% instances), appos (1005; 1% instances), dep (908; 1% instances), fixed (493; 1% instances), xcomp (457; 1% instances), orphan (418; 1% instances), advcl (305; 0% instances), ccomp (165; 0% instances), case (139; 0% instances), acl:relcl (107; 0% instances), acl (60; 0% instances), iobj (58; 0% instances), flat (29; 0% instances), csubj (28; 0% instances), parataxis (22; 0% instances), vocative (18; 0% instances), csubj:pass (4; 0% instances), advmod (3; 0% instances), amod (1; 0% instances), discourse (1; 0% instances)
Parents of NOUN
nodes belong to 17 different parts of speech: VERB (35581; 43% instances), NOUN (32629; 39% instances), ADJ (6271; 8% instances), (2493; 3% instances), PROPN (1409; 2% instances), AUX (1379; 2% instances), NUM (894; 1% instances), DET (814; 1% instances), ADV (753; 1% instances), ADP (496; 1% instances), PRON (148; 0% instances), SYM (127; 0% instances), X (115; 0% instances), PART (54; 0% instances), CCONJ (7; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)
13665 (16%) NOUN
nodes are leaves.
29092 (35%) NOUN
nodes have one child.
24612 (30%) NOUN
nodes have two children.
15804 (19%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 17.
Children of NOUN
nodes are attached using 36 different relations: amod (33138; 24% instances), nmod (29596; 22% instances), case (24959; 18% instances), punct (9363; 7% instances), det (6631; 5% instances), conj (5408; 4% instances), cc (4413; 3% instances), nummod (3702; 3% instances), advmod:emph (3048; 2% instances), acl:relcl (2878; 2% instances), flat (2288; 2% instances), cop (1788; 1% instances), nsubj (1392; 1% instances), nummod:gov (1109; 1% instances), mark (1106; 1% instances), appos (1026; 1% instances), acl (982; 1% instances), dep (941; 1% instances), advmod (484; 0% instances), obl (374; 0% instances), orphan (344; 0% instances), xcomp (343; 0% instances), det:numgov (205; 0% instances), csubj (170; 0% instances), det:nummod (116; 0% instances), parataxis (76; 0% instances), advcl (75; 0% instances), aux (71; 0% instances), obl:arg (25; 0% instances), discourse (14; 0% instances), fixed (13; 0% instances), obj (7; 0% instances), ccomp (6; 0% instances), flat:foreign (6; 0% instances), expl:pv (1; 0% instances), vocative (1; 0% instances)
Children of NOUN
nodes belong to 17 different parts of speech: ADJ (33647; 25% instances), NOUN (32629; 24% instances), ADP (24736; 18% instances), PUNCT (9363; 7% instances), DET (7574; 6% instances), PROPN (6848; 5% instances), NUM (5143; 4% instances), CCONJ (5137; 4% instances), VERB (3877; 3% instances), ADV (2125; 2% instances), AUX (2021; 1% instances), SCONJ (1125; 1% instances), PART (970; 1% instances), X (508; 0% instances), PRON (334; 0% instances), SYM (60; 0% instances), INTJ (2; 0% instances)