home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: NOUN

There are 2418 NOUN lemmas (25%), 8013 NOUN types (30%) and 27244 NOUN tokens (20%). Out of 17 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: панъ, милость, чоловекъ, листъ, земля, место, князь, мещанинъ, день, право

The 10 most frequent NOUN types: м(и)л(о)сти, м(и)л(о)сть, панъ, люди, земли, пан, мѣста, пана, копъ, имѧ

The 10 most frequent ambiguous lemmas: право (NOUN 318, ADV 10), полочанинъ (NOUN 246, ADJ 1), приятель (NOUN 245, PRON 1), справа (NOUN 175, ADV 1), озеро (NOUN 107, ADV 1), приязнь (NOUN 38, ADV 1), потреба (NOUN 32, VERB 3), ближний (NOUN 24, ADJ 5), гора (NOUN 14, ADV 1), восковничий (NOUN 11, ADJ 3)

The 10 most frequent ambiguous types: право (NOUN 98, ADV 10, ADJ 1), справа (NOUN 39, ADV 1), вря(д) (NOUN 50, ADV 2), приязнь (NOUN 26, ADV 1), речи (NOUN 21, VERB 1), рада (NOUN 15, ADJ 1), правъ (NOUN 11, ADJ 2), праве (NOUN 10, ADV 3), потреба (NOUN 6, VERB 3), правѣ (ADV 11, NOUN 6)

Morphology

The form / lemma ratio of NOUN is 3.313896 (the average of all parts of speech is 2.698737).

The 1st highest number of forms (75) was observed with the lemma “бурмистръ”: бормистрꙋ, боръмистром, боръмистроу, боурмистром, боурмистромъ, боурмистроу, боурмистру, боурмистрꙋ, боуръмистром, боуръмистроу, боуръмистру, бу(р)мис(т)рами, бу(р)мис(т)рахъ, бу(р)мис(т)ровъ, бу(р)мистра, бу(р)мистра(х), бу(р)мистрами, бу(р)мистрахъ, бу(р)мистро(в), бу(р)мистро(въ), бу(р)мистро(м), бу(р)мистровъ, бу(р)мистромъ, бу(р)митровъ, бурми, бурмистрами, бурмистро(въ), бурмистров, бурмистрове, бурмистровъ, бурмистром, бурмистромъ, бурмистроу, бурмистру, бурмистры, бурмистрꙋ, буръми(с)тро(въ), буръмистра, буръмистрами, буръмистро(въ), буръмистро(м), буръмистрове, буръмистровъ, буръмистром, буръмистромъ, буръмистру, буръмистръ, буръмистры, буръмистрѡм, буръмистрꙋ, буръстрове, бурьмистрꙋ, бꙋрмистра, бꙋрмистров, бꙋрмистровъ, бꙋрмистром, бꙋрмистромъ, бꙋрмистроу, бꙋрмистру, бꙋрмистръ, бꙋрмистры, бꙋрмистрѡм, бꙋрмистрꙋ, бꙋрмисту, бꙋръмистра, бꙋръмистров, бꙋръмистрове, бꙋръмистровъ, бꙋръмистром, бꙋръмистромъ, бꙋръмистру, бꙋръмистры, бꙋръмистрꙋ, бꙋрьмистръ, бꙋрьмистры.

The 2nd highest number of forms (64) was observed with the lemma “купецъ”: коупцем, коупцовъ, коупцом, коупцомъ, коупцѣмъ, коупъцомъ, коупѣць, купец, купець, купцев, купцеви, купцем, купци, купцов, купцовъ, купцом, купцомъ, купцы, купцю, купцюви, купцѣ, купцѣви, купцѣм, купцѣмъ, купцꙋ, купъца, купъцевъ, купъцемъ, купъци, купъцо(въ), купъцо(м), купъцовъ, купъцѣ, купъцѣви, купъцѣвъ, купьцев, купьцеви, купьцевъ, купьцем, купьцемъ, купьцемь, купьци, купьцом, купьцомъ, купьцю, купѣць, кꙋпец, кꙋпецъ, кꙋпець, кꙋпца, кꙋпцев, кꙋпцевъ, кꙋпци, кꙋпцов, кꙋпцовъ, кꙋпцом, кꙋпцы, кꙋпцѣви, кꙋпцѣх, кꙋпцꙋ, кꙋпъца, кꙋпъцевъ, кꙋпъцы, кꙋпѣць.

The 3rd highest number of forms (56) was observed with the lemma “панъ”: п(а)н, п(а)н(а), п(а)на, п(а)не, п(а)нов, п(а)нове, п(а)новъ, п(а)ном, п(а)номъ, п(а)ну, п(а)ны, п(а)нꙋ, па, па<нъ, па(н), па(н)ъ, па(н̑), пану, пан, пан(а), пан[а], пана, пане, панех, панехъ, пано(м), пано(м)ъ, панов, панове, пановъ, пановє, пановѣ, паном, паномъ, паноу, паноум, пану, панъ, паны, панє, панѡм, панѣ, панѣхъ, панꙋ, панꙑ, пна, пну, пн҃а, пн҃е, пн҃о(м)ъ, пн҃у, п҃(н), п҃на, п҃новє, п҃ну, п҃нъ.

NOUN occurs with 7 features: Case (27188; 100% instances), Gender (27188; 100% instances), Number (27188; 100% instances), Animacy (110; 0% instances), Abbr (63; 0% instances), Typo (5; 0% instances), InflClass (2; 0% instances)

NOUN occurs with 18 feature-value pairs: Abbr=Yes, Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=Ind, Number=Count, Number=Dual, Number=Plur, Number=Sing, Typo=Yes

NOUN occurs with 64 feature combinations. The most frequent feature combination is Case=Gen|Gender=Masc|Number=Sing (3061 tokens). Examples: пана, брата, року, королѧ, підпису, днѧ, вряду, дня, м҃(с)ца, м(е)с(е)ца

Relations

NOUN nodes are attached to their parents using 31 different relations: conj (5205; 19% instances), obl (5132; 19% instances), nmod (4168; 15% instances), obj (3531; 13% instances), appos (3173; 12% instances), nsubj (2297; 8% instances), iobj (1322; 5% instances), root (1109; 4% instances), orphan (323; 1% instances), obl:tmod (206; 1% instances), nsubj:pass (137; 1% instances), advcl (83; 0% instances), acl:relcl (81; 0% instances), vocative (78; 0% instances), parataxis (74; 0% instances), flat:name (46; 0% instances), ccomp (45; 0% instances), xcomp (45; 0% instances), dislocated (32; 0% instances), nummod:gov (30; 0% instances), compound (26; 0% instances), obl:agent (24; 0% instances), acl (21; 0% instances), reparandum (19; 0% instances), flat (7; 0% instances), list (7; 0% instances), nummod (7; 0% instances), dep (6; 0% instances), csubj (4; 0% instances), amod (3; 0% instances), fixed (3; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (12229; 45% instances), NOUN (10717; 39% instances), PROPN (1675; 6% instances), (1109; 4% instances), PRON (681; 2% instances), ADJ (528; 2% instances), ADV (135; 0% instances), DET (59; 0% instances), NUM (40; 0% instances), AUX (21; 0% instances), PART (19; 0% instances), INTJ (12; 0% instances), ADP (11; 0% instances), CCONJ (3; 0% instances), SCONJ (3; 0% instances), X (2; 0% instances)

3012 (11%) NOUN nodes are leaves.

8072 (30%) NOUN nodes have one child.

8129 (30%) NOUN nodes have two children.

8031 (29%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 55.

Children of NOUN nodes are attached using 43 different relations: det (9146; 17% instances), punct (8050; 15% instances), case (7985; 14% instances), amod (6922; 13% instances), nmod (4881; 9% instances), conj (4763; 9% instances), cc (4557; 8% instances), appos (4248; 8% instances), nummod:gov (528; 1% instances), orphan (506; 1% instances), acl (503; 1% instances), advmod (465; 1% instances), nummod (456; 1% instances), acl:relcl (444; 1% instances), cop (292; 1% instances), nsubj (291; 1% instances), parataxis (236; 0% instances), obl (189; 0% instances), mark (144; 0% instances), iobj (137; 0% instances), dep (74; 0% instances), advcl (66; 0% instances), csubj (37; 0% instances), flat (31; 0% instances), obj (20; 0% instances), reparandum (20; 0% instances), discourse (19; 0% instances), flat:name (17; 0% instances), aux (14; 0% instances), dislocated (14; 0% instances), xcomp (9; 0% instances), compound (8; 0% instances), ccomp (5; 0% instances), expl (5; 0% instances), obl:tmod (5; 0% instances), list (4; 0% instances), parataxis:discourse (3; 0% instances), fixed (2; 0% instances), nsubj:pass (2; 0% instances), obl:float (2; 0% instances), vocative (2; 0% instances), expl:pv (1; 0% instances), nsubj:outer (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (10717; 19% instances), PUNCT (8050; 15% instances), ADP (7904; 14% instances), DET (7884; 14% instances), ADJ (7151; 13% instances), CCONJ (4537; 8% instances), PROPN (3432; 6% instances), PRON (2025; 4% instances), VERB (1121; 2% instances), NUM (994; 2% instances), AUX (316; 1% instances), PART (281; 1% instances), ADV (252; 0% instances), SCONJ (238; 0% instances), SYM (137; 0% instances), X (55; 0% instances), INTJ (10; 0% instances)