home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: NOUN

There are 971 NOUN lemmas (20%), 3256 NOUN types (28%) and 4853 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: гривна, куна, поклонъ, рожь, господинъ, осподинъ, грамота, бѣлка, рубль, конь

The 10 most frequent NOUN types: ржи, поклонъ, гн҃е, поклоно, кѹно, покланѧние, соли, кѹнъ, грамота, жита

The 10 most frequent ambiguous lemmas: коробьꙗ (NOUN 57, PRON 1), дѣти (NOUN 47, VERB 1), вершь (NOUN 17, ADV 1), сорокъ (NOUN 12, NUM 3), сто (NOUN 9, NUM 2), добро (NOUN 8, SCONJ 1), ручии (NOUN 8, PROPN 1), вода (NOUN 6, PROPN 1), другъ (NOUN 3, ADV 1), море (NOUN 2, PROPN 1)

The 10 most frequent ambiguous types: соли (NOUN 22, PROPN 1), намо (NOUN 6, PRON 4), добро (ADJ 8, NOUN 3, SCONJ 1), межи (NOUN 3, ADP 1), [к] (NOUN 2, NUM 1), вода (NOUN 2, VERB 2), ги (NOUN 2, X 1), кнѧзѧ (NOUN 2, PROPN 1), лихо (NOUN 2, ADJ 1), море (NOUN 2, PROPN 1)

Morphology

The form / lemma ratio of NOUN is 3.353244 (the average of all parts of speech is 2.412613).

The 1st highest number of forms (169) was observed with the lemma “гривна”: (г)[р]в҃не, (г)ри[вьнь], (гр)[и]в[ь]нъ, (гр)ивена, (гри)вна, (гри)внахъ, (гри)вну, (гри)внъ, (гри)вьн[ѣ], (гри)вьне, (гри)вьно, (гри)[в]ьна, (гри)[вь]но, (гри)вна, (гри)вьно, (гри)вьнъ, (гриве)но, (гривено), (гривенѣ), (гривь)но, (гривь)н[ѣ], [г]рвну, [г]рв҃нѣ, [г]ривна, [г]ривьнѫ, [гр]ивнѣ, [грв҃]н[а], [гри]вь[н]е, [гри]вьнѹ, [грив]но, [гривна, гривь, [гривь]нѹ, [грн]е, вне, г)р(и)в[е], г)ривьне, г)ривьно, гр, г[ри]в[ь]н…, г]р[ив]нѣ, г]ри(в), гивьна, гиривьнѣ, гр(и)вьнъ, гр)[ив]ьнъ, гр[и]…, грив[в]н[е], гри)[в]ьне, гри:в[не, гри:вь:нѹ, гри(внь), гривне, гривни, гривно, гривнꙑ, гривън[о], гривьне, гривьнѣ, грив, грив[ено, грив[ь]но, грив[ь]нѹ, грив]ена, гриве[н]ѹ, гривене, гривенѣ, гривен[а], гривен[е], гривена, гривена], гривенахъ, гривене, гривено, гривенъ, гривень, гривенѣ, гривенѹ, гривен…, гривини, гривн, гривна, гривнама, гривне, гривни, гривно, гривнъ, гривнь, гривнѣ, гривнѹ, гривн…, гривнꙑ, гривоно, гривонъ, гривонѹ, гривъвъно, гривън)ꙑ, гривън-, гривънъ, гривънѣ, гривънꙑ, гривь(но), гривь(нъ), гривьна, гривьне, гривьни:, гривьнѣ, гривьнꙋ, гривь]нь, гривь]но, гривьии, гривьн[ь], гривьн[ѣ], гривьна, гривьне, гривьно, гривьною, гривьнъ, гривьнь, гривьнѣ, гривьнѫ, гривьнѹ, гривьнꙑ, грив…, гривꙑ, гринво, гринꙑ, грин, гри…, грн҃ве, грьн[и], грѣ, грѣвенѣ, грѣвни, грѣвну, грѣвону, грѣвонꙑ, гр҃(вн)-, гр҃в-, гр҃вне, гр҃внѣ, гр҃ивь, гр҃нъ, р[ги]в[нѣ], …(грв)[н҃ѹ], …(гри)вьнѣ.

The 2nd highest number of forms (94) was observed with the lemma “куна”: (к)[у]она[хо], (к)уно[ӏ], (к)ѹно, (к)ꙋнѣ, (ко)[у]н[ахъ], (ку)нѣ, (кѫнѣ), (кѹ)не, (кѹ)нѹ, [к], [к]ѹн[ѣ], [к]ѹно, [ко]уно, [кѹ]нѹ, {кѹ}кѹно, к(ѹнъ), к)ѹнѣ, к[у]н[у, кѫ, к[ѹнѹ, к]нь, кн[ь], кна, кне, кно, кнъ, кнь, кнь:, кнѣ, кн҃а, кн҃ъ, кн҃ѣ, коне, коно, ку]не, кунами, кунахъ, куне, куни, куно, куну, кунъ, кунѧми, кунꙑ, куонь, куо[н]-, куона, куоно, куонь, кѫно, кѫнъ, кѫнѣ, кѹ(н)…, кѹ(нъ, кѹ(нꙑ, кѹ:нꙑ, кѹ(н)[е, кѹ(но), кѹне, кѹно, кѹ[нѹ], кѹ·но, кѹн, кѹн(о, кѹн-, кѹн[ахо], кѹн[е], кѹно, кѹн[ъ], кѹн[ь], кѹн[ѣ], кѹна, кѹнахъ, кѹнами, кѹнамо, кѹнамъ, кѹне, кѹни, кѹно, кѹнъ, кѹнь, кѹнѣ, кѹнѹ, кѹн…, кѹнꙑ, кѹн, кꙋнами, кꙋнахъ, кꙋне, кꙋне」, кꙋно, кꙋнъ, кꙋнѣ, …нꙑ.

The 3rd highest number of forms (52) was observed with the lemma “осподинъ”: (ѡ)споди(не), (ѡсподин)у, (ꙩсподи)не, [ѻ]спъд(инь), [ѻс]подине, ги҃не, ги҃ну, о)[с]подине, оги҃динѹ, огну, огн҃у, ос(подину, осподи)ну, осподине, осподину, ѡсподину, ѡгн҃(е), ѡспд҃не, ѡсподинь, ѡспод[у], ѻсподине, ѻсподину, ѻсподинь, ѻсподѣну, ꙩгне, ꙩгну, ꙩспо[и](не, ꙩсподине, ꙩспн҃е, ꙩсподну, ꙩспод(ину)…, ꙩсподине, ꙩсподине, ꙩсподину, ꙩсподѣну, ги҃не, сподине, сподиню.

NOUN occurs with 5 features: Gender (4736; 98% instances), Case (4707; 97% instances), Number (4707; 97% instances), Typo (6; 0% instances), Animacy (3; 0% instances)

NOUN occurs with 18 feature-value pairs: Animacy=Anim, Case=Acc, Case=Acc,Gen, Case=Acc,Nom, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Count, Number=Dual, Number=Plur, Number=Sing, Typo=Yes

NOUN occurs with 94 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (405 tokens). Examples: поклонъ, поклоно, приказъ, тимо, гл҃со, бо҃, покло, попъ, приказо, сꙑно

Relations

NOUN nodes are attached to their parents using 25 different relations: nsubj (884; 18% instances), obj (834; 17% instances), conj (691; 14% instances), nmod (625; 13% instances), root (610; 13% instances), obl (436; 9% instances), iobj (186; 4% instances), vocative (167; 3% instances), appos (144; 3% instances), dep (72; 1% instances), orphan (70; 1% instances), parataxis (26; 1% instances), nsubj:pass (24; 0% instances), dislocated (22; 0% instances), list (18; 0% instances), flat:name (12; 0% instances), advcl (10; 0% instances), reparandum (7; 0% instances), nummod:gov (5; 0% instances), xcomp (5; 0% instances), acl (1; 0% instances), amod (1; 0% instances), case (1; 0% instances), ccomp (1; 0% instances), nummod (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (1880; 39% instances), NOUN (1333; 27% instances), PROPN (658; 14% instances), (610; 13% instances), X (118; 2% instances), PRON (103; 2% instances), ADJ (72; 1% instances), NUM (40; 1% instances), DET (20; 0% instances), ADV (7; 0% instances), PART (6; 0% instances), ADP (4; 0% instances), SCONJ (2; 0% instances)

1345 (28%) NOUN nodes are leaves.

1559 (32%) NOUN nodes have one child.

1042 (21%) NOUN nodes have two children.

907 (19%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 21.

Children of NOUN nodes are attached using 30 different relations: punct (1193; 17% instances), case (1003; 14% instances), nmod (925; 13% instances), nummod:gov (858; 12% instances), conj (713; 10% instances), cc (536; 8% instances), amod (448; 6% instances), det (412; 6% instances), dep (219; 3% instances), nsubj (166; 2% instances), appos (165; 2% instances), orphan (104; 1% instances), nummod (71; 1% instances), parataxis (62; 1% instances), advmod (58; 1% instances), iobj (25; 0% instances), mark (25; 0% instances), obl (25; 0% instances), acl (20; 0% instances), advcl (19; 0% instances), cop (15; 0% instances), list (14; 0% instances), acl:relcl (13; 0% instances), vocative (9; 0% instances), reparandum (8; 0% instances), flat:name (7; 0% instances), dislocated (4; 0% instances), obj (2; 0% instances), csubj (1; 0% instances), goeswith (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (1333; 19% instances), PUNCT (1193; 17% instances), ADP (1027; 14% instances), NUM (953; 13% instances), PROPN (601; 8% instances), CCONJ (527; 7% instances), ADJ (486; 7% instances), DET (380; 5% instances), X (198; 3% instances), PRON (128; 2% instances), VERB (116; 2% instances), PART (63; 1% instances), SYM (46; 1% instances), SCONJ (28; 0% instances), AUX (22; 0% instances), ADV (20; 0% instances)