home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Church_Slavonic-PROIEL: POS Tags: NOUN

There are 993 NOUN lemmas (32%), 2701 NOUN types (27%) and 9630 NOUN tokens (17%). Out of 14 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: отьць, сꙑнъ, оученикъ, чловѣкъ, господь, дьнь, народъ, богъ, слово, домъ

The 10 most frequent NOUN types: с҃нъ, г҃и, оученици, о҃тца, слово, день, домъ, б҃а, фарисѣи, ч҃лвкъ

The 10 most frequent ambiguous lemmas: богъ (NOUN 147, ADJ 1), цѣсарь (NOUN 55, ADJ 4), вьсь (DET 253, ADJ 168, NOUN 29), дроугъ (ADJ 73, PRON 39, NOUN 25), вечеръ (NOUN 7, ADV 2), оутро (ADV 7, NOUN 3), рѣчь (NOUN 2, ADV 1), близньць (PROPN 2, NOUN 1), десѧть (NUM 97, NOUN 1), прѣлюбодѣи (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: июдеи (NOUN 33, PROPN 2), дѣла (NOUN 21, VERB 1), ц҃сръ (NOUN 16, ADJ 2), весь (DET 23, NOUN 14, ADJ 1), вьси (ADJ 50, DET 32, NOUN 13), селѣ (ADV 15, NOUN 12), дроугꙑ (ADJ 17, NOUN 9), г҃лъ (NOUN 8, VERB 1), дроугъ (PRON 19, NOUN 8), г҃нъ (NOUN 7, ADJ 1)

Morphology

The form / lemma ratio of NOUN is 2.720040 (the average of all parts of speech is 3.275325).

The 1st highest number of forms (38) was observed with the lemma “чловѣкъ”: члвкъ, члвѣкови, чловкꙑ, чловѣкомъ, чловѣкъ, чловѣкъи, члⷦ҇а, члⷦ҇ва, ч҃вкоу, ч҃ка, ч҃комъ, ч҃коу, ч҃къ, ч҃кꙑ, ч҃лва, ч҃лвк, ч҃лвка, ч҃лвкмъ, ч҃лвкомъ, ч҃лвкоу, ч҃лвкъ, ч҃лвкꙑ, ч҃лвци, ч҃лвцхъ, ч҃лвцѣ, ч҃лвцѣхъ, ч҃лвче, ч҃лвѣко, ч҃лвѣкъ, ч҃лка, ч҃лкмъ, ч҃лкомь, ч҃лкоу, ч҃лкъ, ч҃лкꙑ, ч҃лци, ч҃лче, ч҃лⷦ҇а.

The 2nd highest number of forms (36) was observed with the lemma “отьць”: отецъ, отца, отцъ, отъца, отъцемъ, отъцемь, отъци, отъцоу, отъцъ, отъцю, отъче, отьца, отьци, отьцю, отьче, от҃ц, отⷰ҇ъ, о҃тца, о҃тци, о҃тцмъ, о҃тцмь, о҃тцоу, о҃тцъ, о҃тцю, о҃тче, о҃тъца, о҃тъцю, о҃тъче, о҃ца, о҃ци, о҃цъ, о҃цю, о҃че, о҅тч҃е, Ѡ҃̆че, Ѡ҅тьче.

The 3rd highest number of forms (24) was observed with the lemma “цѣсарьствиѥ”: цсарествиѣ, цѣсарествие, ц҃рствие, ц҃рствии, ц҃сарествиѣ, ц҃срествие, ц҃срстви, ц҃срствие, ц҃срствии, ц҃срствию, ц҃срствиѣ, ц҃сртви, ц҃сртвие, ц҃сртвиѣ, ц҃ствие, ц҃ствиѣ, ц҃ствіѣ, ц҃с҃рствие, ц҃с҃рствиѣ, ц҃ѣсарествии, ц҃ѣсрствие, ц҃ѣсрствиѣ, ц҃ⷭ҇рствиѣ, цⷭ҇рвие.

NOUN occurs with 3 features: Case (9599; 100% instances), Gender (9599; 100% instances), Number (9599; 100% instances)

NOUN occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NOUN occurs with 65 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (1105 tokens). Examples: с҃нъ, о҃тцъ, б҃ъ, ч҃лвкъ, г҃ъ, народъ, миръ, г҃ь, ч҃лкъ, д҃хъ

Relations

NOUN nodes are attached to their parents using 19 different relations: obl (2478; 26% instances), nsubj (1938; 20% instances), obj (1913; 20% instances), nmod (791; 8% instances), conj (776; 8% instances), iobj (485; 5% instances), appos (286; 3% instances), vocative (261; 3% instances), root (257; 3% instances), xcomp (97; 1% instances), advcl (81; 1% instances), orphan (53; 1% instances), nsubj:pass (50; 1% instances), obl:agent (50; 1% instances), ccomp (44; 0% instances), dep (38; 0% instances), dislocated (19; 0% instances), parataxis (10; 0% instances), fixed (3; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (6650; 69% instances), NOUN (1376; 14% instances), ADJ (379; 4% instances), AUX (261; 3% instances), (257; 3% instances), PROPN (210; 2% instances), ADV (193; 2% instances), NUM (143; 1% instances), PRON (109; 1% instances), INTJ (21; 0% instances), CCONJ (18; 0% instances), ADP (12; 0% instances), DET (1; 0% instances)

3061 (32%) NOUN nodes are leaves.

4016 (42%) NOUN nodes have one child.

1734 (18%) NOUN nodes have two children.

819 (9%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 14.

Children of NOUN nodes are attached using 25 different relations: case (2748; 26% instances), nmod (1881; 18% instances), amod (1227; 12% instances), det (1183; 11% instances), cc (664; 6% instances), conj (651; 6% instances), acl (518; 5% instances), cop (433; 4% instances), advmod (297; 3% instances), nsubj (291; 3% instances), nummod (175; 2% instances), appos (157; 1% instances), mark (85; 1% instances), orphan (81; 1% instances), discourse (67; 1% instances), obl (47; 0% instances), advcl (38; 0% instances), vocative (25; 0% instances), ccomp (16; 0% instances), dislocated (16; 0% instances), iobj (13; 0% instances), aux (9; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: ADP (2748; 26% instances), ADJ (2612; 25% instances), NOUN (1376; 13% instances), PRON (768; 7% instances), CCONJ (667; 6% instances), DET (643; 6% instances), VERB (628; 6% instances), AUX (462; 4% instances), ADV (297; 3% instances), NUM (203; 2% instances), PROPN (116; 1% instances), SCONJ (86; 1% instances), INTJ (21; 0% instances)