This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cu/pos issue tracker

NOUN: noun

This document is a placeholder for the language-specific documentation for NOUN.


Treebank Statistics (UD_Old_Church_Slavonic)

There are 996 NOUN lemmas (33%), 2705 NOUN types (27%) and 9649 NOUN tokens (17%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: отьць, сꙑнъ#1, оученикъ, чловѣкъ, господь, дьнь, народъ, богъ, слово, домъ

The 10 most frequent NOUN types: с҃нъ, г҃и, оученици, о҃тца, слово, день, домъ, б҃а, фарисѣи, ч҃лвкъ

The 10 most frequent ambiguous lemmas: богъ (NOUN 147, ADJ 1), цѣсарь (NOUN 55, ADJ 4), вьсь (PRON 423, NOUN 29), дроугъ (ADJ 73, PRON 39, NOUN 25), вечеръ (NOUN 7, ADV 2), оутро (ADV 7, NOUN 3), близньць (PROPN 2, NOUN 1), десѧть (NUM 97, NOUN 1), прѣлюбодѣи (ADJ 2, NOUN 1)

The 10 most frequent ambiguous types: июдеи (NOUN 33, PROPN 2), дѣла (NOUN 21, VERB 1), ц҃сръ (NOUN 16, ADJ 2), весь (PRON 24, NOUN 14), вьси (PRON 82, NOUN 13), дроугꙑ (ADJ 17, NOUN 9), г҃лъ (NOUN 8, VERB 1), дроугъ (PRON 19, NOUN 8), г҃нъ (NOUN 7, ADJ 1), г҃ла (VERB 218, NOUN 3)

Morphology

The form / lemma ratio of NOUN is 2.715863 (the average of all parts of speech is 3.336884).

The 1st highest number of forms (38) was observed with the lemma “чловѣкъ”: члвкъ, члвѣкови, чловкꙑ, чловѣкомъ, чловѣкъ, чловѣкъи, члⷦ҇а, члⷦ҇ва, ч҃вкоу, ч҃ка, ч҃комъ, ч҃коу, ч҃къ, ч҃кꙑ, ч҃лва, ч҃лвк, ч҃лвка, ч҃лвкмъ, ч҃лвкомъ, ч҃лвкоу, ч҃лвкъ, ч҃лвкꙑ, ч҃лвци, ч҃лвцхъ, ч҃лвцѣ, ч҃лвцѣхъ, ч҃лвче, ч҃лвѣко, ч҃лвѣкъ, ч҃лка, ч҃лкмъ, ч҃лкомь, ч҃лкоу, ч҃лкъ, ч҃лкꙑ, ч҃лци, ч҃лче, ч҃лⷦ҇а.

The 2nd highest number of forms (36) was observed with the lemma “отьць”: отецъ, отца, отцъ, отъца, отъцемъ, отъцемь, отъци, отъцоу, отъцъ, отъцю, отъче, отьца, отьци, отьцю, отьче, от҃ц, отⷰ҇ъ, о҃тца, о҃тци, о҃тцмъ, о҃тцмь, о҃тцоу, о҃тцъ, о҃тцю, о҃тче, о҃тъца, о҃тъцю, о҃тъче, о҃ца, о҃ци, о҃цъ, о҃цю, о҃че, о҅тч҃е, Ѡ҃̆че, Ѡ҅тьче.

The 3rd highest number of forms (24) was observed with the lemma “цѣсарьствиѥ”: цсарествиѣ, цѣсарествие, ц҃рствие, ц҃рствии, ц҃сарествиѣ, ц҃срествие, ц҃срстви, ц҃срствие, ц҃срствии, ц҃срствию, ц҃срствиѣ, ц҃сртви, ц҃сртвие, ц҃сртвиѣ, ц҃ствие, ц҃ствиѣ, ц҃ствіѣ, ц҃с҃рствие, ц҃с҃рствиѣ, ц҃ѣсарествии, ц҃ѣсрствие, ц҃ѣсрствиѣ, ц҃ⷭ҇рствиѣ, цⷭ҇рвие.

NOUN occurs with 3 features: Case (9618; 100% instances), Gender (9618; 100% instances), Number (9618; 100% instances)

NOUN occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NOUN occurs with 65 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (1108 tokens). Examples: с҃нъ, о҃тцъ, б҃ъ, ч҃лвкъ, г҃ъ, народъ, миръ, г҃ь, ч҃лкъ, д҃хъ

Relations

NOUN nodes are attached to their parents using 16 different relations: nmod (2214; 23% instances), nsubj (1968; 20% instances), dobj (1924; 20% instances), iobj (1586; 16% instances), conj (660; 7% instances), xcomp (512; 5% instances), appos (276; 3% instances), vocative (261; 3% instances), remnant (127; 1% instances), root (54; 1% instances), dep (38; 0% instances), parataxis (9; 0% instances), nsubjpass (7; 0% instances), advmod (5; 0% instances), aux (5; 0% instances), ccomp (3; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (7512; 78% instances), NOUN (1264; 13% instances), PROPN (190; 2% instances), ADJ (178; 2% instances), ADV (147; 2% instances), NUM (137; 1% instances), PRON (97; 1% instances), ROOT (54; 1% instances), ADP (30; 0% instances), INTJ (19; 0% instances), CONJ (17; 0% instances), SCONJ (4; 0% instances)

3145 (33%) NOUN nodes are leaves.

4281 (44%) NOUN nodes have one child.

1751 (18%) NOUN nodes have two children.

472 (5%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 14.

Children of NOUN nodes are attached using 21 different relations: nmod (3041; 32% instances), case (2753; 29% instances), amod (1310; 14% instances), conj (587; 6% instances), cc (580; 6% instances), acl (577; 6% instances), nummod (176; 2% instances), remnant (123; 1% instances), appos (120; 1% instances), advmod (101; 1% instances), nsubj (32; 0% instances), vocative (21; 0% instances), neg (19; 0% instances), ccomp (15; 0% instances), discourse (9; 0% instances), iobj (7; 0% instances), advcl (5; 0% instances), det (3; 0% instances), aux (2; 0% instances), dep (2; 0% instances), mark (1; 0% instances)

Children of NOUN nodes belong to 12 different parts of speech: ADP (2753; 29% instances), PRON (2028; 21% instances), ADJ (1782; 19% instances), NOUN (1264; 13% instances), VERB (616; 6% instances), CONJ (580; 6% instances), NUM (204; 2% instances), ADV (132; 1% instances), PROPN (102; 1% instances), INTJ (19; 0% instances), DET (2; 0% instances), SCONJ (2; 0% instances)


NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]