home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Church_Slavonic-PROIEL: POS Tags: NOUN

There are 2871 NOUN lemmas (34%), 12584 NOUN types (28%) and 40890 NOUN tokens (21%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: господь, богъ, отьць, чловѣкъ, дьнь, слово, сꙑнъ, землꙗ, имѧ, рѫка

The 10 most frequent NOUN types: г҃і, слово, с҃нъ, б҃ъ, вѣкъ, г҃ь, б҃а, градъ, богъ, г҃и

The 10 most frequent ambiguous lemmas: богъ (NOUN 1344, PRON 2), рѫка (NOUN 388, DET 1), цѣсарь (NOUN 343, ADJ 24), нога (NOUN 176, ADJ 2), чоудо (NOUN 142, ADJ 1), свѣтъ (NOUN 106, ADJ 2), крьстиꙗнъ (NOUN 103, ADJ 4, PROPN 1), епискоупъ (NOUN 94, PROPN 1), часъ (NOUN 92, NUM 1), диꙗволъ (NOUN 82, PROPN 2)

The 10 most frequent ambiguous types: г҃і (NOUN 243, ADJ 1), бж҃е (NOUN 92, ADJ 1), г҃ѣ (NOUN 91, ADJ 1), дѣла (NOUN 63, ADP 1, VERB 1), г҃ъ (NOUN 63, ADJ 1), июдеи (NOUN 33, PROPN 2), срѣдѣ (NOUN 31, ADP 1), цѣсарь҆ (NOUN 25, ADJ 1), гнѣва (NOUN 25, VERB 1), цѣсароу (NOUN 23, ADJ 3)

Morphology

The form / lemma ratio of NOUN is 4.383142 (the average of all parts of speech is 5.263244).

The 1st highest number of forms (133) was observed with the lemma “отьць”: о͑т, о͑тʼца, о͑тʼцемъ, о͑тʼци, о͑тʼцоу, о͑тʼць, о͑тецъ, о͑тець͗, о͑тца, о͑тцⱕ, о͑тче, о͑ть͗ца, о͑ть͗ци, о͑ть͗цоу, о͑ц͆а, о͑ц͆ъ, о͑ц͆ь, о͗тʼцемъ, от, отʼца, отʼцоу, отецъ, отца, отцъ, отъца, отъцемъ, отъцемь, отъци, отъцоу, отъцъ, отъцю, отъці, отъче, отьца, отьцемь, отьци, отьцъ, отьць, отьцю, отьці, отьче, от҃ц, от҃цъ, от҃ць, отⷰ҇ъ, оц҃а, оц҃емъ, оц҃емь, оц҃ъ, оц҃ь, оц҃і, оц꙯а, оц꙯ь, о҃тца, о҃тци, о҃тцмъ, о҃тцмь, о҃тцоу, о҃тцъ, о҃тцю, о҃тче, о҃тъца, о҃тъцю, о҃тъче, о҃ца, о҃цемъ, о҃цемь, о҃ци, о҃цмъ, о҃цмь, о҃цъ, о҃ць, о҃цю, о҃че, о҅тецъ, о҅тца, о҅тцемъ, о҅тцемь, о҅тцемь҆, о҅тци, о҅тцоу, о҅тц꙯оу, о҅тче, о҅тч҃е, о҅тъца, о҅тъци, о҅тъцоу, о҅тъць, о҅тъць҆, о҅тьца, о҅тьцемъ, о҅тьци, о҅тьцоу, о҅тьцъ, о҅тьць, о҅тьцьмъ, о҅тьць҆, о҅тьць҆мь҆, о҅тьче, о҅ть҆ца, о҅ть҆цемъ, о҅ть҆цемь, о҅ть҆ци, о҅ть҆цоу, о҅ть҆цъ, о҅ть҆цъ҆, о҅ть҆ць, о҅ть҆ць҆, о҅ть҆ць҆мъ, о҅ть҆че, о҅тꙿца, о҅тꙿце, о҅тꙿцемъ, о҅тꙿцемь, о҅тꙿцемь҆, о҅тꙿцоу, о҅тꙿць, о҅тꙿче, о҅цо꙯у, о҅ц꙯а, о҅ц꙯и, о҅ц꙯ихъ, о҅ц꙯мъ, о҅ц꙯оу, о҅ц꙯ъ, о҅ц꙯ь, о҅ц꙯꙯а, о҅ч꙯е, о҆ц꙯оу, Ѡ҃̆че, Ѡ҃че, Ѡ҅тьче, ҅Оч꙯е.

The 2nd highest number of forms (103) was observed with the lemma “срьдьце”: cр҃це, Срдца, срдьце, срд҃ца, срд҃це, срд҃цемъ, срд҃ци, срд҃цмъ, срд҃ці, срц҃мь, сръдца, сръдце, сръдцемь, сръдъца, сръдъце, сръдъцемъ, сръдъцю, сръдъці, сръдъціхъ, сръдъцїхъ, сръдьца, сръдьце, сръдьцемъ, сръдьцемь, сръдьци, сръдьцихъ, сръдьцю, сръдьці, сръдьціхъ, сръдь҆ца, сръдь҆цемъ, сръдⸯца, сръдⸯцемь, срь͗дʼце, срь͗децъ, срь͗дь͗ца, срь͗дь͗ци, срь͗дьца, срьдʼца, срьдʼцемь͗, срьдцемъ, срьдъці, срьдь͗це, срьдьца, срьдьцемъ, срьдьцемь҆, срьдьцѣ, срьдь҆це, срьдь҆ць҆, срьдꙿцемъ, срьдꙿцемꙿ, срьдꙿци, срьдꙿцихъ, срь҆дце, срь҆дцемь, срь҆дцемь҆, срь҆дци, срь҆дьца, срь҆дьце, срь҆дьцихъ, срь҆дьцоу, срь҆дь҆ца, срь҆дь҆це, срь҆дь҆цемъ, срь҆дь҆цемь, срь҆дь҆цемь҆, срь҆дь҆ци, срь҆дь҆цихъ, срь҆дь҆ць҆, срь҆дꙿце, срь҆дꙿцемъ, ср҃дца, ср҃дце, ср҃дцемъ, ср҃дцемь, ср҃дці, ср҃ца, ср҃це, ср҃цемь, ср҃цемьмь, ср҃цмь, ср҃ці, ср҃ціхъ, сц҃а, сц҃е, сц҃емь, сц҃і, с҃дцмь, с҃рдца, с҃рдце, с҃рдцемъ, с҃рдцемь, с҃рдци, с҃рдцихъ, с҃рдцмъ, с҃рдцоу, с҃рдцхъ, с҃рдцъ, с҃рдъцихъ, с҃рдьца, с҃рце, с҃рці, с҃ръдци.

The 3rd highest number of forms (88) was observed with the lemma “чловѣкъ”: Чк҃ъї, овѣкъ, чв҃ка, чв҃къ, чв҃че, чк҃омъ, чк҃оу, чк҃ъ, чк҃ꙑ, чк꙯а, чк꙯ъ, чл͆къ, чл͆кы, члвкъ, члвѣкови, члв҃ка, члв҃къ, члк҃оу, чловкꙑ, чловѣка, чловѣкови, чловѣкомъ, чловѣкомь, чловѣкомꙿ, чловѣкоу, чловѣкъ, чловѣкъи, чловѣкꙑ, чловѣкꙿ, чловѣци, чловѣцѣ, чловѣцѣхъ, чловѣче, чловѣчі, чл҃вка, чл҃вкомъ, чл҃вкъ, чл҃вкъі, чл҃вцѣхъ, чл҃вѣкъ, чл҃вѣцѣхъ, чл҃ка, чл҃кмъ, чл҃комъ, чл҃коу, чл҃ку, чл҃къ, чл҃кꙑ, чл҃овѣкъ, чл҃ці, члⷦ҇а, члⷦ҇ва, чл꙯ка, чл꙯комъ, чл꙯къ, чл꙯че, ч҃вкоу, ч҃ка, ч҃комъ, ч҃коу, ч҃къ, ч҃къ[і, ч҃кꙑ, ч҃лва, ч҃лвк, ч҃лвка, ч҃лвкмъ, ч҃лвкомъ, ч҃лвкоу, ч҃лвкъ, ч҃лвкꙑ, ч҃лвци, ч҃лвцхъ, ч҃лвцѣ, ч҃лвцѣхъ, ч҃лвче, ч҃лвѣко, ч҃лвѣкъ, ч҃лка, ч҃лкмъ, ч҃лкомь, ч҃лкоу, ч҃лкъ, ч҃лкꙑ, ч҃лци, ч҃лче, ч҃лⷦ҇а, ч҃ци.

NOUN occurs with 3 features: Case (40831; 100% instances), Number (40831; 100% instances), Gender (40828; 100% instances)

NOUN occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NOUN occurs with 70 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (4263 tokens). Examples: б҃ъ, г҃ь, с҃нъ, богъ, г҃ъ, гь҃, народъ, рабъ, чловѣкъ, о҃тцъ

Relations

NOUN nodes are attached to their parents using 23 different relations: obl (10327; 25% instances), obj (7960; 19% instances), nsubj (6883; 17% instances), nmod (3836; 9% instances), conj (3185; 8% instances), obl:arg (2184; 5% instances), root (1664; 4% instances), vocative (1224; 3% instances), appos (1147; 3% instances), xcomp (447; 1% instances), orphan (387; 1% instances), obl:agent (314; 1% instances), advcl (265; 1% instances), nsubj:pass (252; 1% instances), acl (247; 1% instances), advcl:cmp (179; 0% instances), ccomp (113; 0% instances), dep (113; 0% instances), dislocated (81; 0% instances), parataxis (45; 0% instances), fixed (22; 0% instances), nsubj:outer (13; 0% instances), csubj (2; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (27377; 67% instances), NOUN (7430; 18% instances), (1664; 4% instances), ADJ (1290; 3% instances), ADV (692; 2% instances), AUX (672; 2% instances), PROPN (630; 2% instances), PRON (622; 2% instances), NUM (345; 1% instances), INTJ (116; 0% instances), ADP (28; 0% instances), SCONJ (19; 0% instances), DET (5; 0% instances)

11090 (27%) NOUN nodes are leaves.

18013 (44%) NOUN nodes have one child.

8293 (20%) NOUN nodes have two children.

3494 (9%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 19.

Children of NOUN nodes are attached using 32 different relations: case (10842; 23% instances), det (9532; 20% instances), amod (7037; 15% instances), nmod (3992; 9% instances), cc (3142; 7% instances), conj (2929; 6% instances), acl (2530; 5% instances), cop (1295; 3% instances), nsubj (1109; 2% instances), appos (1050; 2% instances), advmod (892; 2% instances), orphan (647; 1% instances), nummod (474; 1% instances), discourse (433; 1% instances), mark (393; 1% instances), obl (216; 0% instances), advcl (155; 0% instances), dislocated (70; 0% instances), vocative (64; 0% instances), ccomp (44; 0% instances), obl:arg (21; 0% instances), csubj (19; 0% instances), fixed (18; 0% instances), aux (14; 0% instances), parataxis (13; 0% instances), obj (7; 0% instances), obl:agent (7; 0% instances), advcl:cmp (6; 0% instances), dep (3; 0% instances), xcomp (2; 0% instances), expl:pv (1; 0% instances), nsubj:outer (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: ADP (10851; 23% instances), DET (7525; 16% instances), ADJ (7516; 16% instances), NOUN (7430; 16% instances), PRON (3223; 7% instances), CCONJ (3142; 7% instances), VERB (2597; 6% instances), ADV (1670; 4% instances), AUX (1390; 3% instances), PROPN (789; 2% instances), NUM (538; 1% instances), SCONJ (212; 0% instances), INTJ (74; 0% instances), X (1; 0% instances)