Treebank Statistics: UD_Old_Church_Slavonic-PROIEL: POS Tags: NOUN
There are 2871 NOUN
lemmas (34%), 12584 NOUN
types (28%) and 40890 NOUN
tokens (21%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent NOUN
lemmas: господь, богъ, отьць, чловѣкъ, дьнь, слово, сꙑнъ, землꙗ, имѧ, рѫка
The 10 most frequent NOUN
types: г҃і, слово, с҃нъ, б҃ъ, вѣкъ, г҃ь, б҃а, градъ, богъ, г҃и
The 10 most frequent ambiguous lemmas: богъ (NOUN 1344, PRON 2), рѫка (NOUN 388, DET 1), цѣсарь (NOUN 343, ADJ 24), нога (NOUN 176, ADJ 2), чоудо (NOUN 142, ADJ 1), свѣтъ (NOUN 106, ADJ 2), крьстиꙗнъ (NOUN 103, ADJ 4, PROPN 1), епискоупъ (NOUN 94, PROPN 1), часъ (NOUN 92, NUM 1), диꙗволъ (NOUN 82, PROPN 2)
The 10 most frequent ambiguous types: г҃і (NOUN 243, ADJ 1), бж҃е (NOUN 92, ADJ 1), г҃ѣ (NOUN 91, ADJ 1), дѣла (NOUN 63, ADP 1, VERB 1), г҃ъ (NOUN 63, ADJ 1), июдеи (NOUN 33, PROPN 2), срѣдѣ (NOUN 31, ADP 1), цѣсарь҆ (NOUN 25, ADJ 1), гнѣва (NOUN 25, VERB 1), цѣсароу (NOUN 23, ADJ 3)
- г҃і
- бж҃е
- г҃ѣ
- дѣла
- г҃ъ
- июдеи
- срѣдѣ
- цѣсарь҆
- гнѣва
- цѣсароу
Morphology
The form / lemma ratio of NOUN
is 4.383142 (the average of all parts of speech is 5.263244).
The 1st highest number of forms (133) was observed with the lemma “отьць”: о͑т, о͑тʼца, о͑тʼцемъ, о͑тʼци, о͑тʼцоу, о͑тʼць, о͑тецъ, о͑тець͗, о͑тца, о͑тцⱕ, о͑тче, о͑ть͗ца, о͑ть͗ци, о͑ть͗цоу, о͑ц͆а, о͑ц͆ъ, о͑ц͆ь, о͗тʼцемъ, от, отʼца, отʼцоу, отецъ, отца, отцъ, отъца, отъцемъ, отъцемь, отъци, отъцоу, отъцъ, отъцю, отъці, отъче, отьца, отьцемь, отьци, отьцъ, отьць, отьцю, отьці, отьче, от҃ц, от҃цъ, от҃ць, отⷰ҇ъ, оц҃а, оц҃емъ, оц҃емь, оц҃ъ, оц҃ь, оц҃і, оц꙯а, оц꙯ь, о҃тца, о҃тци, о҃тцмъ, о҃тцмь, о҃тцоу, о҃тцъ, о҃тцю, о҃тче, о҃тъца, о҃тъцю, о҃тъче, о҃ца, о҃цемъ, о҃цемь, о҃ци, о҃цмъ, о҃цмь, о҃цъ, о҃ць, о҃цю, о҃че, о҅тецъ, о҅тца, о҅тцемъ, о҅тцемь, о҅тцемь҆, о҅тци, о҅тцоу, о҅тц꙯оу, о҅тче, о҅тч҃е, о҅тъца, о҅тъци, о҅тъцоу, о҅тъць, о҅тъць҆, о҅тьца, о҅тьцемъ, о҅тьци, о҅тьцоу, о҅тьцъ, о҅тьць, о҅тьцьмъ, о҅тьць҆, о҅тьць҆мь҆, о҅тьче, о҅ть҆ца, о҅ть҆цемъ, о҅ть҆цемь, о҅ть҆ци, о҅ть҆цоу, о҅ть҆цъ, о҅ть҆цъ҆, о҅ть҆ць, о҅ть҆ць҆, о҅ть҆ць҆мъ, о҅ть҆че, о҅тꙿца, о҅тꙿце, о҅тꙿцемъ, о҅тꙿцемь, о҅тꙿцемь҆, о҅тꙿцоу, о҅тꙿць, о҅тꙿче, о҅цо꙯у, о҅ц꙯а, о҅ц꙯и, о҅ц꙯ихъ, о҅ц꙯мъ, о҅ц꙯оу, о҅ц꙯ъ, о҅ц꙯ь, о҅ц꙯꙯а, о҅ч꙯е, о҆ц꙯оу, Ѡ҃̆че, Ѡ҃че, Ѡ҅тьче, ҅Оч꙯е.
The 2nd highest number of forms (103) was observed with the lemma “срьдьце”: cр҃це, Срдца, срдьце, срд҃ца, срд҃це, срд҃цемъ, срд҃ци, срд҃цмъ, срд҃ці, срц҃мь, сръдца, сръдце, сръдцемь, сръдъца, сръдъце, сръдъцемъ, сръдъцю, сръдъці, сръдъціхъ, сръдъцїхъ, сръдьца, сръдьце, сръдьцемъ, сръдьцемь, сръдьци, сръдьцихъ, сръдьцю, сръдьці, сръдьціхъ, сръдь҆ца, сръдь҆цемъ, сръдⸯца, сръдⸯцемь, срь͗дʼце, срь͗децъ, срь͗дь͗ца, срь͗дь͗ци, срь͗дьца, срьдʼца, срьдʼцемь͗, срьдцемъ, срьдъці, срьдь͗це, срьдьца, срьдьцемъ, срьдьцемь҆, срьдьцѣ, срьдь҆це, срьдь҆ць҆, срьдꙿцемъ, срьдꙿцемꙿ, срьдꙿци, срьдꙿцихъ, срь҆дце, срь҆дцемь, срь҆дцемь҆, срь҆дци, срь҆дьца, срь҆дьце, срь҆дьцихъ, срь҆дьцоу, срь҆дь҆ца, срь҆дь҆це, срь҆дь҆цемъ, срь҆дь҆цемь, срь҆дь҆цемь҆, срь҆дь҆ци, срь҆дь҆цихъ, срь҆дь҆ць҆, срь҆дꙿце, срь҆дꙿцемъ, ср҃дца, ср҃дце, ср҃дцемъ, ср҃дцемь, ср҃дці, ср҃ца, ср҃це, ср҃цемь, ср҃цемьмь, ср҃цмь, ср҃ці, ср҃ціхъ, сц҃а, сц҃е, сц҃емь, сц҃і, с҃дцмь, с҃рдца, с҃рдце, с҃рдцемъ, с҃рдцемь, с҃рдци, с҃рдцихъ, с҃рдцмъ, с҃рдцоу, с҃рдцхъ, с҃рдцъ, с҃рдъцихъ, с҃рдьца, с҃рце, с҃рці, с҃ръдци.
The 3rd highest number of forms (88) was observed with the lemma “чловѣкъ”: Чк҃ъї, овѣкъ, чв҃ка, чв҃къ, чв҃че, чк҃омъ, чк҃оу, чк҃ъ, чк҃ꙑ, чк꙯а, чк꙯ъ, чл͆къ, чл͆кы, члвкъ, члвѣкови, члв҃ка, члв҃къ, члк҃оу, чловкꙑ, чловѣка, чловѣкови, чловѣкомъ, чловѣкомь, чловѣкомꙿ, чловѣкоу, чловѣкъ, чловѣкъи, чловѣкꙑ, чловѣкꙿ, чловѣци, чловѣцѣ, чловѣцѣхъ, чловѣче, чловѣчі, чл҃вка, чл҃вкомъ, чл҃вкъ, чл҃вкъі, чл҃вцѣхъ, чл҃вѣкъ, чл҃вѣцѣхъ, чл҃ка, чл҃кмъ, чл҃комъ, чл҃коу, чл҃ку, чл҃къ, чл҃кꙑ, чл҃овѣкъ, чл҃ці, члⷦ҇а, члⷦ҇ва, чл꙯ка, чл꙯комъ, чл꙯къ, чл꙯че, ч҃вкоу, ч҃ка, ч҃комъ, ч҃коу, ч҃къ, ч҃къ[і, ч҃кꙑ, ч҃лва, ч҃лвк, ч҃лвка, ч҃лвкмъ, ч҃лвкомъ, ч҃лвкоу, ч҃лвкъ, ч҃лвкꙑ, ч҃лвци, ч҃лвцхъ, ч҃лвцѣ, ч҃лвцѣхъ, ч҃лвче, ч҃лвѣко, ч҃лвѣкъ, ч҃лка, ч҃лкмъ, ч҃лкомь, ч҃лкоу, ч҃лкъ, ч҃лкꙑ, ч҃лци, ч҃лче, ч҃лⷦ҇а, ч҃ци.
NOUN
occurs with 3 features: Case (40831; 100% instances), Number (40831; 100% instances), Gender (40828; 100% instances)
NOUN
occurs with 16 feature-value pairs: Case=Acc
, Case=Dat
, Case=Dat,Gen
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Case=Voc
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
NOUN
occurs with 70 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing
(4263 tokens).
Examples: б҃ъ, г҃ь, с҃нъ, богъ, г҃ъ, гь҃, народъ, рабъ, чловѣкъ, о҃тцъ
Relations
NOUN
nodes are attached to their parents using 23 different relations: obl (10327; 25% instances), obj (7960; 19% instances), nsubj (6883; 17% instances), nmod (3836; 9% instances), conj (3185; 8% instances), obl:arg (2184; 5% instances), root (1664; 4% instances), vocative (1224; 3% instances), appos (1147; 3% instances), xcomp (447; 1% instances), orphan (387; 1% instances), obl:agent (314; 1% instances), advcl (265; 1% instances), nsubj:pass (252; 1% instances), acl (247; 1% instances), advcl:cmp (179; 0% instances), ccomp (113; 0% instances), dep (113; 0% instances), dislocated (81; 0% instances), parataxis (45; 0% instances), fixed (22; 0% instances), nsubj:outer (13; 0% instances), csubj (2; 0% instances)
Parents of NOUN
nodes belong to 13 different parts of speech: VERB (27377; 67% instances), NOUN (7430; 18% instances), (1664; 4% instances), ADJ (1290; 3% instances), ADV (692; 2% instances), AUX (672; 2% instances), PROPN (630; 2% instances), PRON (622; 2% instances), NUM (345; 1% instances), INTJ (116; 0% instances), ADP (28; 0% instances), SCONJ (19; 0% instances), DET (5; 0% instances)
11090 (27%) NOUN
nodes are leaves.
18013 (44%) NOUN
nodes have one child.
8293 (20%) NOUN
nodes have two children.
3494 (9%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 19.
Children of NOUN
nodes are attached using 32 different relations: case (10842; 23% instances), det (9532; 20% instances), amod (7037; 15% instances), nmod (3992; 9% instances), cc (3142; 7% instances), conj (2929; 6% instances), acl (2530; 5% instances), cop (1295; 3% instances), nsubj (1109; 2% instances), appos (1050; 2% instances), advmod (892; 2% instances), orphan (647; 1% instances), nummod (474; 1% instances), discourse (433; 1% instances), mark (393; 1% instances), obl (216; 0% instances), advcl (155; 0% instances), dislocated (70; 0% instances), vocative (64; 0% instances), ccomp (44; 0% instances), obl:arg (21; 0% instances), csubj (19; 0% instances), fixed (18; 0% instances), aux (14; 0% instances), parataxis (13; 0% instances), obj (7; 0% instances), obl:agent (7; 0% instances), advcl:cmp (6; 0% instances), dep (3; 0% instances), xcomp (2; 0% instances), expl:pv (1; 0% instances), nsubj:outer (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: ADP (10851; 23% instances), DET (7525; 16% instances), ADJ (7516; 16% instances), NOUN (7430; 16% instances), PRON (3223; 7% instances), CCONJ (3142; 7% instances), VERB (2597; 6% instances), ADV (1670; 4% instances), AUX (1390; 3% instances), PROPN (789; 2% instances), NUM (538; 1% instances), SCONJ (212; 0% instances), INTJ (74; 0% instances), X (1; 0% instances)