Treebank Statistics: UD_Veps-VWT: POS Tags: NOUN
There are 144 NOUN lemmas (37%), 221 NOUN types (37%) and 310 NOUN tokens (24%).
Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: elo, kelʼ, külä, ristit, rad, rahvaz, aig, lapsʼ, škol, kanz
The 10 most frequent NOUN types: kelel, lapsed, ristitud, külän, rad, elo, jurid, kelʼt, elod, kanzan
The 10 most frequent ambiguous lemmas: vepsläine (NOUN 7, ADJ 5), polʼ (ADP 1, NOUN 1)
The 10 most frequent ambiguous types: kerdan (ADV 1, NOUN 1), vepsläižid (ADJ 1, NOUN 1)
- kerdan
- vepsläižid
Morphology
The form / lemma ratio of NOUN is 1.534722 (the average of all parts of speech is 1.550649).
The 1st highest number of forms (6) was observed with the lemma “rahvaz”: rahvahad, rahvahan, rahvahaze, rahvahid, rahvast, rahvaz.
The 2nd highest number of forms (5) was observed with the lemma “elo”: elo, elod, eloho, elon, elos.
The 3rd highest number of forms (5) was observed with the lemma “kodima”: kodimad, kodimaha, kodimal, kodimale, kodiman.
NOUN occurs with 3 features: Case (310; 100% instances), Number (310; 100% instances), Clitic (1; 0% instances)
NOUN occurs with 16 feature-value pairs: Case=Abl, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Ter, Case=Tra, Clitic=Ki, Number=Plur, Number=Sing
NOUN occurs with 23 feature combinations.
The most frequent feature combination is Case=Nom|Number=Plur (45 tokens).
Examples: lapsed, ristitud, vanhembad, vepsläižed, Päžarʼlaižed, adivod, aigad, aldod, astjad, avtobusad
Relations
NOUN nodes are attached to their parents using 13 different relations: obl (111; 36% instances), obj (50; 16% instances), nsubj (47; 15% instances), conj (31; 10% instances), nmod (31; 10% instances), nsubj:cop (19; 6% instances), root (14; 5% instances), xcomp (2; 1% instances), acl:relcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), parataxis (1; 0% instances)
Parents of NOUN nodes belong to 8 different parts of speech: VERB (207; 67% instances), NOUN (69; 22% instances), (14; 5% instances), PRON (8; 3% instances), ADJ (4; 1% instances), PROPN (4; 1% instances), ADV (3; 1% instances), AUX (1; 0% instances)
89 (29%) NOUN nodes are leaves.
148 (48%) NOUN nodes have one child.
39 (13%) NOUN nodes have two children.
34 (11%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 7.
Children of NOUN nodes are attached using 19 different relations: nmod (114; 31% instances), amod (62; 17% instances), punct (43; 12% instances), conj (30; 8% instances), cc (22; 6% instances), cop (19; 5% instances), nsubj:cop (17; 5% instances), advmod (13; 4% instances), case (12; 3% instances), acl:relcl (11; 3% instances), nummod (10; 3% instances), mark (3; 1% instances), obl (3; 1% instances), advcl (2; 1% instances), appos (2; 1% instances), det (2; 1% instances), nsubj (2; 1% instances), parataxis (2; 1% instances), aux (1; 0% instances)
Children of NOUN nodes belong to 12 different parts of speech: PRON (78; 21% instances), NOUN (69; 19% instances), ADJ (63; 17% instances), PUNCT (43; 12% instances), CCONJ (22; 6% instances), PROPN (22; 6% instances), AUX (20; 5% instances), VERB (15; 4% instances), ADV (13; 4% instances), ADP (12; 3% instances), NUM (10; 3% instances), SCONJ (3; 1% instances)