home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latgalian-Cairo: POS Tags: NOUN

There are 25 NOUN lemmas (23%), 26 NOUN types (22%) and 27 NOUN tokens (16%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: mašyna, viestule, bronza, bruoļs, draudzine, dzeršona, dīna, golvyspiļsāta, jausma, kruosa

The 10 most frequent NOUN types: mašynu, Meitine, bronzu, bruoļs, draudzinei, dzeršonu, dīnā, golvyspiļsātā, jausmys, kruosā

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.040000 (the average of all parts of speech is 1.081818).

The 1st highest number of forms (2) was observed with the lemma “viestule”: viestule, viestuli.

The 2nd highest number of forms (1) was observed with the lemma “bronza”: bronzu.

The 3rd highest number of forms (1) was observed with the lemma “bruoļs”: bruoļs.

NOUN occurs with 3 features: Case (27; 100% instances), Gender (27; 100% instances), Number (27; 100% instances)

NOUN occurs with 9 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

NOUN occurs with 11 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (7 tokens). Examples: mašynu, bronzu, dzeršonu, peipiešonu, ustobu, viestuli

Relations

NOUN nodes are attached to their parents using 8 different relations: obj (9; 33% instances), nsubj (6; 22% instances), obl (3; 11% instances), orphan (3; 11% instances), conj (2; 7% instances), iobj (2; 7% instances), appos (1; 4% instances), root (1; 4% instances)

Parents of NOUN nodes belong to 5 different parts of speech: VERB (19; 70% instances), PROPN (4; 15% instances), NOUN (2; 7% instances), ADJ (1; 4% instances), (1; 4% instances)

12 (44%) NOUN nodes are leaves.

10 (37%) NOUN nodes have one child.

3 (11%) NOUN nodes have two children.

2 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 14 different relations: det (6; 24% instances), nmod (3; 12% instances), punct (3; 12% instances), amod (2; 8% instances), cc (2; 8% instances), acl (1; 4% instances), advmod:emph (1; 4% instances), advmod:neg (1; 4% instances), case (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), discourse (1; 4% instances), nsubj (1; 4% instances), orphan (1; 4% instances)

Children of NOUN nodes belong to 11 different parts of speech: DET (6; 24% instances), PART (3; 12% instances), PROPN (3; 12% instances), PUNCT (3; 12% instances), ADJ (2; 8% instances), CCONJ (2; 8% instances), NOUN (2; 8% instances), ADP (1; 4% instances), AUX (1; 4% instances), PRON (1; 4% instances), VERB (1; 4% instances)