home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-Cairo: POS Tags: NOUN

There are 25 NOUN lemmas (23%), 26 NOUN types (22%) and 27 NOUN tokens (16%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent NOUN lemmas: mašīna, vēstule, bronza, brālis, draudzene, dzeršana, galvaspilsēta, iemesls, istaba, jausma

The 10 most frequent NOUN types: mašīnu, Marija, Meitene, bronzu, brālis, draudzenei, dzeršanu, galvaspilsētā, iemesla, istabu

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: Marija (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of NOUN is 1.040000 (the average of all parts of speech is 1.102804).

The 1st highest number of forms (2) was observed with the lemma “vēstule”: vēstule, vēstuli.

The 2nd highest number of forms (1) was observed with the lemma “bronza”: bronzu.

The 3rd highest number of forms (1) was observed with the lemma “brālis”: brālis.

NOUN occurs with 3 features: Case (27; 100% instances), Gender (27; 100% instances), Number (27; 100% instances)

NOUN occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Number=Coll, Number=Plur, Number=Sing

NOUN occurs with 12 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (7 tokens). Examples: mašīnu, bronzu, dzeršanu, istabu, smēķēšanu, vēstuli

Relations

NOUN nodes are attached to their parents using 8 different relations: obj (9; 33% instances), nsubj (7; 26% instances), orphan (3; 11% instances), conj (2; 7% instances), iobj (2; 7% instances), obl (2; 7% instances), appos (1; 4% instances), root (1; 4% instances)

Parents of NOUN nodes belong to 5 different parts of speech: VERB (19; 70% instances), PROPN (4; 15% instances), NOUN (2; 7% instances), ADJ (1; 4% instances), (1; 4% instances)

13 (48%) NOUN nodes are leaves.

9 (33%) NOUN nodes have one child.

3 (11%) NOUN nodes have two children.

2 (7%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 12 different relations: det (6; 25% instances), discourse (3; 13% instances), punct (3; 13% instances), amod (2; 8% instances), cc (2; 8% instances), nmod (2; 8% instances), acl (1; 4% instances), case (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), nsubj (1; 4% instances), orphan (1; 4% instances)

Children of NOUN nodes belong to 11 different parts of speech: DET (5; 21% instances), PART (3; 13% instances), PROPN (3; 13% instances), PUNCT (3; 13% instances), ADJ (2; 8% instances), CCONJ (2; 8% instances), NOUN (2; 8% instances), ADP (1; 4% instances), AUX (1; 4% instances), PRON (1; 4% instances), VERB (1; 4% instances)