home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tswana-Popapolelo: POS Tags: NOUN

There are 1 NOUN lemmas (9%), 24 NOUN types (21%) and 26 NOUN tokens (12%). Out of 11 observed tags, the rank of NOUN is: 5 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent NOUN lemmas: _

The 10 most frequent NOUN types: koloi, lekwalo, Moagisani, Mosetsana, Rre, baesekele, bohibidu, boronse, gauta, kakanyo

The 10 most frequent ambiguous lemmas: _ (PRON 43, VERB 34, PART 32, NOUN 26, PUNCT 23, PROPN 15, ADV 12, AUX 12, CCONJ 8, SCONJ 5, ADJ 4)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 24.000000 (the average of all parts of speech is 10.181818).

The 1st highest number of forms (24) was observed with the lemma “_”: Moagisani, Mosetsana, Rre, baesekele, bohibidu, boronse, gauta, kakanyo, koloi, lebaka, lebelo, legora, lekwalo, letlhabaphefo, letsatsi, monna, moriri, morwarraagwe, mošate, naga, phaposing, pula, selefera, tsala.

NOUN occurs with 1 features: NounClass (26; 100% instances)

NOUN occurs with 6 feature-value pairs: NounClass=Bantu1, NounClass=Bantu14, NounClass=Bantu3, NounClass=Bantu5, NounClass=Bantu7, NounClass=Bantu9

NOUN occurs with 6 feature combinations. The most frequent feature combination is NounClass=Bantu9 (10 tokens). Examples: koloi, baesekele, boronse, gauta, kakanyo, naga, phaposing, pula, tsala

Relations

NOUN nodes are attached to their parents using 11 different relations: obj (8; 31% instances), nsubj (6; 23% instances), orphan (3; 12% instances), obl (2; 8% instances), advcl (1; 4% instances), appos (1; 4% instances), conj (1; 4% instances), iobj (1; 4% instances), obl:lmod (1; 4% instances), obl:tmod (1; 4% instances), root (1; 4% instances)

Parents of NOUN nodes belong to 6 different parts of speech: VERB (17; 65% instances), PROPN (3; 12% instances), AUX (2; 8% instances), NOUN (2; 8% instances), ADJ (1; 4% instances), (1; 4% instances)

11 (42%) NOUN nodes are leaves.

9 (35%) NOUN nodes have one child.

5 (19%) NOUN nodes have two children.

1 (4%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 11 different relations: nmod (9; 38% instances), case (5; 21% instances), punct (2; 8% instances), advmod (1; 4% instances), amod (1; 4% instances), cc (1; 4% instances), conj (1; 4% instances), cop (1; 4% instances), discourse (1; 4% instances), nsubj (1; 4% instances), orphan (1; 4% instances)

Children of NOUN nodes belong to 9 different parts of speech: PART (6; 25% instances), PRON (6; 25% instances), PROPN (3; 13% instances), ADJ (2; 8% instances), NOUN (2; 8% instances), PUNCT (2; 8% instances), ADV (1; 4% instances), AUX (1; 4% instances), CCONJ (1; 4% instances)