home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bokota-ChibErgIS: POS Tags: NOUN

There are 129 NOUN lemmas (35%), 144 NOUN types (32%) and 560 NOUN tokens (21%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: koi, kiro, chi, enusuli, u, chebi, die, gli, giska, nga

The 10 most frequent NOUN types: koi, kiro, chi, u, die, enusuli, gli, giska, nga, bla

The 10 most frequent ambiguous lemmas: chi (VERB 43, NOUN 23), bli (NOUN 8, VERB 3), i (NOUN 6, CCONJ 1), be (AUX 42, NOUN 2), soli (ADJ 4, NOUN 2), ke (ADP 55, NOUN 1), ski (VERB 20, NOUN 1)

The 10 most frequent ambiguous types: i (NOUN 6, CCONJ 1), be (AUX 42, NOUN 2), soli (ADJ 4, NOUN 2), ke (ADP 55, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.116279 (the average of all parts of speech is 1.193029).

The 1st highest number of forms (3) was observed with the lemma “nienu”: nienu, nienua, nienunga.

The 2nd highest number of forms (3) was observed with the lemma “samla”: samla, samlaia, samlale.

The 3rd highest number of forms (2) was observed with the lemma “chebi”: chebi, chebia.

NOUN occurs with 1 features: Foreign (11; 2% instances)

NOUN occurs with 1 feature-value pairs: Foreign=Yes

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (549 tokens). Examples: koi, kiro, chi, u, die, enusuli, gli, giska, nga, bla

Relations

NOUN nodes are attached to their parents using 15 different relations: obj (233; 42% instances), obl:arg (110; 20% instances), nsubj (98; 18% instances), nmod:poss (25; 4% instances), obl:lmod (19; 3% instances), obl:mod (18; 3% instances), conj (17; 3% instances), nsubj:pass (13; 2% instances), nmod (12; 2% instances), reparandum (4; 1% instances), obl:tmod (3; 1% instances), xcomp (3; 1% instances), compound (2; 0% instances), root (2; 0% instances), obl (1; 0% instances)

Parents of NOUN nodes belong to 10 different parts of speech: VERB (491; 88% instances), NOUN (54; 10% instances), ADJ (3; 1% instances), ADV (3; 1% instances), ADP (2; 0% instances), PRON (2; 0% instances), (2; 0% instances), NUM (1; 0% instances), PROPN (1; 0% instances), X (1; 0% instances)

307 (55%) NOUN nodes are leaves.

184 (33%) NOUN nodes have one child.

57 (10%) NOUN nodes have two children.

12 (2%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 5.

Children of NOUN nodes are attached using 18 different relations: case (132; 39% instances), nmod:poss (70; 21% instances), amod (38; 11% instances), nmod (18; 5% instances), conj (16; 5% instances), advmod (12; 4% instances), dep (12; 4% instances), det (10; 3% instances), flat (8; 2% instances), cc (5; 1% instances), punct (4; 1% instances), acl (2; 1% instances), acl:relcl (2; 1% instances), nummod (2; 1% instances), obl:lmod (2; 1% instances), reparandum (2; 1% instances), compound (1; 0% instances), cop (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: ADP (134; 40% instances), NOUN (54; 16% instances), PRON (44; 13% instances), ADJ (38; 11% instances), DET (21; 6% instances), ADV (12; 4% instances), PROPN (12; 4% instances), NUM (6; 2% instances), X (6; 2% instances), PUNCT (4; 1% instances), VERB (4; 1% instances), AUX (1; 0% instances), PART (1; 0% instances)