Treebank Statistics: UD_Bokota-ChibErgIS: POS Tags: NOUN
There are 129 NOUN lemmas (35%), 144 NOUN types (32%) and 560 NOUN tokens (21%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: koi, kiro, chi, enusuli, u, chebi, die, gli, giska, nga
The 10 most frequent NOUN types: koi, kiro, chi, u, die, enusuli, gli, giska, nga, bla
The 10 most frequent ambiguous lemmas: chi (VERB 43, NOUN 23), bli (NOUN 8, VERB 3), i (NOUN 6, CCONJ 1), be (AUX 42, NOUN 2), soli (ADJ 4, NOUN 2), ke (ADP 55, NOUN 1), ski (VERB 20, NOUN 1)
The 10 most frequent ambiguous types: i (NOUN 6, CCONJ 1), be (AUX 42, NOUN 2), soli (ADJ 4, NOUN 2), ke (ADP 55, NOUN 1)
- i
- be
- soli
- ke
Morphology
The form / lemma ratio of NOUN is 1.116279 (the average of all parts of speech is 1.193029).
The 1st highest number of forms (3) was observed with the lemma “nienu”: nienu, nienua, nienunga.
The 2nd highest number of forms (3) was observed with the lemma “samla”: samla, samlaia, samlale.
The 3rd highest number of forms (2) was observed with the lemma “chebi”: chebi, chebia.
NOUN occurs with 1 features: Foreign (11; 2% instances)
NOUN occurs with 1 feature-value pairs: Foreign=Yes
NOUN occurs with 2 feature combinations.
The most frequent feature combination is _ (549 tokens).
Examples: koi, kiro, chi, u, die, enusuli, gli, giska, nga, bla
Relations
NOUN nodes are attached to their parents using 15 different relations: obj (233; 42% instances), obl:arg (110; 20% instances), nsubj (98; 18% instances), nmod:poss (25; 4% instances), obl:lmod (19; 3% instances), obl:mod (18; 3% instances), conj (17; 3% instances), nsubj:pass (13; 2% instances), nmod (12; 2% instances), reparandum (4; 1% instances), obl:tmod (3; 1% instances), xcomp (3; 1% instances), compound (2; 0% instances), root (2; 0% instances), obl (1; 0% instances)
Parents of NOUN nodes belong to 10 different parts of speech: VERB (491; 88% instances), NOUN (54; 10% instances), ADJ (3; 1% instances), ADV (3; 1% instances), ADP (2; 0% instances), PRON (2; 0% instances), (2; 0% instances), NUM (1; 0% instances), PROPN (1; 0% instances), X (1; 0% instances)
307 (55%) NOUN nodes are leaves.
184 (33%) NOUN nodes have one child.
57 (10%) NOUN nodes have two children.
12 (2%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 5.
Children of NOUN nodes are attached using 18 different relations: case (132; 39% instances), nmod:poss (70; 21% instances), amod (38; 11% instances), nmod (18; 5% instances), conj (16; 5% instances), advmod (12; 4% instances), dep (12; 4% instances), det (10; 3% instances), flat (8; 2% instances), cc (5; 1% instances), punct (4; 1% instances), acl (2; 1% instances), acl:relcl (2; 1% instances), nummod (2; 1% instances), obl:lmod (2; 1% instances), reparandum (2; 1% instances), compound (1; 0% instances), cop (1; 0% instances)
Children of NOUN nodes belong to 13 different parts of speech: ADP (134; 40% instances), NOUN (54; 16% instances), PRON (44; 13% instances), ADJ (38; 11% instances), DET (21; 6% instances), ADV (12; 4% instances), PROPN (12; 4% instances), NUM (6; 2% instances), X (6; 2% instances), PUNCT (4; 1% instances), VERB (4; 1% instances), AUX (1; 0% instances), PART (1; 0% instances)