home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Nenets-Tundra: POS Tags: NOUN

There are 74 NOUN lemmas (22%), 194 NOUN types (35%) and 361 NOUN tokens (28%). Out of 12 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: хасава, груша, яха, ӈацекы, тарка, хыдя, си, харад, хэв, марˮ

The 10 most frequent NOUN types: хасава, яхаʼ, марядʼ, пулʼ, яхамʼ, ӈацекы, танцяʼ, харадмʼ, нёʼ, си

The 10 most frequent ambiguous lemmas: сава (NOUN 8, ADJ 1), пя (NOUN 7, VERB 2), яд (NOUN 4, VERB 1), ӈэ (NOUN 2, VERB 1), пэр (VERB 2, NOUN 1), хая- (VERB 2, NOUN 1)

The 10 most frequent ambiguous types: нянда (NOUN 2, ADP 1, PRON 1), нимня (ADP 2, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 2.621622 (the average of all parts of speech is 1.619469).

The 1st highest number of forms (12) was observed with the lemma “груша”: груша, грушаˮ, грушадар, грушамʼ, грушамда?мэ?, груши, грушиʼ, грушиˮ, грушида, грушидам’, грушина, грушита.

The 2nd highest number of forms (10) was observed with the lemma “ӈацекы”: ӈацекэко, ӈацекы, ӈацекыˮ, ӈацекымʼ, ӈацекэко, ӈацекэко?мэ?, ӈацекэкоʼ?мэ?, ӈацекэкоˮ, ӈацекэр?мэ?, ӈацекэра</em>.

The 3rd highest number of forms (8) was observed with the lemma “тарка”: тарка, таркаʼ, таркавна, таркамʼ, таркамда, тарканʼ, тарканда, таркахаюта.

NOUN occurs with 2 features: Number (28; 8% instances), Person (28; 8% instances)

NOUN occurs with 4 feature-value pairs: Number=Sing, Person=1, Person=2, Person=3

NOUN occurs with 4 feature combinations. The most frequent feature combination is _ (333 tokens). Examples: хасава, яхаʼ, пулʼ, яхамʼ, ӈацекы, танцяʼ, харадмʼ, нёʼ, си, хыдяʼ

Relations

NOUN nodes are attached to their parents using 11 different relations: obl:mod (122; 34% instances), obj (76; 21% instances), nmod:poss (60; 17% instances), nsubj (47; 13% instances), nmod (32; 9% instances), reparandum (10; 3% instances), root (7; 2% instances), parataxis (4; 1% instances), nsubj:outer (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Parents of NOUN nodes belong to 7 different parts of speech: VERB (230; 64% instances), NOUN (106; 29% instances), ADV (8; 2% instances), ADJ (7; 2% instances), (7; 2% instances), AUX (2; 1% instances), ADP (1; 0% instances)

161 (45%) NOUN nodes are leaves.

121 (34%) NOUN nodes have one child.

56 (16%) NOUN nodes have two children.

23 (6%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 16 different relations: case (63; 20% instances), nmod:poss (60; 19% instances), discourse (54; 17% instances), nmod (36; 11% instances), amod (23; 7% instances), nummod (18; 6% instances), acl (17; 5% instances), punct (13; 4% instances), reparandum (10; 3% instances), det (7; 2% instances), parataxis (5; 2% instances), nsubj (4; 1% instances), dep (3; 1% instances), advcl (1; 0% instances), aux (1; 0% instances), cop (1; 0% instances)

Children of NOUN nodes belong to 11 different parts of speech: NOUN (106; 34% instances), ADP (64; 20% instances), INTJ (34; 11% instances), ADJ (23; 7% instances), NUM (20; 6% instances), VERB (19; 6% instances), X (19; 6% instances), PUNCT (13; 4% instances), DET (8; 3% instances), PRON (8; 3% instances), AUX (2; 1% instances)