Treebank Statistics: UD_Nenets-Tundra: POS Tags: NOUN
There are 74 NOUN lemmas (22%), 194 NOUN types (35%) and 361 NOUN tokens (28%).
Out of 12 observed tags, the rank of NOUN is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: хасава, груша, яха, ӈацекы, тарка, хыдя, си, харад, хэв, марˮ
The 10 most frequent NOUN types: хасава, яхаʼ, марядʼ, пулʼ, яхамʼ, ӈацекы, танцяʼ, харадмʼ, нёʼ, си
The 10 most frequent ambiguous lemmas: сава (NOUN 8, ADJ 1), пя (NOUN 7, VERB 2), яд (NOUN 4, VERB 1), ӈэ (NOUN 2, VERB 1), пэр (VERB 2, NOUN 1), хая- (VERB 2, NOUN 1)
The 10 most frequent ambiguous types: нянда (NOUN 2, ADP 1, PRON 1), нимня (ADP 2, NOUN 1)
- нянда
- нимня
Morphology
The form / lemma ratio of NOUN is 2.621622 (the average of all parts of speech is 1.619469).
The 1st highest number of forms (12) was observed with the lemma “груша”: груша, грушаˮ, грушадар, грушамʼ, грушамда?мэ?, груши, грушиʼ, грушиˮ, грушида, грушидам’, грушина, грушита.
The 2nd highest number of forms (10) was observed with the lemma “ӈацекы”:
The 3rd highest number of forms (8) was observed with the lemma “тарка”: тарка, таркаʼ, таркавна, таркамʼ, таркамда, тарканʼ, тарканда, таркахаюта.
NOUN occurs with 2 features: Number (28; 8% instances), Person (28; 8% instances)
NOUN occurs with 4 feature-value pairs: Number=Sing, Person=1, Person=2, Person=3
NOUN occurs with 4 feature combinations.
The most frequent feature combination is _ (333 tokens).
Examples: хасава, яхаʼ, пулʼ, яхамʼ, ӈацекы, танцяʼ, харадмʼ, нёʼ, си, хыдяʼ
Relations
NOUN nodes are attached to their parents using 11 different relations: obl:mod (122; 34% instances), obj (76; 21% instances), nmod:poss (60; 17% instances), nsubj (47; 13% instances), nmod (32; 9% instances), reparandum (10; 3% instances), root (7; 2% instances), parataxis (4; 1% instances), nsubj:outer (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Parents of NOUN nodes belong to 7 different parts of speech: VERB (230; 64% instances), NOUN (106; 29% instances), ADV (8; 2% instances), ADJ (7; 2% instances), (7; 2% instances), AUX (2; 1% instances), ADP (1; 0% instances)
161 (45%) NOUN nodes are leaves.
121 (34%) NOUN nodes have one child.
56 (16%) NOUN nodes have two children.
23 (6%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 8.
Children of NOUN nodes are attached using 16 different relations: case (63; 20% instances), nmod:poss (60; 19% instances), discourse (54; 17% instances), nmod (36; 11% instances), amod (23; 7% instances), nummod (18; 6% instances), acl (17; 5% instances), punct (13; 4% instances), reparandum (10; 3% instances), det (7; 2% instances), parataxis (5; 2% instances), nsubj (4; 1% instances), dep (3; 1% instances), advcl (1; 0% instances), aux (1; 0% instances), cop (1; 0% instances)
Children of NOUN nodes belong to 11 different parts of speech: NOUN (106; 34% instances), ADP (64; 20% instances), INTJ (34; 11% instances), ADJ (23; 7% instances), NUM (20; 6% instances), VERB (19; 6% instances), X (19; 6% instances), PUNCT (13; 4% instances), DET (8; 3% instances), PRON (8; 3% instances), AUX (2; 1% instances)