home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Soi-AHA: POS Tags: NOUN

There are 16 NOUN lemmas (42%), 16 NOUN types (39%) and 16 NOUN tokens (29%). Out of 8 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: اُوِ, بار, برنج, بِرا, سات, سال, صب, عبدولو, علی, لباس

The 10 most frequent NOUN types: اُوِ, بار, برنج, بِرا, سات, سال, صبا, عبدولو, علی, لباس

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.078947).

The 1st highest number of forms (1) was observed with the lemma “اُوِ”: اُوِ.

The 2nd highest number of forms (1) was observed with the lemma “بار”: بار.

The 3rd highest number of forms (1) was observed with the lemma “برنج”: برنج.

NOUN occurs with 1 features: Number (16; 100% instances)

NOUN occurs with 1 feature-value pairs: Number=Sing

NOUN occurs with 1 feature combinations. The most frequent feature combination is Number=Sing (16 tokens). Examples: اُوِ, بار, برنج, بِرا, سات, سال, صبا, عبدولو, علی, لباس

Relations

NOUN nodes are attached to their parents using 7 different relations: obl (4; 25% instances), nmod:poss (3; 19% instances), nsubj (3; 19% instances), obj (3; 19% instances), compound:lvc (1; 6% instances), flat (1; 6% instances), nmod (1; 6% instances)

Parents of NOUN nodes belong to 4 different parts of speech: VERB (11; 69% instances), NOUN (3; 19% instances), ADV (1; 6% instances), NUM (1; 6% instances)

8 (50%) NOUN nodes are leaves.

7 (44%) NOUN nodes have one child.

1 (6%) NOUN nodes have two children.

The highest child degree of a NOUN node is 2.

Children of NOUN nodes are attached using 4 different relations: nummod (4; 44% instances), nmod:poss (3; 33% instances), case (1; 11% instances), flat (1; 11% instances)

Children of NOUN nodes belong to 4 different parts of speech: NUM (4; 44% instances), NOUN (3; 33% instances), ADP (1; 11% instances), PRON (1; 11% instances)