Treebank Statistics: UD_Azerbaijani-TueCL: POS Tags: NOUN
There are 111 NOUN
lemmas (39%), 150 NOUN
types (35%) and 243 NOUN
tokens (27%).
Out of 15 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: kitab, ev, mədrəsə, uşaq, ana, söbh, maşın, qardaş, fikr, otaq
The 10 most frequent NOUN
types: evdə, kitab, kitabı, mədrəsəyə, söbh, Fikr, evə, uşaq, Moəllim, anasın
The 10 most frequent ambiguous lemmas: ev (NOUN 25, PROPN 1), Deniz (PROPN 58, NOUN 2), _ (PUNCT 3, NOUN 2, PROPN 1, VERB 1), abi (ADJ 3, NOUN 2), ki (SCONJ 4, ADP 2, NOUN 2)
The 10 most frequent ambiguous types: Deniz (PROPN 55, NOUN 2), abisi (NOUN 2, ADJ 1), Ayşə (PROPN 3, NOUN 1), ki (SCONJ 3, ADP 2, NOUN 1), kinin (NOUN 1, PRON 1)
- Deniz
- abisi
- Ayşə
- ki
- kinin
Morphology
The form / lemma ratio of NOUN
is 1.351351 (the average of all parts of speech is 1.486014).
The 1st highest number of forms (8) was observed with the lemma “ev”: Evin, ev, evdə, evdədir, evdəki, evdəyidi, evlərdə, evə.
The 2nd highest number of forms (4) was observed with the lemma “kitab”: kitab, kitaba, kitabdır, kitabı.
The 3rd highest number of forms (4) was observed with the lemma “mədrəsə”: mədrəsəde, mədrəsədə, mədrəsələrdə, mədrəsəyə.
NOUN
occurs with 7 features: Number (233; 96% instances), Case (224; 92% instances), Number[psor] (55; 23% instances), Person[psor] (53; 22% instances), Person (10; 4% instances), Tense (6; 2% instances), Mood (3; 1% instances)
NOUN
occurs with 18 feature-value pairs: Case=Abl
, Case=Acc
, Case=Com
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Mood=Ind
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=3
, Person[psor]=1
, Person[psor]=3
, Tense=Past
, Tense=Pres
NOUN
occurs with 36 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing
(91 tokens).
Examples: kitab, söbh, Fikr, uşaq, Moəllim, dərs, çörək, Deniz, düktür, maşın
Relations
NOUN
nodes are attached to their parents using 13 different relations: obl (68; 28% instances), obj (55; 23% instances), nsubj (48; 20% instances), nmod (20; 8% instances), root (20; 8% instances), compound:lvc (13; 5% instances), compound (10; 4% instances), advcl (3; 1% instances), orphan (2; 1% instances), ccomp (1; 0% instances), conj (1; 0% instances), nmod:poss (1; 0% instances), nsubj:outer (1; 0% instances)
Parents of NOUN
nodes belong to 8 different parts of speech: VERB (165; 68% instances), NOUN (26; 11% instances), ADJ (23; 9% instances), (20; 8% instances), PROPN (3; 1% instances), AUX (2; 1% instances), NUM (2; 1% instances), PRON (2; 1% instances)
155 (64%) NOUN
nodes are leaves.
54 (22%) NOUN
nodes have one child.
18 (7%) NOUN
nodes have two children.
16 (7%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 5.
Children of NOUN
nodes are attached using 18 different relations: nmod (29; 20% instances), punct (24; 16% instances), nsubj (19; 13% instances), amod (15; 10% instances), det (13; 9% instances), case (7; 5% instances), nummod (7; 5% instances), aux (6; 4% instances), advcl (5; 3% instances), cop (5; 3% instances), acl (4; 3% instances), advmod (3; 2% instances), advmod:emph (2; 1% instances), appos (2; 1% instances), conj (2; 1% instances), ccomp (1; 1% instances), nmod:poss (1; 1% instances), obj (1; 1% instances)
Children of NOUN
nodes belong to 11 different parts of speech: NOUN (26; 18% instances), PUNCT (24; 16% instances), PROPN (21; 14% instances), ADJ (17; 12% instances), AUX (11; 8% instances), DET (11; 8% instances), PRON (8; 5% instances), VERB (8; 5% instances), ADP (7; 5% instances), NUM (7; 5% instances), ADV (6; 4% instances)