Treebank Statistics: UD_Albanian-TSA: POS Tags: NOUN
There are 184 NOUN
lemmas (44%), 217 NOUN
types (44%) and 238 NOUN
tokens (26%).
Out of 14 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: njeri, vend, edukim, familje, kohë, mënyrë, person, qytet, shoqëri, anëtar
The 10 most frequent NOUN
types: Dashuria, Evolucioni, Ishulli, dramaturgu, drejtimet, kohës, lloj, marrëdhënieve, mënyrë, njeriut
The 10 most frequent ambiguous lemmas: bashkim (NOUN 1, PROPN 1), shumë (ADV 4, NOUN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NOUN
is 1.179348 (the average of all parts of speech is 1.167464).
The 1st highest number of forms (4) was observed with the lemma “edukim”: edukim, edukimi, edukimin, edukimit.
The 2nd highest number of forms (4) was observed with the lemma “vend”: vend, vende, vendet, vendin.
The 3rd highest number of forms (3) was observed with the lemma “familje”: Familja, familjen, familjes.
NOUN
occurs with 5 features: Case (235; 99% instances), Definite (235; 99% instances), Gender (235; 99% instances), Number (235; 99% instances), NounType (30; 13% instances)
NOUN
occurs with 13 feature-value pairs: Case=Abl
, Case=Acc
, Case=Acc,Nom
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, NounType=Het
, Number=Plur
, Number=Sing
NOUN
occurs with 45 feature combinations.
The most frequent feature combination is Case=Acc|Definite=Ind|Gender=Fem|Number=Sing
(21 tokens).
Examples: mënyrë, administrim, anë, bizhuteri, jetë, kuadër, kullotë, lehtësi, lindje, liri
Relations
NOUN
nodes are attached to their parents using 12 different relations: nsubj (55; 23% instances), obl (49; 21% instances), nmod:poss (43; 18% instances), obj (32; 13% instances), conj (25; 11% instances), nmod (23; 10% instances), root (6; 3% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), csubj (1; 0% instances), orphan (1; 0% instances)
Parents of NOUN
nodes belong to 8 different parts of speech: VERB (120; 50% instances), NOUN (91; 38% instances), ADJ (14; 6% instances), (6; 3% instances), PRON (3; 1% instances), ADV (2; 1% instances), DET (1; 0% instances), PROPN (1; 0% instances)
26 (11%) NOUN
nodes are leaves.
97 (41%) NOUN
nodes have one child.
72 (30%) NOUN
nodes have two children.
43 (18%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 8.
Children of NOUN
nodes are attached using 20 different relations: det (81; 20% instances), case (70; 18% instances), amod (68; 17% instances), nmod:poss (53; 13% instances), conj (23; 6% instances), cc (21; 5% instances), nmod (20; 5% instances), punct (17; 4% instances), acl:relcl (9; 2% instances), cop (9; 2% instances), nsubj (7; 2% instances), advmod (5; 1% instances), advcl (4; 1% instances), nummod (4; 1% instances), mark (3; 1% instances), obl (2; 1% instances), appos (1; 0% instances), csubj:pass (1; 0% instances), det:noun (1; 0% instances), orphan (1; 0% instances)
Children of NOUN
nodes belong to 14 different parts of speech: NOUN (91; 23% instances), ADP (69; 17% instances), ADJ (68; 17% instances), DET (59; 15% instances), PRON (31; 8% instances), CCONJ (21; 5% instances), PUNCT (17; 4% instances), VERB (12; 3% instances), AUX (9; 2% instances), PROPN (8; 2% instances), ADV (6; 2% instances), NUM (4; 1% instances), SCONJ (3; 1% instances), PART (2; 1% instances)