Treebank Statistics: UD_Albanian-STAF: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
1807 tokens (51%) have a non-empty value of Number
.
990 types (81%) occur at least once with a non-empty value of Number
.
750 lemmas (76%) occur at least once with a non-empty value of Number
.
The feature is used with 7 part-of-speech tags: NOUN (602; 17% instances), VERB (337; 9% instances), PRON (333; 9% instances), DET (234; 7% instances), ADJ (162; 5% instances), AUX (108; 3% instances), PROPN (31; 1% instances).
NOUN
602 NOUN tokens (96% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Fem (354; 59%), Definite=Def (349; 58%).
NOUN
tokens may have the following values of Number
:
Plur
(109; 18% of non-emptyNumber
): sytë, njerëz, njerëzit, njerëzve, ditë, ditët, përkujdesjet, rrethana, çaste, BisedimetSing
(493; 82% of non-emptyNumber
): gjenerali, shi, Nëna, fillim, gjendjes, prifti, shtëpia, arsye, babai, borësEMPTY
(23): Mysafiri, babait, brejtja, djalë, errur, fillin, fundin, gjendje, here, ide
Paradigm njeri | Sing | Plur |
---|---|---|
Case=Acc|Definite=Def | njerëzit | |
Case=Dat|Definite=Def | njerëzve | |
Case=Dat|Definite=Ind | njeriu | |
Case=Gen|Definite=Def | njerëzve | |
Case=Nom|Definite=Def | njerëzit | |
Case=Nom|Definite=Ind | njeri, njeriu | njerëz |
Number
seems to be lexical feature of NOUN
. 95% lemmas (365) occur only with one value of Number
.
VERB
337 VERB tokens (78% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=EMPTY (337; 100%), Mood=Ind (317; 94%), Voice=Act (268; 80%), Person=3 (240; 71%), Tense=Past (181; 54%).
VERB
tokens may have the following values of Number
:
Plur
(62; 18% of non-emptyNumber
): gjejmë, dilnim, gjej, kalonin, prijnë, Dua, Kemi, Mbetemi, Mjafton, bindenSing
(275; 82% of non-emptyNumber
): di, tha, ka, ndodhesha, bëri, shfaq, bëhet, bën, dinte, kamEMPTY
(93): bërë, filluar, thënë, mbyllur, ngjarë, hequr, largova, marrë, mësuar, përpjekur
Paradigm kam | Sing | Plur |
---|---|---|
Mood=Ind|Person=1|Tense=Past|Voice=Act | kam, kisha | |
Mood=Ind|Person=1|Tense=Pres | kam | |
Mood=Ind|Person=1|Tense=Pres|Voice=Act | kam, ke | Kemi |
Mood=Ind|Person=3|Tense=Past|Voice=Act | kishte | |
Mood=Ind|Person=3|Tense=Pres|Voice=Act | ka | kapin |
Mood=Sub|Person=3|Tense=Pres|Voice=Act | ketë |
Number
seems to be lexical feature of VERB
. 92% lemmas (178) occur only with one value of Number
.
PRON
333 PRON tokens (77% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (248; 74%).
PRON
tokens may have the following values of Number
:
Plur
(55; 17% of non-emptyNumber
): i, cilët, na, ata, këto, ato, ne, tjerë, tyre, KëtaSing
(278; 83% of non-emptyNumber
): e, i, më, unë, ai, kjo, tij, ky, ajo, atëEMPTY
(97): që, ç’, asgjë, e, më, ndonjë, asnjë, kush, çdo, diçka
Paradigm ai | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc|Person=3|PronType=Prs | e, atë, i, të | i |
Case=Acc|Gender=Masc|PronType=Dem | atë | |
Case=Acc|Gender=Masc|PronType=Prs | i | |
Case=Acc|Gender=Fem|Person=3|PronType=Prs | e | i |
Case=Acc|Person=1|PronType=Prs | e | |
Case=Acc|Person=3|PronType=Prs | e | i |
Case=Dat|Gender=Masc|Person=3|PronType=Prs | atij, i | |
Case=Dat|Gender=Fem|Person=3|PronType=Prs | i | |
Case=Dat|Person=3|PronType=Prs | i | |
Case=Nom|Gender=Masc|Person=3|PronType=Dem | ai | |
Case=Nom|Gender=Masc|Person=3|PronType=Prs | ai | |
Case=Nom|Gender=Masc|PronType=Dem | atë | |
Case=Nom|Person=1|PronType=Prs | e | |
Case=Nom|Person=2|PronType=Prs | e |
DET
234 DET tokens (78% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=EMPTY (234; 100%), PronType=Art (157; 67%), Gender=Fem (146; 62%).
DET
tokens may have the following values of Number
:
Plur
(49; 21% of non-emptyNumber
): të, e, sëSing
(185; 79% of non-emptyNumber
): e, të, i, sëEMPTY
(66): një, e, të, i, nja, pak
Paradigm e | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc|PronType=Art | e | e |
Case=Acc|Gender=Fem|PronType=Art | e | e |
Case=Gen|Gender=Fem|PronType=Art | së | |
Case=Nom|Gender=Masc|PronType=Art | e | |
Case=Nom|Gender=Fem | e | |
Case=Nom|Gender=Fem|PronType=Art | e | e |
Gender=Masc | e | e |
Gender=Fem | e | e |
Gender=Fem|PronType=Art | e |
ADJ
162 ADJ tokens (91% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (151; 93%), Gender=Fem (101; 62%), Case=Nom (82; 51%).
ADJ
tokens may have the following values of Number
:
Plur
(30; 19% of non-emptyNumber
): para, bardha, befta, devotshme, dinjitoze, dridhura, errëta, fshehura, fundit, fëmijëshSing
(132; 81% of non-emptyNumber
): bardhë, bukur, fundit, parë, sigurt, djathtë, huaj, re, errët, gabuarEMPTY
(16): dytë, fundit, hijerëndë, imperiale, kureshtar, lodhun, relative, rrallë, saktë, shqiptari
Paradigm bardhë | Sing | Plur |
---|---|---|
Case=Acc|Gender=Fem | bardhë | |
Case=Gen|Gender=Masc | bardhë | |
Case=Gen|Gender=Fem | bardhë | |
Case=Nom|Gender=Masc | bardhë | |
Case=Nom|Gender=Fem | bardhë | bardha |
Number
seems to be lexical feature of ADJ
. 93% lemmas (103) occur only with one value of Number
.
AUX
108 AUX tokens (78% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Voice=Act (104; 96%), Mood=Ind (100; 93%), Person=3 (92; 85%).
AUX
tokens may have the following values of Number
:
Plur
(15; 14% of non-emptyNumber
): kishin, kanë, ishin, janë, jemi, kam, keni, paskëshinSing
(93; 86% of non-emptyNumber
): ishte, është, kishte, ka, jam, isha, jesh, jetë, ketë, kishaEMPTY
(30): u, duhet, ishim, ishte, është
Paradigm jam | Sing | Plur |
---|---|---|
Aspect=Imp|Mood=Ind|Person=3|Tense=Past|Voice=Act | ishte | |
Mood=Ind|Person=1|Tense=Past|Voice=Act | isha, kisha | |
Mood=Ind|Person=1|Tense=Pres|Voice=Act | jam | jemi |
Mood=Ind|Person=2|Tense=Pres|Voice=Act | je | |
Mood=Ind|Person=3|Tense=Past|Voice=Act | ishte | ishin |
Mood=Ind|Person=3|Tense=Pres | Ishte | |
Mood=Ind|Person=3|Tense=Pres|Voice=Act | është, qe | janë |
Mood=Sub|Person=3|Tense=Pres|Voice=Act | jesh, jetë |
PROPN
31 PROPN tokens (79% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Definite=Def (24; 77%), Gender=Masc (20; 65%).
PROPN
tokens may have the following values of Number
:
Plur
(1; 3% of non-emptyNumber
): VedatSing
(30; 97% of non-emptyNumber
): Ernesti, Ernestit, Shqipëri, Linda, Vedati, Berti, Dizit, Ernest, Ervehenë, HadiEMPTY
(8): Bamit, Dizi, Dizin, Ernesti, Lindën, Nerminja, Odise, Varrit
Paradigm Vedat | Sing | Plur |
---|---|---|
Case=Gen|Gender=Masc | Vedatit | |
Case=Nom|Gender=Masc | Vedati | |
Case=Nom|Gender=Fem | Vedat |
Number
seems to be lexical feature of PROPN
. 94% lemmas (16) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
ADJ –[det:adj]–> DET (110; 92%),
NOUN –[amod]–> ADJ (107; 91%),
VERB –[obl]–> NOUN (83; 55%),
VERB –[nsubj]–> NOUN (77; 86%),
VERB –[obj]–> NOUN (76; 68%),
VERB –[obj]–> PRON (51; 52%),
VERB –[iobj]–> PRON (47; 64%),
VERB –[conj]–> VERB (44; 73%),
VERB –[nsubj]–> PRON (40; 60%),
NOUN –[nmod]–> NOUN (35; 71%).