Treebank Statistics: UD_Pashto-Sikaram: Features: Number
This feature is universal.
It occurs with 4 different values: Coll, Plur, Ptan, Sing.
2651 tokens (48%) have a non-empty value of Number.
1141 types (83%) occur at least once with a non-empty value of Number.
860 lemmas (83%) occur at least once with a non-empty value of Number.
The feature is used with 7 part-of-speech tags: NOUN (1156; 21% instances), ADJ (578; 11% instances), VERB (333; 6% instances), AUX (230; 4% instances), PROPN (217; 4% instances), DET (83; 2% instances), PRON (54; 1% instances).
NOUN
1156 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (658; 57%), Case=Nom (602; 52%).
NOUN tokens may have the following values of Number:
Coll(23; 2% of non-emptyNumber): خلکو, خلک, اطرافو, سلنه, شتو, موادPlur(308; 27% of non-emptyNumber): ژبو, کتابونه, خبرې, کتابونو, کسان, ارزښتونه, هېوادونو, ژبې, ارزښتونو, ماشومانPtan(1; 0% of non-emptyNumber): معلوماتSing(824; 71% of non-emptyNumber): ژبه, ژبې, خوا, ژباړې, کار, ژباړه, توګه, وخت, ډول, برخه
| Paradigm ژبه | Sing | Plur |
|---|---|---|
| Case=Acc | ژبې | ژبو |
| Case=Loc | ژبه, ژبې | ژبو |
| Case=Nom | ژبه, ژبې | ژبې |
ADJ
578 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Case=Nom (389; 67%), Gender=Masc (351; 61%).
ADJ tokens may have the following values of Number:
Plur(212; 37% of non-emptyNumber): نورو, زيات, ټولنیزو, پوهنیزو, نور, زياتې, اسلامي, نورې, هنري, اکثرهSing(366; 63% of non-emptyNumber): زده, ښه, سمه, ټولنیز, بله, ناسمه, اصلي, اړ, اړوند, لږEMPTY(8): ق, هـ, خپور, م
| Paradigm نور | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | نورو | |
| Case=Acc|Gender=Fem | نورو | |
| Case=Loc|Gender=Masc | نورو | |
| Case=Loc|Gender=Fem | نورو | |
| Case=Nom|Gender=Masc | نور | نور |
| Case=Nom|Gender=Fem | نوره | نورې |
VERB
333 VERB tokens (74% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (275; 83%), Case=EMPTY (273; 82%), Person=3 (255; 77%), Mood=Ind (205; 62%), Gender=EMPTY (196; 59%).
VERB tokens may have the following values of Number:
Plur(120; 36% of non-emptyNumber): کوي, ورکوي, شوي, وايي, کړي, کوو, کېږي, شي, ويلي, وڅېړوSing(213; 64% of non-emptyNumber): لري, کړي, شي, کېږي, کړه, کړې, شو, راځي, شوی, کړیEMPTY(115): کولای, شته, ژباړل, کارول, کولو, لیکل, اخیستلای, لاړ, وهل, ګڼل
| Paradigm کول | Sing | Plur |
|---|---|---|
| Aspect=Imp|Gender=Masc|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | کاوه | کول |
| Aspect=Imp|Gender=Masc|Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin | کوو | |
| Aspect=Imp|Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | کوم | کوو |
| Aspect=Imp|Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | کوې | |
| Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | کوي | کوي |
| Aspect=Perf|Gender=Masc|Mood=Ind|Person=3|Tense=Past|Variant=Long|VerbForm=Fin | وکړ | |
| Aspect=Perf|Gender=Masc|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | کړ | کړل |
| Aspect=Perf|Gender=Fem|Mood=Ind|Person=3|Tense=Past|Variant=Short|VerbForm=Fin | کړه, وکړه | |
| Aspect=Perf|Mood=Imp|Person=2|Tense=Pres|VerbForm=Fin | کړه | |
| Aspect=Perf|Mood=Imp|Person=2|Variant=Long|VerbForm=Fin | وکړه | |
| Case=Nom|Gender=Masc|Tense=Past|VerbForm=Part | کړی | کړي |
| Case=Nom|Gender=Fem|Tense=Past|VerbForm=Part | کړې | |
| Mood=Sub|Person=1|Tense=Past|Variant=Long|VerbForm=Fin | وکړو | |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | کړو | |
| Mood=Sub|Person=1|VerbForm=Fin | کړو | |
| Mood=Sub|Person=3|Variant=Long|VerbForm=Fin | وکړي | وکړي |
| Mood=Sub|Person=3|VerbForm=Fin | کړي | کړي |
AUX
230 AUX tokens (85% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (210; 91%), Person=3 (204; 89%), Aspect=EMPTY (203; 88%), Mood=Ind (149; 65%), Tense=Pres (118; 51%).
AUX tokens may have the following values of Number:
Plur(73; 32% of non-emptyNumber): دي, شي, کېږي, شوي, وو, وي, شو, شول, وې, کېدلSing(157; 68% of non-emptyNumber): ده, شي, دی, وي, کېږي, وه, شوی, و, دﺉ, شوېEMPTY(41): به, شوای, وای, وکولای, وکړای, کولای, کېدای
| Paradigm ول | Sing | Plur |
|---|---|---|
| Gender=Masc|Mood=Ind|Tense=Past | و | وو |
| Gender=Masc|Mood=Ind|Tense=Pres | دی, دﺉ | |
| Gender=Fem|Mood=Ind|Tense=Past | وه | وې |
| Gender=Fem|Mood=Ind|Tense=Pres | ده | دي |
| Mood=Ind|Tense=Pres | دي | |
| Mood=Sub | وي | وي |
PROPN
217 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (109; 50%).
PROPN tokens may have the following values of Number:
Coll(1; 0% of non-emptyNumber): مغلوPlur(24; 11% of non-emptyNumber): پښتانه, پښتنو, ايرانیان, ايرانیانو, فرانسويانو, پنجابیانوSing(192; 88% of non-emptyNumber): پښتو, اردو, افغانستان, پاړسي, کابل, احمد, بابا, وحید, پیتر, خوشال
Number seems to be lexical feature of PROPN. 100% lemmas (68) occur only with one value of Number.
DET
83 DET tokens (42% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Variant=EMPTY (70; 84%), Poss=EMPTY (57; 69%), Reflex=EMPTY (57; 69%), Deixis=EMPTY (55; 66%), Gender=Fem (42; 51%).
DET tokens may have the following values of Number:
Plur(21; 25% of non-emptyNumber): خپلو, ټول, خپل, دغو, ټولې, کومې, دې, هغو, همدغو, ځینوSing(62; 75% of non-emptyNumber): خپل, دې, کوم, کومه, خپله, هغه, همدې, ټوله, دغه, هرهEMPTY(113): دغه, هغه, دې, څو, داسې, ځینې, هر, همدغه, دا, دغسې
| Paradigm دا | Sing | Plur |
|---|---|---|
| Case=Abl|Gender=Fem | دې | |
| Case=Abl|Gender=Fem|Variant=Short | دې | |
| Case=Loc|Gender=Masc | دې | |
| Case=Loc|Gender=Fem | دې | دې |
| Case=Loc|Gender=Fem|Variant=Short | دې |
PRON
54 PRON tokens (25% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Variant=EMPTY (54; 100%), Poss=EMPTY (50; 93%), PronType=Prs (46; 85%), Person=3 (28; 52%).
PRON tokens may have the following values of Number:
Plur(16; 30% of non-emptyNumber): دوی, هغوی, زموږ, موږ, ټولوSing(38; 70% of non-emptyNumber): ده, خپله, هغې, ځان, دا, دې, ما, هغه, هغۀ, ستاEMPTY(162): يې, ور, دا, یې, دې, څه, همدا, چا, هرڅه, هغه
Number seems to be lexical feature of PRON. 100% lemmas (12) occur only with one value of Number.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (315; 95%),
NOUN –[nmod]–> NOUN (119; 60%),
VERB –[obj]–> NOUN (115; 61%),
VERB –[nsubj]–> NOUN (97; 64%),
NOUN –[conj]–> NOUN (87; 82%),
VERB –[conj]–> VERB (59; 60%),
NOUN –[nmod]–> PROPN (56; 79%),
VERB –[compound:lvc]–> ADJ (50; 76%),
NOUN –[cop]–> AUX (45; 83%),
ADJ –[cop]–> AUX (40; 98%).