Treebank Statistics: UD_Pashto-Sikaram: Features: Number
This feature is universal.
It occurs with 4 different values: Coll, Plur, Ptan, Sing.
1970 tokens (48%) have a non-empty value of Number.
882 types (82%) occur at least once with a non-empty value of Number.
691 lemmas (83%) occur at least once with a non-empty value of Number.
The feature is used with 7 part-of-speech tags: NOUN (870; 21% instances), ADJ (434; 11% instances), VERB (241; 6% instances), AUX (177; 4% instances), PROPN (147; 4% instances), DET (65; 2% instances), PRON (36; 1% instances).
NOUN
870 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (490; 56%), Case=Nom (477; 55%).
NOUN tokens may have the following values of Number:
Coll(21; 2% of non-emptyNumber): خلکو, خلک, سلنه, شتو, موادPlur(234; 27% of non-emptyNumber): ژبو, کتابونه, کتابونو, کسان, خبرې, هېوادونو, ارزښتونه, ارزښتونو, ملتونو, ژبېPtan(1; 0% of non-emptyNumber): معلوماتSing(614; 71% of non-emptyNumber): ژبه, ژباړې, ژبې, ژباړه, کار, توګه, خوا, مانا, ډول, برخه
| Paradigm ژبه | Sing | Plur |
|---|---|---|
| Case=Acc | ژبې | ژبو |
| Case=Loc | ژبه, ژبې | ژبو |
| Case=Nom | ژبه | ژبې |
Number seems to be lexical feature of NOUN. 91% lemmas (350) occur only with one value of Number.
ADJ
434 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Case=Nom (301; 69%), Gender=Masc (262; 60%).
ADJ tokens may have the following values of Number:
Plur(164; 38% of non-emptyNumber): نورو, زيات, ټولنیزو, پوهنیزو, زياتې, نور, اسلامي, نورې, اکثره, جوړSing(270; 62% of non-emptyNumber): ښه, زده, سمه, ټولنیز, ناسمه, اصلي, اړ, اړوند, بل, بلهEMPTY(1): خپور
| Paradigm نور | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | نورو | |
| Case=Acc|Gender=Fem | نورو | |
| Case=Loc|Gender=Masc | نورو | |
| Case=Loc|Gender=Fem | نورو | |
| Case=Nom|Gender=Masc | نور | نور |
| Case=Nom|Gender=Fem | نورې |
VERB
241 VERB tokens (74% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (205; 85%), Case=EMPTY (203; 84%), Person=3 (191; 79%), Gender=EMPTY (163; 68%), Mood=Ind (148; 61%).
VERB tokens may have the following values of Number:
Plur(94; 39% of non-emptyNumber): کوي, وايي, ورکوي, شوي, کړي, شي, وڅېړو, کوو, کېږي, شویوSing(147; 61% of non-emptyNumber): لري, کړي, شي, کېږي, کړی, راځي, شو, شوی, کړه, وواههEMPTY(86): ژباړل, کولای, شته, کارول, اخیستلای, لیکل, ګڼل, رااخیستل, راژباړل, لیدل
| Paradigm کول | Sing | Plur |
|---|---|---|
| Aspect=Imp|Gender=Masc|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | کاوه | |
| Aspect=Imp|Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | کوم | کوو |
| Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | کوي | کوي |
| Aspect=Perf|Gender=Masc|Mood=Ind|Person=3|Tense=Past|Variant=Long|VerbForm=Fin | وکړ | |
| Aspect=Perf|Gender=Masc|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | کړ | کړل |
| Aspect=Perf|Gender=Fem|Mood=Ind|Person=3|Tense=Past|Variant=Short|VerbForm=Fin | کړه, وکړه | |
| Aspect=Perf|Mood=Imp|Person=2|Tense=Pres|VerbForm=Fin | کړه | |
| Case=Nom|Gender=Masc|Tense=Past|VerbForm=Part | کړی | کړي |
| Case=Nom|Gender=Fem|Tense=Past|VerbForm=Part | کړې | |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | کړو | |
| Mood=Sub|Person=1|VerbForm=Fin | کړو | |
| Mood=Sub|Person=3|Variant=Long|VerbForm=Fin | وکړي | وکړي |
| Mood=Sub|Person=3|VerbForm=Fin | کړي | کړي |
AUX
177 AUX tokens (82% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (163; 92%), Person=3 (158; 89%), Aspect=EMPTY (155; 88%), Mood=Ind (109; 62%), Tense=Pres (97; 55%), Gender=EMPTY (91; 51%).
AUX tokens may have the following values of Number:
Plur(50; 28% of non-emptyNumber): دي, کېږي, شي, شوي, وي, شو, وو, کېدلSing(127; 72% of non-emptyNumber): ده, شي, وي, دی, کېږي, شوی, دﺉ, و, وه, شېEMPTY(39): به, ونه, شوای, وای, ونۀ, وکولای, کولای, کېدای
| Paradigm ول | Sing | Plur |
|---|---|---|
| Gender=Masc|Mood=Ind|Tense=Past | و | وو |
| Gender=Masc|Mood=Ind|Tense=Pres | دی, دﺉ | |
| Gender=Fem|Mood=Ind|Tense=Past | وه | |
| Gender=Fem|Mood=Ind|Tense=Pres | ده | دي |
| Mood=Ind|Tense=Pres | دي | |
| Mood=Sub | وي | وي |
PROPN
147 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Fem (79; 54%).
PROPN tokens may have the following values of Number:
Plur(16; 11% of non-emptyNumber): پښتانه, پښتنو, ايرانیان, ايرانیانو, فرانسويانو, پنجابیانوSing(131; 89% of non-emptyNumber): پښتو, اردو, پاړسي, احمد, وحید, پیتر, پنج, عربي, مریم, کتاب
Number seems to be lexical feature of PROPN. 100% lemmas (53) occur only with one value of Number.
DET
65 DET tokens (41% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Variant=EMPTY (55; 85%), Deixis=EMPTY (46; 71%), Poss=EMPTY (43; 66%), Reflex=EMPTY (43; 66%).
DET tokens may have the following values of Number:
Plur(16; 25% of non-emptyNumber): ټول, خپل, خپلو, ټولې, کومې, دغو, دې, همدغو, ځینو, ځینېSing(49; 75% of non-emptyNumber): خپل, دې, خپله, کوم, ټوله, کومه, هره, هغه, دغه, هرEMPTY(93): دغه, هغه, داسې, څو, دې, هر, همدغه, ځینې, دغسې, هرې
| Paradigm خپل | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | خپل | خپلو |
| Case=Loc|Gender=Masc | خپل | |
| Case=Loc|Gender=Fem | خپله | |
| Case=Nom|Gender=Masc | خپل | خپل |
| Case=Nom|Gender=Fem | خپله |
PRON
36 PRON tokens (18% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Poss=EMPTY (36; 100%), Variant=EMPTY (36; 100%), PronType=Prs (29; 81%).
PRON tokens may have the following values of Number:
Plur(15; 42% of non-emptyNumber): دوی, هغوی, زموږ, موږ, ټولوSing(21; 58% of non-emptyNumber): ځان, دې, هغۀ, هغې, دا, ستا, ما, زما, څوکEMPTY(160): يې, چې, ور, دا, دې, یې, همدا, څه, چا, هرڅه
Number seems to be lexical feature of PRON. 100% lemmas (10) occur only with one value of Number.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (246; 97%),
NOUN –[nmod]–> NOUN (87; 60%),
VERB –[obj]–> NOUN (79; 59%),
VERB –[nsubj]–> NOUN (77; 67%),
NOUN –[conj]–> NOUN (65; 79%),
VERB –[compound:lvc]–> ADJ (44; 81%),
NOUN –[cop]–> AUX (39; 83%),
VERB –[conj]–> VERB (37; 58%),
NOUN –[nmod]–> PROPN (35; 81%),
ADJ –[cop]–> AUX (31; 97%).