Treebank Statistics: UD_Catalan-AnCora: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
261785 tokens (48%) have a non-empty value of Number.
20436 types (63%) occur at least once with a non-empty value of Number.
11481 lemmas (49%) occur at least once with a non-empty value of Number.
The feature is used with 8 part-of-speech tags: NOUN (89723; 16% instances), DET (87168; 16% instances), ADJ (29667; 5% instances), VERB (25263; 5% instances), AUX (20481; 4% instances), PRON (6820; 1% instances), NUM (2655; 0% instances), PROPN (8; 0% instances).
NOUN
89723 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Plur(28005; 31% of non-emptyNumber): anys, milions, pessetes, persones, obres, mesos, joves, dies, euros, empresesSing(61718; 69% of non-emptyNumber): any, president, part, terme, grup, projecte, cap, lloc, cas, portaveuEMPTY(8923): any, través, temps, juny, partir, dia, fa, tal, maig, mes
| Paradigm any | Sing | Plur |
|---|---|---|
| any | anys |
DET
87168 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (68259; 78%), Definite=Def (63482; 73%).
DET tokens may have the following values of Number:
Plur(18491; 21% of non-emptyNumber): els, les, seus, altres, aquests, seves, tots, aquestes, uns, diferentsSing(68677; 79% of non-emptyNumber): el, la, l’, un, una, aquest, seva, aquesta, seu, totEMPTY(99): meva, prou, gaire, massa, meves, cada, força, teus, teva, Que
| Paradigm el | Sing | Plur |
|---|---|---|
| Definite=Def|Foreign=Yes|Gender=Masc|PronType=Art | el | |
| Definite=Def|Gender=Masc|PronType=Art | el | els |
| Definite=Def|Gender=Fem|PronType=Art | la, L' | les |
| Definite=Def|PronType=Art | l' | |
| Definite=Ind|Gender=Fem|PronType=Art | la | |
| Definite=Ind|PronType=Art | l' | |
| Gender=Masc|PronType=Art | el | els |
| Gender=Masc|PronType=Dem | el | els |
| Gender=Fem|Person=3|Poss=Yes|PronType=Prs | les | |
| Gender=Fem|PronType=Art | la | les |
| Gender=Fem|PronType=Dem | la | les |
| PronType=Art | l' | |
| PronType=Dem | l' |
ADJ
29667 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: VerbForm=EMPTY (24078; 81%).
ADJ tokens may have the following values of Number:
Plur(9066; 31% of non-emptyNumber): grans, principals, importants, municipals, noves, nous, socials, locals, últims, culturalsSing(20601; 69% of non-emptyNumber): gran, general, passat, primer, nou, primera, actual, nova, important, socialEMPTY(415): baix, gran, clau, especial, directe, nord, pilot, límit, xàrter, sud
| Paradigm nou | Sing | Plur |
|---|---|---|
| Gender=Masc | nou | nous |
| Gender=Fem | nova | noves |
VERB
25263 VERB tokens (60% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Gender=EMPTY (18449; 73%), VerbForm=Fin (18449; 73%), Person=3 (17603; 70%), Mood=Ind (16135; 64%), Tense=Pres (13042; 52%).
VERB tokens may have the following values of Number:
Plur(5346; 21% of non-emptyNumber): tenen, fan, tenim, faran, volen, van, formen, consideren, destaquen, volemSing(19917; 79% of non-emptyNumber): té, ha, fa, fet, explicat, dit, considera, cal, farà, volEMPTY(16633): fer, dir, tenir, donar, arribar, aconseguir, veure, passar, presentar, deixar
| Paradigm fer | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | fet | |
| Gender=Fem|Tense=Past|VerbForm=Part | fetes | |
| Mood=Cnd|Person=1|VerbForm=Fin | faria | faríem |
| Mood=Cnd|Person=3|VerbForm=Fin | faria | farien |
| Mood=Imp|Person=1|VerbForm=Fin | fem | |
| Mood=Imp|Person=3|VerbForm=Fin | Faci | |
| Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | faré | farem |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | feia | fèiem |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | faig | fem |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | fas | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | farà | faran |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | feia | feien |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | fa | fan |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | faci | |
| Mood=Sub|Person=2|Tense=Pres|VerbForm=Fin | feu | |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fes | fessin |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | faci | facin |
AUX
20481 AUX tokens (93% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (19830; 97%), Person=3 (19285; 94%), Mood=Ind (18755; 92%), Tense=Pres (17563; 86%).
AUX tokens may have the following values of Number:
Plur(4881; 24% of non-emptyNumber): van, han, són, estan, hem, poden, havien, seran, podran, erenSing(15600; 76% of non-emptyNumber): va, ha, és, estat, està, havia, pot, serà, era, siguiEMPTY(1573): ser, haver, poder, estar, sent, saber, anar, essent, estant, havent
| Paradigm haver | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | hagut | |
| Mood=Cnd|Person=1|VerbForm=Fin | hauríem | |
| Mood=Cnd|Person=3|VerbForm=Fin | hauria | haurien |
| Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | hauré | haurem |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | havia | havíem |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | he | hem |
| Mood=Ind|Person=2|Tense=Fut|VerbForm=Fin | haureu | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | has | heu |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | haurà | hauran |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | havia | havien |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | ha | han |
| Mood=Sub|Person=1|Tense=Imp|VerbForm=Fin | haguéssim | |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | haguem, hàgim | |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | hagués | haguessin, haguéssin |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | hagi | hagin |
PRON
6820 PRON tokens (29% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (6820; 100%), PrepCase=EMPTY (6594; 97%), Case=EMPTY (4112; 60%), Person=EMPTY (3657; 54%).
PRON tokens may have the following values of Number:
Plur(1873; 27% of non-emptyNumber): els, ens, quals, altres, uns, ells, les, los, alguns, nosaltresSing(4947; 73% of non-emptyNumber): un, li, tot, això, ho, qual, la, el, l’, ellEMPTY(16634): que, es, s’, hi, se, on, què, qui, n’, en
| Paradigm ell | Sing | Plur |
|---|---|---|
| Case=Acc,Dat | els, los, 'ls | |
| Case=Acc|Gender=Masc | el, lo, 'l, -lo, l | |
| Case=Acc|Gender=Fem,Masc | l' | |
| Case=Acc|Gender=Fem | la, -la | les |
| Case=Acc|Gender=Neut | ho, -ho | |
| Case=Acc | els, 'ls | |
| Case=Dat | li | els |
| Gender=Masc | ell | ells |
| Gender=Fem | ella | elles |
NUM
2655 NUM tokens (27% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (2655; 100%), NumForm=Word (2653; 100%), Gender=EMPTY (1345; 51%).
NUM tokens may have the following values of Number:
Plur(2219; 84% of non-emptyNumber): dos, tres, dues, quatre, cinc, sis, set, vuit, deu, nouSing(436; 16% of non-emptyNumber): un, una, mig, mitja, doble, quart, triple, desena, X, cinquenaEMPTY(7307): cent, 10, 15, 30, 5, 20, 2, 12, 4, 50
| Paradigm dos | Sing | Plur |
|---|---|---|
| Gender=Masc | dos | dos |
| Gender=Fem | dues | |
| dos |
Number seems to be lexical feature of NUM. 95% lemmas (62) occur only with one value of Number.
PROPN
8 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Sing(8; 100% of non-emptyNumber): Seu, Cobain, Companyia, Font, Justícia, Kurt, PlaEMPTY(46582): Catalunya, Barcelona, Generalitat, Govern, sant, Ajuntament, Girona, Josep, CiU, PP
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (66577; 96%),
NOUN –[amod]–> ADJ (22871; 97%),
NOUN –[nmod]–> NOUN (13623; 54%),
VERB –[nsubj]–> NOUN (8231; 66%),
NOUN –[conj]–> NOUN (4055; 78%),
NOUN –[acl]–> VERB (3985; 52%),
ADJ –[cop]–> AUX (1672; 86%),
VERB –[conj]–> VERB (1589; 68%),
NOUN –[cop]–> AUX (1486; 70%),
NOUN –[appos]–> NOUN (1373; 66%).