Treebank Statistics: UD_Italian-KIParlaForest: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
9580 tokens (51%) have a non-empty value of Number.
2083 types (72%) occur at least once with a non-empty value of Number.
1450 lemmas (69%) occur at least once with a non-empty value of Number.
The feature is used with 12 part-of-speech tags: NOUN (2328; 12% instances), DET (1990; 11% instances), VERB (1900; 10% instances), PRON (1339; 7% instances), AUX (1012; 5% instances), ADJ (836; 4% instances), PROPN (107; 1% instances), NUM (33; 0% instances), ADV (30; 0% instances), ADP (2; 0% instances), CCONJ (2; 0% instances), SCONJ (1; 0% instances).
NOUN
2328 NOUN tokens (87% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (1252; 54%).
NOUN tokens may have the following values of Number:
Plur(601; 26% of non-emptyNumber): anni, dialetti, cose, persone, iscrizioni, segni, lingue, sassi, bambini, arabiSing(1727; 74% of non-emptyNumber): tipo, casa, lingua, cosa, arabo, parte, centro, alfabeto, sacco, sensoEMPTY(340): città, po’, realtà, università, nord, sud, serie, brokering, luban, tesi
| Paradigm lingua | Sing | Plur |
|---|---|---|
| Gender=Masc | lingue | |
| Gender=Fem | lingua | lingue |
DET
1990 DET tokens (91% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (1725; 87%), Definite=Def (1334; 67%).
DET tokens may have the following values of Number:
Plur(477; 24% of non-emptyNumber): i, le, gli, queste, questi, dei, delle, tutti, tutte, tanteSing(1513; 76% of non-emptyNumber): la, il, un, l’, una, questa, questo, un’, lo, miaEMPTY(204): che, loro, tutto, tutti, il, alcuni, la, tutta, prima, qualche
| Paradigm il | Sing | Plur |
|---|---|---|
| Definite=Def|Gender=Masc|PronType=Art | il, lo, l | i, gli, il |
| Definite=Def|Gender=Fem|PronType=Art | la, le | le, lo |
| Definite=Def|PronType=Art | l' | |
| Gender=Masc|Person=3|PronType=Prs | lo, l' | |
| Gender=Fem|Person=3|PronType=Prs | la | |
| Gender=Fem|PronType=Art | la | i |
VERB
1900 VERB tokens (80% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Gender=EMPTY (1516; 80%), VerbForm=Fin (1508; 79%), Mood=Ind (1336; 70%), Tense=Pres (1269; 67%).
VERB tokens may have the following values of Number:
Plur(504; 27% of non-emptyNumber): sono, abbiamo, diciamo, erano, hanno, avete, scrivevano, avevano, stanno, diconoSing(1396; 73% of non-emptyNumber): è, so, detto, fa, era, fatto, ha, va, dice, hoEMPTY(484): fare, dire, far, scrivere, andare, essere, vedere, parlare, trovare, anda’
| Paradigm essere | Sing | Plur |
|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part | stato | stati |
| Gender=Fem|Tense=Past|VerbForm=Part | stata | |
| Mood=Cnd|Person=3|Tense=Pres|VerbForm=Fin | sarebbe | |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | ero | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | sono, so | siamo |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | sarà | |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | erano |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | è | sono, son, furono |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | fossero |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | sia | siano |
PRON
1339 PRON tokens (71% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (965; 72%), Gender=EMPTY (702; 52%).
PRON tokens may have the following values of Number:
Plur(397; 30% of non-emptyNumber): c’, ci, noi, tutti, li, loro, questi, ce, voi, viSing(942; 70% of non-emptyNumber): lo, io, mi, me, quello, l’, questo, ti, lei, questaEMPTY(543): si, che, c’, cui, ne, ci, cosa, niente, chi, le
| Paradigm ci | Sing | Plur |
|---|---|---|
| Person=1 | c', ci, ce | |
| Person=2 | ci |
AUX
1012 AUX tokens (97% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (977; 97%), Mood=Ind (903; 89%), Tense=Pres (847; 84%), Person=3 (632; 62%).
AUX tokens may have the following values of Number:
Plur(248; 25% of non-emptyNumber): sono, hanno, possiamo, abbiamo, potete, son, siamo, dobbiamo, avevano, stiamoSing(764; 75% of non-emptyNumber): è, ho, ha, era, devi, sono, avevo, posso, stata, haiEMPTY(32): essere, esse, essendo, son, avendo, aver, eran, eravam, esser, fare
| Paradigm essere | Sing | Plur |
|---|---|---|
| _ | eran | |
| Gender=Masc | ero | |
| Gender=Masc|Tense=Past|VerbForm=Part | stato | stati |
| Gender=Fem | esser | |
| Gender=Fem|Tense=Past|VerbForm=Part | stata | state |
| Mood=Cnd|Person=3|Tense=Pres|VerbForm=Fin | sarebbe | sarebbero |
| Mood=Imp|Person=1|Tense=Pres|VerbForm=Fin | stiamo | |
| Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | saremo | |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | ero | eravamo |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | sono, son, sto, so | siamo, stiamo |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | sei | siete |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | sarà | |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | erano |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fu | |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | è, é | sono, son, stanno |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | sia | siano |
ADJ
836 ADJ tokens (88% of all ADJ tokens) have a non-empty value of Number.
ADJ tokens may have the following values of Number:
Plur(191; 23% of non-emptyNumber): miei, udenti, grande, indipendenti, arabi, disabili, fertili, importanti, pari, ricchiSing(645; 77% of non-emptyNumber): grande, difficile, esatto, araba, mia, arabo, piccola, comune, prima, protoEMPTY(111): stessa, standard, altra, certo, po’, poco, pre, tris, altre, mezz’
| Paradigm arabo | Sing | Plur |
|---|---|---|
| Gender=Masc | arabo | arabi |
| Gender=Fem | araba |
PROPN
107 PROPN tokens (25% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Fem (59; 55%).
PROPN tokens may have the following values of Number:
Plur(23; 21% of non-emptyNumber): rossi, verdi, nabatei, sinai, gialli, arancioni, gerusalemme, vagoniSing(84; 79% of non-emptyNumber): arabia, siria, giordania, saudita, yemen, erodoto, saba, turchia, egitto, fermoEMPTY(317): [TOWN_NAME], ancona, bologna, pesaro, cristo, [PLACE_NAME], fermo, gialli, imola, marche
| Paradigm Arancioni | Sing | Plur |
|---|---|---|
| arancioni | arancioni |
Number seems to be lexical feature of PROPN. 98% lemmas (50) occur only with one value of Number.
NUM
33 NUM tokens (19% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: Gender=Masc (23; 70%), NumType=Ord (19; 58%).
NUM tokens may have the following values of Number:
Plur(8; 24% of non-emptyNumber): primi, sedici, seicentodieciSing(25; 76% of non-emptyNumber): prima, primo, seicento, trecentoventotto, duecento, duemiladiciotto, milleseicento, ottocento, seconda, secondoEMPTY(139): due, quattro, tre, cinque, quattordici, sette, dieci, mille, undici, cinquanta
| Paradigm primo | Sing | Plur |
|---|---|---|
| Gender=Masc | primo | primi |
| Gender=Fem | prima |
Number seems to be lexical feature of NUM. 92% lemmas (11) occur only with one value of Number.
ADV
30 ADV tokens (1% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: PronType=EMPTY (26; 87%).
ADV tokens may have the following values of Number:
Plur(4; 13% of non-emptyNumber): molte, quali, quanti, tutteSing(26; 87% of non-emptyNumber): quanto, cosa, etcetera, giusto, almeno, bene, esatto, fa, fino, inveceEMPTY(2182): non, sì, no, anche, più, poi, molto, così, bene, adesso
| Paradigm quanto | Sing | Plur |
|---|---|---|
| quanto | ||
| PronType=Int | quanti |
ADP
2 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Sing(2; 100% of non-emptyNumber): a, inEMPTY(1901): di, in, a, per, da, con, su, come, secondo, tra
CCONJ
2 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Number.
CCONJ tokens may have the following values of Number:
Plur(2; 100% of non-emptyNumber): oppureEMPTY(1000): e, cioè, ma, quindi, però, o, comunque, sia, che, infatti
SCONJ
1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Number.
The most frequent other feature values with which SCONJ and Number co-occurred: PronType=Rel (1; 100%).
SCONJ tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): cheEMPTY(697): che, se, perché, quando, come, mentre, siccome, com’, finché, ovunque
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (1412; 83%),
NOUN –[amod]–> ADJ (399; 82%),
VERB –[nsubj]–> PRON (229; 68%),
VERB –[nsubj]–> NOUN (207; 76%),
VERB –[conj]–> VERB (140; 63%),
ADJ –[cop]–> AUX (138; 89%),
VERB –[iobj]–> PRON (119; 62%),
NOUN –[cop]–> AUX (102; 76%),
NOUN –[acl:relcl]–> VERB (93; 62%),
NOUN –[conj]–> NOUN (47; 60%).