Treebank Statistics: UD_Portuguese-Bosque: Features: ExtPos
This feature is language-specific.
It occurs with 10 different values: ADJ, ADP, ADV, AUX, CCONJ, INTJ, NOUN, NUM, PROPN, SCONJ.
5941 tokens (3%) have a non-empty value of ExtPos.
2232 types (9%) occur at least once with a non-empty value of ExtPos.
2093 lemmas (12%) occur at least once with a non-empty value of ExtPos.
The feature is used with 14 part-of-speech tags: PROPN (4103; 2% instances), VERB (573; 0% instances), ADP (418; 0% instances), ADV (392; 0% instances), NOUN (272; 0% instances), X (66; 0% instances), AUX (33; 0% instances), CCONJ (19; 0% instances), DET (19; 0% instances), NUM (14; 0% instances), ADJ (12; 0% instances), PRON (11; 0% instances), SCONJ (8; 0% instances), PART (1; 0% instances).
PROPN
4103 PROPN tokens (22% of all PROPN tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which PROPN and ExtPos co-occurred: Number=Sing (4021; 98%), Gender=Masc (2932; 71%).
PROPN tokens may have the following values of ExtPos:
NOUN(26; 1% of non-emptyExtPos): Câmara, Dívida, Estados, Meio, por, Assembleia, Direcção, Ensino, Guerra, LeiPROPN(4077; 99% of non-emptyExtPos): São, José, João, Fernando, Pedro, Carlos, Manuel, Nova, Banco, Paulo
| Paradigm Câmara | NOUN | PROPN |
|---|---|---|
| Câmara | Câmara |
ExtPos seems to be lexical feature of PROPN. 100% lemmas (1793) occur only with one value of ExtPos.
VERB
573 VERB tokens (3% of all VERB tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which VERB and ExtPos co-occurred: Gender=EMPTY (568; 99%), VerbForm=Fin (469; 82%), Person=3 (450; 79%), Mood=Ind (411; 72%), Number=Sing (352; 61%).
VERB tokens may have the following values of ExtPos:
AUX(566; 99% of non-emptyExtPos): está, continua, estão, tem, vir, voltou, acabou, começou, chegou, começaCCONJ(3; 1% of non-emptyExtPos): SendoINTJ(1; 0% of non-emptyExtPos): éSCONJ(3; 1% of non-emptyExtPos): dado, Visto
| Paradigm ser | AUX | CCONJ | INTJ |
|---|---|---|---|
| Mood=Ind|Number=Sing|Person=3|Tense=Imp|VerbForm=Fin | era | ||
| Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | É | é | |
| VerbForm=Ger | Sendo |
ExtPos seems to be lexical feature of VERB. 95% lemmas (20) occur only with one value of ExtPos.
ADP
418 ADP tokens (1% of all ADP tokens) have a non-empty value of ExtPos.
ADP tokens may have the following values of ExtPos:
ADJ(5; 1% of non-emptyExtPos): a, emADP(70; 17% of non-emptyExtPos): a, em, de, por, paraADV(72; 17% of non-emptyExtPos): por, em, a, de, eis, jáCCONJ(87; 21% of non-emptyExtPos): em, por, a, deNOUN(131; 31% of non-emptyExtPos): porPROPN(2; 0% of non-emptyExtPos): Em, PorSCONJ(51; 12% of non-emptyExtPos): a, de, sem, por, com, desde, para
| Paradigm em | ADJ | ADP | ADV | CCONJ | PROPN |
|---|---|---|---|---|---|
| em | em | em | em | Em |
ADV
392 ADV tokens (5% of all ADV tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which ADV and ExtPos co-occurred: Polarity=EMPTY (391; 100%).
ADV tokens may have the following values of ExtPos:
ADJ(2; 1% of non-emptyExtPos): além, maisADP(87; 22% of non-emptyExtPos): apesar, quanto, diante, graças, Além, acerca, antesADV(106; 27% of non-emptyExtPos): cerca, hoje, mesmo, mais, ontem, dentro, devido, in, ÀCCONJ(21; 5% of non-emptyExtPos): além, ainda, apesar, NãoINTJ(1; 0% of non-emptyExtPos): NãoNOUN(5; 1% of non-emptyExtPos): bem, sem, malPROPN(2; 1% of non-emptyExtPos): Hoje, logoSCONJ(168; 43% of non-emptyExtPos): depois, apesar, antes, tal, ainda, já, além, mesmo, quanto, Agora
| Paradigm além | ADJ | ADP | CCONJ | SCONJ |
|---|---|---|---|---|
| além | Além | além | além |
NOUN
272 NOUN tokens (1% of all NOUN tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which NOUN and ExtPos co-occurred: Number=Sing (222; 82%), Gender=Masc (148; 54%).
NOUN tokens may have the following values of ExtPos:
NOUN(210; 77% of non-emptyExtPos): ponto, mercado, guerra, ser, campanha, ensino, pano, fim, luz, opiniãoPROPN(62; 23% of non-emptyExtPos): Associação, Assembléia, Comissão, Força, União, Conselho, Volta, Álcool, Agência, Assembleia
| Paradigm fundo | NOUN | PROPN |
|---|---|---|
| fundo | Fundo |
ExtPos seems to be lexical feature of NOUN. 95% lemmas (157) occur only with one value of ExtPos.
X
66 X tokens (40% of all X tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which X and ExtPos co-occurred: Gender=EMPTY (54; 82%), Number=EMPTY (52; 79%).
X tokens may have the following values of ExtPos:
ADJ(3; 5% of non-emptyExtPos): in, madeADV(1; 2% of non-emptyExtPos): onNOUN(55; 83% of non-emptyExtPos): pole, body, drag, jet, market, network, dream, best, big, blackPROPN(7; 11% of non-emptyExtPos): Adventure, Journey, So, The, Body, Insight, MacMillan
ExtPos seems to be lexical feature of X. 100% lemmas (57) occur only with one value of ExtPos.
AUX
33 AUX tokens (1% of all AUX tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which AUX and ExtPos co-occurred: Mood=EMPTY (32; 97%), Number=EMPTY (32; 97%), Person=EMPTY (32; 97%), Tense=EMPTY (32; 97%), VerbForm=EMPTY (32; 97%).
AUX tokens may have the following values of ExtPos:
AUX(1; 3% of non-emptyExtPos): temINTJ(32; 97% of non-emptyExtPos): é
CCONJ
19 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of ExtPos.
CCONJ tokens may have the following values of ExtPos:
CCONJ(18; 95% of non-emptyExtPos): ouSCONJ(1; 5% of non-emptyExtPos): Ou
| Paradigm ou | CCONJ | SCONJ |
|---|---|---|
| ou | Ou |
DET
19 DET tokens (0% of all DET tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which DET and ExtPos co-occurred: PronType=Art (15; 79%), Gender=Fem (13; 68%), Number=Sing (13; 68%), Definite=Ind (10; 53%).
DET tokens may have the following values of ExtPos:
ADP(3; 16% of non-emptyExtPos): tais, talPROPN(6; 32% of non-emptyExtPos): The, As, O, OsSCONJ(10; 53% of non-emptyExtPos): uma
NUM
14 NUM tokens (0% of all NUM tokens) have a non-empty value of ExtPos.
NUM tokens may have the following values of ExtPos:
NOUN(4; 29% of non-emptyExtPos): meia, quintaNUM(7; 50% of non-emptyExtPos): meia, Setenta, Trinta, cento, cinquentaPROPN(3; 21% of non-emptyExtPos): Mil, VIII, X
| Paradigm meia | NOUN | NUM |
|---|---|---|
| meia | meia |
ADJ
12 ADJ tokens (0% of all ADJ tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which ADJ and ExtPos co-occurred: Number=Sing (11; 92%), Gender=Masc (9; 75%).
ADJ tokens may have the following values of ExtPos:
ADV(1; 8% of non-emptyExtPos): bomNOUN(7; 58% of non-emptyExtPos): bomPROPN(4; 33% of non-emptyExtPos): Alta, Real, Social, Sózinhos
| Paradigm bom | ADV | NOUN |
|---|---|---|
| bom | bom |
PRON
11 PRON tokens (0% of all PRON tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which PRON and ExtPos co-occurred: Case=EMPTY (11; 100%), Person=EMPTY (11; 100%), Gender=Masc (10; 91%), Number=Sing (10; 91%), PronType=Dem (10; 91%).
PRON tokens may have the following values of ExtPos:
ADV(2; 18% of non-emptyExtPos): isso, nadaCCONJ(9; 82% of non-emptyExtPos): isto
SCONJ
8 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of ExtPos.
SCONJ tokens may have the following values of ExtPos:
ADP(1; 13% of non-emptyExtPos): graçasSCONJ(7; 88% of non-emptyExtPos): se, de, em, por, visto
PART
1 PART tokens (33% of all PART tokens) have a non-empty value of ExtPos.
The most frequent other feature values with which PART and ExtPos co-occurred: Gender=EMPTY (1; 100%), Number=EMPTY (1; 100%).
PART tokens may have the following values of ExtPos:
NOUN(1; 100% of non-emptyExtPos): pré
Relations with Agreement in ExtPos
The 10 most frequent relations where parent and child node agree in ExtPos:
PROPN –[conj]–> PROPN (321; 57%),
ADP –[nmod]–> ADP (8; 100%),
PROPN –[nsubj]–> PROPN (6; 67%),
PROPN –[obj]–> PROPN (4; 100%),
ADP –[conj]–> ADP (3; 100%),
PROPN –[xcomp]–> PROPN (2; 100%),
ADJ –[nmod]–> PROPN (1; 100%).