home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: ADJ

There are 2445 ADJ lemmas (23%), 4293 ADJ types (12%) and 15057 ADJ tokens (5%). Out of 16 observed tags, the rank of ADJ is: 2 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent ADJ lemmas: _, primeiro, novo, último, maior, grande, próximo, segundo, brasileiro, mesmo

The 10 most frequent ADJ types: maior, grande, primeiro, primeira, novo, última, segundo, mesmo, segunda, nova

The 10 most frequent ambiguous lemmas: _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1), primeiro (ADJ 388, NOUN 44, ADV 18), novo (ADJ 322, NOUN 1), último (ADJ 283, NOUN 14), maior (ADJ 243, NOUN 4), grande (ADJ 240, NOUN 2), próximo (ADJ 215, ADV 21, NOUN 2), segundo (ADJ 204, NOUN 25, ADV 2), brasileiro (ADJ 175, NOUN 64), mesmo (ADJ 168, ADV 130, NOUN 8)

The 10 most frequent ambiguous types: maior (ADJ 202, NOUN 4, PROPN 1), primeiro (ADJ 165, NOUN 21, ADV 12), primeira (ADJ 151, NOUN 17, NUM 1), novo (ADJ 117, NOUN 1, PROPN 1), última (ADJ 108, NOUN 6), segundo (ADP 122, ADJ 102, NOUN 14, CCONJ 7, ADV 1), mesmo (ADJ 101, ADV 98, PRON 14, NOUN 8, CCONJ 2, ADP 1), segunda (PROPN 106, ADJ 100, NOUN 8), próximo (ADJ 87, ADV 21, NOUN 2), melhor (ADJ 81, NOUN 18, ADV 13)

Morphology

The form / lemma ratio of ADJ is 1.755828 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (323) was observed with the lemma “_”: 1, 10, 11, 119, 12, 13, 14, 15, 159, 16, 17, 18, 2, 20, 22, 25, 28, 29, 3, 30, 32, 33, 34, 35, 38, 4, 40, 41, 47, 5, 53, 56, 6, 60, 64, 7, 71, 76, 8, 80, 83, 8o, 9, 93, ABCdista, Asterixianas, Chief, II, IX, Meda, Senior, Sr., Wikipédia, administraviva, agrossilvipastoris, agrotécnicas, agrônoma, aguarda, alcóolico, alencarino, alividas, alocados, americanas.Harry, anabatista, anticíclicas, antifumo, antipetista, antitouradas, arena, assembly, autovalores, avant, aventurescas, bantas, best, blanche, blue, cabido, canado, caprichada, cartucheira, celebrativa, censo, centro, challenger, chavista, chavistas, checados, churrigueresca, cimentantes, cineastas, coalho, compact, compactuados, computadorizada, computadorizados, consorciadas, continuo, contracíclico, controvérsias, conveniadas, criminalistas, crimnosa, criptografada, cruzmaltina, cults, cyber, decretiva, defasada, degradê, densamente, desestimulada, devotadíssimo, diagonalizável, diesel, differential, ditadores, divida, dublada, dubladas, dunar, décima, décimo, e, edócrina, entrosado, específicoa, estiloso, estusiasmado, extendida, extrafusais, fair, fast, fatímidas, fibroblásticas, financeirso, finitos, flamboyant, flexionadas, fomos, free, freqüêntes, future, garde, georreferenciados, giallorossa, global., golden, gordinhas, gothic, grandense, graças, gripado, grávida, gutturata, hermano, hiperbárica, historica, historique, homem, homoafetiva, human, ideário, idioma, imunocastrado, inconstútil, indecidíveis, indenizado, infratores, inicial.A, iniciantes, inicias, interparoquial, intersecretarial, intersetorial, inuktitut, judeo, jurisdicionados, laranja, leg, legitima, libertadorística, liquido, logado, madeira, magisteriais, maliano, marauense, mato, meaningless, mediocres, merengue, merostomados, metal, micros, modulante, mulher, multicoloridos, multiplayer, multiuso, mundialmente, neuropsicomotor, new, nona, nono, offshore, oitava, oitavo, opposite, orientativo, ovalado, pale, palestrino, paradigma, paraensea, paralímpica, paranista, partial, parótida, paulina, paulino, paulinos, pinça, pinçado, planares, planejado, plantonista, plantonistas, polinesias, politica, porta, porto, poró, possivel, preconceitos, preferia, prestes, previdenciárias, previdenciário, primivito, professor, proteináceos, pseudoriemanniana, puba, publico, péssima, péssimas, quanto, quarta, quarto, quatiense, queer, quinta, quinto, quitadas, racialistas, registrada, registrado, regulamentados.Códig, reimplantada, relativamente, rena, reponsável, residencias, retangulates, retinoico, retroescavadeira, rio, saariana, safrinha, satelital, sedado, segreto, semana, semiativos, sexta, sextas, sexto, shakspeariano, shimaore, siguintes, sikh, single, skatista, socioeducativas, sucroenergético, sunitas, superurbano, supérior, sus, sétima, sétimo, taiuanesa, terceira, terceirizados, terceiro, termobáricas, terrorismo, terça, texturizado, tinteiras, toleíticos, torneios, tourística, transceptor, treineiro, tricologista, trigésimos, tumultuado, ultimo, ultimos, unica, varzeagrandense, versivos, vinda, vindas, vizinhas, vizinhos, volta, wide, wireless, áreas, ética.

The 2nd highest number of forms (6) was observed with the lemma “baixo”: baixa, baixas, baixinho, baixo, baixos, baixíssimas.

The 3rd highest number of forms (6) was observed with the lemma “europeu”: europeia, europeias, europeu, europeus, européia, européias.

ADJ occurs with 2 features: Gender (2; 0% instances), Number (2; 0% instances)

ADJ occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

ADJ occurs with 3 feature combinations. The most frequent feature combination is _ (15055 tokens). Examples: maior, grande, primeiro, primeira, novo, última, segundo, mesmo, segunda, nova

Relations

ADJ nodes are attached to their parents using 18 different relations: amod (12653; 84% instances), xcomp (1777; 12% instances), conj (439; 3% instances), nmod (130; 1% instances), root (13; 0% instances), obj (7; 0% instances), appos (6; 0% instances), dep (6; 0% instances), fixed (5; 0% instances), nsubj (4; 0% instances), advmod (3; 0% instances), case (3; 0% instances), flat (3; 0% instances), parataxis (3; 0% instances), acl (2; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances), obl (1; 0% instances)

Parents of ADJ nodes belong to 15 different parts of speech: NOUN (11815; 78% instances), VERB (1917; 13% instances), PROPN (620; 4% instances), ADJ (539; 4% instances), PRON (56; 0% instances), NUM (35; 0% instances), PART (33; 0% instances), (13; 0% instances), ADP (9; 0% instances), ADV (7; 0% instances), SYM (5; 0% instances), DET (3; 0% instances), X (3; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)

11895 (79%) ADJ nodes are leaves.

2276 (15%) ADJ nodes have one child.

652 (4%) ADJ nodes have two children.

234 (2%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 21 different relations: nmod (1173; 27% instances), advmod (934; 21% instances), punct (849; 19% instances), conj (505; 12% instances), cc (365; 8% instances), amod (160; 4% instances), nsubj (136; 3% instances), mark (109; 2% instances), det (57; 1% instances), case (35; 1% instances), appos (14; 0% instances), advcl (8; 0% instances), dep (8; 0% instances), csubj (4; 0% instances), cop (3; 0% instances), parataxis (3; 0% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), det:poss (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (953; 22% instances), ADV (949; 22% instances), PUNCT (849; 19% instances), ADJ (539; 12% instances), CCONJ (361; 8% instances), VERB (208; 5% instances), PROPN (140; 3% instances), ADP (134; 3% instances), PRON (79; 2% instances), DET (60; 1% instances), PART (39; 1% instances), X (23; 1% instances), SYM (21; 0% instances), NUM (13; 0% instances), AUX (3; 0% instances)