home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: X

There are 4 X lemmas (0%), 160 X types (0%) and 531 X tokens (0%). Out of 16 observed tags, the rank of X is: 12 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: _, dele, freedesktop.org, ~/Desktop

The 10 most frequent X types: disso, dele, deles, delas, do, dela, +, etc, @, comigo

The 10 most frequent ambiguous lemmas: _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1)

The 10 most frequent ambiguous types: disso (X 67, ADP 2), dele (X 49, PRON 2, ADP 1), do (X 21, CCONJ 1), + (X 13, PUNCT 6, PROPN 1), e (CCONJ 6349, ADJ 14, X 9, ADP 2, AUX 2, DET 1, VERB 1), no (X 8, PRON 2, NOUN 1), pelo (X 8, NOUN 1), x (PUNCT 16, X 6, NOUN 1), desses (ADP 16, X 4), da (X 4, PROPN 1)

Morphology

The form / lemma ratio of X is 40.000000 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (158) was observed with the lemma “_”: #, &, +, 1940.http, ?, @, Amazon.com, Destes, Flyscoot.com, GameSpot.com, Neles, OBS., PiratenOnline, UltimoInstante, ], a, a_0, a_1, a_2, a_3, a_i, a_n, amaralcarvalho.org.br, ao, aos, art, atributos.Alucard, b, caput, cdots, comigo, conosco, consigo, contato@cinedireitoshumanos.org.br, contigo, cpae@unesc.net, d, da, daquelas, daquele, daqueles, das, dela, delas, dele, deles, denunciapropaganda@tre-rj.jus.br, dessa, dessas, desse, desses, desta, destas, deste, disso, disto, do, dos, durvalorlato, dx, e, eletrônicowww.cespe.unb.br/concursos/pc_al_12, etc, ex, fake, frac, g1.globo.com/economia, g1.globo.com/ma, g1.globo.com/para, g1.globo.com/piaui, g1.globo.com/politica, g1.globo.com/ribeirao, g1.globo.com/vanguarda, gmail.com, http://m.goal.com, http://t.co/HmrlNAqd, http://www.cmgww.com/stars/baker/about/biography.html, http://www.portal-gestao.com/financas/folhas-de-calculo.html, i, jabisbusqueti, k, m, n, na, naqueles, naquilo, nela, nelas, nele, nessa, nesta, neste, nisso, no, num, o, offs, os, ouvidoria@imepi.pi.gov.br, p, p.e., pelo, planeta1@sercomtel.com.br, play, por, poupatemposp, precisava.Seu, prev, que, r., simone.bavaroski, sum_, t, uol.com, up, usopera.com, v1, v2, vm, www.anac.gov.br., www.barracaodosamba.com, www.centropaulasouza.sp.gov.br, www.cotec.unimontes.br, www.detran.rj.gov.br, www.edraaeronautica.com.br, www.goobec.com.br, www.informalcool.org.br, www.ingresso.com, www.ipem.rj.gov.br., www.planetaeducacao.com.br, www.planexcon.com.br, www.receita.fazenda.gov.br, www.revitechpisos.com.br, www.saocaetanodosul.sp.gov.br, www.sc.gov.br/portalturismo/Default.asp?CodMunicipio=386&Pag=2, www.submarino.com.br, www.timedoemprego.sp.gov.br, www.universa.org.br, www.valeviagemcvc.com.br, www.vestibulinhoetec.com.br, x, y, z, à, àquela, àqueles, às, λ1, λ2, λm, الاذكار, مشرق, →, ☎, 天台, 日, 禅, 莲.

The 2nd highest number of forms (1) was observed with the lemma “dele”: deles.

The 3rd highest number of forms (1) was observed with the lemma “freedesktop.org”: freedesktop.org.

X does not occur with any features.

Relations

X nodes are attached to their parents using 20 different relations: nmod (264; 50% instances), det:poss (64; 12% instances), appos (41; 8% instances), flat (36; 7% instances), conj (32; 6% instances), fixed (26; 5% instances), case (11; 2% instances), dep (11; 2% instances), parataxis (10; 2% instances), cc (6; 1% instances), obj (6; 1% instances), advmod (5; 1% instances), root (5; 1% instances), nsubj (4; 1% instances), amod (3; 1% instances), iobj (2; 0% instances), mark (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: NOUN (182; 34% instances), VERB (128; 24% instances), ADV (64; 12% instances), X (49; 9% instances), PRON (38; 7% instances), PROPN (24; 5% instances), ADJ (23; 4% instances), NUM (9; 2% instances), (5; 1% instances), ADP (4; 1% instances), SYM (4; 1% instances), CCONJ (1; 0% instances)

341 (64%) X nodes are leaves.

118 (22%) X nodes have one child.

38 (7%) X nodes have two children.

34 (6%) X nodes have three or more children.

The highest child degree of a X node is 38.

Children of X nodes are attached using 21 different relations: punct (130; 36% instances), acl:relcl (60; 17% instances), flat (43; 12% instances), case (32; 9% instances), nmod (19; 5% instances), conj (14; 4% instances), det (13; 4% instances), cc (7; 2% instances), acl (6; 2% instances), advmod (5; 1% instances), appos (5; 1% instances), nsubj (5; 1% instances), cop (4; 1% instances), amod (3; 1% instances), dep (3; 1% instances), nummod (3; 1% instances), det:poss (1; 0% instances), mark (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (130; 36% instances), VERB (67; 19% instances), X (49; 14% instances), ADP (25; 7% instances), NOUN (25; 7% instances), PROPN (16; 4% instances), DET (15; 4% instances), ADV (10; 3% instances), CCONJ (7; 2% instances), NUM (6; 2% instances), ADJ (3; 1% instances), AUX (3; 1% instances), PRON (1; 0% instances)