Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: X
There are 182 X
lemmas (1%), 182 X
types (1%) and 246 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country
The 10 most frequent X
types: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country
The 10 most frequent ambiguous lemmas: corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), on-line (X 3, ADJ 2, ADV 2), premium (X 2, ADJ 1), acusar (VERB 17, X 1), and (PROPN 4, X 1), de (ADP 11129, X 1), denunciar (VERB 17, X 1), drag (NOUN 2, X 1)
The 10 most frequent ambiguous types: corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), on-line (X 3, ADJ 2, ADV 2), premium (X 2, ADJ 1), and (PROPN 4, X 1), arepas (NOUN 1, X 1), blockbusters (NOUN 2, X 1), de (ADP 11029, X 1), drag (NOUN 1, X 1)
- corpus
- habeas
- in
- on-line
- premium
- and
- arepas
- blockbusters
- de
- drag
- NOUN 1: Embora não tenha vencido em a categoria melhor programa de competição ( o troféu foi para “ The Voice “ ) , a drag RuPaul foi homenageada em quadro rápido .
- X 1: E a própria estatueta de o Emmy - que “ deu uma entrevista “ a Colbert - foi personificada por ninguém menos de o que a drag queen RuPaul .
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.496159).
The 1st highest number of forms (1) was observed with the lemma “Bad”: Bad.
The 2nd highest number of forms (1) was observed with the lemma “Italy”: Italy.
The 3rd highest number of forms (1) was observed with the lemma “acusar”: acusar.
X
occurs with 1 features: Foreign (231; 94% instances)
X
occurs with 1 feature-value pairs: Foreign=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes
(231 tokens).
Examples: bitcoin, car, safety, ale, bitcoins, pale, rohingyas, capita, corpus, country
Relations
X
nodes are attached to their parents using 17 different relations: nmod (72; 29% instances), flat:foreign (68; 28% instances), conj (27; 11% instances), obj (25; 10% instances), nsubj (20; 8% instances), obl (11; 4% instances), appos (4; 2% instances), parataxis (4; 2% instances), root (3; 1% instances), discourse (2; 1% instances), flat (2; 1% instances), nsubj:pass (2; 1% instances), xcomp (2; 1% instances), advcl (1; 0% instances), cc (1; 0% instances), ccomp (1; 0% instances), ccomp:speech (1; 0% instances)
Parents of X
nodes belong to 10 different parts of speech: X (89; 36% instances), NOUN (78; 32% instances), VERB (58; 24% instances), PROPN (7; 3% instances), ADJ (3; 1% instances), ADV (3; 1% instances), (3; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), PRON (1; 0% instances)
84 (34%) X
nodes are leaves.
29 (12%) X
nodes have one child.
50 (20%) X
nodes have two children.
83 (34%) X
nodes have three or more children.
The highest child degree of a X
node is 7.
Children of X
nodes are attached using 20 different relations: punct (123; 26% instances), det (81; 17% instances), flat:foreign (68; 15% instances), case (58; 12% instances), nmod (28; 6% instances), conj (27; 6% instances), appos (22; 5% instances), cc (16; 3% instances), amod (9; 2% instances), acl:relcl (6; 1% instances), advmod (6; 1% instances), acl (5; 1% instances), cop (5; 1% instances), nsubj (3; 1% instances), nummod (3; 1% instances), flat (2; 0% instances), mark (2; 0% instances), advcl (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)
Children of X
nodes belong to 16 different parts of speech: PUNCT (123; 26% instances), X (89; 19% instances), DET (82; 18% instances), ADP (59; 13% instances), NOUN (40; 9% instances), CCONJ (16; 3% instances), PROPN (14; 3% instances), VERB (12; 3% instances), ADJ (11; 2% instances), ADV (5; 1% instances), AUX (5; 1% instances), NUM (4; 1% instances), PRON (3; 1% instances), SYM (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)