Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: X
There are 250 X lemmas (2%), 250 X types (1%) and 398 X tokens (0%).
Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X lemmas: internet, e-mail, bitcoin, fintechs, deficit, car, chef, on-line, safety, ale
The 10 most frequent X types: internet, e-mail, bitcoin, fintechs, deficit, car, chef, on-line, safety, ale
The 10 most frequent ambiguous lemmas: on-line (X 5, ADV 2), corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), acusar (VERB 17, X 1), and (PROPN 4, X 1), de (ADP 11129, X 1), denunciar (VERB 17, X 1), e (CCONJ 3051, X 1), re (NOUN 1, X 1)
The 10 most frequent ambiguous types: on-line (X 5, ADV 2), corpus (X 3, NOUN 1), habeas (X 3, NOUN 1), in (PROPN 18, X 3), and (PROPN 4, X 1), arepas (NOUN 1, X 1), de (ADP 11029, X 1), e (CCONJ 2915, X 1), media (VERB 1, X 1), re (NOUN 1, X 1)
- on-line
- corpus
- habeas
- in
- and
- arepas
- de
- e
- media
- re
- NOUN 1: Uma é o conceito “ re “ , que já vinha de o “ Refazenda “ [ 1975 ] .
- X 1: Mas , se você quiser ( re ) ler lo , ou se acaba de concluir que um século traz com si “ distanciamento “ suficiente para entender melhor o que se passou em aqueles dez dias que abalaram o mundo , então deve saber que uma excelente introdução a o tema acaba de chegar a as livrarias .
Morphology
The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.491519).
The 1st highest number of forms (1) was observed with the lemma “Bad”: Bad.
The 2nd highest number of forms (1) was observed with the lemma “Italy”: Italy.
The 3rd highest number of forms (1) was observed with the lemma “Publisher”: Publisher.
X occurs with 1 features: Foreign (383; 96% instances)
X occurs with 1 feature-value pairs: Foreign=Yes
X occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes (383 tokens).
Examples: internet, e-mail, bitcoin, fintechs, deficit, car, chef, on-line, safety, ale
Relations
X nodes are attached to their parents using 18 different relations: nmod (124; 31% instances), flat:foreign (70; 18% instances), obj (49; 12% instances), nsubj (44; 11% instances), obl (37; 9% instances), conj (36; 9% instances), amod (8; 2% instances), appos (6; 2% instances), root (6; 2% instances), parataxis (5; 1% instances), nsubj:pass (4; 1% instances), discourse (2; 1% instances), xcomp (2; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), cc (1; 0% instances), ccomp (1; 0% instances), ccomp:speech (1; 0% instances)
Parents of X nodes belong to 11 different parts of speech: NOUN (148; 37% instances), VERB (123; 31% instances), X (89; 22% instances), PROPN (9; 2% instances), ADJ (8; 2% instances), ADV (6; 2% instances), (6; 2% instances), PRON (4; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), NUM (1; 0% instances)
99 (25%) X nodes are leaves.
78 (20%) X nodes have one child.
105 (26%) X nodes have two children.
116 (29%) X nodes have three or more children.
The highest child degree of a X node is 7.
Children of X nodes are attached using 19 different relations: det (156; 21% instances), punct (145; 19% instances), case (134; 18% instances), flat:foreign (70; 9% instances), nmod (65; 9% instances), conj (34; 5% instances), amod (31; 4% instances), appos (27; 4% instances), cc (25; 3% instances), acl:relcl (13; 2% instances), advmod (13; 2% instances), cop (9; 1% instances), acl (8; 1% instances), nummod (7; 1% instances), nsubj (4; 1% instances), mark (3; 0% instances), advcl (2; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)
Children of X nodes belong to 16 different parts of speech: DET (157; 21% instances), PUNCT (145; 19% instances), ADP (136; 18% instances), X (89; 12% instances), NOUN (68; 9% instances), ADJ (33; 4% instances), PROPN (31; 4% instances), CCONJ (25; 3% instances), VERB (23; 3% instances), ADV (13; 2% instances), AUX (9; 1% instances), NUM (9; 1% instances), PRON (4; 1% instances), SYM (4; 1% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)