Treebank Statistics: UD_Portuguese-PetroGold: POS Tags: PUNCT
There are 52 PUNCT lemmas (0%), 50 PUNCT types (0%) and 29428 PUNCT tokens (12%).
Out of 16 observed tags, the rank of PUNCT is: 9 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent PUNCT lemmas: ,, ., ), (, :, ;, -, –, , =
The 10 most frequent PUNCT types: ,, ., ), (, :, ;, -, –, , =
The 10 most frequent ambiguous lemmas: , (PUNCT 12322, PROPN 14, NOUN 1), . (PUNCT 7995, PROPN 34, X 3), ; (PUNCT 351, PROPN 1), - (PUNCT 312, PROPN 5), – (PUNCT 222, SYM 6), (PUNCT 102, NOUN 2, PROPN 1), = (PUNCT 82, PROPN 2), / (PUNCT 39, ADP 11, PROPN 5), ” (PUNCT 33, PROPN 3), + (PUNCT 19, ADP 2, NOUN 2, PROPN 2, SYM 2)
The 10 most frequent ambiguous types: , (PUNCT 12322, PROPN 14, NOUN 1), . (PUNCT 7995, PROPN 34, X 5), ; (PUNCT 351, PROPN 1), - (PUNCT 312, PROPN 5), – (PUNCT 222, SYM 6), (PUNCT 104, NOUN 2, PROPN 1), = (PUNCT 82, PROPN 2), / (PUNCT 39, ADP 11, PROPN 5, X 2), + (PUNCT 19, ADP 2, NOUN 2, PROPN 2, SYM 2), x (PUNCT 19, ADP 10, NOUN 6, SYM 4)
- ,
- PUNCT 12322: Fonte : Petrobras , 2015 .
- PROPN 14: Logo o BE será : BE = Emissões de a CSN ( tCO , ) + Emissões de a CSA ( tCO , )
- NOUN 1: Pode se observar em os perfis de modelos sintéticos de o item 4.1.2.3 que os corpos recuperados por a inversão compacta atingiram um bom grau de homogeneidade interna , a o passo , que em os casos práticos relativos a os alvos interpretados ( capitulo 6 ) se observa menos homogeneidade em as soluções finais .
- .
- ;
- -
- –
-
- =
- /
- PUNCT 39: Fonte : http:/ / pt.wikipedia .
- ADP 11: O cubo a o lado possui 1 cm de aresta , sua densidade é de 2g / cm3 ,
- PROPN 5: A capacidade de troca de cátions de as argilas foi feita em o Laboratório de Análise Mineralógica – LAM / IQ-UFRJ .
- X 2: De esta parcela , 60 % ( US$ 82 bilhões ) é destinado a o pré-sal ( Figura 7 ) ( Fonte : www.petrobras . com.br/pt/ quem-somos / estrategia/plano-de-negocios-e-gestao / )
- +
- PUNCT 19: Fe + 3 CO2
- ADP 2: Esse conceito levou a uma melhor compreensão de as propriedades de o sistema argila + água .
- NOUN 2: O volume morto foi medido com água destilada e balança de precisão , apresentando o valor de 4,92 + / - 0,01 cc para a injeção de o fluido e 4,49 + / - 0,01 cc para o fluxo reverso .
- PROPN 2: Os óleos foram caracterizados por viscosidade cinemática a 40 ° C ( ASTMD445 ) , índice de acidez ( ASTMD664 ) , teor de água ( ASTMD6304 ) , estabilidade a a oxidação 110 ° C ( h ) ( EM 14112 ) , massa específica a 20 ° C ( Kg m-3 ) ( ASTM D 4052 ) , enxofre total ( mg/Kg ) , Na + K ( mg/Kg ) , Mg + Ca ( mg/Kg ) ( NBR 15553 ) e perfil de ácidos graxos .
- SYM 2: BE = 11.958.667 tCO , + 8.500.000 tCO , = 20.458.667 tCO ,
- x
Morphology
The form / lemma ratio of PUNCT is 0.961538 (the average of all parts of speech is 1.452383).
The 1st highest number of forms (2) was observed with the lemma “””: “, ”.
The 2nd highest number of forms (1) was observed with the lemma “!”: !.
The 3rd highest number of forms (1) was observed with the lemma “#”: #.
PUNCT does not occur with any features.
Relations
PUNCT nodes are attached to their parents using 2 different relations: punct (29407; 100% instances), root (21; 0% instances)
Parents of PUNCT nodes belong to 17 different parts of speech: VERB (10974; 37% instances), NOUN (8020; 27% instances), PROPN (4361; 15% instances), NUM (2537; 9% instances), ADJ (1437; 5% instances), ADV (773; 3% instances), CCONJ (345; 1% instances), PRON (321; 1% instances), ADP (299; 1% instances), SYM (265; 1% instances), X (47; 0% instances), (21; 0% instances), AUX (11; 0% instances), PUNCT (7; 0% instances), DET (5; 0% instances), SCONJ (4; 0% instances), INTJ (1; 0% instances)
29422 (100%) PUNCT nodes are leaves.
3 (0%) PUNCT nodes have one child.
3 (0%) PUNCT nodes have two children.
The highest child degree of a PUNCT node is 2.
Children of PUNCT nodes are attached using 3 different relations: punct (7; 78% instances), appos (1; 11% instances), det (1; 11% instances)
Children of PUNCT nodes belong to 3 different parts of speech: PUNCT (7; 78% instances), DET (1; 11% instances), NOUN (1; 11% instances)