Treebank Statistics: UD_Spanish-GSD: POS Tags: PUNCT
There are 34 PUNCT lemmas (0%), 35 PUNCT types (0%) and 47474 PUNCT tokens (11%).
Out of 17 observed tags, the rank of PUNCT is: 12 in number of lemmas, 13 in number of types and 4 in number of tokens.
The 10 most frequent PUNCT lemmas: ,, ., “, (, ), -, :, ;, ‘, /
The 10 most frequent PUNCT types: ,, ., “, (, ), -, :, ;, ‘, /
The 10 most frequent ambiguous lemmas: ” (PUNCT 2718, X 5, SYM 1), - (PUNCT 1073, X 1), ’ (PUNCT 208, X 2), / (PUNCT 192, SYM 24), » (PUNCT 167, ADJ 2), – (PUNCT 156, X 1), ’’ (PUNCT 13, NOUN 2), • (PUNCT 5, X 1), * (PUNCT 4, SYM 4), = (SYM 14, PUNCT 2)
The 10 most frequent ambiguous types: ” (PUNCT 2718, X 5, SYM 1), - (PUNCT 1073, X 1), ’ (PUNCT 208, X 2), / (PUNCT 192, SYM 24), » (PUNCT 167, ADJ 2), – (PUNCT 156, X 1), ’’ (PUNCT 13, NOUN 2), • (PUNCT 5, X 1), * (PUNCT 4, SYM 4), = (SYM 14, PUNCT 2)
- ”
- PUNCT 2718: ” Peul de ciudad ” .
- X 5: Se encuentra en las coordenadas 32 ° 8’9 .71 ” S 60 ° 58’52 .38 ” O ;
- SYM 1: Ha colaborado con él en las canciones Cum On Everybody , Drug Ballad , Superman , Renegades ( la versión de 8 minutos , contando también con la presencia de Ms. Korona y Royce Da 5’9 ” ) y We Made You .
- -
- PUNCT 1073: Los no - lugares están muy presentes en la obra de J. G. Ballard .
- X 1: Estas parameras se inician en la zona de contacto de la Cordillera Central con la Ibérica en la Sierra de Pela , prolongando se hacia el Este en amplias extensiones de topografía arrasada y plana , con áreas de pliegues suaves cortados por lapenillanura fundamental de la Meseta , por lo que sus relieves son en buena parte una superficie de erosión , horizontal o ligeramente deformada , donde los niveles aflorantes corresponden a materiales calcáreos que se depositaron durante el Triásico , desarrollando se un gran número de formas kársticas ( dolinas , cuevas , lapiaces ) , y con una altitud media que se sitúa en torno a los 1.000 - 1.200m .
- ’
- /
- »
- –
- ’’
- PUNCT 13: Con este último grupo colaboró en la canción `` I can not dance ’’ .
- NOUN 2: Castro Bergidum ( Bergdunum ) o Castro Ventosa es un yacimiento arqueológico situado en las coordenadas 6 º 45’06 ’’ de longitud Oeste y 42 º 36’04 ’’ de latitud Norte a 642 msnm de altitud , en el centro de la Hoya berciana en la comarca de El Bierzo , Provincia de León , a el noroeste de la península Ibérica .
- •
- *
- =
- SYM 14: Palabra clave : desplazamiento = 0 , tamaño = 2 .
- PUNCT 2: Filocalia o filokalia ( en griego Φιλοκαλια , de φιλíα = afición , amor y de καλóς = bello , belleza ) , nombre que recibe una colección ya clásica de textos dedicados a la mística y ascesis en la Iglesia Ortodoxa , uno de sus principales temas es el hesicasmo .
Morphology
The form / lemma ratio of PUNCT is 1.029412 (the average of all parts of speech is 1.326443).
The 1st highest number of forms (2) was observed with the lemma “.”: ., .ç.
The 2nd highest number of forms (1) was observed with the lemma “!”: !.
The 3rd highest number of forms (1) was observed with the lemma “””: ”.
PUNCT occurs with 4 features: PunctType (47260; 100% instances), PunctSide (5206; 11% instances), Foreign (3; 0% instances), Typo (2; 0% instances)
PUNCT occurs with 14 feature-value pairs: Foreign=Yes, PunctSide=Fin, PunctSide=Ini, PunctType=Brck, PunctType=Colo, PunctType=Comm, PunctType=Dash, PunctType=Elip, PunctType=Excl, PunctType=Peri, PunctType=Qest, PunctType=Quot, PunctType=Semi, Typo=Yes
PUNCT occurs with 16 feature combinations.
The most frequent feature combination is PunctType=Comm (21253 tokens).
Examples: ,
Relations
PUNCT nodes are attached to their parents using 1 different relations: punct (47474; 100% instances)
Parents of PUNCT nodes belong to 17 different parts of speech: VERB (20649; 43% instances), NOUN (13213; 28% instances), PROPN (6843; 14% instances), ADJ (2307; 5% instances), NUM (1484; 3% instances), PRON (793; 2% instances), ADV (705; 1% instances), X (679; 1% instances), SYM (305; 1% instances), ADP (195; 0% instances), CCONJ (105; 0% instances), DET (85; 0% instances), AUX (32; 0% instances), SCONJ (32; 0% instances), PART (24; 0% instances), PUNCT (16; 0% instances), INTJ (7; 0% instances)
47466 (100%) PUNCT nodes are leaves.
0 (0%) PUNCT nodes have one child.
8 (0%) PUNCT nodes have two children.
The highest child degree of a PUNCT node is 2.
Children of PUNCT nodes are attached using 1 different relations: punct (16; 100% instances)
Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (16; 100% instances)