home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Galician-PUD: POS Tags: X

There are 64 X lemmas (1%), 64 X types (1%) and 79 X tokens (0%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: the, a, of, really, you, ‘ya, Don’t, Hitchhiker’s, and, anyway

The 10 most frequent X types: the, a, of, Really, You, ‘Ya, Anyway, Breaking, Buck, Cena

The 10 most frequent ambiguous lemmas: the (X 7, PROPN 4), a (ADP 461, X 5), of (X 4, ADP 2), ground (PROPN 1, X 1), me (PRON 11, X 1), my (PROPN 1, X 1)

The 10 most frequent ambiguous types: a (DET 840, ADP 430, PRON 13, X 3), of (X 4, ADP 2), Ground (PROPN 1, X 1), My (PROPN 1, X 1), Son (AUX 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.319483).

The 1st highest number of forms (1) was observed with the lemma “’ya”: ‘Ya.

The 2nd highest number of forms (1) was observed with the lemma “Don’t”: Don’t.

The 3rd highest number of forms (1) was observed with the lemma “Hitchhiker’s”: Hitchhiker’s.

X occurs with 1 features: Foreign (79; 100% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 1 feature combinations. The most frequent feature combination is Foreign=Yes (79 tokens). Examples: the, a, of, Really, You, ‘Ya, Anyway, Breaking, Buck, Cena

Relations

X nodes are attached to their parents using 6 different relations: flat:foreign (58; 73% instances), appos (5; 6% instances), nmod (5; 6% instances), conj (4; 5% instances), nsubj (4; 5% instances), obl (3; 4% instances)

Parents of X nodes belong to 3 different parts of speech: X (61; 77% instances), NOUN (11; 14% instances), VERB (7; 9% instances)

56 (71%) X nodes are leaves.

2 (3%) X nodes have one child.

1 (1%) X nodes have two children.

20 (25%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 8 different relations: flat:foreign (58; 51% instances), punct (36; 32% instances), case (8; 7% instances), cc (4; 4% instances), appos (3; 3% instances), conj (3; 3% instances), det (1; 1% instances), nmod (1; 1% instances)

Children of X nodes belong to 8 different parts of speech: X (61; 54% instances), PUNCT (36; 32% instances), ADP (8; 7% instances), CCONJ (4; 4% instances), NOUN (2; 2% instances), DET (1; 1% instances), NUM (1; 1% instances), PROPN (1; 1% instances)