home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-PUD: POS Tags: X

There are 1 X lemmas (5%), 39 X types (1%) and 43 X tokens (0%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: El, Punta, a, posteriori, Cumae, Die, Este, Fjögur, Greco, Jin

The 10 most frequent ambiguous lemmas: _ (NOUN 4804, ADP 3323, DET 3036, PUNCT 2553, VERB 2277, ADJ 1603, PRON 1333, PROPN 1241, ADV 1035, AUX 1020, CCONJ 562, NUM 462, SCONJ 209, X 43, SYM 34, PART 9)

The 10 most frequent ambiguous types: a (AUX 287, VERB 30, DET 2, X 2, ADP 1), (NOUN 1, X 1), of (ADP 9, X 1)

Morphology

The form / lemma ratio of X is 39.000000 (the average of all parts of speech is 309.000000).

The 1st highest number of forms (39) was observed with the lemma “_”: Cumae, Die, El, Este, Fjögur, Greco, Jin, Mare, Mei, Mundo, Nostrum, Or, Ping, Pithekoussae, Punta, Píanó, Rasa, Roma, Rós, Sigur, Skylark, Spiegel, Traum, Tre, Valeron, Zeit, Zettel’s, a, beurk, del, der, dessus, hui, maiorum, mos, n°, of, posteriori, ème.

X occurs with 1 features: Foreign (18; 42% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (25 tokens). Examples: El, Punta, a, posteriori, Cumae, Die, Fjögur, Jin, Mare, Or

Relations

X nodes are attached to their parents using 10 different relations: flat (17; 40% instances), appos (9; 21% instances), nmod (4; 9% instances), fixed (3; 7% instances), goeswith (3; 7% instances), advmod (2; 5% instances), conj (2; 5% instances), obl (1; 2% instances), root (1; 2% instances), xcomp (1; 2% instances)

Parents of X nodes belong to 8 different parts of speech: X (20; 47% instances), NOUN (14; 33% instances), VERB (4; 9% instances), ADP (1; 2% instances), ADV (1; 2% instances), NUM (1; 2% instances), PROPN (1; 2% instances), (1; 2% instances)

23 (53%) X nodes are leaves.

8 (19%) X nodes have one child.

3 (7%) X nodes have two children.

9 (21%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 12 different relations: flat (17; 35% instances), punct (15; 31% instances), case (6; 12% instances), appos (2; 4% instances), fixed (2; 4% instances), advmod (1; 2% instances), cc (1; 2% instances), ccomp (1; 2% instances), conj (1; 2% instances), cop (1; 2% instances), det (1; 2% instances), nsubj (1; 2% instances)

Children of X nodes belong to 9 different parts of speech: X (20; 41% instances), PUNCT (15; 31% instances), ADP (7; 14% instances), NOUN (2; 4% instances), AUX (1; 2% instances), CCONJ (1; 2% instances), DET (1; 2% instances), NUM (1; 2% instances), VERB (1; 2% instances)