home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_French-PUD: POS Tags: X

There are 1 X lemmas (5%), 43 X types (1%) and 48 X tokens (0%). Out of 16 observed tags, the rank of X is: 16 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: ., El, Punta, a, posteriori, !, ?, Cumae, Die, Este

The 10 most frequent ambiguous lemmas: _ (NOUN 4804, ADP 3324, DET 3037, VERB 3024, PUNCT 2548, ADJ 1607, PRON 1335, PROPN 1241, ADV 1035, CCONJ 562, NUM 458, AUX 274, SCONJ 206, X 48, SYM 34, PART 9)

The 10 most frequent ambiguous types: . (PUNCT 983, X 2), a (VERB 316, DET 2, X 2, ADP 1, AUX 1), ! (PUNCT 4, X 1), ? (PUNCT 12, X 1), (NOUN 1, X 1), of (ADP 9, X 1)

Morphology

The form / lemma ratio of X is 43.000000 (the average of all parts of speech is 309.550000).

The 1st highest number of forms (43) was observed with the lemma “_”: !, ., ?, Cumae, Die, El, Este, Fjögur, Greco, Jin, Mare, Mei, Mundo, Nostrum, Or, Ping, Pithekoussae, Punta, Píanó, Rasa, Roma, Rós, Sigur, Skylark, Spiegel, Traum, Tre, Valeron, Zeit, Zettel’s, a, beurk, del, der, dessus, hui, maiorum, mos, n°, of, posteriori, ème, ….

X occurs with 1 features: Foreign (18; 38% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (30 tokens). Examples: ., El, Punta, a, posteriori, !, ?, Cumae, Die, Fjögur

Relations

X nodes are attached to their parents using 11 different relations: flat (17; 35% instances), appos (9; 19% instances), punct (5; 10% instances), nmod (4; 8% instances), fixed (3; 6% instances), goeswith (3; 6% instances), advmod (2; 4% instances), conj (2; 4% instances), obl (1; 2% instances), root (1; 2% instances), xcomp (1; 2% instances)

Parents of X nodes belong to 10 different parts of speech: X (20; 42% instances), NOUN (15; 31% instances), VERB (5; 10% instances), PROPN (2; 4% instances), ADJ (1; 2% instances), ADP (1; 2% instances), ADV (1; 2% instances), AUX (1; 2% instances), NUM (1; 2% instances), (1; 2% instances)

28 (58%) X nodes are leaves.

8 (17%) X nodes have one child.

3 (6%) X nodes have two children.

9 (19%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 12 different relations: flat (17; 35% instances), punct (15; 31% instances), case (6; 12% instances), appos (2; 4% instances), fixed (2; 4% instances), advmod (1; 2% instances), cc (1; 2% instances), ccomp (1; 2% instances), conj (1; 2% instances), cop (1; 2% instances), det (1; 2% instances), nsubj (1; 2% instances)

Children of X nodes belong to 9 different parts of speech: X (20; 41% instances), PUNCT (15; 31% instances), ADP (7; 14% instances), NOUN (2; 4% instances), AUX (1; 2% instances), CCONJ (1; 2% instances), DET (1; 2% instances), NUM (1; 2% instances), VERB (1; 2% instances)