home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PatentChar: POS Tags: X

There are 1 X lemmas (7%), 12 X types (2%) and 14 X tokens (0%). Out of 15 observed tags, the rank of X is: 15 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: S3、 S4、 S1、 S10、 S11、 S12、 S2、 S5、 S6、 S7

The 10 most frequent ambiguous lemmas: _ (NOUN 1661, VERB 948, PUNCT 560, ADJ 474, PART 346, ADP 259, NUM 185, CCONJ 106, ADV 68, PROPN 60, PRON 48, DET 39, X 14, SCONJ 10, AUX 6)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of X is 12.000000 (the average of all parts of speech is 50.400000).

The 1st highest number of forms (12) was observed with the lemma “_”: S1, S10, S11, S12, S2, S3, S4, S5, S6, S7, S8, S9.

X does not occur with any features.

Relations

X nodes are attached to their parents using 2 different relations: root (12; 86% instances), appos (2; 14% instances)

Parents of X nodes belong to 2 different parts of speech: (12; 86% instances), NOUN (2; 14% instances)

2 (14%) X nodes are leaves.

12 (86%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 1 different relations: appos (12; 100% instances)

Children of X nodes belong to 1 different parts of speech: VERB (12; 100% instances)