home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: AUX

There are 165 AUX lemmas (1%), 165 AUX types (1%) and 2684 AUX tokens (2%). Out of 15 observed tags, the rank of AUX is: 9 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: 是、 為、 會、 可以、 可、 也是、 能、 要、 可能、 就是

The 10 most frequent AUX types: 是、 為、 會、 可以、 可、 也是、 能、 要、 可能、 就是

The 10 most frequent ambiguous lemmas: 是 (AUX 883, VERB 322, X 1), 為 (AUX 553, VERB 507, ADP 131, PROPN 1, X 1), 會 (AUX 200, PART 137, NOUN 3), 可 (AUX 106, ADV 1), 也是 (AUX 76, VERB 5, ADV 3), 能 (AUX 72, PART 8), 要 (AUX 68, VERB 6), 可能 (AUX 60, NOUN 9), 就是 (AUX 29, VERB 16, ADV 6), 必須 (AUX 28, ADJ 1, VERB 1)

The 10 most frequent ambiguous types: 是 (AUX 883, VERB 322, X 1), 為 (AUX 553, VERB 507, ADP 131, PROPN 1, X 1), 會 (AUX 200, PART 137, NOUN 3), 可 (AUX 106, ADV 1), 也是 (AUX 76, VERB 5, ADV 3), 能 (AUX 72, PART 8), 要 (AUX 68, VERB 6), 可能 (AUX 60, NOUN 9), 就是 (AUX 29, VERB 16, ADV 6), 必須 (AUX 28, ADJ 1, VERB 1)

Morphology

The form / lemma ratio of AUX is 1.000000 (the average of all parts of speech is 1.000266).

The 1st highest number of forms (1) was observed with the lemma “一爭”: 一爭.

The 2nd highest number of forms (1) was observed with the lemma “上表”: 上表.

The 3rd highest number of forms (1) was observed with the lemma “不可”: 不可.

AUX does not occur with any features.

Relations

AUX nodes are attached to their parents using 7 different relations: cop (1795; 67% instances), aux (879; 33% instances), conj (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), acl:relcl (1; 0% instances), dep (1; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: NOUN (1192; 44% instances), VERB (852; 32% instances), PART (251; 9% instances), ADJ (219; 8% instances), NUM (93; 3% instances), PROPN (54; 2% instances), X (17; 1% instances), AUX (3; 0% instances), ADP (2; 0% instances), PRON (1; 0% instances)

2656 (99%) AUX nodes are leaves.

22 (1%) AUX nodes have one child.

5 (0%) AUX nodes have two children.

1 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 11 different relations: advmod (16; 43% instances), mark (5; 14% instances), punct (4; 11% instances), cc (3; 8% instances), conj (3; 8% instances), case (1; 3% instances), csubj (1; 3% instances), dep (1; 3% instances), mark:relcl (1; 3% instances), nsubj (1; 3% instances), xcomp (1; 3% instances)

Children of AUX nodes belong to 8 different parts of speech: ADV (21; 57% instances), PUNCT (4; 11% instances), AUX (3; 8% instances), CCONJ (3; 8% instances), VERB (3; 8% instances), ADP (1; 3% instances), NOUN (1; 3% instances), PART (1; 3% instances)