home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Welsh-CCG: POS Tags: DET

There are 4 DET lemmas (0%), 10 DET types (0%) and 3308 DET tokens (6%). Out of 15 observed tags, the rank of DET is: 14 in number of lemmas, 14 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: y, pa, an, The

The 10 most frequent DET types: y, ‘r, yr, pa, ba, na, An, P’, The, mha

The 10 most frequent ambiguous lemmas: y (DET 3277, PART 500), The (PROPN 3, DET 1)

The 10 most frequent ambiguous types: y (DET 1387, PART 142), yr (DET 458, PART 188), na (PART 52, ADP 30, DET 1), The (PROPN 2, DET 1)

Morphology

The form / lemma ratio of DET is 2.500000 (the average of all parts of speech is 1.452021).

The 1st highest number of forms (4) was observed with the lemma “pa”: P’, ba, mha, pa.

The 2nd highest number of forms (3) was observed with the lemma “y”: ‘r, y, yr.

The 3rd highest number of forms (2) was observed with the lemma “an”: An, na.

DET occurs with 2 features: Mutation (5; 0% instances), Foreign (4; 0% instances)

DET occurs with 3 feature-value pairs: Foreign=Yes, Mutation=NM, Mutation=SM

DET occurs with 4 feature combinations. The most frequent feature combination is _ (3299 tokens). Examples: y, ‘r, yr, pa, An, P’

Relations

DET nodes are attached to their parents using 3 different relations: det (3302; 100% instances), advmod (5; 0% instances), nmod (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (3031; 92% instances), PROPN (150; 5% instances), ADJ (38; 1% instances), VERB (35; 1% instances), PRON (28; 1% instances), NUM (21; 1% instances), ADV (3; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)

3306 (100%) DET nodes are leaves.

2 (0%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 1 different relations: fixed (2; 100% instances)

Children of DET nodes belong to 1 different parts of speech: ADV (2; 100% instances)