home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Persian-PerDT: POS Tags: DET

There are 60 DET lemmas (0%), 57 DET types (0%) and 10438 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 11 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: این، هر، آن، هیچ، چه، چند، همان، همین، برخی، چنین

The 10 most frequent DET types: این، هر، آن، هیچ، چه، چند، همان، همین، برخی، چنین

The 10 most frequent ambiguous lemmas: این (DET 4859, PRON 904, NOUN 2), هر (DET 1703, NOUN 2), آن (PRON 2019, DET 1140, NOUN 16, ADJ 1), هیچ (DET 438, NOUN 19, ADV 7, ADJ 1, PRON 1), چه (DET 438, NOUN 213, PRON 79, ADV 67, SCONJ 6), چند (DET 418, NOUN 21, ADJ 5), همان (DET 348, PRON 18), همین (DET 267, PRON 49, NOUN 4), برخی (DET 153, NOUN 131), چنین (DET 125, PRON 40, NOUN 18)

The 10 most frequent ambiguous types: این (DET 4814, PRON 902, NOUN 2), هر (DET 1703, NOUN 2), آن (PRON 1991, DET 1130, NOUN 10, ADJ 1), هیچ (DET 438, NOUN 18, ADV 7, ADJ 1, PRON 1), چه (DET 438, NOUN 210, ADV 67, PRON 54, SCONJ 6), چند (DET 418, NOUN 21, ADJ 5), همان (DET 348, PRON 15), همین (DET 267, PRON 49, NOUN 4), برخی (DET 153, NOUN 127), چنین (DET 125, PRON 40, NOUN 18)

Morphology

The form / lemma ratio of DET is 0.950000 (the average of all parts of speech is 1.486663).

The 1st highest number of forms (2) was observed with the lemma “آن”: آن, ان.

The 2nd highest number of forms (2) was observed with the lemma “این”: این, ین.

The 3rd highest number of forms (2) was observed with the lemma “همه”: همه, همهٔ.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 10 different relations: det (10273; 98% instances), nmod (126; 1% instances), conj (11; 0% instances), obl (10; 0% instances), obl:arg (7; 0% instances), nsubj (5; 0% instances), amod (2; 0% instances), obj (2; 0% instances), root (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (10235; 98% instances), ADJ (80; 1% instances), PROPN (29; 0% instances), NUM (27; 0% instances), PRON (18; 0% instances), ADV (16; 0% instances), DET (15; 0% instances), VERB (12; 0% instances), INTJ (3; 0% instances), ADP (1; 0% instances), (1; 0% instances), SCONJ (1; 0% instances)

10373 (99%) DET nodes are leaves.

50 (0%) DET nodes have one child.

11 (0%) DET nodes have two children.

4 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 14 different relations: case (17; 20% instances), obl (11; 13% instances), advmod (10; 12% instances), cc (10; 12% instances), det (10; 12% instances), nmod (7; 8% instances), punct (6; 7% instances), conj (3; 4% instances), acl (2; 2% instances), cop (2; 2% instances), dep (2; 2% instances), obl:arg (2; 2% instances), nummod (1; 1% instances), xcomp (1; 1% instances)

Children of DET nodes belong to 10 different parts of speech: ADP (22; 26% instances), DET (15; 18% instances), NOUN (13; 15% instances), ADV (11; 13% instances), CCONJ (10; 12% instances), PUNCT (6; 7% instances), VERB (3; 4% instances), AUX (2; 2% instances), ADJ (1; 1% instances), NUM (1; 1% instances)