home fa/pos edit page issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Persian)

There are 1 DET lemmas (7%), 35 DET types (0%) and 3561 DET tokens (2%). Out of 15 observed tags, the rank of DET is: 6 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: این، آن، هر، همان، همین، برخی، تمام، تمامی، دیگر، بعضی

The 10 most frequent ambiguous lemmas: _ (NOUN 57475, ADP 17533, VERB 16902, ADJ 13589, PUNCT 13442, CONJ 8218, PRON 5772, SCONJ 5160, ADV 4150, DET 3561, NUM 3406, PART 2569, AUX 772, X 253, INTJ 69)

The 10 most frequent ambiguous types: این (DET 2372, PRON 487, CONJ 1), آن (PRON 591, DET 366, NOUN 3), همان (DET 128, PRON 8, NOUN 1), همین (DET 119, PRON 18), برخی (DET 43, PRON 41), تمام (DET 42, ADJ 38, PRON 1), دیگر (ADJ 204, ADV 39, DET 21, NOUN 1), بعضی (PRON 64, DET 20), سراسر (DET 17, ADV 1), دین (NOUN 48, DET 16, X 1)

Morphology

The form / lemma ratio of DET is 35.000000 (the average of all parts of speech is 1071.133333).

The 1st highest number of forms (35) was observed with the lemma “_”: آن, آن‌ها, اون, این, اینهمه, اینگونه, برخی, بعض, بعضی, تعدادی, تمام, تمامی, تنها, تک‌تک, دان, دین, دیگر, سراسر, سری, فلان, هر, همان, همه, همهٔ, همچین, همین, هیچگونه, چنان, چنین, چگونه, کدام, کلیه, کلیهٔ, ین, یک‌یک.

DET occurs with 1 features: fa-feat/PronType (19; 1% instances)

DET occurs with 1 feature-value pairs: PronType=Ind

DET occurs with 2 feature combinations. The most frequent feature combination is _ (3542 tokens). Examples: این، آن، هر، همان، همین، برخی، تمام، تمامی، دیگر، همهٔ

Relations

DET nodes are attached to their parents using 11 different relations: fa-dep/det (3469; 97% instances), fa-dep/det:predet (36; 1% instances), fa-dep/mark (18; 1% instances), fa-dep/nsubj (15; 0% instances), fa-dep/advmod (6; 0% instances), fa-dep/dobj (6; 0% instances), fa-dep/nmod:poss (5; 0% instances), fa-dep/mwe (3; 0% instances), fa-dep/conj (1; 0% instances), fa-dep/parataxis (1; 0% instances), fa-dep/root (1; 0% instances)

Parents of DET nodes belong to 10 different parts of speech: NOUN (3444; 97% instances), ADJ (27; 1% instances), VERB (27; 1% instances), ADV (24; 1% instances), NUM (22; 1% instances), PRON (12; 0% instances), ADP (2; 0% instances), CONJ (1; 0% instances), DET (1; 0% instances), ROOT (1; 0% instances)

3511 (99%) DET nodes are leaves.

47 (1%) DET nodes have one child.

1 (0%) DET nodes have two children.

2 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 12 different relations: fa-dep/mwe (23; 41% instances), fa-dep/nummod (13; 23% instances), fa-dep/case (5; 9% instances), fa-dep/nmod:poss (5; 9% instances), fa-dep/ccomp (2; 4% instances), fa-dep/nsubj (2; 4% instances), fa-dep/acl:relcl (1; 2% instances), fa-dep/amod (1; 2% instances), fa-dep/aux (1; 2% instances), fa-dep/cop (1; 2% instances), fa-dep/det (1; 2% instances), fa-dep/punct (1; 2% instances)

Children of DET nodes belong to 9 different parts of speech: NOUN (22; 39% instances), NUM (14; 25% instances), PART (5; 9% instances), VERB (5; 9% instances), CONJ (4; 7% instances), ADJ (3; 5% instances), ADP (1; 2% instances), DET (1; 2% instances), PUNCT (1; 2% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]