Treebank Statistics: UD_English-ParTUT: POS Tags: DET
There are 33 DET lemmas (1%), 36 DET types (0%) and 5316 DET tokens (11%).
Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 11 in number of types and 4 in number of tokens.
The 10 most frequent DET lemmas: the, a, this, his, their, its, any, all, us, that
The 10 most frequent DET types: the, a, this, his, an, their, its, any, these, all
The 10 most frequent ambiguous lemmas: a (DET 953, X 1), this (DET 337, PRON 90, ADJ 1), his (DET 249, PRON 6), any (DET 79, ADV 1), all (DET 60, PRON 42, ADV 5), us (DET 60, PRON 36), that (SCONJ 325, PRON 217, DET 59, ADJ 2), no (DET 58, ADV 8), such (DET 50, ADJ 29), you (PRON 131, DET 50)
The 10 most frequent ambiguous types: a (DET 789, X 1), this (DET 221, PRON 42, ADJ 1), his (DET 224, PRON 6), any (DET 73, ADV 1), these (DET 62, PRON 6), all (DET 56, PRON 34, ADV 5), our (DET 54, PRON 1), no (DET 41, ADV 8), such (DET 37, ADJ 28), that (SCONJ 324, PRON 173, DET 34)
- a
- DET 789: ( 1 ) Everyone has the right to a nationality .
- X 1: It merely prolongs transitional rules by postponing deadlines , deletes provisions which are no longer applicable , and lays down the procedures for a ) carrying out the ad hoc transportation of dangerous goods and b ) enacting less stringent national regulations , in particular for the transport of very small amounts of dangerous goods within strictly defined local areas .
- this
- DET 221: We are badly behind now in this matter .
- PRON 42: Mr Berenguer Fuster , we shall check all this .
- ADJ 1: 19 ) Chapter 6 of the Commission ‘s “ Communication on Environmental Agreements at Community level within the Framework of the Action Plan on the Simplification and Improvement of the Regulatory Environment “ could provide useful guidance when assessing self-regulation by industry in the context of this Directive .
- his
- any
- DET 73: Such credit may be implemented in any reasonable manner ;
- ADV 1: In this context , I should like to make a request and ask the Commissioner responsible , who is with us here today , to table an appropriate text as soon as possible with a view to continuing to make it safer for traffic to transit tunnels in the future , so that we in Europe do not have to experience any more such disasters on this scale .
- these
- all
- our
- no
- such
- that
Morphology
The form / lemma ratio of DET is 1.090909 (the average of all parts of speech is 1.205397).
The 1st highest number of forms (2) was observed with the lemma “a”: a, an.
The 2nd highest number of forms (2) was observed with the lemma “that”: that, those.
The 3rd highest number of forms (2) was observed with the lemma “this”: these, this.
DET occurs with 5 features: PronType (5315; 100% instances), Definite (3927; 74% instances), Number (1530; 29% instances), Poss (640; 12% instances), ExtPos (5; 0% instances)
DET occurs with 15 feature-value pairs: Definite=Def, Definite=Ind, ExtPos=ADV, ExtPos=PRON, Number=Plur, Number=Sing, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot
DET occurs with 21 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art (2953 tokens).
Examples: the
Relations
DET nodes are attached to their parents using 14 different relations: det (4631; 87% instances), nmod:poss (639; 12% instances), det:predet (17; 0% instances), advmod (7; 0% instances), nmod (5; 0% instances), obl (4; 0% instances), fixed (3; 0% instances), obj (3; 0% instances), nsubj (2; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), iobj (1; 0% instances), parataxis (1; 0% instances), root (1; 0% instances)
Parents of DET nodes belong to 11 different parts of speech: NOUN (5055; 95% instances), PROPN (161; 3% instances), PRON (35; 1% instances), ADJ (33; 1% instances), VERB (14; 0% instances), X (6; 0% instances), ADV (5; 0% instances), NUM (3; 0% instances), SYM (2; 0% instances), INTJ (1; 0% instances), (1; 0% instances)
5295 (100%) DET nodes are leaves.
15 (0%) DET nodes have one child.
4 (0%) DET nodes have two children.
2 (0%) DET nodes have three or more children.
The highest child degree of a DET node is 8.
Children of DET nodes are attached using 12 different relations: case (9; 26% instances), punct (7; 20% instances), fixed (5; 14% instances), nmod (3; 9% instances), advmod (2; 6% instances), aux (2; 6% instances), conj (2; 6% instances), advcl (1; 3% instances), cop (1; 3% instances), mark (1; 3% instances), nsubj (1; 3% instances), obl (1; 3% instances)
Children of DET nodes belong to 10 different parts of speech: ADP (11; 31% instances), PUNCT (7; 20% instances), ADJ (5; 14% instances), AUX (3; 9% instances), NOUN (3; 9% instances), ADV (2; 6% instances), PRON (1; 3% instances), PROPN (1; 3% instances), SCONJ (1; 3% instances), VERB (1; 3% instances)