Treebank Statistics: UD_English-EWT: POS Tags: DET
There are 21 DET
lemmas (0%), 43 DET
types (0%) and 20098 DET
tokens (8%).
Out of 17 observed tags, the rank of DET
is: 14 in number of lemmas, 15 in number of types and 6 in number of tokens.
The 10 most frequent DET
lemmas: the, a, this, all, some, any, no, that, another, every
The 10 most frequent DET
types: the, a, this, an, all, some, any, no, that, these
The 10 most frequent ambiguous lemmas: a (DET 5353, NUM 7, NOUN 4, X 3, ADV 1), this (DET 1104, PRON 555, ADV 5, NOUN 1), all (DET 563, ADV 86, NOUN 3), some (DET 430, ADV 2), any (DET 404, ADV 8), no (DET 323, INTJ 57, ADV 31, NOUN 2), that (SCONJ 1166, PRON 1111, DET 297, ADV 19), every (DET 114, ADV 1), what (PRON 506, DET 77), both (CCONJ 58, DET 55, ADV 14)
The 10 most frequent ambiguous types: the (DET 9860, PRON 9, ADP 2, PART 1), a (DET 4542, ADP 7, NUM 6, NOUN 4, ADV 2, X 2, ADJ 1, AUX 1, CCONJ 1, PART 1), this (DET 761, PRON 372, ADV 5, NOUN 1), an (DET 592, CCONJ 3), all (DET 502, ADV 87, NOUN 3), some (DET 398, ADV 2, PRON 1), any (DET 360, ADV 10, PRON 2), no (DET 259, INTJ 38, ADV 26, NOUN 2, VERB 2), that (SCONJ 1157, PRON 967, DET 192, ADV 19, ADP 1), these (DET 169, PRON 22)
- the
- a
- DET 4542: Read the entire article ; there ‘s a punchline , too .
- ADP 7: Big deal kind a stuff .
- NUM 6: 2 ) I would like to say on a island with an a ) all inclusive resort ( if possible ) , and a beach front room
- NOUN 4: Top range of bike , cheap prices , excellent a +++
- ADV 2: Also , any tour recommendations would be very helpful a well .
- X 2: A la guerre c’est comme a la guerre !
- ADJ 1: there will be talent and opportunity a plenty on the market soon .
- AUX 1: yea i guess but rabbits a easily escape a pen or another rabbit could get in there and that rabbit could be the opposite gender .
- CCONJ 1: But word of advice if you ‘re get your girlfriend a laptop make sure it s a good brand a not something like DELL , Acer , Asus , eMachines etc .
- PART 1: I feel X - BOX is a very smooth system i own it like 3 years , it s very compatible to previous versions and mostly important i was very comfortable with the User Interface and the JOYSTICK …. coz you do nt wan a hold a joystick that gives you discomfort .
- this
- an
- all
- some
- any
- no
- that
- SCONJ 1157: It is rumored that North Korea has at least a couple nuclear weapons .
- PRON 967: Right now that seems to be the US , EU , and IAEA .
- DET 192: I have sent your question re on line trading to that area .
- ADV 19: it ‘s passable as a pub , but the pizza is not that great .
- ADP 1: Dr Greenwalt fixed my neck from a snowboard injury and was way more effective that a regular doctor .
- these
Morphology
The form / lemma ratio of DET
is 2.047619 (the average of all parts of speech is 1.236432).
The 1st highest number of forms (10) was observed with the lemma “the”: da, he, te, teh, the, then, ther, thes, to, tttthhhhh.
The 2nd highest number of forms (4) was observed with the lemma “a”: a, aa, an, and.
The 3rd highest number of forms (4) was observed with the lemma “some”: $ome, sm, some, ssome.
DET
occurs with 8 features: PronType (20093; 100% instances), Definite (16379; 81% instances), Number (1400; 7% instances), Typo (21; 0% instances), ExtPos (18; 0% instances), NumForm (9; 0% instances), NumType (9; 0% instances), Abbr (3; 0% instances)
DET
occurs with 18 feature-value pairs: Abbr=Yes
, Definite=Def
, Definite=Ind
, ExtPos=ADV
, ExtPos=PRON
, NumForm=Word
, NumType=Frac
, Number=Plur
, Number=Sing
, PronType=Art
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Rcp
, PronType=Rel
, PronType=Tot
, Typo=Yes
DET
occurs with 18 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art
(11017 tokens).
Examples: the
Relations
DET
nodes are attached to their parents using 25 different relations: det (19427; 97% instances), det:predet (221; 1% instances), nsubj (100; 0% instances), obj (89; 0% instances), obl (81; 0% instances), nmod (37; 0% instances), conj (31; 0% instances), root (23; 0% instances), nsubj:outer (17; 0% instances), advmod (15; 0% instances), nsubj:pass (15; 0% instances), reparandum (13; 0% instances), appos (5; 0% instances), obl:npmod (4; 0% instances), nmod:npmod (3; 0% instances), obl:agent (3; 0% instances), advcl (2; 0% instances), fixed (2; 0% instances), parataxis (2; 0% instances), vocative (2; 0% instances), xcomp (2; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), iobj (1; 0% instances), nmod:poss (1; 0% instances)
Parents of DET
nodes belong to 12 different parts of speech: NOUN (17792; 89% instances), PROPN (1428; 7% instances), ADJ (361; 2% instances), VERB (265; 1% instances), PRON (96; 0% instances), NUM (53; 0% instances), SYM (29; 0% instances), ADV (27; 0% instances), (23; 0% instances), DET (15; 0% instances), AUX (6; 0% instances), INTJ (3; 0% instances)
19711 (98%) DET
nodes are leaves.
243 (1%) DET
nodes have one child.
111 (1%) DET
nodes have two children.
33 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 9.
Children of DET
nodes are attached using 23 different relations: nmod (209; 35% instances), case (129; 21% instances), punct (68; 11% instances), acl:relcl (42; 7% instances), cc (28; 5% instances), advmod (24; 4% instances), conj (24; 4% instances), fixed (19; 3% instances), cop (16; 3% instances), nsubj (14; 2% instances), advcl (6; 1% instances), mark (5; 1% instances), discourse (3; 0% instances), parataxis (3; 0% instances), amod (2; 0% instances), aux (2; 0% instances), det (2; 0% instances), appos (1; 0% instances), csubj (1; 0% instances), nmod:npmod (1; 0% instances), nmod:poss (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)
Children of DET
nodes belong to 16 different parts of speech: NOUN (157; 26% instances), ADP (129; 21% instances), PUNCT (68; 11% instances), PRON (57; 9% instances), VERB (50; 8% instances), ADJ (36; 6% instances), CCONJ (28; 5% instances), ADV (20; 3% instances), AUX (18; 3% instances), DET (15; 2% instances), PART (8; 1% instances), PROPN (7; 1% instances), NUM (3; 0% instances), SCONJ (3; 0% instances), INTJ (2; 0% instances), SYM (1; 0% instances)