This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ta/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Tamil)

There are 7 DET lemmas (0%), 21 DET types (1%) and 108 DET tokens (1%). Out of 14 observed tags, the rank of DET is: 12 in number of lemmas, 11 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: இந்த, அந்த, எந்த, மிக, அதிகம், அந்தந்த, ஒரு

The 10 most frequent DET types: இந்த, அந்த, இந்தப், எந்த, அந்தப், அந், இந்தத், மிக, மிகப், அதிக

The 10 most frequent ambiguous lemmas: இந்த (DET 60, PRON 2, NOUN 1), அந்த (DET 28, PRON 1), அதிகம் (DET 2, PRON 2), ஒரு (ADJ 21, DET 1)

The 10 most frequent ambiguous types: இந்த (DET 47, PRON 2), அந்தத் (DET 2, PRON 1), ஒரு (ADJ 19, DET 1)

Morphology

The form / lemma ratio of DET is 3.000000 (the average of all parts of speech is 1.557992).

The 1st highest number of forms (7) was observed with the lemma “அந்த”: அந், அந்த, அந்தக், அந்தச், அந்தத், அந்தப், அப்.

The 2nd highest number of forms (6) was observed with the lemma “இந்த”: இச், இந், இந்த, இந்தத், இந்தப், இப்.

The 3rd highest number of forms (3) was observed with the lemma “மிக”: மிக, மிகச், மிகப்.

DET occurs with 1 features: NumType (10; 9% instances)

DET occurs with 1 feature-value pairs: NumType=Card

DET occurs with 2 feature combinations. The most frequent feature combination is _ (98 tokens). Examples: இந்த, அந்த, இந்தப், எந்த, அந்தப், அந், இந்தத், அந்தக், அந்தச், அந்தத்

Relations

DET nodes are attached to their parents using 1 different relations: det (108; 100% instances)

Parents of DET nodes belong to 3 different parts of speech: NOUN (97; 90% instances), ADJ (7; 6% instances), PROPN (4; 4% instances)

108 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]