ADJ
: adjective
Definition
Adjectives are words that typically modify nouns and specify their properties or attributes. They may also function as predicates, as in
Example: [bg] Колата е зелена / Kolata e zelena (The car is green.)
The ADJ
tag is intended for ordinary adjectives only. See DET
for determiners and NUM for numerals.
In Bulgarian the words that map to the ADJ
tag from the BulTreeBank tagset are:
- A# (adjective)
Example: [bg] добър / dobar (good) 7-годишен / 7-godishen (seven-years-old)
- H# (family name adjective)
Example: [bg] Иванова книга / Ivanova kniga (Ivan’s book)
- Mo# (ordinal numeral)
Example: [bg] втори / vtori (second)
- V#car# (present participle)
Example: [bg] идващ / idvasht (coming)
- V#cv# (past passive participle)
Example: [bg] намерен / nameren (found)
- V#cao# (past perfective participle)
Example: [bg] направил / napravil (made)
Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.
Treebank Statistics (UD_Bulgarian)
There are 3121 ADJ
lemmas (20%), 6326 ADJ
types (23%) and 13589 ADJ
tokens (9%).
Out of 16 observed tags, the rank of ADJ
is: 2 in number of lemmas, 3 in number of types and 5 in number of tokens.
The 10 most frequent ADJ
lemmas: нов, друг, български, голям, народен, пръв, държавен, европейски, цял, втори
The 10 most frequent ADJ
types: други, народното, българската, нова, другите, нови, европейската, последните, 2001, друг
The 10 most frequent ambiguous lemmas: нов (ADJ 301, PROPN 3), български (ADJ 229, ADV 2), голям (ADJ 229, PROPN 1), европейски (ADJ 124, ADV 1), политически (ADJ 107, ADV 3), икономически (ADJ 54, ADV 2), следвам (ADJ 52, VERB 14), мина-(се) (ADJ 50, VERB 37), стар (ADJ 49, PROPN 4), син (ADJ 44, NOUN 20)
The 10 most frequent ambiguous types: 2001 (ADJ 42, NUM 1), 2000 (ADJ 40, NUM 12, PROPN 4), български (ADJ 30, ADV 2), политически (ADJ 33, ADV 3), 1 (NUM 53, ADJ 29, PROPN 1), II (ADJ 18, PROPN 1), останалите (ADJ 10, VERB 1), европейски (ADJ 13, ADV 1), Южна (ADJ 14, PROPN 1), свързани (ADJ 13, VERB 3)
- 2001
- 2000
- български
- политически
- 1
- II
- останалите
- европейски
- Южна
- свързани
Morphology
The form / lemma ratio of ADJ
is 2.026914 (the average of all parts of speech is 1.728233).
The 1st highest number of forms (25) was observed with the lemma “голям”: големи, големите, големия, големият, голям, голяма, голямата, голямо, най-големи, най-големите, най-големия, най-големият, най-голям, най-голяма, най-голямата, най-голямо, най-голямото, по-големи, по-големите, по-големия, по-голям, по-голяма, по-голямата, по-голямо, по-голямото.
The 2nd highest number of forms (21) was observed with the lemma “добър”: Най-добра, добра, добрата, добри, добрите, добрият, добро, доброто, добър, най-добрата, най-добри, най-добрите, най-добрия, най-добрият, най-доброто, най-добър, по-добра, по-добри, по-добрият, по-добро, по-добър.
The 3rd highest number of forms (17) was observed with the lemma “висок”: висок, висока, високата, високи, високите, високия, високо, високото, най-висок, най-високата, най-високите, най-високо, по-висок, по-висока, по-високи, по-високите, по-високо.
ADJ
occurs with 10 features: bg-feat/Number (13351; 98% instances), bg-feat/Definite (13292; 98% instances), bg-feat/Degree (11557; 85% instances), bg-feat/Gender (9449; 70% instances), bg-feat/Aspect (1472; 11% instances), bg-feat/VerbForm (1472; 11% instances), bg-feat/Voice (1472; 11% instances), bg-feat/NumType (895; 7% instances), bg-feat/Tense (519; 4% instances), bg-feat/Case (24; 0% instances)
ADJ
occurs with 19 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Case=Voc
, Definite=Def
, Definite=Ind
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumType=Ord
, Number=Plur
, Number=Sing
, Tense=Past
, Tense=Pres
, VerbForm=Part
, Voice=Act
, Voice=Pass
ADJ
occurs with 128 feature combinations.
The most frequent feature combination is Definite=Ind|Degree=Pos|Number=Plur
(1788 tokens).
Examples: други, нови, различни, големи, български, добри, народни, подобни, финансови, военни
Relations
ADJ
nodes are attached to their parents using 20 different relations: bg-dep/amod (11863; 87% instances), bg-dep/conj (444; 3% instances), bg-dep/root (383; 3% instances), bg-dep/dobj (312; 2% instances), bg-dep/nmod (189; 1% instances), bg-dep/nsubj (144; 1% instances), bg-dep/ccomp (85; 1% instances), bg-dep/iobj (55; 0% instances), bg-dep/advcl (36; 0% instances), bg-dep/acl (32; 0% instances), bg-dep/csubj (13; 0% instances), bg-dep/nsubjpass (11; 0% instances), bg-dep/xcomp (11; 0% instances), bg-dep/vocative (3; 0% instances), bg-dep/csubjpass (2; 0% instances), bg-dep/discourse (2; 0% instances), bg-dep/appos (1; 0% instances), bg-dep/compound (1; 0% instances), bg-dep/nummod (1; 0% instances), bg-dep/remnant (1; 0% instances)
Parents of ADJ
nodes belong to 10 different parts of speech: NOUN (11701; 86% instances), VERB (738; 5% instances), ROOT (383; 3% instances), ADJ (372; 3% instances), PROPN (326; 2% instances), NUM (24; 0% instances), DET (18; 0% instances), ADV (14; 0% instances), PRON (11; 0% instances), PART (2; 0% instances)
11189 (82%) ADJ
nodes are leaves.
1071 (8%) ADJ
nodes have one child.
512 (4%) ADJ
nodes have two children.
817 (6%) ADJ
nodes have three or more children.
The highest child degree of a ADJ
node is 11.
Children of ADJ
nodes are attached using 26 different relations: bg-dep/punct (1234; 22% instances), bg-dep/nmod (842; 15% instances), bg-dep/cop (584; 10% instances), bg-dep/advmod (575; 10% instances), bg-dep/nsubj (479; 9% instances), bg-dep/conj (458; 8% instances), bg-dep/det (381; 7% instances), bg-dep/cc (369; 7% instances), bg-dep/case (272; 5% instances), bg-dep/mark (86; 2% instances), bg-dep/neg (66; 1% instances), bg-dep/expl (58; 1% instances), bg-dep/advcl (52; 1% instances), bg-dep/aux (52; 1% instances), bg-dep/discourse (27; 0% instances), bg-dep/acl (22; 0% instances), bg-dep/csubj (18; 0% instances), bg-dep/iobj (12; 0% instances), bg-dep/auxpass (5; 0% instances), bg-dep/dobj (5; 0% instances), bg-dep/nsubjpass (3; 0% instances), bg-dep/amod (1; 0% instances), bg-dep/nummod (1; 0% instances), bg-dep/parataxis (1; 0% instances), bg-dep/remnant (1; 0% instances), bg-dep/vocative (1; 0% instances)
Children of ADJ
nodes belong to 15 different parts of speech: PUNCT (1234; 22% instances), NOUN (1064; 19% instances), VERB (733; 13% instances), PRON (635; 11% instances), ADV (590; 11% instances), ADJ (372; 7% instances), CONJ (366; 7% instances), ADP (269; 5% instances), PROPN (108; 2% instances), SCONJ (80; 1% instances), INTJ (72; 1% instances), PART (60; 1% instances), AUX (13; 0% instances), DET (6; 0% instances), NUM (3; 0% instances)
ADJ in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]