Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Kurmanji-MG: POS Tags: `NOUN`

There are 1019 NOUN lemmas (52%), 1482 NOUN types (50%) and 2653 NOUN tokens (26%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: sal, ode, nav, tişt, ser, car, dest, Dr., dem, gund

The 10 most frequent NOUN types: sala, oda, Dr., xweha, navê, zirbavê, gund, serê, zimanê, kurdan

The 10 most frequent ambiguous lemmas: nav (NOUN 47, ADP 9), ser (ADP 44, NOUN 27), dest (NOUN 26, X 1), kurdî (ADJ 14, NOUN 13), gotin (VERB 70, NOUN 11), xwarin (NOUN 11, VERB 4), mirin (NOUN 9, VERB 7), hûr (NOUN 7, ADJ 1), zanîn (VERB 19, NOUN 6), başûr (NOUN 5, ADJ 1)

The 10 most frequent ambiguous types: navê (NOUN 18, VERB 2), nav (NOUN 14, ADP 9), kurdî (NOUN 11, ADJ 10), dest (NOUN 12, X 1), caran (NOUN 11, ADV 4), dema (NOUN 11, SCONJ 4), hûr (NOUN 6, ADJ 1), ser (ADP 44, NOUN 7), cihê (NOUN 6, ADJ 1), bin (NOUN 5, ADP 2, AUX 1)

navê
- NOUN 18: Di vê navê de min dengekî din seh kir .
- VERB 2: Holmes : Min ji te tu tişt navê .
nav
- NOUN 14: Holmes serê xwe hejand û got : ev nav ji min re ne jî nenas e .
- ADP 9: Lewra ev gund di nav daristana darên mêşe de cih digire .
kurdî
- NOUN 11: Ji ber vê yekê her çiqas ew bi kurdî jî binivîse li tirkî jî dinivîse .
- ADJ 10: Sala 1940’ê dest bi nivîsandina helbestên kurdî dike .
dest
- NOUN 12: Ji cane xweha min dikir ko tukesî jî dest ne da bûye .
- X 1: Ji bo dest pê kirina şerî çekdarî derbazî Rojhilata Navîn dibe .
caran
- NOUN 11: Çend caran bi destên xwe oda diktor şanî da .
- ADV 4: Gelek caran jî bi şêwazê vegotina nivîskar derbas dibe .
dema
- NOUN 11: Di vê demê de dema şerê cihanê ê yekemin bû .
- SCONJ 4: Lê dema ku mirov bêje Mîrê Botan , hingî jî , mirov behsa mîrê ku li Cizîra Botan mîr e dike .
hûr
- NOUN 6: Stonêr : Tebîb cane wê qelaşt , lê hûr lê _ tu şop jehrê xuya ne kir .
- ADJ 1: Holmes bi pertavsoja xwe jî li wan hûr bû ; lê jê _ jî tu netîce ne xiste destên xwe .
ser
- ADP 44: Ez rabûm ser xwe û derketim lîwanê .
- NOUN 7: Em tev de ji qesirê derketin û di ser çîmênên mêrgê re digeriyan .
cihê
- NOUN 6: Holmes : Belê ev gotin di cihê xwe de ye ; heye ko …
- ADJ 1: Ev her sê ode jî ji hev cihê , yanî ne qulêrî hev in .
bin
- NOUN 5: Lê ew jî di bin barê rehneke giran de ye .
- ADP 2: Di dema derketine Îslamê de Amed di bin destê Bizansan de bû .
- AUX 1: Heger hin nirxên xwînê ne li cih bin , wekî sîmptoman xwe bi nexweşiyan dide der .

Morphology

The form / lemma ratio of NOUN is 1.454367 (the average of all parts of speech is 1.510518).

The 1st highest number of forms (8) was observed with the lemma “dem”: dema, deman, deme, demek, demeke, demekê, demê, demên.

The 2nd highest number of forms (7) was observed with the lemma “mar”: mar, maran, mare, marekî, marê, marên, mêr.

The 3rd highest number of forms (7) was observed with the lemma “ode”: oda, odan, ode, odeke, odeyên, odê, odêyen.

NOUN occurs with 5 features: Case (2621; 99% instances), Gender (2621; 99% instances), Number (2621; 99% instances), Definite (2311; 87% instances), PronType (305; 11% instances)

NOUN occurs with 11 feature-value pairs: Case=Acc, Case=Con, Case=Nom, Case=Voc, Definite=Def, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind

NOUN occurs with 34 feature combinations. The most frequent feature combination is Case=Con|Definite=Def|Gender=Fem|Number=Sing (593 tokens). Examples: sala, oda, xweha, dema, mirina, dibistana, cara, nava, bandora, diya

Relations

NOUN nodes are attached to their parents using 24 different relations: nmod (865; 33% instances), nmod:poss (470; 18% instances), nsubj (436; 16% instances), obj (269; 10% instances), conj (154; 6% instances), root (121; 5% instances), compound:lvc (86; 3% instances), obl:dat (83; 3% instances), fixed (38; 1% instances), flat (24; 1% instances), ccomp (23; 1% instances), parataxis (17; 1% instances), appos (14; 1% instances), acl (11; 0% instances), obl (10; 0% instances), xcomp (10; 0% instances), case (7; 0% instances), advcl (3; 0% instances), compound (3; 0% instances), discourse (3; 0% instances), mark (3; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: VERB (1441; 54% instances), NOUN (838; 32% instances), (121; 5% instances), ADJ (68; 3% instances), AUX (44; 2% instances), ADP (41; 2% instances), NUM (40; 2% instances), PROPN (40; 2% instances), PRON (12; 0% instances), ADV (2; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), DET (1; 0% instances), X (1; 0% instances)

690 (26%) NOUN nodes are leaves.

962 (36%) NOUN nodes have one child.

455 (17%) NOUN nodes have two children.

546 (21%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 9.

Children of NOUN nodes are attached using 29 different relations: case (1149; 29% instances), nmod:poss (918; 23% instances), punct (323; 8% instances), amod (293; 7% instances), det (201; 5% instances), nmod (174; 4% instances), conj (166; 4% instances), cop (159; 4% instances), nsubj (119; 3% instances), cc (115; 3% instances), nummod (68; 2% instances), acl (67; 2% instances), flat (51; 1% instances), advmod (28; 1% instances), appos (24; 1% instances), mark (21; 1% instances), dep (12; 0% instances), advcl (10; 0% instances), parataxis (8; 0% instances), ccomp (6; 0% instances), fixed (6; 0% instances), discourse (5; 0% instances), obj (3; 0% instances), obl:dat (3; 0% instances), aux (2; 0% instances), advmod:neg (1; 0% instances), compound (1; 0% instances), csubj (1; 0% instances), obl (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (1161; 30% instances), NOUN (838; 21% instances), PRON (360; 9% instances), PUNCT (323; 8% instances), ADJ (300; 8% instances), DET (202; 5% instances), PROPN (172; 4% instances), AUX (163; 4% instances), NUM (132; 3% instances), CCONJ (113; 3% instances), VERB (105; 3% instances), SCONJ (23; 1% instances), ADV (15; 0% instances), PART (13; 0% instances), X (8; 0% instances), INTJ (4; 0% instances), SYM (3; 0% instances)

Treebank Statistics: UD_Kurmanji-MG: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Kurmanji-MG: POS Tags: `NOUN`