EmoInHindi: A Multi-label Emotion and Intensity Annotated Dataset in Hindi for Emotion Recognition in Dialogues

JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

a) This work is accepted in Language Resources and Evaluation Conference (LREC) 2022.
b) This dataset is intended only for non-commercial, educational and/or research purposes only.
c) For access to the dataset and any associated queries, please reach us at iitpainlpmlresourcerequest@gmail.com / priyanshu528priya@gmail.com / gopendra.99@gmail.com
d) The dataset is allowed to be used in any publication, only upon citation.

BibTex:

@inproceedings{singh-etal-2022-emoinhindi,
title = "{E}mo{I}n{H}indi: A Multi-label Emotion and Intensity Annotated Dataset in {H}indi for Emotion Recognition in Dialogues",
author = "Singh, Gopendra Vikram and
Priya, Priyanshu and
Firdaus, Mauajama and
Ekbal, Asif and
Bhattacharyya, Pushpak",
booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
month = jun,
year = "2022",
address = "Marseille, France",
publisher = "European Language Resources Association",
url = "https://aclanthology.org/2022.lrec-1.627",
pages = "5829--5837",
abstract = "The long-standing goal of Artificial Intelligence (AI) has been to create human-like conversational systems. Such systems should have the ability to develop an emotional connection with the users, consequently, emotion recognition in dialogues has gained popularity. Emotion detection in dialogues is a challenging task because humans usually convey multiple emotions with varying degrees of intensities in a single utterance. Moreover, emotion in an utterance of a dialogue may be dependent on previous utterances making the task more complex. Recently, emotion recognition in low-resource languages like Hindi has been in great demand. However, most of the existing datasets for multi-label emotion and intensity detection in conversations are in English. To this end, we propose a large conversational dataset in Hindi named EmoInHindi for multi-label emotion and intensity recognition in conversations containing 1,814 dialogues with a total of 44,247 utterances. We prepare our dataset in a Wizard-of-Oz manner for mental health and legal counselling of crime victims. Each utterance of dialogue is annotated with one or more emotion categories from 16 emotion labels including neutral and their corresponding intensity. We further propose strong contextual baselines that can detect the emotion(s) and corresponding emotional intensity of an utterance given the conversational context.",

}

Email *

Name *

Affiliation (Department/Institute/University you belong to) *

You are *

Undergraduate Student

Postgraduate Student

Ph.D. Student

Faculty

Other:

Address of correspondence *

Contact Information *

How did you come to know about the dataset? *

Describe briefly how you intend to use this dataset? (minimum characters 350) *

Accept Terms *

I UNDERTAKE TO ATTRIBUTE THE CREATORS OF THIS RESOURCE IN ANY WORKS (PUBLICATIONS, PRESENTATIONS OR OTHER PUBLIC DISSEMINATION OF WORK) UTILIZING THE DATA SET AND ALSO AGREE NOT TO DISSEMINATE THE DATASET WITHOUT PRIOR PERMISSION OF APPROPRIATE AUTHORITIES

Required

Submit

Clear form

Never submit passwords through Google Forms.

This content is neither created nor endorsed by Google. Report Abuse - Terms of Service - Privacy Policy

Forms