EmoInHindi: A Multi-label Emotion and Intensity Annotated Dataset in Hindi for Emotion Recognition in Dialogues
a) This work is accepted in Language Resources and Evaluation Conference (LREC) 2022.
b) This dataset is intended only for non-commercial, educational and/or research purposes only.  
c) For access to the dataset and any associated queries, please reach us at iitpainlpmlresourcerequest@gmail.com /  priyanshu528priya@gmail.com / gopendra.99@gmail.com
d) The dataset is allowed to be used in any publication, only upon citation.

BibTex:

@inproceedings{singh-etal-2022-emoinhindi,
    title = "{E}mo{I}n{H}indi: A Multi-label Emotion and Intensity Annotated Dataset in {H}indi for Emotion Recognition in Dialogues",
    author = "Singh, Gopendra Vikram  and
      Priya, Priyanshu  and
      Firdaus, Mauajama  and
      Ekbal, Asif  and
      Bhattacharyya, Pushpak",
    booktitle = "Proceedings of the Thirteenth Language Resources and Evaluation Conference",
    month = jun,
    year = "2022",
    address = "Marseille, France",
    publisher = "European Language Resources Association",
    url = "https://aclanthology.org/2022.lrec-1.627",
    pages = "5829--5837",
    abstract = "The long-standing goal of Artificial Intelligence (AI) has been to create human-like conversational systems. Such systems should have the ability to develop an emotional connection with the users, consequently, emotion recognition in dialogues has gained popularity. Emotion detection in dialogues is a challenging task because humans usually convey multiple emotions with varying degrees of intensities in a single utterance. Moreover, emotion in an utterance of a dialogue may be dependent on previous utterances making the task more complex. Recently, emotion recognition in low-resource languages like Hindi has been in great demand. However, most of the existing datasets for multi-label emotion and intensity detection in conversations are in English. To this end, we propose a large conversational dataset in Hindi named EmoInHindi for multi-label emotion and intensity recognition in conversations containing 1,814 dialogues with a total of 44,247 utterances. We prepare our dataset in a Wizard-of-Oz manner for mental health and legal counselling of crime victims. Each utterance of dialogue is annotated with one or more emotion categories from 16 emotion labels including neutral and their corresponding intensity. We further propose strong contextual baselines that can detect the emotion(s) and corresponding emotional intensity of an utterance given the conversational context.",
}  
Sign in to Google to save your progress. Learn more
Email *
Name *
Affiliation (Department/Institute/University you belong to) *
You are *
Address of correspondence *
Contact Information *
How did you come to know about the dataset? *
Describe briefly how you intend to use this dataset? (minimum characters 350) *
Accept Terms *
Required
Submit
Clear form
Never submit passwords through Google Forms.
This content is neither created nor endorsed by Google. Report Abuse - Terms of Service - Privacy Policy