DSRP Seminar - Can we connect Vision and Language using Graphs?
Speaker: Mr Brandon Birmingham

Date and time: Wednesday 16th December, 12 pm
Venue: Zoom (https://universityofmalta.zoom.us/j/99201168004?pwd=TE9tMkZWVHN3YVRUZTlvZ0QzMnhHdz09)

A long-standing goal of Artificial Intelligence is to have agents capable of understanding and interpreting the visual world using natural language. The advancements in computing power and the sheer amount of visual and linguistic data available today helps in getting closer to this quest. Research at the intersection of Computer Vision and Natural Language Processing is currently booming and the automatic generation of image captions has recently gained a lot of popularity. Several ideas and architectures have been proposed to machine generate human-like sentences that describe images, but all are short of reaching human-level quality. The focus of this talk is to specifically explore how the graph data structure can be used to connect the vision and language modalities in the context of image caption generation and how such graph-based models compare with the current state-of-the-art deep learning based models.

The Data Science Research Platform (DSRP) at the University of Malta conducts research in the interdisciplinary field of data science. The scope of the group is to use signal processing, machine learning and statistics to develop innovative techniques and to extract useful knowledge from various data sources in an effective manner to benefit the wider public.

For more information about the DSRP, please visit: https://www.um.edu.mt/platform/dsrp

To receive notifications about future events organized by the DSRP, please subscribe to our mailing list: https://groups.google.com/a/um.edu.mt/d/forum/events.dsrp

Sign in to Google to save your progress. Learn more
Email *
Name and surname *
Department *
Submit
Clear form
Never submit passwords through Google Forms.
This form was created inside of University of Malta. Report Abuse