On May 22, 2023. starting at 10:00 a.m. organized by the Croatian Competence Center for HPC (https://www.hpc-cc.hr/) with technical support of the Faculty of Electrical Engineering, Computing and Information Technologies of the Josip Juraj Stossmayer University of Osijek (member institution of the Center and project partner) and in cooperation with the Centre of Research Excellence for Data Science and Cooperative Systems (ACROSS–DataScience), within the project "National Competence Centers in the Framework of EuroHPC Phase 2 - EuroCC 2", Digital Europe (
https://www.eurocc-access.eu/), will be held in person (in the room T3-27, FERIT Osijek, Kneza Trpimira 2b, Osijek)) and on-line, workshop "Big Data Analytics and Stream Processing on Apache
Spark". The leader of the workshop is Krešimir Pripužić, a Full Professor at the Faculty of
Electrical Engineering and Computing, University of Zagreb,
Croatia (UNIZG-FER).
Please, express your interest for the workshop participation by entering your name, surname, full name of the institution/company, e-mail and form of participation (in person, on-line) by using this form available at https://forms.gle/18yqEHikok4DNonr7 no later than 18 May 2023 (EoD).
You will receive an invitation to the workshop with a link for participation in time before the workshop.
For any additional questions, you can contact Professor Goran Martinović, Ph.D. via goran.martinovic@ferit.hr.
About the workshop:
Summary: Companies collect enormous amounts of data about their
customers, suppliers, and operations, while billions of connected
devices on the Internet of things (IoT) and we as individuals
additionally produce vast amounts of data. Therefore, we have
witnessed an exponential growth in the amount of newly created
data for more than a decade. We define Big Data as data that
either contains greater variety, arrives in increasing volumes or
is produced with higher velocity. Many different open-source
platforms have been developed recently for dealing with mentioned
challenges using cluster computing, such as the Apache Hadoop YARN
(MapReduce2), Apache Spark, Apache Flink, Apache Storm, etc.
Probably the most popular among them are the Apache Hadoop YARN
and Apache Spark. This workshop briefly presents the Apache Spark
platform and then demonstrates its analytics and stream processing
capabilities. After that, during a hands-on session the attendants
will learn to use the Apache Spark for processing both the
unstructured and structured Big Data.
Biography: Krešimir Pripužić is a Full Professor at the Faculty of
Electrical Engineering and Computing, University of Zagreb,
Croatia (UNIZG-FER), where he leads the Data Streams Laboratory.
He has been affiliated with the Department of Telecommunications
at UNIZGFER since 2003. He received his diploma degree in
electrical engineering with a major in telecommunications and
informatics from the UNIZG-FER in 2003. In 2005 he started his
Ph.D. studies at UNIZG-FER, which he successfully finished in 2010
by defending his dissertation. As a part of his Ph.D. studies, he
spent academic year 2006-2007 at the Distributed Information
Systems Laboratory at EPFL (Ecole Polytechnique Fédérale de
Lausanne), Switzerland, as a scholarship holder of the Swiss
Government scholarship for university, fine arts and music schools
for foreign students. He has co-authored over 40 scientific
journal and conference papers. His main research interests are
large-scale distributed systems, algorithms and data structures,
big data analytics, data stream processing and internet of things.