<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">
    jsea
   </journal-id>
   <journal-title-group>
    <journal-title>
     Journal of Software Engineering and Applications
    </journal-title>
   </journal-title-group>
   <issn pub-type="epub">
    1945-3116
   </issn>
   <issn publication-format="print">
    1945-3124
   </issn>
   <publisher>
    <publisher-name>
     Scientific Research Publishing
    </publisher-name>
   </publisher>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="doi">
    10.4236/jsea.2025.181002
   </article-id>
   <article-id pub-id-type="publisher-id">
    jsea-140021
   </article-id>
   <article-categories>
    <subj-group subj-group-type="heading">
     <subject>
      Articles
     </subject>
    </subj-group>
    <subj-group subj-group-type="Discipline-v2">
     <subject>
      Computer Science 
     </subject>
     <subject>
       Communications
     </subject>
    </subj-group>
   </article-categories>
   <title-group>
    Audiovisual Art Event Classification and Outreach Based on Web Extracted Data
   </title-group>
   <contrib-group>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Andreas
      </surname>
      <given-names>
       Giannakoulopoulos
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Minas
      </surname>
      <given-names>
       Pergantis
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Aristeidis
      </surname>
      <given-names>
       Lamprogeorgos
      </given-names>
     </name>
    </contrib>
    <contrib contrib-type="author" xlink:type="simple">
     <name name-style="western">
      <surname>
       Stella
      </surname>
      <given-names>
       Lampoura
      </given-names>
     </name>
    </contrib>
   </contrib-group> 
   <aff id="affnull">
    <addr-line>
     aDepartment of Audio and Visual Arts, Ionian University, Corfu, Greece
    </addr-line> 
   </aff> 
   <pub-date pub-type="epub">
    <day>
     06
    </day> 
    <month>
     01
    </month>
    <year>
     2025
    </year>
   </pub-date> 
   <volume>
    18
   </volume> 
   <issue>
    01
   </issue>
   <fpage>
    24
   </fpage>
   <lpage>
    43
   </lpage>
   <history>
    <date date-type="received">
     <day>
      20,
     </day>
     <month>
      December
     </month>
     <year>
      2024
     </year>
    </date>
    <date date-type="published">
     <day>
      14,
     </day>
     <month>
      December
     </month>
     <year>
      2024
     </year> 
    </date> 
    <date date-type="accepted">
     <day>
      14,
     </day>
     <month>
      January
     </month>
     <year>
      2025
     </year> 
    </date>
   </history>
   <permissions>
    <copyright-statement>
     © Copyright 2014 by authors and Scientific Research Publishing Inc. 
    </copyright-statement>
    <copyright-year>
     2014
    </copyright-year>
    <license>
     <license-p>
      This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
     </license-p>
    </license>
   </permissions>
   <abstract>
    The World Wide Web provides a wealth of information about everything, including contemporary audio and visual art events, which are discussed on media outlets, blogs, and specialized websites alike. This information may become a robust source of real-world data, which may form the basis of an objective data-driven analysis. In this study, a methodology for collecting information about audio and visual art events in an automated manner from a large array of websites is presented in detail. This process uses cutting edge Semantic Web, Web Search and Generative AI technologies to convert website documents into a collection of structured data. The value of the methodology is demonstrated by creating a large dataset concerning audiovisual events in Greece. The collected information includes event characteristics, estimated metrics based on their text descriptions, outreach metrics based on the media that reported them, and a multi-layered classification of these events based on their type, subjects and methods used. This dataset is openly provided to the general and academic public through a Web application. Moreover, each event’s outreach is evaluated using these quantitative metrics, the results are analyzed with an emphasis on classification popularity and useful conclusions are drawn concerning the importance of artistic subjects, methods, and media.
   </abstract>
   <kwd-group> 
    <kwd>
     Web Data Extraction
    </kwd> 
    <kwd>
      Art Events
    </kwd> 
    <kwd>
      Classification
    </kwd> 
    <kwd>
      Artistic Outreach
    </kwd> 
    <kwd>
      Online Media
    </kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <sec id="s1">
   <title>1. Introduction and State-of-the-Art</title>
   <p>Over the years, the World Wide Web has emerged as the primary means of communication and information sharing, resulting in significant shifts in how people understand the process of consuming information <xref ref-type="bibr" rid="scirp.140021-1">
     [1]
    </xref>. In this contemporary landscape, information about artistic works and cultural heritage artifacts has also migrated into the digital realm through digitization <xref ref-type="bibr" rid="scirp.140021-2">
     [2]
    </xref>. To provide users with an engaging experience when interacting with content related to art and culture, it is crucial to comprehend their behavioral patterns <xref ref-type="bibr" rid="scirp.140021-3">
     [3]
    </xref> and to utilize that insight in crafting a user experience (UX) that caters to the distinct needs of both the users <xref ref-type="bibr" rid="scirp.140021-4">
     [4]
    </xref> and the content itself. This necessity is particularly heightened in outlets presenting vast amounts of information, as the user interface (UI) is an essential instrument to mitigate user confusion <xref ref-type="bibr" rid="scirp.140021-5">
     [5]
    </xref> and enhance engagement by prioritizing simplicity and familiarity.</p>
   <p>It is safe to say that a key aspect of sharing cultural events, including various audiovisual art projects, is their visibility on the Internet <xref ref-type="bibr" rid="scirp.140021-6">
     [6]
    </xref>. Online media platforms, cultural websites, blogs, social media, and other digital repositories offer abundant information about art events. However, news coverage available on the web is frequently both scattered and short-lived, influenced by the differing interests of traditional media and user-generated content <xref ref-type="bibr" rid="scirp.140021-7">
     [7]
    </xref> <xref ref-type="bibr" rid="scirp.140021-8">
     [8]
    </xref>, as well as the popularity of algorithm-driven news distribution systems <xref ref-type="bibr" rid="scirp.140021-9">
     [9]
    </xref>. To gather more meaningful insights into the traits of modern audiovisual art projects, it is essential to extract information from a variety of sources. Additionally, the process of pinpointing and gathering details regarding a specific event from different websites can result in a well-informed assessment of its potential reach, which may subsequently affect its cultural significance <xref ref-type="bibr" rid="scirp.140021-10">
     [10]
    </xref>.</p>
   <p>In order to collect large-scale information from the Web in an automated manner, it is essential to programmatically analyze the structure of Web pages and their metadata, as they appear in social media data-graphs or Semantic Web integrations <xref ref-type="bibr" rid="scirp.140021-11">
     [11]
    </xref>. Additionally, this data must be recorded in a manner that takes advantage of the benefits of Semantic Web ontologies and data models. The Semantic Web arose as an effort to enhance how information is structured online to improve its ability to convey meanings in a manner that digital systems can interpret. This process includes the systematic arrangement of data into standardized, subject-oriented hierarchies known as ontologies <xref ref-type="bibr" rid="scirp.140021-12">
     [12]
    </xref>. Ontologies serve as a means of comprehending and organizing web information and lie at the heart of the Semantic Web, facilitating the development of a versatile digital space that acts not just as a collection of relevant data but as a dynamic, intelligent, and efficient environment with new opportunities for advanced applications and services <xref ref-type="bibr" rid="scirp.140021-13">
     [13]
    </xref>. Although computers do not “understand” information like humans do, they can efficiently relate and manage it so that the outcome becomes meaningful for human users <xref ref-type="bibr" rid="scirp.140021-14">
     [14]
    </xref>. Scholars in Web science are continuously working on research and technological advancements to actualize the Semantic Web concept <xref ref-type="bibr" rid="scirp.140021-15">
     [15]
    </xref>. For instance, the RDF/XML standard has shown to be challenging to apply in scenarios with high complexity <xref ref-type="bibr" rid="scirp.140021-16">
     [16]
    </xref>, whereas newer technologies such as N-Triplets <xref ref-type="bibr" rid="scirp.140021-17">
     [17]
    </xref>, Turtle <xref ref-type="bibr" rid="scirp.140021-18">
     [18]
    </xref>, and JSON-LD <xref ref-type="bibr" rid="scirp.140021-19">
     [19]
    </xref> offer more readable structures, enabling better grammar and smoother mapping from XML syntax to RDF. As a result, the technologies capable of realizing the essential Semantic Web are evolving and will continue to develop, moving closer to achieving semantic understanding of data by machines.</p>
   <p>Going one step further, Generative Artificial Intelligence (GenAI) can be used to both analyze html documents and derive structured information from their texts. AI technologies have a long history of successful application on text structuring and summarization <xref ref-type="bibr" rid="scirp.140021-20">
     [20]
    </xref> which has become even more powerful with the advances in AI and the introduction of the transformer architecture, which gave birth to GenAI as we know it <xref ref-type="bibr" rid="scirp.140021-21">
     [21]
    </xref>.</p>
   <p>Mining information from the Web may lead to large objective datasets which are quintessential in statistical analysis. In recent years, there has been a growing interest among researchers around big data and computational engineering. Big data is defined as an extensive collection of datasets that conventional data processing systems cannot analyze; instead, they demand innovative automated technologies for tasks like capturing, storing, distributing, managing, and analyzing <xref ref-type="bibr" rid="scirp.140021-22">
     [22]
    </xref>. Online news items are an example of such datasets. This vast and varied information is unmanageable through manual processing, which involves breaking it down into sentences or words to analyze and extract valuable insights <xref ref-type="bibr" rid="scirp.140021-23">
     [23]
    </xref>. Additionally, significant attention from researchers has been directed toward the application of big data, highlighting its role as a crucial indicator of a company’s efficiency and productivity <xref ref-type="bibr" rid="scirp.140021-24">
     [24]
    </xref>. This use of big data has also found its way into social sciences and culture in the form of cultural analytics which may be defined as “the analysis of massive cultural data sets and flows using computational and visualization techniques” <xref ref-type="bibr" rid="scirp.140021-25">
     [25]
    </xref>.</p>
   <p>In this study, modern innovative technologies are utilized to create a large objective dataset from the Web concerning audiovisual art events as they are presented by online media outlets and other related websites. Via the use of programmatic Web Search, Semantic Web analysis and the power of GenAI structured information is collected about the various aspects of an event’s online presence. This information is quantified and processed to derive objective measurements, which are used for statistical analysis. This article presents the detailed methodology of both data collection and an analysis of real-world data regarding contemporary works of audio and visual art in Greece. The collected data are openly provided to the general public through an innovative Web application which focuses on user experience and functionality <xref ref-type="bibr" rid="scirp.140021-26">
     [26]
    </xref> available at <xref ref-type="bibr" rid="scirp.140021-https://repo.artdata.gr/">
     https://repo.artdata.gr/
    </xref> and through an extended application programming interface. The analysis focuses on the classification of the collected events based on the event type, the methods and techniques used in the event as well as the events’ thematic elements. By investigating both the popularity of specific classification tags and the popularity and perceived cultural outreach of individual events described by these tags, the study demonstrates how objective Web data may be collected, analyzed and used to reach significant conclusions about the importance of various artistic subjects, methods and media in contemporary culture. The following research questions concerning the Greek audiovisual event landscape were established as a means to guide the research direction:</p>
   <p>RQ1. What are the most popular classifications of audiovisual art events regarding event type, methods, and subjects?</p>
   <p>RQ2. What are the classifications of audiovisual art events regarding event type, methods and subjects that are most influential based on event outreach index?</p>
  </sec><sec id="s2">
   <title>2. Methodology</title>
   <p>The algorithm for data extraction is made up of three primary components. The initial phase involves searching through media websites, which, after being processed, results in a collection of audiovisual events. The second and most important component includes an iterative process that seeks additional publications related to each of these events on the internet, transforming the content of these publications from code and free text found on any webpage into a graph of structured data that is integrated into the recorded information about the event in the system’s database. Additionally, during this phase, publications that are extracted but do not relate to an event contribute to the formation of new events for further investigation. Lastly, the third component investigates the websites where the publications retrieved during the event inquiry originated and gathers information from them.</p>
   <sec id="s2_1">
    <title>2.1. Event Data Web Extraction Algorithm</title>
    <p>Before classifying and analyzing the outreach of an audiovisual event, it is important to generate a record highlighting its characteristics. The data extraction algorithm begins by collecting initial events, discovering them from means that distribute these events online. It starts from specific web pages that publish art and culture events, programmatically extracting URLs of relevant publications through web mining. Once a list of publication URLs is generated, the algorithm retrieves the content of each individual web page. The information is then integrated into a data graph that fits the system’s data model, utilizing semantic web technologies to identify web page structure. GenAI methods are employed to analyze the texts and to extract structured data and summaries, which are stored in the system’s relational database. These structured data include classification tags based on event type, themes, and media. The next step involves inferring events from these publications. The algorithm compares the data with existing event records to determine if a publication is related to an already documented event. If a match is found, the publication information is integrated into the event details. If not, a new event record is created, increasing the total number of recorded events, and preparing the analysis of the outreach of each event.</p>
    <p>The discovery of the full range of a specific event’s classification tags and the analysis of its Web outreach starts by utilizing a search engine API to locate more publications related to the event’s title and location. The API provides distinct URLs that contain pertinent content, which will be the basis for further examination. Following this, the algorithm gathers additional information from these web pages. By employing GenAI, it determines if the content pertains to an audiovisual event. If it confirms this, the text is converted into a structured data graph which includes classification tags and recorded in the database as a new publication entry. The newly added publication is subsequently assessed for its relevance to the original event using spatial and temporal derived from the event’s title and description. If it is deemed relevant, the information and classification tags are integrated into the event; if not, they are added to the database to be investigated later. This procedure is repeated for every event in the database, and upon completion, a final step of publication comparison and media exploration is carried out.</p>
    <p>Every publication that results from the search for related materials using the search engine API, as outlined above, comes from a distinct website that might not be part of the original list of websites from which the initial publications were obtained. In this scenario, the algorithm retrieves the main web page of that website and looks for information regarding the media outlet, using the semantic data supplied by the web page. The media exploration process establishes a fundamental record of the content provider in the system’s database, which is then further enhanced with details about the outlet by human editors. This additional data includes information on its popularity and social media presence, which is valuable for exploring the extent of event dissemination.</p>
   </sec>
   <sec id="s2_2">
    <title>2.2. Outreach Analysis Algorithm</title>
    <p>To analyze the extensive data gathered and stored in the system’s database, it was essential to quantify the distinct characteristics associated with the impact of each event. The intrinsic characteristics of the events, which can be measured directly, played a crucial role in this process, such as the total number of publications related to each event, or the count of contributors involved. Furthermore, by examining the textual content of these publications, quantitative assessments concerning the extent of each production and the target audience it addresses were derived with the assistance of GenAI. Concurrently, attention was also directed toward the attributes of the various media outlets that disseminate information about each event, particularly focusing on quantitative measures of their popularity, like website traffic statistics and social media presence size. Lastly, the popularity of the tags used to describe and categorize each event was assessed by considering the number of other collected events that share similar classification tags.</p>
    <p>A key component of the algorithm designed to estimate the dissemination of different audiovisual art events was assessing the corresponding popularity of the online media outlets and websites that covered these events. This popularity was determined by the online visibility of the websites, which was characterized by their overall monthly traffic, backlinks, organic search rankings, ranking status in Greece, and their domain authority. Moreover, the presence of the media on various social media platforms and the size of their follower or subscriber counts on official accounts was also investigated. The following attributes were used in these estimates, each with a different weight:</p>
    <p>In terms of Web popularity, the total number of visitors to the website was considered the most crucial factor and therefore received the highest weight. Backlinks serve as a significant indicator of how integrated the website is within a broader informational network, and for this reason, it was also assigned a substantial weight. This was succeeded by the quantity of organic results, which pertain solely to visits originating from search engines. Lastly, both the Greek ranking and domain authority can reflect a domain’s local or topic-specific significance, and for this reason, they were included as well, though with lesser weights. All the metrics mentioned were sourced through third-party services like Similar Web and Neil Patel.</p>
    <p>Regarding social media popularity, the platforms chosen were Facebook, YouTube, Instagram, TikTok, X (Twitter), and LinkedIn. The first four are among the most visited according to SimilarWeb, while X (Twitter) was selected for its crucial role in spreading news and LinkedIn for its relevance in the business sector. Platforms that primarily focus on messaging, such as WhatsApp, WeChat, and Messenger, or those with limited presence in Greece, like eastern platforms (Douyin, Baidu, etc.), were excluded. YouTube received the highest rating due to its significance in delivering audiovisual content, followed closely by the well-established and widely used Facebook. X (Twitter) ranked next in importance because of its function as a news distributor. Although TikTok’s audiovisual characteristics are significant, its narrower audience base resulted in a moderate score. Lastly, Instagram, with its restrictions on URL linking, and LinkedIn, which is focused on commerce, are also notable but less so than the others.</p>
    <p>Moving from media outlet popularity to specific event outreach, a series of measurements were devised to quantitatively describe each individual audiovisual art event. These metrics included the weighted harmonic mean of the Web popularity and the Social Media popularity of each website that reported on the event, the total number of publications discovered during the web search process, the total number of contributors identified from the event publications’ texts, the intended audience size and the estimated production cost as inferred with the help of GenAI through the textual descriptions of the event on the various media outlets and finally the harmonically weighted sum of events that shared the same classification tags as the investigated event based on type, methods and subject.</p>
    <p>The various metrics used for both the estimation of media outlet popularity and event outreach were normalized using logarithmic and z-score methods, depending on the number of outliers detected in each metric. The total popularity indexes were calculated using weighted means which gave increasing importance to the various metrics in accordance with the order they are presented in the list above. Finally, all the event related metrics were also consolidated into a singular event outreach metric using an equivalent weighted average methodology. The different quantitative metrics for each event and their weights are as follows:</p>
    <p>Popularity through Web and social media, which involves various objective characteristics of the different media outlets, was identified as the most critical aspect, closely followed by the number of publications, as it serves as a valuable objective indicator of outreach. The frequency of each classification tag was assessed in a way that ensures that overall the various classifications’ significance was considered moderately important. The values derived from publication texts were rated just slightly lower, as they provide additional insights, but may be subjective or inconsistent due to the nature of GenAI. Lastly, the metric with the least weight was the number of contributors, which offers some intrinsic value in reflecting the scale of an event, but is less reliable since media outlets often fail to report every contributor.</p>
    <p>This singular index can be used as a strong indicator of individual event popularity to draw conclusions regarding the relationship between the various classification tags, their own popularity as well as the popularity of the events they describe.</p>
    <p>
     <xref ref-type="fig" rid="fig1">
      Figure 1
     </xref> presents a detailed flowchart of the outreach analysis algorithm.</p>
   </sec>
  </sec><sec id="s3">
   <title>3. Results</title>
   <p>During a data collection period that took place in two separate phases in the Spring and Summer of 2024 over 16,500 different audiovisual art events in Greece were discovered. These events were reported in over 30 k unique webpages from over 2.4 k different media outlets and websites. The generation of classification tags based on the texts of these discovered publications lead to over 1000 different tags describing the type of the events, over 1700 different tags describing the methods and techniques used in the events and over 4500 different tags describing the events’ themes and subject matter. <xref ref-type="table" rid="table1">
     Table 1
    </xref> presents a summary of the collected information.</p>
   <fig id="fig1" position="float">
    <label>Figure 1</label>
    <caption>
     <title>Figure 1. Outreach analysis algorithm flowchart.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId22.jpeg?20250120044740" />
   </fig>
   <table-wrap id="table1">
    <label>
     <xref ref-type="table" rid="table1">
      Table 1
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.140021-"></xref>Table 1. Summary of the web extraced information.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td custom-top-td aleft" width="50.00%"><p style="text-align:left"></p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="50.00%"><p style="text-align:left"># of occurrences</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="50.00%"><p style="text-align:center">Events</p></td> 
      <td class="custom-top-td acenter" width="50.00%"><p style="text-align:center">16,501</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="50.00%"><p style="text-align:center">Publications</p></td> 
      <td class="acenter" width="50.00%"><p style="text-align:center">30,803</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="50.00%"><p style="text-align:center">Media</p></td> 
      <td class="acenter" width="50.00%"><p style="text-align:center">2456</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="50.00%"><p style="text-align:center">Type tags</p></td> 
      <td class="acenter" width="50.00%"><p style="text-align:center">1149</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="50.00%"><p style="text-align:center">Method tags</p></td> 
      <td class="acenter" width="50.00%"><p style="text-align:center">1775</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td acenter" width="50.00%"><p style="text-align:center">Subject tags</p></td> 
      <td class="custom-bottom-td acenter" width="50.00%"><p style="text-align:center">4698</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <p>In order to gain insights into the popularity of the various classification tags the total sum of events which were described using one such tag was calculated. This led to a ranking based on how plentiful were the events which belonged to that specific tag. <xref ref-type="table" rid="table2">
     Table 2
    </xref> presents the top 20 most popular tags describing an event’s type. Since the data extraction process involved Greek events and publications mainly in the Greek language, the tag names have been translated into English for the reader’s benefit.</p>
   <table-wrap id="table2">
    <label>
     <xref ref-type="table" rid="table2">
      Table 2
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.140021-"></xref>Table 2. Top 20 most popular event type tags.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td custom-top-td aleft" width="20.17%"><p style="text-align:left">Ranking</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="39.99%"><p style="text-align:left">Tag name</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="47.48%"><p style="text-align:left"># of occurrences</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="20.17%"><p style="text-align:center">1</p></td> 
      <td class="custom-top-td acenter" width="39.99%"><p style="text-align:center">Music</p></td> 
      <td class="custom-top-td acenter" width="47.48%"><p style="text-align:center">8247</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">2</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Concert</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">4922</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">3</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Culture</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">4397</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">4</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Theater</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">4155</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">5</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Cinema</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">1151</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">6</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Festival</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">1002</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">7</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Art</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">921</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">8</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Comedy</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">767</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">9</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Exhibition</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">680</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">10</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Classical Music</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">663</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">11</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Visuals</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">650</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">12</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Dance</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">647</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">13</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Documentary</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">502</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">14</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Live Performance</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">494</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">15</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Cultural Event</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">488</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">16</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">History</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">469</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">17</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Performance</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">435</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">18</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Feature</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">370</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="20.17%"><p style="text-align:center">19</p></td> 
      <td class="acenter" width="39.99%"><p style="text-align:center">Visual Arts</p></td> 
      <td class="acenter" width="47.48%"><p style="text-align:center">326</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td acenter" width="20.17%"><p style="text-align:center">20</p></td> 
      <td class="custom-bottom-td acenter" width="39.99%"><p style="text-align:center">Drama</p></td> 
      <td class="custom-bottom-td acenter" width="47.48%"><p style="text-align:center">321</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <p>The distribution of events over these tags is presented more clearly in <xref ref-type="fig" rid="fig2">
     Figure 2
    </xref> in the form of a pie chart. According to the chart music related events are dominant in the landscape of audiovisual art events, with the first most popular tags “music” and “concert” culminating in over 40% of the events. The performative arts follow with “theater” and “cinema” being in the top 5 most popular tags alongside the more generic tag of “culture”. The top 20 is populated with other general types of art events such as “exhibitions”, “visual art” and “dance”, but also includes more specific subcategories such as “comedy”, “classical music” and “drama”.</p>
   <fig id="fig2" position="float">
    <label>Figure 2</label>
    <caption>
     <title>Figure 2. Pie chart of the distribution of type tags.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId23.jpeg?20250120044740" />
   </fig>
   <p>Going beyond simple event type categorization, the top 20 tags describing the means, methods and techniques used during the various audiovisual art events are presented in <xref ref-type="table" rid="table3">
     Table 3
    </xref>.</p>
   <table-wrap id="table3">
    <label>
     <xref ref-type="table" rid="table3">
      Table 3
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.140021-"></xref>Table 3. Top 20 most popular event method tags.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td custom-top-td aleft" width="18.75%"><p style="text-align:left">Ranking</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="37.14%"><p style="text-align:left">Tag name</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="44.11%"><p style="text-align:left"># of occurrences</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="18.75%"><p style="text-align:center">1</p></td> 
      <td class="custom-top-td acenter" width="37.14%"><p style="text-align:center">Live Music</p></td> 
      <td class="custom-top-td acenter" width="44.11%"><p style="text-align:center">4650</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">2</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Theatrical Performance</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">2461</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">3</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Music</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">2014</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">4</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Direction</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1668</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">5</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Acting</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1461</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">6</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Dance</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1433</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">7</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Concert</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1382</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">8</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Vocals</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">697</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">9</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Song Performance</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">673</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">10</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Piano</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">646</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">11</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Painting</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">646</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">12</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Singing</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">558</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">13</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Narration</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">555</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">14</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Musical Performance</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">549</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">15</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Guitar</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">523</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">16</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Dramaturgy</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">480</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">17</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Choir</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">478</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">18</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Discussion</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">468</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">19</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Drums</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">450</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td acenter" width="18.75%"><p style="text-align:center">20</p></td> 
      <td class="custom-bottom-td acenter" width="37.14%"><p style="text-align:center">Speech</p></td> 
      <td class="custom-bottom-td acenter" width="44.11%"><p style="text-align:center">384</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <p>How the various tags relating to audiovisual art methods are distributed over the discovered events is presented more clearly in <xref ref-type="fig" rid="fig3">
     Figure 3
    </xref> in the form of a pie chart.</p>
   <fig id="fig3" position="float">
    <label>Figure 3</label>
    <caption>
     <title>Figure 3. Pie chart of the distribution of method tags.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId24.jpeg?20250120044740" />
   </fig>
   <p>Continuing the trend observed in <xref ref-type="fig" rid="fig2">
     Figure 2
    </xref>, the more musical methods are dominant with “live music” getting first place with over 20% of the events and accompanied by other related tags such as “music”, “concert”, “vocals” and more. The performative art methods and means also play a dominant role with “theatrical performance”, “direction” and “acting” all appearing in the top 5. The chart also includes the performance of various musical instruments such as “piano”, “guitar” and “drums”, as well as methods of storytelling, including “dance”, “dramaturgy”, “narration” and “speech”. “Painting” is also present in the top 20 as a representative of the fine arts.</p>
   <p>Finally, the third category of classification tags included the themes and subjects of the events. <xref ref-type="table" rid="table4">
     Table 4
    </xref> presents the top 20 most popular subject tags and the number of events described by each tag.</p>
   <p>The distribution of the above tags can be presented in a more comprehensive manner through the pie chart available in <xref ref-type="fig" rid="fig4">
     Figure 4
    </xref>.</p>
   <table-wrap id="table4">
    <label>
     <xref ref-type="table" rid="table4">
      Table 4
     </xref></label>
    <caption>
     <title>
      <xref ref-type="bibr" rid="scirp.140021-"></xref>Table 4. Top 20 most popular event subject tags.</title>
    </caption>
    <table class="MsoTableGrid custom-table" border="0" cellspacing="0" cellpadding="0"> 
     <tr> 
      <td class="custom-bottom-td custom-top-td aleft" width="18.75%"><p style="text-align:left">Ranking</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="37.14%"><p style="text-align:left">Tag name</p></td> 
      <td class="custom-bottom-td custom-top-td aleft" width="44.11%"><p style="text-align:left"># of occurrences</p></td> 
     </tr> 
     <tr> 
      <td class="custom-top-td acenter" width="18.75%"><p style="text-align:center">1</p></td> 
      <td class="custom-top-td acenter" width="37.14%"><p style="text-align:center">Music</p></td> 
      <td class="custom-top-td acenter" width="44.11%"><p style="text-align:center">2559</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">2</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Greek Music</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1515</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">3</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Concert</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1406</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">4</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Cultural Heritage</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1211</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">5</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Culture</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">1010</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">6</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Musical Tradition</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">905</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">7</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Classical Music</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">855</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">8</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">History</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">768</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">9</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Art</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">600</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">10</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Social Issues</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">582</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">11</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Love</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">526</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">12</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Personal Relationships</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">501</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">13</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Comedy</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">482</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">14</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Friendship</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">441</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">15</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Romance</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">437</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">16</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Artistic Expression</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">429</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">17</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Tradition</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">416</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">18</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Folk Music</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">364</p></td> 
     </tr> 
     <tr> 
      <td class="acenter" width="18.75%"><p style="text-align:center">19</p></td> 
      <td class="acenter" width="37.14%"><p style="text-align:center">Live</p></td> 
      <td class="acenter" width="44.11%"><p style="text-align:center">354</p></td> 
     </tr> 
     <tr> 
      <td class="custom-bottom-td acenter" width="18.75%"><p style="text-align:center">20</p></td> 
      <td class="custom-bottom-td acenter" width="37.14%"><p style="text-align:center">Cinema</p></td> 
      <td class="custom-bottom-td acenter" width="44.11%"><p style="text-align:center">354</p></td> 
     </tr> 
    </table>
   </table-wrap>
   <fig id="fig4" position="float">
    <label>Figure 4</label>
    <caption>
     <title>Figure 4. Pie chart of the distribution of subject tags.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId25.jpeg?20250120044741" />
   </fig>
   <p>As presented in <xref ref-type="fig" rid="fig4">
     Figure 4
    </xref> the importance of musical events is conveyed by the multitude of musical styles that are dominant in the top 20 subject tags. Going beyond the very general “music” and “concert” we get “Greek music”, “musical tradition”, “classical music” and “Laïko music” (which is a very prevalent blend of folk and popular music in Greece) <xref ref-type="bibr" rid="scirp.140021-27">
     [27]
    </xref>. “Culture” and “cultural heritage” are also popular tags and alongside “history” and “tradition” indicate a tendency for many events to focus thematically on Greece’s rich cultural past. Themes and subjects popular in performative art are also included in the chart with “social issues”, “love”, “romance”, “friendship” and “personal relationships” all being prevalent in the top 20 most popular subject tags.</p>
   <p>
    <xref ref-type="table" rid="tableTables 2-4">
     Tables 2-4
    </xref> and <xref ref-type="fig" rid="figFigures 2-4">
     Figures 2-4
    </xref> give an interesting glimpse on the landscape of audiovisual arts in contemporary Greece, based on the popularity of the various classification tags that were mined from texts reporting on audiovisual art events on the Web. These real-world data represent an objective reality as reported through the web. Music, performative art, and other forms of expression are all represented in various measurable degrees. But the number of events for each specific classification tag cannot be the only indicator. The difference in popularity, outreach and cultural impact between various events also plays a significant role.</p>
   <p>Out of the total 16,501 events a total of 2685 were fully investigated in terms of outreach using the web search algorithm to collect exhaustive information about them as described in the methodology section and using the outreach analysis algorithm to calculate their various outreach metrics. The culmination of these outreach measurements in one singular index allowed us to gauge the popularity of each event. For each classification tag that appeared in at least ten different fully investigated events an average outreach index was calculated. The threshold of ten events was established in order to have a representative sample of events before calculating the average. <xref ref-type="fig" rid="fig5">
     Figure 5
    </xref> presents the top 20 more impactful event type as a bar chart of the average outreach index of each type.</p>
   <fig id="fig5" position="float">
    <label>Figure 5</label>
    <caption>
     <title>Figure 5. Bar chart of the type tags with the highest average outreach index.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId26.jpeg?20250120044740" />
   </fig>
   <p>Through the bar chart of the top 20 type tags with the highest average outreach index a series of highly impactful event type subcategories are established. Musical event genres such as “pop”, “rock”, “classical”, “opera” and “Greek music” all get higher places than the wider general tags like “music” got, indicating a higher outreach of these specific genres. Additionally, “audiovisual art” as a tag makes a strong appearance indicating a higher outreach for events with a clearer artistic identity. At the same time niche types like “improvisation” or “animation” make an appearance, and this is rounded up by some more general themes like “charity” or “LGBTQ+” issue related events.</p>
   <p>Moving on to the audiovisual event classification tags for methods, techniques and media, <xref ref-type="fig" rid="fig6">
     Figure 6
    </xref> presents the top 20 tags with the highest average outreach index as a bar chart.</p>
   <fig id="fig6" position="float">
    <label>Figure 6</label>
    <caption>
     <title>Figure 6. Bar chart of the method tags with the highest average outreach index.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId27.jpeg?20250120044741" />
   </fig>
   <p>As made clear by the bar chart, more technical presentation methods and techniques utilizing “audiovisual effects”, “visual effects”, “audiovisual media” and “electronic instruments” present a high average outreach index. Moreover, general technical aspects of an event are also highly important with “sound effects”, “stage presentation”, “musical arrangement” and “instrumental performance” making an appearance in the top spots. The high production values related to the use of modern technology and high technical implementation appear to have a big impact on the outreach of the events.</p>
   <p>Finally, through <xref ref-type="fig" rid="fig7">
     Figure 7
    </xref>, we take a glimpse into the top 20 event classification tags for subjects and themes with the highest average outreach index in the form of a bar chart.</p>
   <p>On the matter of subject classification tags, seasonal or annual themes have the highest average outreach index. The annual “Eurovision” song contest with its multitude of preliminary, semifinal, and final events appears in the top spot, followed by “New Year’s”, “celebration”, “anniversary” and “anniversary concert” in the top 20, indicating the high outreach of seasonal events. The tag “Maria Callas” which also appears on the top 20 list is related to a multitude of events in tribute to 100 years from the artist’s birth, which also adds to the importance of anniversary related events. Music themes revolving around the “artist collaboration”, the “artistic collaboration” and “music career” point to a high outreach for events that have added significance due to scope or multitude of participants, while popular musical genres such as “pop”, “pop music” and “Greek songs” also allow for high outreach. Some tags related to “Greek culture”, such as “Greek tradition”, and “musical heritage” also make an appearance.</p>
   <fig id="fig7" position="float">
    <label>Figure 7</label>
    <caption>
     <title>Figure 7. Bar chart of the method tags with the highest average outreach index.</title>
    </caption>
    <graphic mimetype="image" position="float" xlink:type="simple" xlink:href="https://html.scirp.org/file/9303375-rId28.jpeg?20250120044741" />
   </fig>
  </sec><sec id="s4">
   <title>4. Discussion</title>
   <p>The tables and figures presented in the results section clearly show that the popularity of the classification tag itself and the popularity of the events that are described by such a tag in terms of outreach on the World Wide Web are significantly different and as such they present a challenge in terms of understanding the importance of each tag. This will be further discussed in this section.</p>
   <sec id="s4_1">
    <title>4.1. RQ1. What Are the Most Popular Classifications of Audiovisual Art Events Regarding Event Type, Methods, and Subjects?</title>
    <p>Classification tag popularity is a good indicator of how often an event that is described by that tag occurs in the audiovisual art event landscape. Tags that are popular represent types, methods and subjects of events that are commonplace, and that artists and event organizers are more interested in presenting to the audience. From the objective quantitative data collected in this study from the Web regarding contemporary audiovisual art events, it was made apparent that music dominated the event type classification tags, followed closely by the performing arts. In terms of techniques and methods musical events are represented by live singing and instrument performance, while the performing arts use acting, dramaturgy and direction as their essential tools. In terms of themes, musical genres such as pop, rock, classical and laiko are popular, while well established story telling themes involving love, friendship and romance dominate in the performative arts.</p>
    <p>Since the emergence of Web 2.0, music and musical events have played a predominant role in cultural dissemination on the Internet <xref ref-type="bibr" rid="scirp.140021-28">
      [28]
     </xref>. Participatory content diffusion combined with the popularity of musical events in younger and more dynamic audiences <xref ref-type="bibr" rid="scirp.140021-29">
      [29]
     </xref>, has become a driving force behind the commonplace organization of such events and their multilayered Web presence that was in this research through real-world Web data. In an equivalent manner the performing arts are also utilizing Web 2.0 and attracting more dynamic audiences <xref ref-type="bibr" rid="scirp.140021-30">
      [30]
     </xref>, thus establishing themselves a strong Web presence and an important place in the audiovisual art landscape.</p>
    <p>From a musical standpoint genre popularity and the rise of pop music as a trend setter in the contemporary music landscape has been well established <xref ref-type="bibr" rid="scirp.140021-31">
      [31]
     </xref> <xref ref-type="bibr" rid="scirp.140021-32">
      [32]
     </xref> and is supported by the findings of this study as well. On the other hand, the worldwide importance of rap and hip hop <xref ref-type="bibr" rid="scirp.140021-31">
      [31]
     </xref> is not apparent in this research and is replaced instead by a slightly stronger representation of rock and an extraordinarily strong representation of Greek pop, folk and laiko music <xref ref-type="bibr" rid="scirp.140021-27">
      [27]
     </xref>, thus highlighting the peculiarities of the contemporary Greek musical landscape. In the field of theater and cinema, the importance of themes closely related to personal relationships and feelings including love, romance, friendship as established by the popularity of the relevant tags in this research, is based on the importance of the emotional experience conveyed through the performing arts, which act as a link between the artist and the audience and their vital interdependency <xref ref-type="bibr" rid="scirp.140021-33">
      [33]
     </xref>.</p>
   </sec>
   <sec id="s4_2">
    <title>4.2. RQ2. What Are the Classifications of Audiovisual Art Events Regarding Event Type, Methods and Subjects That Are Most Influential Based on Event Outreach Index?</title>
    <p>Going one step further this research focused not only on the estimated popularity of the various classification tags, but moreover on the outreach index of the events described by each such tag. With the use of objective real-world quantitative metrics derived from the Web the outreach index of each individual event was established and for each classification tag an average outreach index was calculated, indicating how impactful the various events that are described by that tag are. In the study of tat average outreach, general popular classification tags were replaced by specific tags that though their outreach indicate fewer but more influential events. In terms of event types, the importance of highly influential musical genres was made apparent, combined with an added importance to a distinct artistic identity. When it came to methods and techniques the high outreach of events that take advantage of technology in production was established, while the study of subject tags outlined the importance of the celebrational, annual and seasonal nature of audiovisual art events.</p>
    <p>Globally important musical genres such as pop and rock displayed high influence as observed in other studies <xref ref-type="bibr" rid="scirp.140021-31">
      [31]
     </xref> <xref ref-type="bibr" rid="scirp.140021-32">
      [32]
     </xref>, but were closely followed by genres closer to Greek idiosyncrasy as well less popular genres like classical and opera music, thus establishing how highly influential events of these genres are <xref ref-type="bibr" rid="scirp.140021-34">
      [34]
     </xref> despite being fewer. In terms of methods the high ranking of tags related to audio and visual effects and high production value was made apparent. Technologies and new media have a continuously significant importance in the preforming arts worldwide <xref ref-type="bibr" rid="scirp.140021-35">
      [35]
     </xref> and are creating a new paradigm that attracts and impacts wide audiences. Finally, the study of subject tags showed a shift from commonplace themes to tags of a more temporal nature. Seasonal and annual events revolving around celebrations have been observed to be important social traditions that have important influence in local societies <xref ref-type="bibr" rid="scirp.140021-36">
      [36]
     </xref> and this is backed up by the findings of this study.</p>
   </sec>
  </sec><sec id="s5">
   <title>5. Conclusions</title>
   <p>This study focuses on the use of contemporary real-world data, extracted from the Web through an advanced algorithm utilizing semantic web, search engine and generative AI technologies, for the purpose of classifying audiovisual art events. Using objective quantitative metrics, the event data were distilled into a singular web outreach index, which quantified their perceived impact based on their online presence. Using the events classification in terms of event type, methods and techniques, and thematic subjects, the popularity of various classification tags was explored. Going one step further, the impact of the events described by each tag was estimated using each event’s outreach index. This process was used to discover important classification tags in terms of potential influence.</p>
   <p>The analysis of the popularity of various classification tags within the current context of audiovisual art events in Greece, using a data-driven quantitative approach, objectively validated the significance of music and performative arts, alongside the popularity of particular musical genres and narrative themes focused on interpersonal relationships. Furthermore, examining the average outreach index for each classification tag highlighted the relevance of Greek musical heritage, the benefits of utilizing modern technology in performances, and the additional value associated with events that are annual, seasonal, or commemorative in nature. In addition to the insights discussed, the findings of the study encompass the methodology used, as well as the dataset collected, both of which can be applied similarly to yield data-driven discoveries to other research questions with different goals.</p>
   <p>The focus of both the data extraction process and the classification and outreach analysis is the various events’ online presence. The information available on the Web allows an objective numerical representation of the events but offers limited insight about events that may be important and culturally impactful, but disproportionately promoted online. This limitation of the study is somewhat mitigated by the extraction of information from user generated content, blogs, reviews etc., but remains important. For this reason, it is optimal to use data-driven quantitative analysis in tandem with qualitative expert analysis to combine findings and reach safer conclusions.</p>
   <p>In the future, the collected dataset may be used in addressing issues beyond classification, looking into geographical or temporal attributes of audiovisual art events. Moreover, the analysis may move from a descriptive statistical discussion to the use of computational statistics and machine learning in order to gain deeper insight. In the next phase of this research, techniques such as clustering and topic modeling will be used to gain better understanding of the dataset, based both on its quantitative characteristics and its descriptive texts.</p>
   <p>Obtaining a clearer understanding of the landscape of audiovisual art events using real-world quantitative metrics, in tandem with a critical qualitative approach of the wider field of art and culture, can provide the basis for more informed decision making from, the artists, the organizers and the audience. This clearer approach may act for the benefit of everyone involved, leading to more appealing, interesting, and lucrative events and to a deeper relationship between the artistic product and the general public.</p>
  </sec><sec id="s6">
   <title>Funding</title>
   <p>This research was funded by the Hellenic Foundation for Research &amp; Innovation, within the framework of the “2nd Call for H.F.R.I.’s Research Projects to Support Faculty Members &amp; Researchers” (project number 3607).</p>
  </sec>
 </body><back>
  <ref-list>
   <title>References</title>
   <ref id="scirp.140021-ref1">
    <label>1</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Berners-Lee, T., Cailliau, R., Luotonen, A., Nielsen, H.F. and Secret, A. (2023) The World-Wide Web. In: Seneviratne, O. and Hendler, J., Eds., Linking the World’s Information: Essays on Tim Berners-Lee’s Invention of the World Wide Web, ACM, 51-65. &gt;https://doi.org/10.1145/3591366.3591373
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref2">
    <label>2</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jensen, K.B. and Craig, R.T. (2016) The International Encyclopedia of Communication Theory and Philosophy. Wiley.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref3">
    <label>3</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Pergantis, M., Varlamis, I., Kanellopoulos, N.G. and Giannakoulopoulos, A. (2023) Searching Online for Art and Culture: User Behavior Analysis. Future Internet, 15, Article 211. &gt;https://doi.org/10.3390/fi15060211
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref4">
    <label>4</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Barifah, M., Landoni, M. and Eddakrouri, A. (2020) Evaluating the User Experience in a Digital Library. Proceedings of the Association for Information Science and Technology, 57, e280. &gt;https://doi.org/10.1002/pra2.280
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref5">
    <label>5</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Gaona-García, P.A., Martin-Moncunill, D. and Montenegro-Marin, C.E. (2017) Trends and Challenges of Visual Search Interfaces in Digital Libraries and Repositories. The Electronic Library, 35, 69-98. &gt;https://doi.org/10.1108/el-03-2015-0046
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref6">
    <label>6</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Leibovitz, T., Roig, A. and Sánchez-Navarro, J. (2015) Up Close and Personal: Exploring the Bonds between Promoters and Backers in Audiovisual Crowdfunded Projects. In Bennett, L., Chin, B. and Jones, B., Eds., Crowdfunding the Future Media Industries, Ethics, and Digital Society, Peter Lang, 15-30.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref7">
    <label>7</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Maier, S. (2010) All the News Fit to Post? Comparing News Content on the Web to Newspapers, Television, and Radio. Journalism &amp; Mass Communication Quarterly, 87, 548-562. &gt;https://doi.org/10.1177/107769901008700307
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref8">
    <label>8</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Boczkowski, P.J. and Mitchelstein, E. (2013) The News Gap: When the Information Preferences of the Media and the Public Diverge. The MIT Press. &gt;https://doi.org/10.7551/mitpress/9780262019835.001.0001
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref9">
    <label>9</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Martens, B., Aguiar, L., GGmez, E. and Mueller-Langer, F. (2018) The Digital Transformation of News Media and the Rise of Disinformation and Fake News. SSRN Electronic Journal. &gt;https://doi.org/10.2139/ssrn.3164170
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref10">
    <label>10</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Nadotti, L. and Vannoni, V. (2019) Cultural and Event Tourism: An Interpretative Key for Impact Assessment. Eastern Journal of European Studies, 10, 115-131.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref11">
    <label>11</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Giannakoulopoulos, A., Pergantis, M., Konstantinou, N., Kouretsis, A., Lamprogeorgos, A. and Varlamis, I. (2022) Estimation on the Importance of Semantic Web Integration for Art and Culture Related Online Media Outlets. Future Internet, 14, Article 36. &gt;https://doi.org/10.3390/fi14020036
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref12">
    <label>12</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Jacquette, D. (2014) Ontology. Routledge.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref13">
    <label>13</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     O’Leary, D. (2005) Review: Ontologies: A Silver Bullet for Knowledge Management and Electronic Commerce. The Computer Journal, 48, 498-498. &gt;https://doi.org/10.1093/comjnl/bxh100
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref14">
    <label>14</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Berners-Lee, T., Hendler, J. and Lassila, O. (2001) The Semantic Web. Scientific American, 284, 34-43. &gt;https://doi.org/10.1038/scientificamerican0501-34
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref15">
    <label>15</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Berners-Lee, T., Hall, W., Hendler, J.A., O’Hara, K., Shadbolt, N. and Weitzner, D.J. (2006) A Framework for Web Science. Foundations and Trends® in Web Science, 1, 1-130. &gt;https://doi.org/10.1561/1800000001
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref16">
    <label>16</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Gandon, F. and Schreiber, G. (n.d.) RDF 1.1 XML Syntax. &gt;https://www.w3.org/TR/rdf-syntax-grammar/
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref17">
    <label>17</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Beckett, D., Carothers, G. and Seaborne, A. (2014) RDF 1.1 N-Triples: A Line-Based Syntax for an RDF Graph. W3C Recommendation. 
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref18">
    <label>18</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Beckett, D., Berners-Lee, T., Prud’hommeaux, E. and Carothers, G. (2014) RDF 1.1 Turtle. World Wide Web Consortium, 18-31.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref19">
    <label>19</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Sporny, M., Longley, D., Kellogg, G., Lanthaler, M. and Lindström, N. (2020) JSON-LD 1.1. W3C Recommendation. 
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref20">
    <label>20</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Maybury, M.T. (1995) Generating Summaries from Event Data. Information Processing &amp; Management, 31, 735-751. &gt;https://doi.org/10.1016/0306-4573(95)00025-c
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref21">
    <label>21</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref22">
    <label>22</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Bhosale, H.S. and Gadekar, D.P. (2014) A Review Paper on Big Data and Hadoop. International Journal of Scientific and Research Publications, 4, 1-7.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref23">
    <label>23</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     von Bloh, J., Broekel, T., Özgun, B. and Sternberg, R. (2019) New(s) Data for Entrepreneurship Research? An Innovative Approach to Use Big Data on Media Coverage. Small Business Economics, 55, 673-694. &gt;https://doi.org/10.1007/s11187-019-00209-x
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref24">
    <label>24</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Rosenbush, S. and Totty, M. (2013) How Big Data Is Changing the Whole Equation for Business. Wall Street Journal.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref25">
    <label>25</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Manovich, L. (2016) The Science of Culture? Social Computing, Digital Humanities and Cultural Analytics. Journal of Cultural Analytics, 1, 1-15. &gt;https://doi.org/10.22148/16.004
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref26">
    <label>26</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Giannakoulopoulos, A., Pergantis, M., Lamprogeorgos, A., Lampoura, S., Limniati, L., (2024) Presenting Web-Extracted Contemporary Audiovisual Arts Events with a Focus on User Experience (UX). EUTIC 2024 (Under Publication).
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref27">
    <label>27</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Economou, L. (2018) Sentiment, Memory, and Identity in Greek Laiko Music (1945-1967) In: Tragaki, D., Ed., Made in Greece, Routledge, 17-27. &gt;https://doi.org/10.4324/9781315749075-3
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref28">
    <label>28</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Beer, D. (2008) Making Friends with Jarvis Cocker: Music Culture in the Context of Web 2.0. Cultural Sociology, 2, 222-241. &gt;https://doi.org/10.1177/1749975508091034
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref29">
    <label>29</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Robinson, R. (2016) Music Festivals and the Politics of Participation. Routledge. &gt;https://doi.org/10.4324/9781315596785
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref30">
    <label>30</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Coleman, L.J., Jain, A., Bahnan, N. and Chene, D. (2019) Marketing the Performing Arts: Efficacy of Web 2.0 Social Networks. Journal of Marketing Development and Competitiveness, 13, 23-28.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref31">
    <label>31</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Petitbon, A.M. and Hitchcock, D.B. (2022) What Kind of Music Do You Like? A Statistical Analysis of Music Genre Popularity over Time. Journal of Data Science, 20, 168-187. &gt;https://doi.org/10.6339/22-jds1040
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref32">
    <label>32</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Fossi, J., Dzwonkowski, A. and Othman, S. (2021) Analyzing Music Genre Popularity. In: Arai, K., Ed., Intelligent Computing, Springer International Publishing, 284-294. &gt;https://doi.org/10.1007/978-3-030-80126-7_22
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref33">
    <label>33</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Soltani, M. (2014) The Essence of Eddie: A Reflection of the Interplay between Art, Relationships, and Emotion. Academic Psychiatry, 38, 751-751. &gt;https://doi.org/10.1007/s40596-014-0227-3
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref34">
    <label>34</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Getz, E. (2015) Why Classical Music Still Matters. University of California Press.
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref35">
    <label>35</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Hebert, D., Kallio, A.A. and Odendaal, A. (2012) Not So Silent Night: Tradition, Transformation and Cultural Understandings of Christmas Music Events in Helsinki, Finland. Ethnomusicology Forum, 21, 402-423. &gt;https://doi.org/10.1080/17411912.2012.721525
    </mixed-citation>
   </ref>
   <ref id="scirp.140021-ref36">
    <label>36</label>
    <mixed-citation publication-type="other" xlink:type="simple">
     Todi, C. (2019) The Metamorphosis of Performing Arts. Theatrical Colloquia, 9, 173-186. &gt;https://doi.org/10.2478/tco-2019-0004
    </mixed-citation>
   </ref>
  </ref-list>
 </back>
</article>