<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">INFEDU</journal-id>
      <journal-title-group>
        <journal-title>Informatics in Education</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2335-8971</issn>
      <issn pub-type="ppub">1648-5831</issn>
      <publisher>
        <publisher-name>VU</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">INFEDU.2025.21</article-id>
      <article-id pub-id-type="doi">10.15388/infedu.2025.21</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Provenance-based Data Analysis in Block-based Programming for Application Development</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Arruda</surname>
            <given-names>Naira</given-names>
          </name>
          <email xlink:href="mailto:naira.alice@ifsc.edu.br">naira.alice@ifsc.edu.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_000"/>
        </contrib>
        <aff id="j_INFEDU_aff_000">Xanxerê Campus, Federal Institute of Education, Science and Technology of Santa Catarina, Brazil</aff>
        <contrib contrib-type="author">
          <name>
            <surname>de Lima</surname>
            <given-names>Matheus Roberto</given-names>
          </name>
          <email xlink:href="mailto:matheusroberto@id.uff.br">matheusroberto@id.uff.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_001"/>
        </contrib>
        <aff id="j_INFEDU_aff_001">Institute of Computing, Universidade Federal Fluminense, Brazil</aff>
        <contrib contrib-type="author">
          <name>
            <surname>Martins</surname>
            <given-names>Simone</given-names>
          </name>
          <email xlink:href="mailto:simone@ic.uff.br">simone@ic.uff.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_002"/>
          <xref ref-type="corresp" rid="cor3">∗∗∗</xref>
        </contrib>
        <aff id="j_INFEDU_aff_002">Institute of Computing, Universidade Federal Fluminense, Brazil</aff>
        <contrib contrib-type="author">
          <name>
            <surname>de Oliveira</surname>
            <given-names>Daniel</given-names>
          </name>
          <email xlink:href="mailto:danielcmo@ic.uff.br">danielcmo@ic.uff.br</email>
          <xref ref-type="aff" rid="j_INFEDU_aff_003"/>
        </contrib>
        <aff id="j_INFEDU_aff_003">Institute of Computing, Universidade Federal Fluminense, Brazil</aff>
      </contrib-group>
      <author-notes>
        <corresp id="cor3"><label>∗∗∗</label>Corresponding author.</corresp>
      </author-notes>
      <volume>24</volume>
      <issue>3</issue>
      <fpage>431</fpage>
      <lpage>470</lpage>
      <pub-date pub-type="epub">
        <day>26</day>
        <month>10</month>
        <year>2025</year>
      </pub-date>
      <permissions>
        <copyright-year>2025</copyright-year>
        <copyright-holder>Vilnius University</copyright-holder>
        <license license-type="open-access">
          <license-p>Open access article under the CC BY license.</license-p>
        </license>
      </permissions>
      <abstract>
        <p>Teaching programming to elementary and high school students is important for developing problem-solving and logical reasoning skills. Block-based programming frameworks, such as Scratch and Kodular, have gained popularity for introducing programming concepts in an engaging and more didactic manner. However, these frameworks lack structured tools for analysing student learning processes, which makes it difficult to track progress, identify challenges, and understand student behaviour during application development. This manuscript presents EduPROV, a provenance-based approach that extracts, structures, and analyses student actions from log files generated by block-based programming frameworks. By storing this data in a queryable format, EduPROV supports the identification of learning bottlenecks, tracking programming trajectories, and can help refine teaching strategies. EduPROV was evaluated in a study with elementary and high school students from three schools in southern Brazil, using Kodular as the block-based programming framework. The results show that provenance analysis helps reveal student behaviour, contributing to more informed and effective programming education.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>provenance data</kwd>
        <kwd>block-based programming</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
