Modern software projects are more than just the code that comprises them: teams follow specific development processes; the code runs on servers or mobile phones and produces runtime logs; users talk about the software in forums like StackOverflow and GitHub and rate the product in app stores. The software is part of a collection of similar applications and depends on external code or service API’s to deliver its functionality. Modern software teams need data to make informed decisions that enable continuous, feedback-driven improvement.

At the Software Analytics lab, we work to make software analytics a core asset for software development teams. Our research touches topics such as computer-supported collaborative work (CSCW), big data systems, software engineering processes, software reliability, software analysis, machine learning, and data science.

Currently, we focus on the following 2 research lines, even though we are always open to new ideas:

  • Engineering for (software) analytics: creating platforms for data ingestion, integration and querying in a streaming fashion. Related projects:

    • AI4Fintech Making large software-based organizations more efficient.
    • Codefeedr A platform to ingest and process software analytics data in a streaming fashion
    • GHTorrent Collects all data from the GitHub event API
  • Software ecosystems: We build ecosystem-wide, versioned call graphs out of package networks to make studies such as precise security vulnerability tracking, software license tracking, data-based API evolution, etc possible.

    • FASTEN A platform for analyzing dependency management services at the call graph level granularity


(Some) Members of the Software Analytics Lab in May 2020. Left to right: Elvan Kula, Georgios Gousios, Maliheh Izadi, Mehdi Keshani, Amir Mir, Joseph Hejderup

The following people are part of the Software Analytics lab:

  • Diomidis Spinellis (Lab leader)
  • Arie van Deursen (Leader of the Software Engineering group)
  • Georgios Gousios (Lab leader emeritus)

  • Joseph Hejderup (PhD student) working on ecosystem analysis/tics
  • Mehdi Keshani (PhD student), working on scaling static analyses
  • Elvan Kula (PhD student, also with ING), working on analytics for software process optimization
  • Chandra Maddila (PhD student, also with Microsoft), working on tools for software engineering
  • Amir Mir (PhD student), working on making Python better through Machine Learning


The lab collaborates with the following organizations:

Student collaborators

SAL is always open to hosting brilliant MSc/BSc students to work on the exiting topics we offer. Currently, SAL has the privilege to host the following students:

  • Wouter Zorgdrager (MSc student). Working on FASTEN, system administration

  • Roberta Gismondi (BSc student). Working on ML-based auto-completion for Python
  • Evaldas Latoškinas (BSc student). Working on type prediction for Python
  • Mihhail Sokolov (BSc student). Working on FASTEN
  • Edoardo Lanzini (BSc student). Working on FASTEN


The following people were part of the Software Analytics lab:

  • Ayushi Rastogi (Postdoc, now assist prof at U Groningen)
  • Enrique Larios (Postdoc, now postdoc at U Vic)
  • Chushu Gao (visitor, now at SIG)
  • Moritz Beller (Postdoc, now at Facebook)
  • Maria Kechagia (Postdoc, now at UCL)
  • Maliheh Izadi (visitor from Sharif University of Technology, now postdoc at SERG)
  • Xunhui Zhang (visitor from NUDT, China)

  • Ilya Grishkov (BSc student). Worked on FASTEN
  • Konstantinos Triantafyllou (MSc intern from ETH Zürich). Worked on call graph generator for Rust


The Software Analytics Lab has received funding from:

  • NWO
  • European Commission
  • Microsoft
  • Facebook
  • ING