Crowd-Sourced Exemplars for Developing Research Software Skills in STEM

James DesLauriers, Chris Cooling, John Pinney, Liam Gao, Katerina Michalickova and Deslauriers, J. 2025. Crowd-Sourced Exemplars for Developing Research Software Skills in STEM. 17th International Conference on Education and New Learning Technologies. Palma, Spain 30 Jun - 02 Jul 2025 IATED Digital Library. https://doi.org/10.21125/edulearn.2025.2184

TitleCrowd-Sourced Exemplars for Developing Research Software Skills in STEM
AuthorsJames DesLauriers, Chris Cooling, John Pinney, Liam Gao, Katerina Michalickova and Deslauriers, J.
TypeConference paper
Abstract

As scientific research becomes more data-intensive, STEM students supporting research projects or working on a masters project or doctoral thesis have had to learn to approach and solve problems computationally. These students come from varied backgrounds, though, and few have a formal education in computer science or software engineering. As a consequence, the area of research software training has become an important topic in academia and research, with organisations like The Carpentries and the Software Sustainability Institute leading the way in teaching best practices in software and data.

The Research Computing and Data Science (RCDS) team, situated in the Early Career Researcher Institute (ECRI) at Imperial College London, delivers short courses for postgraduate Masters and PhD students. Our portfolio of courses is varied, ranging from programming languages to the use of specific data science packages within a language, to more general topics like version control, documentation, and best practices in software engineering. Courses are typically delivered in the style of a workshop, with tutors teaching by example and frequent breaks for practise with exercises.

Short courses like the ones offered by RCDS or the Carpentries work well to equip students with the fundamental skills to get started with a programming language or software library. As learners move beyond the basics, however, such short courses become less effective and so should eventually be substituted by other approaches. Inspired by the cognitive apprenticeship model, in both research software, and general software engineering training, real-life projects and code examples are one favoured approach for supporting the transition from novice to proficient.

This paper will introduce the ReCoDE (Research Computing and Data Science Exemplars) project at Imperial College London, led by the RCDS team and supported by the central Research Software Engineering (RSE) team. ReCoDE is a collection of user-contributed code examples based on the actual research of PhD students at Imperial, open to the public for independent study. PhD students pitch their project ideas in a rolling open call, and those selected undertake paid part-time work, collaborating with an RCDS and RSE team member over three months to develop an exemplar. The output of this collaboration is an online resource to guide students through a research software project, demonstrating best practices in data science and software engineering.

We describe and detail the process and approach to selecting, developing, publishing and evaluating exemplars, which could be applied more generally to curating user-generated content in other educational settings. We discuss the free, open-source tools and platforms we use to build and host the documentation, exemplar catalogue, and general information that makes ReCoDE available to the public. Finally, we conclude with our plans for the future of the project.

Year2025
Conference17th International Conference on Education and New Learning Technologies
PublisherIATED Digital Library
Publisher's version
File Access Level
Open (open metadata and files)
Publication dates
PublishedJun 2025
JournalEDULEARN25 Proceedings
Journal citationpp. 8422-8427
ISSN2340-1117
ISBN9788409742189
Digital Object Identifier (DOI)https://doi.org/10.21125/edulearn.2025.2184
Web address (URL)https://doi.org/10.21125/edulearn.2025.2184

Related outputs

Preprint: Towards a Decentralised Application-Centric Orchestration Framework in the Cloud-Edge Continuum (full version)
Ullah, A., Markus, A., Aslam, H. I., Kiss, T., Kovacs, J., Deslauriers, J., Murphy, A.L., Wamg, Y. and Kao, O. 2025. Preprint: Towards a Decentralised Application-Centric Orchestration Framework in the Cloud-Edge Continuum (full version). arXiv. https://doi.org/10.48550/arXiv.2504.00761

Towards a Decentralised Application-Centric Orchestration Framework in the Cloud-Edge Continuum
Ullah, A., Markus, A., Aslan, H. I., Kiss, T., Kovács, J., Deslauriers, J., Murphy, A. L., Wang, Y. and Kao, O 2025. Towards a Decentralised Application-Centric Orchestration Framework in the Cloud-Edge Continuum. ICFEC 2025. Tromso, Norway 19 - 22 May 2025 IEEE .

Automated Generation of Deployment Descriptors for Managing Microservices-based Applications in the Cloud to Edge Continuum
Deslauriers, J., Kovács, J., Kiss, T., Stork, A., Pena Serna, S. and Ullah, A. 2025. Automated Generation of Deployment Descriptors for Managing Microservices-based Applications in the Cloud to Edge Continuum. Future Generation Computer Systems. 166 107628. https://doi.org/10.1016/j.future.2024.107628

You & AI: A Research Computing Hackathon
James DesLauriers, Katerina Michalickova, John Pinney, Liam Gao and Chris Cooling 2024. You & AI: A Research Computing Hackathon. 2024 36th International Conference on Software Engineering Education and Training (CSEE&T). Würzburg, Germany 29 Jul - 01 Aug 2024 https://doi.org/10.1109/cseet62301.2024.10663012

Decentralised Orchestration of Microservices in the Cloud-to-Edge Continuum
Kiss, T., Ullah, A., Kovacs, J., Deslauriers, J., Terstyanszky, G. and Tusa, F. 2024. Decentralised Orchestration of Microservices in the Cloud-to-Edge Continuum. 16th International Workshop on Science Gateways (IWSG2024). Tolouse, France 18 - 20 Jun 2024 Zenodo. https://doi.org/10.5281/zenodo.13863564

Orchestration in the Cloud-to-Things Compute Continuum: Taxonomy, Survey and Future Directions
Ullah, A., Kiss, T., Kovacs, J., Tusa, F., Deslauriers, J., Dagdeviren, H., Arjun, R. and Hamzeh, H. 2023. Orchestration in the Cloud-to-Things Compute Continuum: Taxonomy, Survey and Future Directions. Journal of Cloud Computing: Advances, Systems and Applications. 12 (135). https://doi.org/10.1186/s13677-023-00516-5

Toward a reference architecture based science gateway framework with embedded e‐learning support
Pierantoni, G., Kiss, T., Bolotov, A., Kagialis, D., James DesLauriers, Ullah, A., Chen, H., Chan You Fee, D., Dang, H., Kovacs, J., Belehaki, A., Herekakis, T., Tsagouri, I. and Gesing, S. 2023. Toward a reference architecture based science gateway framework with embedded e‐learning support. Concurrency and Computation: Practice and Experience. 35 (18) e6872. https://doi.org/10.1002/cpe.6872

Everyday orchestration with Docker on Kubernetes
Deslauriers, J., Arjun, R., Kovács, J. and Kiss, T. 2022. Everyday orchestration with Docker on Kubernetes. IWSG 2022. Trento, Italy 15 - 17 Jun 2022

Interoperable Data Analytics Reference Architectures Empowering Digital-Twin-Aided Manufacturing
Marosi, A.C., Márk Emodi, Hajnal, A., Lovas, R., Kiss, T., Valerie Poser, Antony, J., Bergweiler, S., Hamzeh, H., Deslauriers, J. and Kovacs, J. 2022. Interoperable Data Analytics Reference Architectures Empowering Digital-Twin-Aided Manufacturing. Future Internet. 14 (4) e114. https://doi.org/10.3390/fi14040114

Abstractions of Abstractions: Metadata to Infrastructure-as-Code
Deslauriers, J., Kovacs, J. and Kiss, T. 2022. Abstractions of Abstractions: Metadata to Infrastructure-as-Code. FIST 2022 - 1st International Workshop on the Foundations of Infrastructure Specification and Testing, In conjunction with the 19TH IEEE International Conference on Software Architecture (ICSA 2022). On-line 12 - 15 Mar 2022 IEEE . https://doi.org/10.1109/icsa-c54293.2022.00051

MiCADO-Edge: Towards an Application-level Orchestrator for the Cloud-to-Edge Computing Continuum
Ullah, A., Dagdeviren, H., Arjun, R., Deslauriers, J., Kiss, T. and Bowden, J. 2021. MiCADO-Edge: Towards an Application-level Orchestrator for the Cloud-to-Edge Computing Continuum. Journal of Grid Computing. 19 47 (2021). https://doi.org/10.1007/s10723-021-09589-5

Dynamic Composition and Automated Deployment of Digital Twins for Manufacturing
Deslauriers, J., Kiss, T. and Kovacs, J. 2021. Dynamic Composition and Automated Deployment of Digital Twins for Manufacturing. 13th International Workshop on Science Gateways. Virtual event 10 - 11 Jun 2021 CEUR Workshop Proceedings.

Cloud apps to‐go: Cloud portability with TOSCA and MiCADO
Deslauriers, J., Kiss, T., Ariyattu, R.C., Dang, H., Ullah, A., Bowden, J., Krefting, D., Pierantoni, G. and Terstyanszky, G. 2020. Cloud apps to‐go: Cloud portability with TOSCA and MiCADO. Concurrency and Computation: Practice and Experience. 33 (19). https://doi.org/10.1002/cpe.6093

Innovations in Simulation: Experiences with Cloud-based Simulation Experimentation
Taylor, S.J.E., Anagnostou, A., Abubakar, N.T., Kiss, T., Deslauriers, J., Terstyanszky, G., Kacsuk, P., Kovacs, J., Kite, S., Pattison, G. and Petry, J. 2020. Innovations in Simulation: Experiences with Cloud-based Simulation Experimentation. Bae, K. H., Feng, B., Kim, S., Lazarova-Molnar, S., Zheng, Z., Roeder, T. and Thiesing, R. (ed.) Winter Simulation Conference 2020. Orlando, Florida, USA 13 - 16 Dec 2020 IEEE . https://doi.org/10.1109/WSC48552.2020.9383971

Towards a Cloud Native Big Data Platform using MiCADO
MOSA, A., Kiss, T., Pierantoni, G., Deslauriers, J., Kagialis, D. and Terstyanszky, G. 2020. Towards a Cloud Native Big Data Platform using MiCADO. ISPDC 2020. on-line event (originally Warsaw, Poland) 05 - 08 Jul 2020 IEEE . https://doi.org/10.1109/ISPDC51135.2020.00025

Science Gateways with Embedded Ontology-based E-learning Support
Kiss, T., Bolotov, A., Pierantoni, G., Deslauriers, J., MOSA, A., Kagialis, D., Terstyanszky, G. and Chan You Fee, D. 2020. Science Gateways with Embedded Ontology-based E-learning Support. Gesing, S., Taylor, I. and Barclay, I (ed.) 12th International Workshop on Science Gateways. On-line 10 - 11 Jun 2020 CEUR Workshop Proceedings.

Describing and Processing Topology and Quality of Service Parameters of Applications in the Cloud
Pierantoni, G., Kiss, T., Terstyanszky, G., Deslauriers, J., Gesmier, G. and Dang, H. 2020. Describing and Processing Topology and Quality of Service Parameters of Applications in the Cloud. Journal of Grid Computing. 18, p. 761–778. https://doi.org/10.1007/s10723-020-09524-0

Towards a Deadline-Based Simulation Experimentation Framework Using Micro-Services Auto-Scaling Approach
Anagnostou, A., Taylor, S.J.E., Abubakar, N.T., Kiss, T., Deslauriers, J., Gesmier, G., Terstyanszky, G., Kacsuk, P. and Kovacs, J. 2019. Towards a Deadline-Based Simulation Experimentation Framework Using Micro-Services Auto-Scaling Approach. Mustafee, N., Bae, K.-H.G., Lazarova-Molnar, S., Rabe, M., Szabo, C., Haas, P. and Son, Y-J. (ed.) Winter Simulation Conference 2019. Gaylord National Resort & Conference Center National Harbor, Maryland 08 - 11 Dec 2019 IEEE . https://doi.org/10.1109/wsc40007.2019.9004882

A Cloud-agnostic Queuing System to Support the Implementation of Deadline-based Application Execution Policies
Kiss, T., Deslauriers, J., Gesmier, G., Terstyanszky, G., Pierantoni, G., Abu Oun, O., Taylor, S.J.E., Anagnostou, A. and Kovacs, J. 2019. A Cloud-agnostic Queuing System to Support the Implementation of Deadline-based Application Execution Policies. Future Generation Computer Systems. 101, pp. 99-111. https://doi.org/10.1016/j.future.2019.05.062

Enabling modular design of an application-level auto-scaling and orchestration framework using tosca-based application description templates
Deslauriers, J., Kiss, T., Pierantoni, G., Gesmier, G. and Terstyanszky, G. 2019. Enabling modular design of an application-level auto-scaling and orchestration framework using tosca-based application description templates. Stankovski, V and Gesing, S. (ed.) 11th International Workshop on Science Gateways, IWSG 2019. Ljubljana, Slovenia 12 - 14 Jun 2019 CEUR Workshop Proceedings.

Automated Scalability of Cloud Services and Jobs
Kiss, T., Terstyanszky, G., Abu Oun, O., Deslauriers, J., Gesmier, G., Pierantoni, G., Kovacs, J., Kacsuk, P., Nagy, E and Farkas, A 2018. Automated Scalability of Cloud Services and Jobs. Gesing, S. and Atkinson, M. (ed.) 10th International Workshop on Science Gateways, IWSG 2018. Edinburgh UK 13 - 15 Jun 2018

Flexible Deployment of Social Media Analysis Tools, Flexible, Policy-Oriented and Multi-Cloud deployment of Social Media Analysis Tools in the COLA Project
Pierantoni, G., Kiss, T., Gesmier, G., Deslauriers, J., Terstyanszky, G. and Martin Rapun, J.M. 2018. Flexible Deployment of Social Media Analysis Tools, Flexible, Policy-Oriented and Multi-Cloud deployment of Social Media Analysis Tools in the COLA Project. Gesing, S. and Atkinson, M. (ed.) 10th International Workshop on Science Gateways, IWSG 2018. Edinburgh UK 13 - 15 Jun 2018 CEUR Workshop Proceedings.

Permalink - https://westminsterresearch.westminster.ac.uk/item/x1829/crowd-sourced-exemplars-for-developing-research-software-skills-in-stem


Share this

Usage statistics

40 total views
50 total downloads
These values cover views and downloads from WestminsterResearch and are for the period from September 2nd 2018, when this repository was created.