DataScience4NP
Data Science for non-programmers
Description

The objective of this project is to explore the use of visual programming paradigms to enable non-programmers to be part of the Data Science workforce.

In contrast to existing approaches, which require programming, Scientific Workflow Management Systems (SWMS) can become an alternative to support the visual programming of data science projects. Such systems (e.g. Taverna and Kepler) use a simple graphical, graph-based structure to develop applications.

This simplicity has shown to be suitable in several scientific areas such as bioinformatics, geophysics, and climate analysis. Despite the success of SWMS in data intensive research, they did not reach a state where non-programmers data scientists can use them. They still require some programming and scripting skills to code individual processing tasks. That is why research teams using those systems are usually composed of scientists and software developers.

We propose to extend current SWMS to support the parameterization of generic prebuild workflow templates. Workflow templates capture the processing tasks of data science projects. A template can be seen as a formalized best practice that data scientists can use to solve common data analysis challenges. Templates are developed by multidisciplinary teams of experts and reused by non-programmer data scientists. Parameterized workflows have been used successfully in the field of enterprise computing since 1970 to increase software reuse (e.g. SAP’s parameterized workflows to automate business process models). We claim that the same type of benefits can be obtained by parameterizing scientific workflow templates.

Internal Partners
Research Centre Research Group Role in Project Begin Date End Date
ISTAR-Iscte Software Systems Engineering Partner 2019-09-17 2019-12-30
External Partners

No records found.

Project Team
Name Affiliation Role in Project Begin Date End Date
Fernando Brito e Abreu Professor Associado (DCTI); Integrated Researcher (ISTAR-Iscte); Local Coordinator 2019-09-17 2019-12-30
Carlos Serrão Professor Associado (DTDA); Integrated Researcher (ISTAR-Iscte); Researcher 2019-09-17 2019-12-30
João Caldeira Associate Researcher (ISTAR-Iscte); Researcher 2019-09-17 2019-12-30
João Pedro Oliveira Professor Associado (DCTI); Integrated Researcher (IT-Iscte); Researcher 2019-09-17 2019-12-30
José Américo Alves Sustelo Rio Associate Researcher (ISTAR-Iscte); Researcher 2019-09-17 2019-12-30
José Pereira dos Reis Professor Auxiliar (DCTI); Associate Researcher (ISTAR-Iscte); Researcher 2019-09-17 2019-12-30
Project Fundings
Reference/Code Funding DOI Funding Type Funding Program Funding Amount (Global) Funding Amount (Local) Begin Date End Date
PT2020 – SAICT –PTDC/ICDT -- Contract PT2020 - PTDC/ICDT - Portugal 1 1 2019-09-17 2019-12-30
Publication Outputs

No records found.

Related Research Data Records

No records found.

Related References in the Media

No records found.

Other Outputs

No records found.

Project Files

No records found.

With the objective to increase the research activity directed towards the achievement of the United Nations 2030 Sustainable Development Goals, the possibility of associating scientific projects with the Sustainable Development Goals is now available in Ciência_Iscte. These are the Sustainable Development Goals identified for this project. For more detailed information on the Sustainable Development Goals, click here.

Data Science for non-programmers
2019-09-17
2019-12-30