Attention

The DEEP platform is sunsetting.

The DEEP-Hybrid-DataCloud project project has ended and its plaftform and software are being decomissioned during 2023, as they have been superseeded by the AI4EOSC platform and the AI4OS software stack.

Please refer to the following links for further information:

AI4OS software distribution documentation

AI4EOSC project

AI4EOSC platformt

DEEP Modules Template

To simplify the development of new modules, and make the integration of your model with the DEEPaaS API easier, we provide a standard template for modules.

There are different versions of this template:

master: this is what 99% of users are probably looking for. Simple, minimal template, with the minimum requirements to integrate your code in DEEP.
child-module: this is a fork of the master branch specifically tailored to users performing a retraining of an existing module. It only creates a Docker repo whose container is based on an existing module’s Docker image.
advanced: this is a more advanced template. It makes more assumptions on how to structure projects and adds more files than those strictly needed for integration. Unless you are looking for some specific feature, you are probably safer using master.

Create your project based on the template

Based on the version of the template you choose you will be asked to answer a number of questions, which might include:

git_base_url: Remote URL to host your new git repositories (e.g. https://github.com/deephdc ).
project_name: Project name, to be added after "git_base_url" (see above)”, (aka <your_project> in the following).
author_name: Author name(s) (and/or your organization/company/team). If many, separate by comma.
author_email: E-Mail(s) of main author(s) (or contact person). If many, separate by comma.
description: Short description of the project.
app_version: Application version (expects X.Y.Z (Major.Minor.Patch)).
open_source_license: Choose open source license, default is MIT. More info.
docker_baseimage: Docker image your Dockerfile starts from (FROM <docker_baseimage>) (don’t provide the tag here), (e.g. tensorflow/tensorflow ).
baseimage_cpu_tag: CPU tag for the baseimage, e.g. 2.9.1. Has to match python3!
baseimage_gpu_tag: GPU tag for the baseimage, e.g. 2.9.1-gpu. Has to match python3! Sometimes baseimage_cpu_tag and baseimage_gpu_tag are the same (for example in Pytorch). In Tensorflow they are different.
failure_notify: whether you want to receive updates if your model fails to build.

Based on your answers, we will fill the template and create two repositories (linked to your git_base_url):

~/your_project: this is where the code of your module goes
~/DEEP-OC-your_project: this is where the Docker files to build your module go. It also has the metadata of your module that will be shown in the Marketplace.

Each repository has two branches: master (to commit stable changes) and test (to test features without disrupting your users).

Via User Interface

Go to the Template creation webpage. You will need an EGI Check-In account to access to this webpage, so if you don’t have one already, you will need to create it.

Then select which version of the template you want and answer the questions. Click on Generate and you will be able to download a .zip file with both repositories.

Via Terminal

Useful video demos

Data science cookiecutter template

You will need to install cookiecutter and then run it as follows:

$ pip install cookiecutter
$ cookiecutter https://github.com/deephdc/cookiecutter-deep --checkout master

You are first provided with an [Info] line with information about the parameter. And in the next line you configure this parameter. Once all questions are answered, the two repositories will be created.

Project structure

Based on the on the branch you choose, the template will create different files, being master the most minimal option (see above). The content of these files is populated based on your answer to the questions.

Master branch

<your_project>
##############
├── LICENSE                <- License file
│
├── README.md              <- The top-level README for developers using this project.
│
├── requirements.txt       <- The requirements file for reproducing the analysis
│                              environment (`pip freeze > requirements.txt`)
│
├── setup.py, setup.cfg    <- makes project pip installable (`pip install -e .`) so
│                             {{cookiecutter.repo_name}} can be imported
│
├── {{cookiecutter.__repo_name}}    <- Source code for use in this project.
│   │
│   ├── __init__.py        <- Makes {{cookiecutter.repo_name}} a Python module
│   │
│   ├── api.py             <- Main script for the integration with DEEP API
│   │
│   ├── misc.py            <- Misc functions that were helpful across projects
│   │
│   └── tests              <- Scripts to perform code testing
│
└── Jenkinsfile            <- Describes basic Jenkins CI/CD pipeline


DEEP-OC-<your_project>
######################
├─ Dockerfile             <- Describes main steps on integration of DEEPaaS API and
│                            <your_project> application in one Docker image
│
├─ Jenkinsfile            <- Describes basic Jenkins CI/CD pipeline
│
├─ LICENSE                <- License file
│
├─ README.md              <- README for developers and users.
│
└─ metadata.json          <- Defines information propagated to the DEEP Marketplace

Child-module branch

DEEP-OC-<your_project>
######################
├─ Dockerfile             <- Describes main steps on integration of DEEPaaS API and
│                            <your_project> application in one Docker image
│
├─ Jenkinsfile            <- Describes basic Jenkins CI/CD pipeline
│
├─ LICENSE                <- License file
│
├─ README.md              <- README for developers and users.
│
└─ metadata.json          <- Defines information propagated to the DEEP Marketplace

Advanced branch

<your_project>
##############
├── LICENSE
├── README.md              <- The top-level README for developers using this project.
├── data
│   └── raw                <- The original, immutable data dump.
│
├── docs                   <- A default Sphinx project; see sphinx-doc.org for details
│
├── models                 <- Trained and serialized models, model predictions, or model
│                             summaries
│
├── notebooks              <- Jupyter notebooks. Naming convention is a number
│                             (for ordering), the creator's initials (if many
│                             user development), and a short `_` delimited
│                             description.
│                             e.g.`1.0-jqp-initial_data_exploration.ipynb`.
│
├── references             <- Data dictionaries, manuals, and all other explanatory
│                             materials.
│
├── reports                <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures            <- Generated graphics and figures to be used in reporting
│
├── requirements.txt       <- The requirements file for reproducing the analysis
│                             environment, (`pip freeze > requirements.txt`)
│
├── test-requirements.txt  <- The requirements file for the test environment
│
├── setup.py               <- makes project pip installable (pip install -e .) so
│                             {{cookiecutter.repo_name}} can be imported
│
├── {{cookiecutter.__repo_name}}    <- Source code for use in this project.
│   │
│   ├── __init__.py        <- Makes {{cookiecutter.repo_name}} a Python module
│   │
│   ├── dataset            <- Scripts to download or generate data
│   │   └── make_dataset.py
│   │
│   ├── features           <- Scripts to turn raw data into features for modeling
│   │   └── build_features.py
│   │
│   ├── models             <- Scripts to train models and make predictions
│   │   └── deep_api.py    <- Main script for the integration with DEEP API
│   │
│   ├── tests              <- Scripts to perform code testing
│   │
│   └── visualization      <- Scripts to create exploratory and results oriented
│       └── visualize.py      visualizations
│
└── tox.ini                <- tox file with settings for running tox; see tox.testrun.org

DEEP-OC-<your_project>
######################
├─ Dockerfile             <- Describes main steps on integration DEEPaaS API and
│                            <your_project> application in one Docker image
│
├─ Jenkinsfile            <- Describes basic Jenkins CI/CD pipeline
│
├─ LICENSE                <- License file
│
├─ README.md              <- README for developers and users.
│
├─ docker-compose.yml     <- Allows running the application with various configurations
│                            via docker-compose
│
└─ metadata.json          <- Defines information propagated to the DEEP Marketplace