Setup via container

This section describes how to set up Kassiopeia using a Docker image as container.

Docker image 

Docker is a way to store programs together with operating systems in so-called images. Instances of these images can be run - then they are called containers. The philosophy behind Docker is that one tries to not store any relevant information in containers, so whenever a new version of an image is available, one can just start a new container from the new image and immediately continue working without any configuration. To achieve this, e.g. folders from outside can be mounted into the container.

Docker/Podman/Apptainer(/Singularity)

There are many ways to run containers. On desktop machines, docker and podman are the most widespread and the following commands from this readme can be used with both. Docker is more robust, Podman in root-less mode is safer if you don’t fully trust the images you run (see here).

On HPC environments, Apptainer(/Singularity) may be available instead.

Provided images 

For Kassiopeia, multiple Docker images are provided:

ghcr.io/katrin-experiment/kassiopeia/minimal:main for a minimal image, just containing enough software to run Kassiopeia. Best for non-interactive use-cases, e.g. in HPC environments.

ghcr.io/katrin-experiment/kassiopeia/full:main for a full image containing a JupyterLab server for full convenience.

You can download and name them the following way:

# Download the image
docker pull ghcr.io/katrin-experiment/kassiopeia/full:main

# Give the image a short name
docker tag ghcr.io/katrin-experiment/kassiopeia/full:main kassiopeia_full

It is also possible to build the images yourself. That is described in section Building the docker image.

Running a docker container 

Warning

Files created inside containers may be lost after stopping the container. Ensure that you store important data to a permanent location!

Run on HPC infrastructure (Apptainer/Singularity)

Some HPC-Clusters prefer the use of Apptainer or Singularity over Docker. Apptainer is a fork of Singularity, both can be used similarly. They support Docker images, which can be used following these steps:

Load Apptainer/Singularity module if applicable. Example from the NEMO cluster: module load tools/singularity/3.11

Create Container file by executing singularity build kassiopeia.sif docker://ghcr.io/katrin-experiment/kassiopeia/full:main

Run Container by executing singularity run kassiopeia.sif bash

For automatic jobs, commands may be packaged into a shell script and run like singularity run kassiopeia.sif script.sh.

Run locally (docker/podman)

To run Kassiopeia applications from the Docker image, you can now start a container by running e.g.:

docker run --rm -it \
  -v /path/on/host:/home/parrot \
  -p 44444:44444 \
  kassiopeia_full

Here, the --rm option automatically removes the container after running it, saving disk space.

This implies that files saved and changes done inside the container won’t be stored after exiting the container. Therefore using a persistent storage outside of the container like /path/on/host (see below) is important. Another possibility on how to make persistent changes to a Docker container can be found in the section Customizing Docker containers.

Note

Theoretically, one can also create named containers using docker create instead of docker run. This has the downside that it makes it harder to swap containers for a newer version as one can easily get into modifying the container significantly. Before doing that, one should consider the approach shown in the section Customizing Docker containers, which in practically all cases should be the preferred option.

-it lets the application run as interactive terminal session.

-v maps /home/parrot inside the container to /path/on/host outside. /path/on/host has to be switched to a path of your choice on your machine.

If /home/parrot shall be writable and the container is run rootless, file write permissions for the user and group ids of the parrot user inside the container have to be taken into account. If Podman is used and the current user has uid=1000 and gid=1000 (defined at the top of the Dockerfile), this is as simple as using --userns=keep-id in the create command. More information on that can be found in the section Using an existing directory.

The argument -p 44444:44444 maps the port 44444 from inside the container (right) to outside the container (left). This is only needed if you want to use jupyter lab.

Depending on the image you chose, the above will start a shell or jupyter lab using the previously built kassiopeia image. From this shell, you can run any Kassiopeia commands. Inside the container, Kassiopeia is installed to /kassiopeia/install. The script kasperenv.sh is executed at the beginning, so all Kassiopeia executables are immediately available at the command line.

File structure of the container 

/home/parrot        # The default user's home directory inside the container.
                    # Used in the examples here for custom code, shell scripts,
                    # output files and other work. Mounted from host, therefore also
                    # available if the container is removed.

/kassiopeia         # Kassiopeia related files
|
+-- install         # The Kassiopeia installation directory ($KASPERSYS).
|     |
|     +-- config
|     +-- bin
|     +-- lib
|     .
|     .
|
+-- build           # The Kassiopeia build directory.
|                   # Only available on target `build`.
|
+-- code            # The Kassiopeia code. If needed, this directory has to be
                    # mounted from the host using '-v'.

Listing and removing existing containers 

To see a list of all running and stopped containers, run:

docker ps -a

To stop an existing, running container, find its name with the above command and run:

docker stop containername

To remove an existing container, run:

docker rm containername

This also cleans up any data that is only stored inside the container.

Running applications directly 

As an alternative to starting a shell in an interactive container, you can also run any Kassiopeia executable directly from the Docker command:

docker run --rm kassiopeia_minimal \
 Kassiopeia /kassiopeia/install/config/Kassiopeia/Examples/DipoleTrapSimulation.xml -batch

Note

Some of the example simulations (and other configuration files) will show some kind of visualization of the simulation results, using ROOT or VTK for display. Because graphical applications are not supported in Docker by default, this will lead to a crash with the error bad X server connection or similar.

To avoid this, one can pass the -b or -batch flag to Kassiopeia and other Kassiopeia applications. This will prevent opening any graphical user interfaces. See below for information on how to use graphical applications.

Setting up persistent storage 

Docker containers do not have any persistent storage by default. In order to keep any changed or generated files inside your container, you should provide a persistent volume or mount a location from your local harddisk inside the conainter. Both approaches are outlined below.

Using a persistent volume 

A persistent storage volume can be added by modifying the docker run command. The storage volume can be either an actual volume that is managed by Docker, or a local path that is mapped into the container.

To use a persistent Docker volume named kassiopeia-output, use the flag:

-v kassiopeia-output:/kassiopeia/install/output

You can add multiple volumes for other paths, e.g. providing separate volumes kassiopeia-log and kassiopeia-cache for the log and cache paths.

Using an existing directory 

To use an existing directory on the host system instead, use:

-v /path/on/host:/path/in/container

Note

This command assumes that the local path /path/on/host already exists.

The option to use a local path is typically easier to use, because it’s easy to share files between the host system and the Docker container.

If you use a rootless container and the mount will be used to write data to it, you have to take care about permissions. In Podman, this can e.g. be done by calling create with the --userns flag. As used with --userns=keep-id, group and user ids of non-root users inside the container equal those outside the container. The gid and uid of the parrot user inside the container have to be adapted to your user outside the container, as e.g. given by the output of the id command. This can be done by building using the arguments --build-arg KASSIOPEIA_GID=<VALUE> and --build-arg KASSIOPEIA_UID=<VALUE> like this:

podman build \
--build-arg KASSIOPEIA_GID=$(id -g) \
--build-arg KASSIOPEIA_UID=$(id -u) \
--target full -t kassiopeia_full .

Adapting the example from section Running a docker container, an exemplary rootless podman container could then be started like this:

podman run -it --userns=keep-id \
 -v /path/on/host:/home/parrot \
 -p 44444:44444 \
 kassiopeia_full

If e.g. only members of a specific group have write access to the files, make sure that the user inside the container is part of an identical group.

Running graphical applications 

Using kassiopeia_full 

With the VNC (Desktop) link in the launcher, a desktop environment can be opened. When afterwards applications with GUI are launched - e.g. through a terminal available from the launcher - the GUI is shown in the desktop environment.

Note that launching a GUI requires first opening the desktop environment. In case the connection is breaks, you can reload the VNC connection by clicking the Reload button on the top right of the VNC (Desktop) tab.

Using kassiopeia_minimal 

The Docker container does not allow to run any graphical applications directly. This is because inside the container there is no X11 window system available, so any window management must be passed down to the host system. It is therefore possible to run graphical applications if the host system provides an X11 environment, which is typically the case on Linux and MacOS systems.

To do this, one needs to pass additional options:

docker run -e DISPLAY -v /tmp/.X11-unix:/tmp/.X11-unix --rm kassiopeia_minimal \
  Kassiopeia /kassiopeia/install/config/Kassiopeia/Examples/DipoleTrapSimulation.xml

In addition, it is often necessary to set up X11 so that network connections from the local machine are allowed. This is needed so that applications running inside Docker can access the host’s X11 server. The following command should be executed once before docker run:

xhost local:docker

Note

For security reasons, do not run this command on shared computer systems!

Root access 

Note that in nearly any case, there should be no need for actual root access to an active container. Use the information from section Customizing Docker containers instead. If you are developing with Docker, there may be reasons to install software lateron anyways, in which case you can get a root shell by running the container with the additional option --name myKassiopeia and then executing:

podman exec -u 0 -it myKassiopeia bash

Customizing Docker containers 

If e.g. the software pre-installed via the pre-defined images is not enough, you can prepare them further by building upon already built container images. For this, create a new file called Dockerfile in a directory of your choice. An example of how it could look like, given an already built container kassiopeia_minimal:

Dockerfile
FROM kassiopeia_minimal

# Switch to root to gain privileges
ä Note: No password needed!
USER root

# Run a few lines in the shell to update everything and install nano.
# Cleaning up /packages at the end to reduce the size of the resulting
# container.
RUN dnf update -y \
 && dnf install -y nano \
 && rm /packages

# Switch back to parrot user
# USER parrot

Now you can build this and give it a custom tag: docker build -t custom_kassiopeia_minimal. From now on, you can use custom_kassiopeia_minimal instead of kassiopeia_minimal to have access to nano.

Building the docker image 

To create a Docker image from this Dockerfile, download the Kassiopeia sources (e.g. using git clone as described in Downloading the code). Then change into the directory where the Dockerfile is located, and run one of these commands:

Minimal (bare Kassiopeia installation)

docker build --target minimal -t kassiopeia_minimal .

for an image with only the bare Kassiopeia installation. If no other command is specified, it starts into a bash. This image can directly be used in applications where container size matters, e.g. if the container image has to be spread to a high amount of computation clients. Because of its smaller size, this target is also useful as a base image of e.g. an application-taylored custom Dockerfile.

Full (for personal use)

docker build --target full -t kassiopeia_full .

for an image containing jupyter lab for a simple web interface, multiple terminals and Python notebooks. If no other command is specified, it starts into jupyter lab at container port 44444. If started with the command bash, it can also be used like the minimal container.

This will pull a Fedora base image, set up the Kassiopeia dependencies (including ROOT and VTK), and create a Kassiopeia installation that is built from the local sources. If you use git, this will use the currently checked out branch. If you need a more recent Kassiopeia version, update the sources before you build the container (e.g. by fecthing remote updates via git or by switching to a different branch).

When building these container images, the .git folder is not copied, meaning the resulting Kassiopeia installation e.g. can’t show the build commit and branch when sourcing kasperenv.sh. To build the containers with knowledge of the used git version, one can use:

docker build --target minimal -t kassiopeia_minimal --build-arg KASSIOPEIA_GIT_BRANCH=<branch name here> --build-arg KASSIOPEIA_GIT_COMMIT=<first 9 characters of commit id here>

or to automate getting the branch and commit names:

docker build --target minimal -t kassiopeia_minimal --build-arg KASSIOPEIA_GIT_BRANCH=$(git rev-parse --abbrev-ref HEAD) --build-arg KASSIOPEIA_GIT_COMMIT=$(git rev-parse --short HEAD)

The Docker build will use half of the available CPU cores to speed up the process. A typical build will take about 30 mins and the resulting Docker image is about 2.5 GB (minimal) / 3 GB (full) in size.

Important

On Windows, make sure to use the Linux line endings on all files in the Kassiopeia project.

Re-Building Kassiopeia with Docker 

As a user, to get a new release, re-build your Docker image as described in Building the docker image. This ensures a clean build with the correct root and boost versions and applies Docker configuration changes.

But if you work on Kassiopeia code, re-building everything can be tedious and you might want to recompile only the parts of Kassiopeia you changed, and for this re-use the current build folder. To do this with Docker, you first need an image that still contains the build folder, which is done by building the build image:

docker build --target build -t kassiopeia_build .

Now you can build to a custom build and install path on your host:

docker run --rm \
 -v /path/on/host:/home/parrot \
 -e kassiopeia_dir_code='...' \
 -e kassiopeia_dir_build='...' \
 -e kassiopeia_dir_install='...' \
 kassiopeia_build \
 /kassiopeia/code/setup.sh "Release" "/kassiopeia/install" "/kassiopeia/build"

The three dots after kassiopeia_dir_build and kassiopeia_dir_install have to be replaced by paths relative to /path/on/host where you want your build and install directories to be.

If the build and install directories are empty, they are initialized to the content your kassiopeia_build image has for these folders.

Additionally, the install directory includes a python directory containing local Python packages, set via the environment variable PYTHONUSERBASE.

To run a kassiopeia_minimal or kassiopeia_full container with the new Kassiopeia installation, just use the correct mapping for /home/parrot and provide kassiopeia_dir_install as in

-v /path/on/host:/home/parrot \
-e kassiopeia_dir_install='...' \

. To have more than one mapping - e.g. a mapping /path_one/on/host to data and /path/on/host/to/install that contains your new installation directory, you could map both to your container e.g. using:

-v /path_one/on/host:/home/parrot/dir_one \
-v /path/on/host/to/install:/home/parrot/custom_install \
-e kassiopeia_dir_install='custom_install' \

It is just important that if you provide $kassiopeia_dir_install, at the position of /home/parrot/$kassiopeia_dir_install, your custom installation can be found.

If you use --userns=keep-id on your main container, you also need to use it on this container.

You can also replace "Release" with a build type of your choice, like "RelWithDebInfo" for debugging.