Please enable JavaScript to view this site.

GeoDict User Guide 2025

Neural Network Training

To train a neural network, GeoDict-AI uses the UNet method Ronneberger and Fischer, 2015, which implements an image-to-image transformation. When illustrated as below, the UNet has a U-shape (hence the name) with a convolutional constricting and a deconvolutional expanding branch on the left and right respectively. The constricting branch analyzes and simplifies the input structure to features. The expanding branch uses these features to decide, for example, which of the input voxels to label as binder.

The UNet always works on a fixed input window size and produces results for a smaller output window which is centered within the input window. In order to analyze a given 3D structure, which is usually larger than the input window, the window is automatically shifted over the whole structure and the network is applied at each window location in order to obtain results for the whole domain.

In the diagram, the given size for this input window is 52 voxels in X-, Y- and Z-direction. The window contents are encoded in the left branch of the UNet to an abstract feature map. This feature map is then decoded within the right branch of the UNet to obtain the output image.

The diagram below shows a UNet with depth=2. The depth is defined by the number of max pool operations / deconvolutions leading from layer to layer. Thus, a depth of 2 results in three layers.

hmtoggle_arrow0Operations in one layer of the UNet

hmtoggle_arrow0Leaving a layer with Max Pool

hmtoggle_arrow0End of the last layer

hmtoggle_arrow0Moving up with deconvolution

hmtoggle_arrow0Concatenate features with high resolution cutouts

hmtoggle_arrow0Weights and Optimizers

©2025 created by Math2Market GmbH / Imprint / Privacy Policy