GeoDict User Guide 2025

Zoom Window Out
Larger Text | Smaller Text
Hide Page Header
Show Expanding Text
Print Topic
Share This Topic
Save Permalink URL

Navigation: GeoDict 2025 - User Guide > Image Analysis > GeoDict-AI > Theoretical Background

Neural Network Training

To train a neural network, GeoDict-AI uses the UNet method Ronneberger and Fischer, 2015, which implements an image-to-image transformation. When illustrated as below, the UNet has a U-shape (hence the name) with a convolutional constricting and a deconvolutional expanding branch on the left and right respectively. The constricting branch analyzes and simplifies the input structure to features. The expanding branch uses these features to decide, for example, which of the input voxels to label as binder.

The UNet always works on a fixed input window size and produces results for a smaller output window which is centered within the input window. In order to analyze a given 3D structure, which is usually larger than the input window, the window is automatically shifted over the whole structure and the network is applied at each window location in order to obtain results for the whole domain.

In the diagram, the given size for this input window is 52 voxels in X-, Y- and Z-direction. The window contents are encoded in the left branch of the UNet to an abstract feature map. This feature map is then decoded within the right branch of the UNet to obtain the output image.

The diagram below shows a UNet with depth=2. The depth is defined by the number of max pool operations / deconvolutions leading from layer to layer. Thus, a depth of 2 results in three layers.

Operations in one layer of the UNet

Leaving a layer with Max Pool

End of the last layer

Moving up with deconvolution

Concatenate features with high resolution cutouts

Weights and Optimizers

Please enable JavaScript to view this site.

GeoDict User Guide 2025

Neural Network Training