Snappy: Where to start?

Good afternoon,

I am a new ESA Trainee and I would like to implement some of the processes of S2-tbx/sen2cor into a online platform. As a programming language I am going to use python. Are there beginner guides, handbooks or well documented scripts available?

Until now I found:

  1. How to use the SNAP API from Python:
    https://senbox.atlassian.net/wiki/display/SNAP/How+to+use+the+SNAP+API+from+Python
    -> I could install snappy properly and run “snappy first contact”

  2. Scripts
    https://github.com/senbox-org/snap-examples/tree/master/snap-engine-python-script
    https://github.com/senbox-org/snap-examples/tree/master/snap-engine-python-operator
    -> I looked at them but they are quite complicated for a beginner like me, especially without basic comments. :wink:

Thank you for your help.
Andreas

3 Likes

You made already a good start. :+1:
Some more examples can be found in the snappy dir.
There is an examples and a tests directory. In those directories you will find more or less simple scripts.
Beside what you have already found there is no beginners guide or handbook. Sorry for disappointing you.
The developer guide, you found it already in the wiki, is still under construction.
However, feel free to ask questions here in the forum as much as you like.

1 Like

Thank you @marpet, I found the examples you mentioned and they are really helpful. I costumized “snappy_subset.py” to run with “S-2” and added some new input arguments. It works well but there are two INFO message and especially the second one irritates me:

INFO: org.esa.s2tbx.dataio.s2.ortho.S2OrthoProduct10MReaderPlugIn: Building product reader 10M - EPSG:32632
INFO: org.esa.s2tbx.dataio.s2.l1c.L1cMetadata: Skipping tile S2A_OPER_MTD_L1C_TL_SGS__20160113T143043_A002922_T31TGJ.xml because it has crs EPSG:32631 instead of requested EPSG:32632

I saw it already when running the “snappy 1st contact” script with different S-2 datasets. When I open the “end” products in SNAP it does not look like any tile was not processed. Has this any impacts? If so, do I need to reproject everything before running any S-2 dataset to not get this message anymore?

I have an other question regarding python/snappy.

I wrote a small code to reproject S-2 data.

import snappy
from snappy import ProductIO

ReprojectOp = snappy.jpy.get_type(‘org.esa.snap.core.gpf.common.reproject.ReprojectionOp’)

in_file = ‘C:/S2/…/S2A_OPER_…20160113T103004.dim’
out_file = 'C:/S2/…/S2A_OPER
…_20160113T103004_reproj.dim’

product = ProductIO.readProduct(in_file)

op = ReprojectOp()
op.setSourceProduct(product)
op.setParameter(‘crs’,‘EPSG:21781’)
op.setParameter(‘resampling’,‘Nearest’)

sub_product = op.getTargetProduct()

ProductIO.writeProduct(sub_product, out_file, “BEAM-DIMAP”)

As much as I saw it works fine. But I am curious about “setParameter(…)”. Is there a list or anything alike where I could see what kind of parameters are available and which ones are mandatory to run a operator? And also which values are valid (e.g. i used ‘nearest’ instead of ‘Nearest’ and got an error) for a certain parameter?

Thank you for your help and best regards from Frascati.
Andreas

The problem is that S2 data can cross several UTM zones. This is not yet well handled by SNAP and the S2 reader.
When you read a product you can also specify the product format.

ProductIO.readProduct(file, formatName)

This way you can force the S2 reader to open a certain UTM zone. For example

ProductIO.readProduct(file, "SENTINEL-2-MSI-10M-UTM31N")
ProductIO.readProduct(file, "SENTINEL-2-MSI-10M-UTM32N")

If you have opened them you can e.g. create a mosaic of both.

1 Like

To get documentation about the operator the easiest way is to use the gpt command line tool
Call

gpt -h Reproject

And you will get the list of parameters and a description for each.
When you simply call

gpt -h

You will get a list of all available operators.

The way you call the operator is valid but you should call
setParameterDefaultValues() after creating the operator. This will ensure that the default value is assigned to the parameters if there is any.
However, there is a more preferred way of calling an operator

parameters = HashMap()
parameters.put('crs', 'EPSG:21781')
parameters.put('resampling', 'Nearest')
reprojProduct = GPF.createProduct('Reproject', parameters, product)
2 Likes

Thank you for your hint. gpt -h Reproject works well.

I also tried to use the “more preferred way of calling an operator” but I get an error:

I tested ‘reproject’, ‘Reproject’ and ‘ReprojectOp’ but nothing worked. What could be the cause?

Is there a way to get all possible formatName which are used in the current S-2 dataset before using ProductIO.readProduct(…)?

Sorry I forgot to say that you have to call the following code once.

GPF.getDefaultInstance().getOperatorSpiRegistry().loadOperatorSpis()

Afterwards the operators shall be found.

Sorry, I don’t know this. This is somewhere encoded in the file names and in the xml.
Maybe @Nicolas or @jmalik can jump in here and help you further.

For your Information:

I tried to run it as you described it:

ProductIO.readProduct(file, “SENTINEL-2-MSI-10M-UTM31N”)

But with that I am getting a RuntimeError:

RuntimeError: no matching Java method overloads found

I could solve it while running the code as explained in: [SNAP-343] - JIRA

Good morning,

I have some questions regarding snappy and Mosaicing:

  1. I read out all files with their CRS and save them in a list:


products =

products.append(self.readProduct_func(srcPath,formatName))

  1. Afterward I would like to do a mosaicing:

import snappy
from snappy import GPF
from snappy import jpy

HashMap = jpy.get_type(‘java.util.HashMap’)
GPF.getDefaultInstance().getOperatorSpiRegistry().loadOperatorSpis()

in_param = {‘crs’: ‘EPSG:4326’}
parameters = HashMap()

for para_name in in_param:
…parameters.put(para_name,in_param[para_name])

reproj_data = GPF.createProduct(‘Mosaic’, parameters, products)

But if I run it I will get a RuntimeError:

RuntimeError: org.esa.snap.core.gpf.OperatorException

What could be the cause? How to solve it?

Also I saw in a previous beam-forum (Link) that there was also a parameter “variables” to select the bands which should be processed. In SNAP it is mandatory to select them but I can not find it in “gpt -h Mosaic”. Did I oversee something?

Hi,

the exception message must be longer as just the one line you posted . It would be helpful to know this error message.
However, I think it complains about the missing variables definition what you already have assumed.
The variables are not directly listed in the parameters list when you call gpt -h Mosaic. The reason is that it is are more complex parameter and it cannot be specified on the command line. It can only be specified within a xml file or via the the API. The variables and the conditions are shown in the example xml on the command line when you type gpt -h Mosaic.
You can use it in your code as follows:

variables = MosaicOp.Variable('name', 'expression'), MosaicOp.Variable('name2', 'expression2')
parameters.put('variables', variables)

Don’t take the syntax to serious. I’m not a python expert.

Hi,

that is the thing: This is the whole error message! I may have fixed the code, but I get the Java-Out-of-Memory-Error when running two S-2 datasets. So I can not be sure. Will try to test it on the VM, as soon as I get to run it.

Good afternoon,
Again, I have a question regarding SNAPPY. I would like to compute a mosaic and would like to set the “northBound”, “southBound”,… automatically to the overall boundingbox of the input files. Is it possible to read them out of an input raster (e.g. GeoTiff, S2)? If so, how to do it?

I could not find something within the ProductIO.readProduct(…) . Did I miss something?

Secondly is it also possible to get the Pixelsize/GSD of a file, to set a appropriate pixelSizeX/pixelSizeY?

Cheers,
Andreas

You can do the following:

sceneUL = PixelPos(0 + 0.5f, 0 + 0.5f);
sceneUR = PixelPos(product.getSceneRasterWidth() - 1 + 0.5f, 0 + 0.5f);
sceneLL = PixelPos(0 + 0.5f, product.getSceneRasterHeight() - 1 + 0.5f);
sceneLR = PixelPos(product.getSceneRasterWidth() - 1 + 0.5f, product.getSceneRasterHeight() - 1 + 0.5f);

this way you have the corner pixels. Afterwards you can do:

gp_ul = geoCoding.getGeoPos(sceneUL, gp);
gp_ur = geoCoding.getGeoPos(sceneUR, gp);
gp_ll = geoCoding.getGeoPos(sceneLL, gp);
gp_lr = geoCoding.getGeoPos(sceneLR, gp);

From these geo-positions you can retrieve lat and lon.

gp_lr.getLat()
gp_lr.getLon()

Compute the min and max for lat and lon and then you have the results.
When you have the lat and lon values you can compute the lat/lon width/height and divide it by the pixel width/height.
This way you get the pixelSizeX/Y

Good afternoon,

Thank you once again. I could successfully implement it.

But now I have two questions regarding this PixelSize and Mosaicing.

As you mentioned above I can read out the S2 data with their different UTM zones and afterward create a mosaic of both of them. Now I would like to keep the same resolution (e.g. 10,20 or 60m). I have a test file in the area of the Siachen Glacier (~36°N, India-Pakistan). If I did not make a huge mistake, then 10m in this latitude should be ~0.0001° (PixelSizeY, East-West) and ~0.00009° (PixelSizeX, North-South). But if I want to run the Mosaicing with such values I will receive an error RuntimeError: java.lang.RuntimeException: Cannot construct DataBuffer. . If I run the same script with 0.001° (~90m, which is also the minimum Value in SNAP) it works fine. Is there a reason why it is not possible to have a better resolution?

Cheers,
Andreas

Hi

How much memory do you have? Sounds like that you do not have enough.

Hi,

I tested it on my Desktop (8 GB RAM) and on my Laptop (16 GB RAM) with 4 DEM (Geotiff, each ~25 MB) and I got the same result:

Using 0.0010 as input: it works fine
Using 0.0009 as input: RuntimeError: java.lang.RuntimeException: Cannot construct DataBuffer.

You can try to set the JVM memory.
Have a look into the snappy directory. There is a jpyconfig.py file. Change ‘jvm_maxmem = None’ to jvm_maxmem = 6G for example.