MultiImageSamplingRate¶
Compute sampling rate for an input set of images.
Description¶
The application computes sampling rates for a set of input images. Before calling this application, each pair of image and training vectors has to be analysed with the application PolygonClassStatistics. The statistics file is then used to compute the sampling rates for each class in each image. Several types of sampling are implemented. Each one is a combination of a mono-image strategy and a multi-image mode. The mono-image strategies are:
smallest (default): select the same number of sample in each class so that the smallest one is fully sampled.
constant: select the same number of samples N in each class (with N below or equal to the size of the smallest class).
byclass: set the required number for each class manually, with an input CSV file (first column is class name, second one is the required samples number).
The multi-image modes (mim) are proportional, equal and custom. The custom mode lets the user choose the distribution of samples among the images. The different behaviours are described below. Ti(c) and Ni(c) refers respectively to the total number and needed number of samples in image i for class c. Let’s call L the total number of images.
strategy = all
Same behaviour for all modes: take all samples
strategy = constant: let’s call M the global number of samples required per class. For each image i and each class c:
if mim = proportional, then Ni( c ) = M * Ti( c ) / sum_k( Tk(c) )
if mim = equal , then Ni( c ) = M / L
if mim = custom , then Ni( c ) = Mi where Mi is the custom requested number of samples for image i
strategy = byClass : let’s call M(c) the global number of samples for class c). For each image i and each class c:
if mim = proportional, then Ni( c ) = M(c) * Ti( c ) / sum_k( Tk(c) )
if mim = equal , then Ni( c ) = M(c) / L
if mim = custom , then Ni( c ) = Mi(c) where Mi(c) is the custom requested number of samples for image i and class c
strategy = percent : For each image i and each class c:
if mim = proportional, then Ni( c ) = p * Ti( c ) where p is the global percentage of samples
if mim = equal , then Ni( c ) = p * sum_k(Tk(c)]/L where p is the global percentage of samples
if mim = custom , then Ni( c ) = p(i) * Ti(c) where p(i) is the percentage of samples for image i. c
strategy = total : For each image i and each class c:
if mim = proportional, then Ni( c ) = total * (sum_k(Ti(k))/sum_kl(Tl(k))) * (Ti(c)/sum_k(Ti(k))) where total is the total number of samples specified.
if mim = equal , then Ni( c ) = (total / L) * (Ti(c)/sum_k(Ti(k))) where total is the total number of samples specified.
if mim = custom , then Ni( c ) = total(i) * (Ti(c)/sum_k(Ti(k))) where total(i) is the total number of samples specified for image i.
strategy = smallest class
if mim = proportional, then the smallest class size (computed globally) is used for the strategy constant+proportional.
if mim = equal , then the smallest class size (computed globally) is used for the strategy constant+equal.
if mim = custom , then the smallest class is computed and used for each image separately.
Parameters¶
Input statistics -il filename1 filename2...
Mandatory
List of statistics files for each input image.
Output sampling rates -out filename [dtype]
Mandatory
Output filename storing sampling rates (CSV format with class name, required samples, total samples, and rate). The given filename will be used with a suffix to indicate the corresponding input index (for instance: rates.csv will give rates_1.csv, rates_2.csv, …).
Sampling strategy -strategy [byclass|constant|smallest|percent|total|all]
Default value: smallest
Set samples count for each class
Set samples count for each classSet the same samples counts for all classes
Set the same samples counts for all classesSet same number of samples for all classes, with the smallest class fully sampled
Set same number of samples for all classes, with the smallest class fully sampledUse a percentage of the samples available for each class
Use a percentage of the samples available for each classSet the total number of samples to generate, and use class proportions.
Set the total number of samples to generate, and use class proportions.Use all samples
Use all samples
Set samples count for each class options¶
Number of samples by class -strategy.byclass.in filename1 filename2...
Mandatory
Number of samples by class (CSV format with class name in 1st column and required samples in the 2nd).In the case of the custom multi-image mode, several inputs may be given for each image.
Set the same samples counts for all classes options¶
Number of samples for all classes -strategy.constant.nb string
Mandatory
Number of samples for all classes.In the case of the custom multi-image mode, several values can be given for each image.
Use a percentage of the samples available for each class options¶
The percentage(s) to use -strategy.percent.p string
Mandatory
The percentage(s) to use In the case of the custom multi-image mode, several values can be given for each image.
Set the total number of samples to generate, and use class proportions. options¶
The number of samples to generate -strategy.total.v string
Mandatory
The number of samples to generateIn the case of the custom multi-image mode, several values can be given for each image.
Multi-Image Mode -mim [proportional|equal|custom]
Default value: proportional
Proportional
Split the required number of samples proportionallyequal
Equal split of the required number of samplesCustom
Split the number of samples based on the user’s choice.
Examples¶
From the command-line:
otbcli_MultiImageSamplingRate -il stats_1.xml stats_2.xml -out rates.csv -strategy smallest -mim proportional
From Python:
import otbApplication
app = otbApplication.Registry.CreateApplication("MultiImageSamplingRate")
app.SetParameterString("out", "rates.csv")
app.SetParameterString("strategy","smallest")
app.SetParameterString("mim","proportional")
app.ExecuteAndWriteOutput()